Scripts for processing and analyzing federal lobbyist disclosure data reporting contributions to political campaigns
palewire/sopr-contribs
master
Name already in use
Code
-
Clone
Use Git or checkout with SVN using the web URL.
Work fast with our official CLI. Learn more.
- Open with GitHub Desktop
- Download ZIP
Sign In Required
Please sign in to use Codespaces.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching Xcode
If nothing happens, download Xcode and try again.
Launching Visual Studio Code
Your codespace will open once ready.
There was a problem preparing your codespace, please try again.
A script that fetches, parses and archives the XML data dumps of lobbyist's political contributions published by The Senate Office of Public Records. Zips files containing the XML are: 1. Downloaded and unzipped. 2. Parsed out into flat text files and stored in a timestamped folder structure. 3. Imported to a SQLite database. The ultimate goal is for a series of SQL statements to scrub and cut the data to account for flaws in the reporting system first uncovered by Bill Allison and Anupama Narayanswamy of The Sunlight Foundation.
About
Scripts for processing and analyzing federal lobbyist disclosure data reporting contributions to political campaigns

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
