Skip to content

Commit

Permalink
add tse related data and script
Browse files Browse the repository at this point in the history
  • Loading branch information
rafonseca committed May 10, 2017
1 parent a1826c1 commit 3e2ad7d
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -147,6 +147,9 @@ fetch_latest_backup('data/')
- Agreements: `YYYY-MM-DD-agreements.xz`
- Amendments: `YYYY-MM-DD-amendments.xz`

##### Electoral information
1. `src/fetch_tse_data.py` downloads datasets files from TSE website and organize them in the dataset `data/YYYY-MM-DD-tse-candidates.xz`.

##### Companies and Non-Profit Entities with sanctions (CEIS, CEPIM and CNEP).

1. `src/fetch_federal_sanctions.py` downloads all three datasets files (CEIS, CEPIM and CNEP) from official source. The script gets the lastest version available for each dataset, unpacks, translates columns to english and saves them into `data/`. The files are named as follows:
Expand All @@ -165,6 +168,8 @@ All files are named with a [ISO 8601](https://en.wikipedia.org/wiki/ISO_8601) da
1. `data/YYYY-MM-DD-companies.xz`: Dataset with suppliers info containing all the fields offered in the [Federal Revenue alternative API](http://receitaws.com.br) and complemented with geolocation (latitude and longitude) gathered from Google Maps.
1. `data/YYYY-MM-DD-congressperson_relatives.xz` contains data on the relatives of congresspeople and the nature of their relationship.
1. `data/YYYY-MM-DD-congressperson_relatives_raw.xz` also contains data on the relatives of congresspeople, but is only created if `src/get_family_names.py` fails to handle some names.
1. `data/YYYY-MM-DD-tse-candidates.xz` contains information about politicians candidacy over the last years. Can be used to extract a list of all politicians in Brazil.


## Four moments

Expand Down

0 comments on commit 3e2ad7d

Please sign in to comment.