Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

United States of America (Senate): refresh pre-115 data #23769

Merged
merged 3 commits into from Jan 4, 2017

Conversation

tmtmtmtm
Copy link
Contributor

@tmtmtmtm tmtmtmtm commented Jan 4, 2017

Morph is choking on the primary scraper at the moment, so run it locally,
and then export all data other than the 115th Congress (which will be
isolated to a different change)

Add memberships from sources/morph/data.csv
Merging with sources/morph/socialmedia.csv
Data Mismatches
* 442 of 542 unmatched
	{:id=>"D000612"}
	{:id=>"L000565"}
	{:id=>"G000563"}
	{:id=>"R000575"}
	{:id=>"M001189"}
	{:id=>"W000791"}
	{:id=>"C001059"}
	{:id=>"C000059"}
	{:id=>"L000563"}
	{:id=>"E000215"}
Merging with sources/morph/wikidata.csv
Data Mismatches
  ☁ Mismatch in birth_date for 40bd653f-fc86-4ed1-88fe-4950ff9ba8c4 (1952-07-21) vs 1952-06-21 (for Q720521)
  ☁ Mismatch in birth_date for 443ddc59-8344-49fe-bd7a-521427d275c5 (1909-01-01) vs 1909-01-02 (for Q319129)
  ☁ Mismatch in birth_date for b7e8d905-4681-450a-b129-dc1e79ab0338 (1955-09-28) vs 1955-09-29 (for Q1691395)
  ☁ Mismatch in instagram for 025093e4-e646-43d0-8de5-f1a88aca3a98 (senatorrandpaul) vs drrandpaul (for Q463557)

Adding GenderBalance results from sources/gender-balance/results.csv
  ⚥ data for 569; 0 added

Adding OCD names from sources/ocd/divisions.csv

Top identifiers:
  310 x google_entity_id
  310 x thomas
  310 x bioguide
  310 x uscongress
  310 x wikipedia

Creating names.csv
Creating unstable/positions.csv
  Unknown position (x1): Q5405633 research assistant — e.g. Q6294
  Unknown position (x1): Q7140693 partner — e.g. Q6294
  Unknown position (x1): Q212071 rector — e.g. Q419976
  Unknown position (x2): Q23761019 intern — e.g. Q6294
  Unknown position (x2): Q140686 chairperson — e.g. Q6294
  Unknown position (x6): Q2824523 board member — e.g. Q6294
Persons matched to Wikidata: 310 ✓ 
Parties matched to Wikidata: 3 ✓ 

Morph is choking on the primary scraper at the moment, so run it
locally, and then export all data other than the 115th Congress (which
will be isolated to a different change)
@everypoliticianbot
Copy link
Member

Summary of changes in data/United_States_of_America/Senate/ep-popolo-v1.0.json:

People

Added

No people added

Removed

No people removed

Name Changes

  • b70474ad-0b14-4495-8447-be2902dfeda1: Gary Peters → Gary C. Peters

Additional Name Changes

  • b70474ad-0b14-4495-8447-be2902dfeda1 (Gary Peters): Added: Gary C. Peters.

Wikidata Changes

No changes

Organizations

Added

No organizations added

Removed

No organizations removed

Memberships

Added

term/101

  • Gordon Humphrey ( - 1990-12-04)

  • Bob Smith (1990-12-07 - )

term/102

  • Byron Dorgan (1992-12-14 - )

  • Albert Gore, Jr. ( - 1993-01-02)

term/103

  • Fred Thompson (1994-12-02 - )

term/104

  • Robert Dole ( - 1996-06-11)

  • Sheila Frahm (1996-06-11 - 1996-11-05)

term/106

  • Zell Miller (2000-07-24 - )

term/97

  • Nicholas Brady (1982-04-12 - 1982-12-27)

  • John Chafee

  • John Danforth

  • Frank R. Lautenberg (1982-12-27 - )

  • Howard Metzenbaum

  • Harrison Williams, Jr. ( - 1982-03-11)

  • Edward Zorinsky

term/99

  • James Broyhill (1986-07-14 - 1986-11-04)

  • John East ( - 1986-06-29)

  • James Sanford (1986-11-05 - )

Removed

term/101

  • Gordon Humphrey

  • Bob Smith (1990-01-01 - )

term/102

  • Byron Dorgan (1992-01-01 - )

  • Albert Gore, Jr.

term/103

  • Albert Gore, Jr. ( - 1993-12-31)

  • Fred Thompson (1994-01-01 - )

term/104

  • Robert Dole

  • Sheila Frahm (1996-01-01 - )

term/106

  • Zell Miller (2000-01-01 - )

term/97

  • Nicholas Brady (1982-01-01 - )

  • Frank R. Lautenberg (1982-01-01 - )

  • Harrison Williams, Jr.

term/99

  • James Broyhill (1986-07-14 - )

  • John East

  • James Sanford (1986-01-01 - )

Terms

Added

No terms added

Removed

No terms removed

Elections

Added

No elections added

Removed

No elections removed

@tmtmtmtm tmtmtmtm merged commit 5ea6aaa into master Jan 4, 2017
@tmtmtmtm tmtmtmtm removed the 3 - WIP label Jan 4, 2017
@chrismytton chrismytton deleted the us-senate-refresh-114 branch February 13, 2017 09:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants