Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converting output of NER Classifier to WANE Format #378

Merged
merged 7 commits into from Nov 14, 2019

Conversation

SinghGursimran
Copy link
Collaborator

Converts the output of NER Classifier to WANE Format

#297

@ruebot
Copy link
Member

ruebot commented Nov 14, 2019

Nice! Looks good to me.

{"timestamp":"20091027","url":"http://geocities.com/deaw_zx/image/small/araya/?D=A","named_entities":{"persons":[],"organizations":["Index of /"],"locations":[]},"digest":"sha1:7RXEVPRKK35YWNMM6GIM3JWECHJF35ZD"}
{"timestamp":"20091027","url":"http://geocities.com/illiton/second/?N=D","named_entities":{"persons":[],"organizations":[],"locations":[]},"digest":"sha1:LJ2YTLSFNPRRIB4WDD6DXX6Y33MWTUF6"}
{"timestamp":"20091027","url":"http://geocities.com/illiton/second/secretary.htm","named_entities":{"persons":["Gwyndlyn Caer Vyrddin m/k/a","Gwyn Krause"],"organizations":[],"locations":["Chillicothe"]},"digest":"sha1:F4MKYKCRHXDWUCAR2WWZVNMMGPVZBSGZ"}
{"timestamp":"20091027","url":"http://geocities.com/crowers_roost/heilenman.htm","named_entities":{"persons":["Ellen Keenan","Abram Heilenman","Earlene Napolean Smith","Rosina Heilenman Terrell","Mary Beth","John Frederick HEILENMAN","John Frederick","John Frederick","Mary F. STREET","John","Mary GILBERT","Mary F.","Rosina Shaw","John Frederick","Elizabeth","Alphus Conrad","Alphus Conrad","Abraham Street","Charles","Charles","Sarah","Sarah","Claude Lorraine","George Michael","George Michael","Rosina Shaw HEILENMAN","Rosina Shaw","Rosina Shaw","George C. TERRELL","George TERRELL","Margaret WHAN","George C.","Geraldine","Geraldine","Harry Gilbert","George Fletcher","Mary Florence","Mary Florence","Mary Florence","Albert D. SCOTT","Celena","Celena","John Versey","John Frederick HEILENMAN","John Frederick","John Frederick","Anna HOWELL","Anna","Anna May","Anna May","George HELLER","John Frederick","Sarah R. BROWN","Elizabeth HEILENMAN","Elizabeth","Elizabeth","John P. EDWARDS","Robert E.","Abraham Street HEILENMAN","Abraham Street","Abraham Street","Emily Fusilier PREVOST","James F.","James F.","Josephine Wesley","Alice Rose","Eben T.","Elizabeth P.","Claude Lorraine HEILENMAN","Claude Lorraine","Oliver H. Bair","Claude Lorraine","Lillian Boothrod MARTIN","Isaac MARTIN","Katherine JACKSON","Thomas E. Martindale","Lillian Boothrod","Claude Lorraine","Mabel R.","Mabel R.","Mary E. VALLON","Mary E.","Claude Lorraine","Minnie","Harry Gilbert TERRELL","Harry Gilbert","Harry Gilbert","Mamie M. CULLENEY","Gilbert Veasey","Florence Irene","Raymond Coursey","Mildred Elizabeth","Dorothea Oleta","Robert E. EDWARDS","Robert E.","Anna Marie CRAIG","Robert Marvin","Madeline Beckman","Irene Street HEILENMAN","Irene Street","Jules LAVIGNERE","Mary Louise Irene Street","Paul Rhinehart MULLER","Claude Lorraine HEILENMAN","Claude Lorraine","Spencer T. Videon","Claude Lorraine","Mary Agnes SPOTTS","Benjamin Franklin SPOTTS","Mary Amelia SAUTTER","Mary Agnes","Claude Lorraine","Benjamin Franklin","Charles Emory","Leslie Harry","Dorothy Agnes","Lillian Boothrod","Robert Kenton"],"organizations":["Majesty First Generation First Generation","Methodist Episcopal","Emory Street","Drexel Hill","Drexel Hill","Drexel Hill","Geocities"],"locations":["Stutgart","Wuertemberg","Germany","Stricklersville","Chester County","Flint Hill ME Church Cemetery","Stricklersville","London Britain Township","Chester County","Elkton","Cecil County","MD.","Wilmington","Pennsylvania","Philadelphia","PA.","Stricklersville","Chester County","PA.","Delaware","Philadelphia","PA.","Cherry Hill","Maryland","Cherry Hill","Maryland","Philadelphia","PA.","New Jersey","Near Newark","Delaware","Pennsylvania","Pennsylvania","N. Peach Street","Philadelphia","Montrose Cemetery","Upper Darby","Delaware County","PA.","W. Chestnut Street","Phila","Warren Street","Phila","PA.","Wilmington","New Castle County","Lancaster Avenue","Philadelphia","Wilmington","Delaware","Philadelphia","Delaware","Emory Street","New Jersey","Emory Street","Colwyn","Delaware County","Montrose Cemetery","Upper Darby","Delaware County","PA.","Cherry Hill","Maryland","Wilmington","New Castle County","Middletown","Delaware County","Arlington Cemetery","Shadeland Avenue","PA.","Phila","Frankford Avenue","Philadelphia","PA.","Albemarle Avenue","Delaware County"]},"digest":"sha1:IWQODRPWLJWVFTQU666QBOVMLRJAKDIH"}
{"timestamp":"20091027","url":"http://geocities.com/aas_andromeda/images/?M=A","named_entities":{"persons":[],"organizations":[],"locations":[]},"digest":"sha1:WM24LF6DFQYEQ7QLSG26EDOGFV72PJW6"}

@codecov
Copy link

codecov bot commented Nov 14, 2019

Codecov Report

Merging #378 into master will decrease coverage by 0.21%.
The diff coverage is 0%.

@@            Coverage Diff             @@
##           master     #378      +/-   ##
==========================================
- Coverage   76.37%   76.16%   -0.22%     
==========================================
  Files          40       40              
  Lines        1414     1418       +4     
  Branches      268      268              
==========================================
  Hits         1080     1080              
- Misses        217      221       +4     
  Partials      117      117

@ruebot ruebot merged commit f9ce826 into archivesunleashed:master Nov 14, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants