New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v0.8.0 #258
Conversation
…agen-corpus into quality_assesment_feature
…agen-corpus into dev
Four quick comments:
Also a question: Do we check diffs also in the csv-file? |
This comment was marked as resolved.
This comment was marked as resolved.
Curation 1880s
@ninpnin crashing tests |
Great! @ninpnin Do you rerun the wikidata fetching? Hopefully this should fix the test suite. |
FYI @ninpnin WD is cached a lot so sometime you need to wait some minutes.... |
31 correct 17 incorrect = 64.6% correct. |
@ninpnin Do we know what is the reason for the errors? should we remove the edits in the protocols for now and only update the mp database? Then file this edits of the protocols as a separate issue to fix? |
@MansMeg all errors are downstream from other things. |
Besides that, having a mismatch between the metadata and the corpus sounds like a terrible idea. |
|
|
…t_feature test: quality assesment feature
|
…ragraphs Sample paragraphs
…agen-corpus into curation-1870s
…1870s Curation 1870s
Fetch new metadata and rerun intro mapping.
Sample:
corpus/protocols/1910/prot-1910--ak--24.xml
Diff starting from line 188
corpus/protocols/1927/prot-1927--ak--20.xml
Diff starting from line 2837
corpus/protocols/1929/prot-1929--ak--11.xml
Diff starting from line 4392
corpus/protocols/1930/prot-1930--fk--46.xml
Diff starting from line 985
corpus/protocols/1932/prot-1932--ak--2.xml
Diff starting from line 166
corpus/protocols/1932/prot-1932--fk--33.xml
Diff starting from line 2837
corpus/protocols/1934/prot-1934--ak--47.xml
Diff starting from line 348
corpus/protocols/1936/prot-1936--fk--2.xml
Diff starting from line 436
corpus/protocols/1937/prot-1937--fk--22.xml
Diff starting from line 9897
corpus/protocols/1938/prot-1938--fk--23.xml
Diff starting from line 656
corpus/protocols/1939/prot-1939--ak--12.xml
Diff starting from line 5090
corpus/protocols/1940/prot-1940--fk--1.xml
Diff starting from line 344
corpus/protocols/1947/prot-1947--fk--1.xml
Diff starting from line 302
corpus/protocols/1948/prot-1948--ak--21.xml
Diff starting from line 3370
corpus/protocols/1948/prot-1948--fk--2.xml
Diff starting from line 2582
corpus/protocols/1949/prot-1949--fk--15.xml
Diff starting from line 1781
corpus/protocols/1951/prot-1951--fk--24.xml
Diff starting from line 9081
corpus/protocols/1952/prot-1952--fk--4.xml
Diff starting from line 1522
corpus/protocols/1954/prot-1954--fk--1.xml
Diff starting from line 2110
corpus/protocols/1955/prot-1955--ak--18.xml
Diff starting from line 18635
corpus/protocols/1956/prot-1956--ak--10.xml
Diff starting from line 12026
corpus/protocols/1956/prot-1956--fk--24.xml
Diff starting from line 5276
corpus/protocols/1959/prot-1959--ak--1.xml
Diff starting from line 436
corpus/protocols/1963/prot-1963--ak--26.xml
Diff starting from line 13718
corpus/protocols/1963/prot-1963--fk--16.xml
Diff starting from line 2307
corpus/protocols/1963/prot-1963--fk--20.xml
Diff starting from line 7381
corpus/protocols/1966/prot-1966--ak--27.xml
Diff starting from line 13023
corpus/protocols/1966/prot-1966--ak--8.xml
Diff starting from line 4909
corpus/protocols/1967/prot-1967--fk--22.xml
Diff starting from line 1696
corpus/protocols/1968/prot-1968--ak--26.xml
Diff starting from line 15967
corpus/protocols/1969/prot-1969--ak--16.xml
Diff starting from line 6292
corpus/protocols/1972/prot-1972--118.xml
Diff starting from line 1056
corpus/protocols/1973/prot-1973--112.xml
Diff starting from line 3774
corpus/protocols/1973/prot-1973--132.xml
Diff starting from line 950
corpus/protocols/1974/prot-1974--134.xml
Diff starting from line 2114
corpus/protocols/197576/prot-197576--114.xml
Diff starting from line 3904
corpus/protocols/197576/prot-197576--150.xml
Diff starting from line 6178
corpus/protocols/198384/prot-198384--129.xml
Diff starting from line 3511
corpus/protocols/198384/prot-198384--166.xml
Diff starting from line 8424
corpus/protocols/198485/prot-198485--113.xml
Diff starting from line 6262
corpus/protocols/198990/prot-198990--1.xml
Diff starting from line 488
corpus/protocols/199394/prot-199394--33.xml
Diff starting from line 3453
corpus/protocols/199798/prot-199798--118.xml
Diff starting from line 7905
corpus/protocols/199899/prot-199899--58.xml
Diff starting from line 2917
corpus/protocols/200102/prot-200102--102.xml
Diff starting from line 1888
corpus/protocols/201415/prot-201415--23.xml
Diff starting from line 9107
corpus/protocols/202021/prot-202021--4.xml
Diff starting from line 4761
Diff starting from line 4618