-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spelling errors during Plazi extraction #10
Comments
Thanks @ajacsherman |
@ajacsherman thanks for sharing your detailed feedback. Curious to see what @flsimoes comes up with. |
Just from the list itself I can see them being OCR misreads |
I found another. |
|
@ajacsherman for Lophostoma silvicolum, the hmw actually calls it silvicola and checking the GBIF database I can see it as such https://www.gbif.org/species/5706810 I understand that this has been updated, but we keep the name written in the original document. EDIT: same applies to Carollia brevicaudum |
@ajacsherman You listed Anthops ornatus, but the HMW calls it Scotomanes ornatus |
@flsimoes @ajacsherman very neat that we are getting into the details here. I am hoping to discuss ways to systematically capture the name relations, so that we can easily traverse them and infer that the relations between the names. In this case, though, it appears that batnames knows about both names. @ajacsherman can you help figure out what is going on?
but
|
@ajacsherman Not sure what to do with Mirimiri acrodonta |
Found a few more... |
|
Fixed Mucronycteris schmidtorum. |
@flsimoes @ajacsherman thanks for all the hard work on this. Please let me know when is a good time to generate a new version of hmw.csv . |
I think all of the names that were listed here and that we could fix, have been fixed. |
as requested by @flsimoes , I've prepared a new version of hmw.csv and derived data products. Please review the latest versions at: |
@ajacsherman @flsimoes I am assuming that you've reviewed version https://github.com/jhpoelen/hmw/releases/tag/0.4 and that all expected fixes are present. Closing issue. Please feel free to comment / re-open if issue remain. |
They should all be fixed, yes (apart from those where we have missing pages). |
@flsimoes thanks for confirming. @ajacsherman can you confirm that the issues that you documented above has been addressed in v0.4 of the https://github.com/jhpoelen/hmw ? |
@jhpoelen @flsimoes
The following were spelling errors from the Plazi extraction process followed by their correct spelling and docID. Thanks for your help.
The text was updated successfully, but these errors were encountered: