New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parse nasty names with ambivalent specific epithets #53
Comments
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/43 mentioned in issue #86 |
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/44 changed the description |
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/45 changed the description |
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/46 Thanks for the kind words and more names for this ticket @joelnitta. I added them to the description of the issue. |
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/47 changed the description |
created by @joelnitta at https://gitlab.com/gogna/gnparser/-/issues/48 Thanks for the great program! This is a lifesaver for taxonomic workflows. Acrostichum nudum, Adiantum nudum, Aspidium nudum, Athyrium nudum, Blechnum nudum, Lycopodium nudum, Nephrodium nudum, Polypodium nudum, Polystichum nudum, Psilotum nudum |
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/49 Also from GlobalNamesArchitecture/gnparser#331 Looks like "le" is used as part of an author and as part of a specific epithet. I also have a suspision that names with "le" as specific epithet are really have epithet separated by a space! http://gni.globalnames.org/name_strings?commit=Search&page=2&search_term=sp%3Ale We probably should make a dictionary where it is an author and where it is a name, and normalize them accordingly... Big job. |
Hi Dima, Here are a few more names for your whitelist. These are all from the current version of the Catalogue of Life (COL-2021-06-10): Navicula bacterium (diatom) Also, names don’t get parsed if the generic name is too short, but there are a few two letter genera: Do holotrichius (beetle) Maybe these could get whitelisted, too? |
@KatjaSchulz thanks for more 'nasty' names, I am going to bump priority up for this issue |
Oh, I thought I have all two-letter genera accounted for: TwoLetterGenus <- ('Ca' / 'Ea' / 'Ge' / 'Ia' / 'Io' / 'Ix' / 'Lo' / 'Oa' / |
created by @dimus at https://gitlab.com/gogna/gnparser/-/issues/53
@diatomsRcool, @KatjaSchulz and @joelnitta found the following names:
We need to double check that they are 'real' and whitelist the real ones in rules O.o
The text was updated successfully, but these errors were encountered: