New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DwC-A: cannot import occurrence data for existing species #2581
Comments
Problem seems to be that the importer is trying to locate I could probably change the way existing names are located when restricting is enabled @mjy? This would mean ignoring higher classification if the name cannot be found in the provided parents path (would also fix cases when the dataset does not agree with the existing classification in the database). This requires a bit significant algorithm change. |
I think that he doesn't want to restrict, he wants them to be created, correct @sergeitarasov? He is just missing an option somewhere? |
I can envision many cases where classification does not match. I would say, that the classification should be ignored. The name string should match. There could be some issues so. For example we have both Protonym and Combination with the same ScientificName. We may also have homonyms. In some cases a manual resolution would still be required. |
Following @mjy, my understanding, that @sergeitarasov specifically restricted creation of new names, he wants to link specimen records to the existing classification. That would be the requirements in most of the cases when we import data to 3i Auchenorrhyncha project as well. |
We should definitely maintain the mode where import only succeeds when the hierarchy fully matches as an option, and the default. Having an alternate mode where name matches OTU#name or Otu.taxon_name.cached only (and there is only one match), as you imply, is also useful. |
Yep, that's correct @proceps, I would like to link the records to the existing sp. This going to be the most frequent task for me to import with DwC-A. Do you have any idea of how I can fix it now? Adding a higher-level taxon on the csv? |
Yes, adding higher level will make it work. It must match all the way up. |
It is hard to envision managing classification in DwC. We have a table for 250 holotypes (just species name), updating the higher classification all the way up, this this would be a job comparable to creating collection objects manually using Comprehensive task. |
Current classification on TW: 'Parachorius thomsoni -> Parachorius-> Parachoriini-> Scarabaeinae-> Root' |
Not too hard. Pre-step is to build something like geographic name matcher service we have. You paste in one column of names, you get the higher matching names back, you paste those into your columns. Again, both modes are warranted, I'm not debating that, but people importing data from diverse datasets are going to want the strict mode as well. |
This is literally the challenge everyone wants to solve "trivially", which is anything but trivial when you want to make many decisions about your data. For your data @proceps you have already pre-validated it all, this is different from others bringing in data that they haven't looked at. |
I added 'higherCLassification': Scarabaeinae|Parachoriini|Parachorius |
Which project is this? Is it in production? |
Yep, in production 'Dung_Beetles' |
Getting production database into my development machine for testing. Will take me around 15 mins to setup. |
@sergeitarasov Sorry late reply, had some problems with the database and long meeting afterwards. I tried with the attached spreadsheet and it had no trouble handling the name, but complained that the repository referenced in Spreadsheet used: DwC_Parach.xlsx ( |
Thanks @LocoDelAssembly! |
@sergeitarasov right, sorry. On first try I forgot to use existing nomenclature, so it created the duplicate, but on second try I enabled and still succeeded. Investigating why is happening... |
Second try works for me too (with the restriction). TW adds the records to P. thomsoni that was previously imported with DwC-A but not to the original P. thomsoni. |
@sergeitarasov forgot to mention, this fix won't be available until we release 0.20.1. Please be sure to delete the duplicate name created by the importer to avoid confusion. |
Works for me now! Thanks for the help @LocoDelAssembly :)
|
Can we close this? |
As an experiment, I tried to import (many times with different modifications) one occurrence record using DwC-A (attached) with ‘restrict record to existing nmcl’ to match the species that is already in TW. But it does not work:
Protonym Parachorius not found with that name and/or classification. Importing new names is disabled by import settings.
I wonder how can I fix it? :)
DwC_Parach.xlsx
?
The text was updated successfully, but these errors were encountered: