Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hi, This Magnify dataset shouldn't contain grasses. Something was mismatched #2248

Closed
gbif-portal opened this issue Oct 14, 2019 · 3 comments

Comments

@gbif-portal
Copy link
Collaborator

Hi, This Magnify dataset shouldn't contain grasses. Something was mismatched


User: See in registry
System: Chrome 77.0.3865 / Mac OS X 10.14.6
Referer: https://www.gbif.org/occurrence/search?dataset_key=d596fccb-2319-42eb-b13b-986c932780ad&taxon_key=5289779
Window size: width 1757 - height 883
API log
Site log
System health at time of feedback: OPERATIONAL

@ManonGros
Copy link

ManonGros commented Oct 14, 2019

Nor should it contain Rhinos: https://www.gbif.org/occurrence/search?publishing_org=ab733144-7043-4e88-bd4f-fca7bf858880&taxon_key=795
We need a flag for occurrences records that are based on sequences and have only Organism quantity = 1 or 2 DNA sequence reads, regardless of whether a species is terrestrial or not.
https://data-blog.gbif.org/post/gbif-molecular-data-quality/

@thomasstjerne
Copy link

I can verify that the mismatch isn´t on our side. You will see Aegilops tauschii in the MGnify API here

I agree with @ManonGros that we need a way to flag and filter out occurrences from metagenomic datasets that are based very few reads. Preferably the user would be able to select a minimum number of reads, or even better a relative value based on Organism quantity and Sample size value

@ManonGros
Copy link

Closing since most vascular plants and vertebrates have been filtered out and that we now have the relative organism quantity filter: https://www.gbif.org/occurrence/search?publishing_org=ab733144-7043-4e88-bd4f-fca7bf858880&advanced=1&relative_organism_quantity=0.00001,*

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants