Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More lenient interpretation of individualCount NULL #94

Closed
timrobertson100 opened this issue Mar 20, 2019 · 4 comments
Closed

More lenient interpretation of individualCount NULL #94

timrobertson100 opened this issue Mar 20, 2019 · 4 comments
Assignees

Comments

@timrobertson100
Copy link
Member

timrobertson100 commented Mar 20, 2019

It is relatively common that numerical values that should be NULL are presented as 0.

A consequence is that GBIF will infer absence when individualCount is stated as 0 but not intended. When an explicit statement of presence is given in occurrenceStatus we should be more lenient e.g.:

If (individualCount == 0 AND occurrenceStatus == ‘present’) then individualCount = NULL

This would be implemented somewhere around here

@timrobertson100
Copy link
Member Author

@MattBlissett can you please provide your opinion or implement if you agree?

@tucotuco
Copy link

tucotuco commented Mar 20, 2019 via email

@MattBlissett
Copy link
Member

This affects about 20,000,000 records.

We don't yet interpret occurrenceStatus. That can wait for pipelines, so I will handle the existing cases: present, Present and Presente and individualCount being 0.

What did you mean by "GBIF will infer absence"?

John: I think we will work on improving the documentation around GBIF interpretation later this year, at the moment too many questions require me to read the source code to see what we do.

@MattBlissett
Copy link
Member

This was implemented in pipelines, resulting in an issue: https://api.gbif.org/v1/occurrence/search?issue=INDIVIDUAL_COUNT_CONFLICTS_WITH_OCCURRENCE_STATUS

The interpretation is according to this table: gbif/pipelines#268 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants