-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Get coordinate quality categories #23
Comments
Discovered that:
|
Here's a breakdown for my test dataset:
Decimals for valuable coordinates: 1 177 |
Note: I think we could almost use the count API for this, e.g. http://api.gbif.org/v1/occurrence/count?datasetKey=4ce8e3f9-2546-4af1-b28d-e2eadf05dfd4&issue=COUNTRY_COORDINATE_MISMATCH. The reason it won't work for all, is that you can't search for |
I think we can actually do a lot more of this with the regular occurrence search API: http://api.gbif.org/v1/occurrence/search?datasetKey=4ce8e3f9-2546-4af1-b28d-e2eadf05dfd4&hasCoordinate=true&issue=COUNTRY_COORDINATE_MISMATCH => count = 50330 The biggest question is if we can use negations: all coordinates with NO issues and multiple issues. I'll ask Tim. |
Asked Tim: There is no OR and NOT operator in the API, only AND. I think that means we can't use it for this usecase. :-( |
Concerning the I think a record with these issues is ready for use. While the other issues we categorized as minor issues are not (if you don't know the geodetic datum, you'll need to do some work to figure that out first). So I think records with these However, I agree that this information would be valuable to the data provider. So he should be informed of the fact that GBIF fixed his coordinates but maybe we can provide that information somewhere else. |
OK. We keep the categories as they are. We can discuss this in the documentation #32. All 4 fields are now available in CartoDB. |
@peterdesmet, @bartaelterman: I'm ready to implement: is the algorithm above (in "Process") still valid ? are there adjustments to be made ? |
@niconoe, the algorithm described in the issue body is still valid. No adjustments needed for now. |
Description
For a given dataset, I want to know how many records have coordinates. I also want to know how many of those are useful, have issues, and maybe what their precision is.
Outcome
Terms we need
Questions
PRESUMED_SWAPPED_COORDINATE
,PRESUMED_NEGATED_LATITUDE
,PRESUMED_NEGATED_LONGITUDE
could be useful to the provider as minor issues, but to the user, these are quite valuable. Where would you group them?Process
The text was updated successfully, but these errors were encountered: