New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
occ_data
/occ_search
: maybe not use species
field for main name field
#329
Comments
In result sets, I prefer to have as first column some kind of ID. For occurrences I would expect “occurrenceID” to be the first column. Constructing names from parts might lead to names that are not wellformed (e.g. plant names have “var.” in their name)... that is, if we care. I would expect scientificName. My 2 cents |
Not a fan of scientificName or acceptedScientificName because of the
authors, as Scott mentioned. It makes sense for the first column in a row
to be a unique identifier (so maybe key?), but perhaps the next columns be
genericName, specificEpithet, infraspecificEpithet. Constructing a column
from the parts seems fine, but also, you could leave it to the user to do
it.
My two cents as well.
…On Wed, Oct 24, 2018 at 2:58 AM Peter Desmet ***@***.***> wrote:
In result sets, I prefer to have as first column some kind of ID. For
occurrences I would expect “occurrenceID” to be the first column.
Constructing names from parts might lead to names that are not wellformed
(e.g. plant names have “var.” in their name)... that is, if we care. I
would expect scientificName.
My 2 cents
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#329 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/ACARS6OPRROLUd1rcWxHSteCeEp-B0grks5uoA-TgaJpZM4X2yXx>
.
--
Kathryn G Turner, PhD
kathryngturner.com
https://profiles.impactstory.org/u/0000-0001-8982-0301
alienplantation.com
|
thanks, good idea to make first column the occurrence id. as it should be unique for each row. constructing a name from parts does open up the possibility for occassional errors and badly formed names as you said Peter. the 2nd column as scientificName makes sense to me |
I have to admit I work most of the time with keys and having a unique key as first column makes sense to me. I like scientific names way more than canonical or composed names (names from parts as you call them) because they are as much unique as possible. Name from parts are less informative and can induce misinterpretations of the returned data. |
thanks @damianooldoni for your feedback. |
…olumns in the dataframe fixed test accordingly for output changes cleaned up tests for each and put assertinos outside of vcr calls bump dev version
okay, pushed changes. considering this done. made the following changes:
|
right now we present in the first column in results from
occ_data
andocc_search
the species name - the value from the fieldspecies
in the occurrence route. For all taxa at species rank and above this is fine.species
name of course makes senseHere is where we select
species
column https://github.com/ropensci/rgbif/blob/master/R/zzz.r#L106-L107 and move it to the frontFor ranks below species, e.g., http://api.gbif.org/v1/occurrence/search?taxonKey=6163845 for Ursus arctos horribilis, the
species
value of Ursus arctos is accurate for it's species rank level name, but the user probably expects to see a subspecific name instead of the species rank name.I don't think we want to go with
scientificName
oracceptedScientificName
as they are complete names with authorities and years, or do we? Does anyone want this name format? Do we stick with what we have now except add in:species
+ the entry forinfraspecificEpithet
any feedback appreciated @kgturner @damianooldoni @MattBlissett @jwhalennds @poldham
The text was updated successfully, but these errors were encountered: