Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only include species #35

Closed
peterdesmet opened this issue Feb 13, 2019 · 8 comments
Closed

Only include species #35

peterdesmet opened this issue Feb 13, 2019 · 8 comments

Comments

@peterdesmet
Copy link
Member

The checklist currently contains:

taxonRank taxa
GENUS 11
SPECIES 2399
SUBSPECIES 101
VARIETY 18

For a unified checklist, I think it makes sense to aggregate this information on SPECIES only, because:

  • This is likely the taxonomic unit used for reporting
  • This is likely the taxonomic unit used for analyses
  • This makes the unified checklist easier to work with (e.g. for querying occurrences)
  • This would be a documented step in the processing
  • The original subspecies ranks would still be present in the original checklists, so it is traceable

This would only affect 5% of the taxa (i.e. the non-SPECIES):

  • Lumping SUBSPECIES and VARIETY under their SPECIES: this can reduce the number of taxa if more than one SUBSPECIES or VARIETY is listed for a SPECIES
  • Dropping GENUS for which no species is listed on a checklist (8 taxa)

I've created a spreadsheet of the taxa that are affected.

@timadriaens @SoVDH @qgroom would you be OK with this choice?

@timadriaens
Copy link
Member

It is probably mostly relevant for plants as is clear from the spreadsheet, where cultivars/subspecies can differ in invasiveness.

As we discussed yesterday, I am ok with that provided it is performed in the flow to the unified (so kept in the original checklist) and it is documented. Genus level PRA's are indeed not acceptable in practice.

I checked all non-manual issues:

  • no problem to lump the Trachemis scripta and Gramtemys pseudogeographica subspecies.
  • For Leiobunum, I think the genus is in because of L. sp. A which does occur in Belgium. As it is not yet described, it will be difficult to map it onto the backbone...
  • no problem for the spider genera, they all have a representative species

@peterdesmet
Copy link
Member Author

Thanks Tim! Ludwigia (peploides), Olios (argelasius), Pholcus (opilionoides) have representative species, but not all (spider) genera. I created an issue for this in the ad hoc list: trias-project/ad-hoc-checklist#33

@SoVDH
Copy link

SoVDH commented Feb 14, 2019

Thinking about it ...
Am wondering whether it is better to aggregating variety/cultivar/subspecies to species level or rather discarding them from the unified checklist if you have a problem with keeping sub-species levels.
The reason for having sub-species levels for plants is that not all varieties behave the same way. This means if you aggregate up to species level, you mean all other potential varieties and subspecies are also present and/or established, which is sometimes wrong or sometimes unknown.
So, personally, I would either discard them for now from he unified checklist or keep them as sub-species but NOT aggregating to species level.

This is a feedback from Quentin and Sonia from Zagreb

@peterdesmet
Copy link
Member Author

Leaving subspecies in will be challenging, because the ideal way to search or filter for occurrences on GBIF is by speciesKey: that key includes all sublevels and all synonyms for a taxon, which is what we want. Example: Reynoutria japonica: https://www.gbif.org/species/2889173 also looks for 42 (!) synonyms and 2 varieties. It's very hard to do this otherwise.

Not aggregating them is a hard choice too, because it means they will never be considered for any analyses or emerging species, except those where the SPECIES is also listed in one of the source checklists (e.g. both Panicum miliaceum L. and Panicum miliaceum subsp. miliaceum are listed). It also throws away subspecies for animals.

I suggest we add SUBSPECIES to the list of taxa to verify, with the suggestion to lump under the species. There we can - for each taxon - decide if we want to lump them or not. By default (i.e. unverified) they will not be included. It then becomes a deliberate choice and we are aware of what we are throwing away.

Would that be OK?

@peterdesmet
Copy link
Member Author

It basically boils down to: how big is the net you want to use for searching occurrences. In the result set you will still be able to see what subspecies/synonyms are included.

@qgroom
Copy link

qgroom commented Feb 16, 2019

It's a bit more than that. At least for plants we will get completely the wrong results if we model using the species data, rather than the subspecies data.

All these have alien supspecies and common native subspecies. If we select at the species level most of the records will be for the native taxon.

Armeria maritima subsp. elongata
Equisetum hyemale
Cornus sanguinea subsp. australis
Reseda lutea subsp. vivantii
Hippophae rhamnoides subsp. caucasica Rousi
Sedum telephium subsp. maximum
Matricaria recutita var. coronata
Phalaris arundinacea var. picta L.
Aira caryophyllea subsp. plesiantha (Jord.) K.Richt.
Pastinaca sativa subsp. urens (Req. ex Godr.) Čelak.
Lamium galeobdolon subsp. argentatum (Smejkal) J.Duvigneaud

and many more I suspect

@damianooldoni
Copy link
Contributor

So, I think we solved this. The answer is: no, we need infraspecific ones in unified checklist. Issue to be closed?

@peterdesmet
Copy link
Member Author

Yes, thanks for cleaning up issues @damianooldoni.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

5 participants