Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create classifier for Costa Rica moths #28

Closed
JennaLawson opened this issue Sep 25, 2023 · 12 comments
Closed

Create classifier for Costa Rica moths #28

JennaLawson opened this issue Sep 25, 2023 · 12 comments
Assignees

Comments

@JennaLawson
Copy link

Provisional list for Costa Rica Lepidoptera
Lepidoptera of Costa Rica.xlsx

@LevanBokeria
Copy link
Contributor

Hi @JennaLawson and @DavidRoy. Just want to very quickly confirm this:

  • there are two rowz where "Subspecies" entry is corrupted I think. Row 7920 has "zyzyxoxyx" and row 4476 says "(aggr.). I will remove these Subspecies entries. Image
  • Whenever subspecies are specified, we should download images only for those subspecies, right? And not the whole species?

@DavidRoy
Copy link
Member

@LevanBokeria I recommend ignoring the subspecies and just using the list of names at the species level

@LevanBokeria
Copy link
Contributor

Thanks @DavidRoy.

I've uploaded the google sheet for Costa Rica list in this folder.

There are quite a few missing species on GBIF unfortunately. Please fine them by filtering the "status" column for "NotAvail".
Also, please check the fuzzy matches by filter the "match_type" column for "FUZZY".

I will meanwhile download images for thos species that are available on GBIF.

@LevanBokeria LevanBokeria moved this from 🥚 Todo to 🐛 In Progress in AMBER Oct 23, 2023
@JennaLawson
Copy link
Author

JennaLawson commented Oct 23, 2023 via email

@LevanBokeria
Copy link
Contributor

Hi @JennaLawson

You mean if they add images to GBIF, correct? I think it should be straightforward to download new images, yes. If they do upload anything, let us know and we'll download them (or I suppose @KatrionaGoldmann will :) )

@LevanBokeria
Copy link
Contributor

By the way, @JennaLawson GBIF has this neat online tool you can use to check for species in the GBIF backbone:
https://www.gbif.org/tools/species-lookup

Our code (adapted from Rolnick) just uses the API version of this. But if you yourself would like to dynamically check any given species checklist, that page is quite useful. It will highlight missing species, and even give you multiple alternative match options for species that couldn't be matched. Its quite good for exploration.

@JennaLawson
Copy link
Author

JennaLawson commented Oct 24, 2023 via email

@LevanBokeria LevanBokeria moved this from 🐛 In Progress to 🥚 Todo in AMBER Oct 30, 2023
@LevanBokeria
Copy link
Contributor

Downloading of images is paused until a potential taxonomic inconsistency is resolved for some of the species. See issue

@LevanBokeria
Copy link
Contributor

@JennaLawson I forgot to mention, that when checking the species in the google sheet list, please also check for match_type = HIGHERRANK. There are 3 such rows, which indicate that the exact match was not found on GBIF, and the API returned a GENUS match instead.

@JennaLawson
Copy link
Author

@LevanBokeria apologies but the list sent before had some butterflies in it. I have removed the butterfly families and am attaching here a list with just moth species.
Moths of Costa Rica_wo_butterflies .csv

@KatrionaGoldmann KatrionaGoldmann moved this from 🥚 Todo to 🐛 In Progress in AMBER Nov 24, 2023
@KatrionaGoldmann KatrionaGoldmann moved this from 🐛 In Progress to 🦋 Done in AMBER Mar 12, 2024
@KatrionaGoldmann
Copy link
Member

Completed in costarica_model.sh . Closing issue.

@mihow
Copy link

mihow commented Sep 24, 2024

Thanks @KatrionaGoldmann, do you have a version of this CSV with a column for the GBIF key that you matched do? The GBIF key you used to fetch training images for each species.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: 🦋 Done
Development

No branches or pull requests

5 participants