Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write taxon IDs to imported/exceptions files #66

Closed
rjchallis opened this issue Apr 7, 2021 · 1 comment
Closed

Write taxon IDs to imported/exceptions files #66

rjchallis opened this issue Apr 7, 2021 · 1 comment
Assignees

Comments

@rjchallis
Copy link
Contributor

Storing a list of taxon_id/scientific_name matches alongside spellchecked or imported taxa will help speed up importing of updated files by creating lists that can be placed in a names directory to remove the need for taxon_id lookups.

@rjchallis rjchallis self-assigned this Apr 7, 2021
@rjchallis
Copy link
Contributor Author

exceptions spellcheck files now have 4 columns (plus extra columns to the right if multiple suggestions are found for a taxon):

exceptions/example.spellcheck.tsv

taxon_id	input	rank	suggested
190659	Actitis macularius	species	Actitis macularia
297805	Chlidonias hybrida	species	Chlidonias hybridus
54026	Thalassarche melanophris	species	Thalassarche melanophrys
216219	Sylvia ruppeli	species	Sylvia rueppelli

imported taxon ID files have 3 columns:

imported/example.taxon_ids.tsv

taxon_id	input	rank
100830	Tetrao urogallus	species
1233216	Lyrurus tetrix	species
64668	Lagopus muta	species
52650	Lagopus lagopus	species

These files do not contain rank information so name collisions in the taxon_ids file are automatically commented out.

These files can be copied and pasted into a names subdirectory with the same name as the original input file to allow them to be used as taxon_id lookup tables. Commented rows will be ignored.

names/example.tsv

taxon_id	input	rank	suggested
190659	Actitis macularius	species	Actitis macularia
297805	Chlidonias hybrida	species	Chlidonias hybridus
54026	Thalassarche melanophris	species	Thalassarche melanophrys
216219	Sylvia ruppeli	species	Sylvia rueppelli
100830	Tetrao urogallus	species
1233216	Lyrurus tetrix	species
64668	Lagopus muta	species
52650	Lagopus lagopus	species

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant