-
Notifications
You must be signed in to change notification settings - Fork 12
Publishing data
Once you are happy with the Darwin Core data you cooked together, you are ready to get them published. 👏 The easiest way to do so is by using the Integrated Publishing Toolkit (IPT).
If you don't have an IPT at your institution, navigate the map of existing IPT installations and contact the person administering one near you. Many IPT installations will host datasets for other institutions.
If you haven't used the IPT before, we advice to watch the introduction video or browse the IPT documentation. Once you know your way around, here is how to proceed:
From Manage Resources
create a new resource (= a checklist dataset in your case). Use your my_dataset_shortname
as shortname and choose Type: Checklist
. Don't upload any data yet.
Skip this step completely if you have created your checklist dataset on the IPT before and just want to update the data.
- Go to the overview page of your resource (it will look like this).
- Under Source data:
- Choose
URL
and provide the raw GitHub URL to your Darwin Core data file, e.g.https://raw.githubusercontent.com/trias-project/checklist-recipe/main/data/processed/taxon.csv
. The Source Name will be taken automatically from the name of your file. - Or, choose
File
and upload the Darwin Core data file from your computer. ClickYes
to overwrite a previously uploaded (and now outdated) file if applicable.
-
Save
the linked or uploaded file.
Repeat the process for any of the extension file(s) you have on your computer.
- Under Darwin Core Mapping select
Darwin Core Taxon
>Add
. - Select
taxon
from the dropdown menu. - Since your data is already mapped to Darwin Core, the IPT will automap all columns for you. 👍
- Inspect and click
Save
andBack
.
Repeat the process for any of the extensions you have uploaded, but make sure you select the correct extension (step 1) and file (step 2).
Note: if you overwrote existing source files on the IPT with more recent ones on your computer, be sure to verify your mapping again. E.g. a newly added column in your Darwin Core data will brake the mapping. Often it is easiest to click Edit
next to Darwin Core Taxon and then click Delete
to remove your whole mapping (including extensions) and start over. Since all of the mapping is automapping, redoing the mapping in the IPT shouldn't take long.
Under Metadata click Edit
to describe your dataset. See this section of the IPT manual for more information on metadata.
Note: with all the effort you put into documenting your mapping process, it is a good idea to refer to your repository (which contains your R Markdown file) in the Sampling Methods
section, ideally as step descriptions. These will end up in the Methodology
section of your dataset on GBIF. See for example the Methodology section of this checklist which followed the checklist recipe.
To publish your dataset:
- Under Visibility click
Public
. - Under Published versions click
Publish
. You won't be able to do so as long as your required metadata is not complete. - If this is the first time you published your dataset, register it with GBIF under Visibility
Register
.
For more information on publishing data using the IPT, see this section of the IPT manual.
Congratulations on publishing your checklist data in an open and repeatable way (and making it this far in the wiki)! 🎉
- Home
- Getting started
- Basics
- Ingredients: Source data
- Instructions: R Markdown
- Utensils: Tidyverse functions
- Dinner: Darwin Core data
- Mapping script
- Data preparation
- Mapping
- GitHub
- Publishing data
- Examples