Resolver to resolve material sample ids separately to the occurrence ids #19

rukayaj · 2022-01-17T12:06:36Z

We have a problem with the way we're publishing data currently: it's not possible to separately identify material samples vs occurrences for most of our records.

I don't think the resolver should issue identifiers to the data records, I think that we should be publishing the identifiers and the resolver should be resolving them.

We do have separate material sample IDs for DNA datasets from Corema, so step 1 could be to make the resolver resolve those separately.

dagendresen · 2022-01-17T12:30:46Z

(1) I think that we CAN unambiguously publish (real) Occurrences separately from voucher specimens and tissue samples -- using the IPT and DwC-A by using basisOfRecord. (However, I also think that the GBIF data portal and Artskart do not present these appropriately/correctly).

(2) Agree! The resolver should not issue PIDs, only resolve them.

(3) Agree! The MaterialSamples with materialSampleIDs should be resolved by separate endpoints from the corresponding Occurrences they are linked to. The respective occurrenceID should here be an attribute of the MaterialSample endpoint metadata ...

dagendresen · 2022-01-17T12:50:33Z

I think we should also extract and resolve organismIDs, eventIDs, taxonIDs, etc (when these IDs are following a reasonable name string syntax that we can trust will be persistent ... TODO: decide of a test for the PID name syntax)

Notice also that there exists nowhere yet, for the Norwegian GBIF-datasets, except from the resolver we are building, any end-point (machine-readable or not) for occurrenceID, materialSampleID, organismID, ... etc.

(The global GBIF portal sort of almost provides something that resembles an end-point for data-records, but obviously not for any of the other real-life object classes ...).

The envisioned workflow is for the data publisher to mint (create) a persistent identifier - for their MaterialSamples Occurrences, etc., and add these to their DwCA datasets. The envisioned agreement is next for GBIF-Norway to create the end-point for these publisher-provided persistent identifiers. Currently we support urn:uuid:UUID type identifiers (including the PURL form), but exploring other (more robust?) identifier types such as Handles and DOIs would be perfect.

I think that establishing these end-points is the important rationale for the resolver :-)

rukayaj added the resolver-annotater label Apr 11, 2022

rukayaj transferred this issue from gbif-norway/helpdesk May 13, 2022

rukayaj mentioned this issue Jun 20, 2022

Duplicate occurrenceID into materialSampleID for collections gbif-norway/helpdesk#29

Open

rukayaj mentioned this issue Jul 25, 2022

Resolve materialsampleids #27

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolver to resolve material sample ids separately to the occurrence ids #19

Resolver to resolve material sample ids separately to the occurrence ids #19

rukayaj commented Jan 17, 2022 •

edited

Loading

dagendresen commented Jan 17, 2022

dagendresen commented Jan 17, 2022 •

edited

Loading

Resolver to resolve material sample ids separately to the occurrence ids #19

Resolver to resolve material sample ids separately to the occurrence ids #19

Comments

rukayaj commented Jan 17, 2022 • edited Loading

dagendresen commented Jan 17, 2022

dagendresen commented Jan 17, 2022 • edited Loading

rukayaj commented Jan 17, 2022 •

edited

Loading

dagendresen commented Jan 17, 2022 •

edited

Loading