Skip to content

FAIRification Strategy

Mark Wilkinson edited this page Apr 6, 2020 · 1 revision

Proposal

  1. We initially focus on metadata capture - allowing the discovery of datasets in their native state.
  • for each dataset being used within the hackathon, we need:
    • citation
    • license
    • topic-list (this should be captured as ontological information where possible; if not, record it as "missing" and inform the Ontology working group)
  1. We will use the RO Crates model (https://researchobject.github.io/ro-crate/) (https://researchobject.github.io/ro-crate/1.0/)

  2. Initially, we will generate the RO Crates as zip files, with the RDF metadata file and data files contained in the Crate; in addition, the same metadata file should be left outside the Crate.

  1. I will push the crate, and its metadata file, into the LDP server (https://w3id.org/FAIR_COVID19/DAV/coronavirus/ro-crates/)

  2. The LDP server will index the metadata file, allowing the Crate to be discovered by SPARQL.

  3. I am also going to work on an LDP-native Crate structure during the hackathon, as an extension to the RO Crates spec.

Clone this wiki locally