Skip to content

RiverBench/dataset-nanopubs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

.github/workflows/release.yaml

nanopubs (development version)

Nanopublications are small units of publishable information, used for scientific results and more. This dataset is based on a subset of a dump of all available nanopublications as of April 5, 2018. Only the first 5M of freely-licensed nanopubs were included. Each nanopub consists of several RDF graphs and thus is an RDF dataset. The included data is primarily from the biomedical domain. More information: paper, website.

This README is a snapshot of documentation for the latest development version of the dataset. Full documentation for all versions can be found on the website.

General information

Technical metadata

  • Has stream type usage:
    • RDF stream type usage (1)
    • RDF stream type usage (2)
      • Type: RDF stream type usage (stax:RdfStreamTypeUsage)
      • Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
      • Has stream type: RDF dataset stream (stax:datasetStream)
  • Has stream element count: 5,000,000
  • Has stream element split:
  • Uses vocabulary:
  • Conforms to W3C RDF 1.1 specification: yes
  • Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
  • Uses generalized triples: no
  • Uses generalized RDF datasets: no
  • Uses RDF-star: no

Distributions

Full stream distribution

Full Jelly distribution

Full flat distribution

1M elements stream distribution

1M elements Jelly distribution

1M elements flat distribution

100K elements stream distribution

100K elements Jelly distribution

100K elements flat distribution

10K elements stream distribution

10K elements Jelly distribution

10K elements flat distribution