Data Release: 2017Q1

Michele Pasin edited this page Nov 9, 2017 · 5 revisions

graph-image

SpringerNature SciGraph

On this page you can find information about the February 2017 data release, now deprecated. For more recent releases please see the SciGraph explorer.

Table of Contents

Getting Started

The Downloads section contains a number of N-Triples files that are meant to be loaded into a triplestore. In order to make full use of the data we recommend to also load the following datasets into the triplestore:

Downloads

Dataset Description Things* Triples File size
Ontologies 42 1,401 12K (245K)
Articles 2016 351,018 34,511,256 828M (7.1G)
2015 335,276 32,147,098 766M (6.6G)
2014 354,006 31,660,946 758M (6.5G)
2013 499,717 29,157,570 688M (6.0G)
2012 305,431 26,399,143 636M (5.5G)
Grants 51,811 1,021,970 48M (342M)
Journals 4,721 94,693 2.0M (18M)
Subjects 2,750 37,777 420K (6.4M)
Total 1,904,772 155,031,854 3.7G (32G)

* Things refers to the primary type for each dataset.

Files are compressed using bzip2.

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Models and Mappings

You can also access the models and mappings to external vocabularies in the project source code.

Datasets at a Glance

scigraph-public-release-2017-02-15

Ontologies

This dataset contains the Springer Nature SciGraph Core Ontology which is the main model encoding the semantics of Springer Nature SciGraph data. It is implemented as an OWL 2 ontology, with a DL expressivity ALHIF(D), and consists of 45 classes and 206 properties. The ontology lives within its own SciGraph namespace (http://www.springernature.com/scigraph/ontologies/core/, prefix sg:). Conceptually this work is a logical extension of the previous nature.com core ontology and supersedes that.

Note that this version does not currently include mappings to external models. Such mappings will be released shortly.

Articles

This dataset provides bibliographic metadata for the complete set of Springer Nature journal articles for the past five years (2012-1016). Included also are abstracts, information about authors and affiliations (as strings). Within this dataset there are links to the SciGraph subjects dataset (see below) and to ANZSRC Field of Research codes. Furthermore in some cases affiliation strings have been disambiguated with GRID identifiers.

Grants

This dataset provides metadata about research grants (including abstracts) linked to Springer Nature articles. Research grants are categorized using ANZSRC Field of Research codes. Funding and recipient organizations are also marked with GRID identifiers.

Journals

This dataset provides metadata about the complete set of Springer Nature journals. Note that our model, similarly to BIBFRAME, maintains a fundamental distinction between product level journal info (as expressed by ISSNs) and journal brand level (the more abstract work).

Subjects

This dataset provides our custom-built categorization of scholarly subject areas. It includes subject terms of varying levels of specificity such as Biological sciences (top level), Cancer (level 2), or B-2 cells (level 7). In total there are more than 2500 subject terms, organized into a polyhierarchical tree using the SKOS vocabulary. Note that in this release only articles published on nature.com are indexed with these subjects.

Classes and Properties

Models for the main classes in the datasets are described below.

Class: Affiliation

Field Property Datatype Example
(Type) rdf:type sg:Affiliation sg:Affiliation
(ID) sg:scigraphId xsd:string "ee0bc4b9b663c28602d16fed10b8c867"
.
Name
Name sg:publishedName xsd:string "Org Name"
Order sg:order xsd:integer 1
.
Organization
Organization sg:hasOrganization rdfs:Resource http://www.grid.ac/...

Class: Article

Field Property Datatype Example
(Type) rdf:type sg:Article sg:Article
(ID) sg:scigraphId xsd:string "8ba89261543322df04dfe9aa9cf0b2dc"
.
Identity
DOI sg:doi xsd:string "10.1038/npjschz.2014.6"
DOI Link sg:doiLink rdfs:Resource http://dx.doi.org/...
.
Label
Language (Original) sg:language xsd:string "English"
Title sg:title xsd:string "Title ..."
Abstract sg:abstract xsd:string "Description ..."
Title (Translated) sg:titleTranslated rdf:langString "Title ..."@en
Abstract (Translated) sg:abstractTranslated rdf:langString "Description ..."@en
.
Contributor
Contributor sg:hasContribution sg:Contribution contributions:...
.
Publisher
Year sg:publicationYear xsd:gYear "2014"^^ xsd:gYear
Year Month sg:publicationYearMonth xsd:gYearMonth "2014-07"^^ xsd:gYearMonth
Date sg:publicationDate xsd:date "2014-07-21"^^xsd:date
Webpage sg:webpage rdfs:Resource http://...
.
Description
FOR Code sg:hasFieldOfResearchCode rdfs:Resource http://purl.org/au-research/vocabulary/anzsrc-for/...
Subject sg:hasSubject sg:Subject subjects:...
.
Source
Journal sg:hasJournal sg:Journal journals:...
Volume sg:volume xsd:string "12"
Issue sg:issue xsd:string "134"

Class: Contribution

Field Property Datatype Example
(Type) rdf:type sg:Contribution sg:Contribution
(ID) sg:scigraphId xsd:string "98b6a65500b47022e1f5a8b1b9d6cc68"
.
Name
Family Name sg:publishedFamilyName xsd:string "Family Name"
Given Name sg:publishedGivenName xsd:string "Given Name"
Name sg:publishedName xsd:string "Full Name"
.
Other
Corresponding? sg:isCorresponding xsd:boolean true
Order sg:order xsd:integer 1
Role sg:role xsd:string "editor"
.
Affiliation
Affiliation sg:hasAffiliation sg:Affiliation affiliations:...

Class: Grant

Field Property Datatype Example
(Type) rdf:type sg:Grant sg:Grant
(ID) sg:scigraphId xsd:string "81dc9bdb52d04dc20036dbd8313ed055"
.
Label
Language (Original) sg:language xsd:string "English"
Title sg:title xsd:string "Title ..."
Abstract sg:abstract xsd:string "Description ..."
Title (Translated) sg:titleTranslated rdf:langString "Title ..."@en
Abstract (Translated) sg:abstractTranslated rdf:langString "Description ..."@en
.
Contributor
PI sg:hasContribution sg:Contribution contributors:...
.
Funder
Funding Amount sg:fundingAmount xsd:decimal 13021970.0
Funding Currency sg:fundingCurrency xsd:string "USD"
Funder sg:hasFundingOrganization rdfs:Resource http://grid.ac/...
Recipient sg:hasRecipientOrganization rdfs:Resource http://grid.ac/...
Start Year sg:startYear xsd:gYear "2007"^^xsd:gYear
End Year sg:endYear xsd:gYear "2014"^^xsd:gYear
License sg:license xsd:string "..."
Web Page sg:webpage rdfs:Resource http://...
.
Description
FOR Code sg:hasFieldOfResearchCode rdfs:Resource http://purl.org/au-research/...
.
Publication
Publication sg:hasFundedPublication sg:Article articles:...

Class: Journal

Field Property Datatype Example
(Type) rdf:type sg:Journal sg:Journal
(ID) sg:scigraphId xsd:string "d41d8cd98f00b204e9800998ecf8427e"
.
Identity
ISSN sg:issn xsd:string "1234-5678"
DOI sg:doi xsd:string "10.1234/..."
.
Journal Brand
Journal Brand sg:hasJournalBrand sg:JournalBrand journal-brands:...
.
Format
Medium sg:medium xsd:string "Electronic"
.
Access
Active? sg:isActivePublication xsd:boolean true
.
History
Historical? sg:isHistoricalJournal xsd:boolean true

Class: JournalBrand

Field Property Datatype Example
(Type) rdf:type sg:JournalBrand sg:JournalBrand
(ID) sg:scigraphId xsd:string "d41d8cd98f00b204e9800998ecf8427e"
.
Label
Language sg:language xsd:string "English"
Title sg:title xsd: string "Title"
Short Title sg:shortTitle xsd:string "Short Title"
Subtitle sg:subtitle xsd: string "Subtitle"
.
Publisher
Imprint sg:imprint xsd: string "Imprint"
Publisher sg:publisher xsd:string "Publisher"
Homepage sg:webpage rdfs:Resource http://...
Copyright sg:copyrightHolder xsd:string "..."
.
Coverage
Title Added Year sg:titleAddedYear xsd:gYear "2005"^^xsd:gYear
Start Year sg:startYear xsd:gYear "1981"^^xsd:gYear
End Year sg:endYear xsd:gYear "1996"^^xsd:gYear
Start Volume sg:volumeStart xsd:string "1"
End Volume sg:volumeEnd xsd:string "43"
Total Volumes sg:volumeCount xsd:integer 43
.
Access
OpenAccess sg:openAccess xsd: string "Fully Open Access"

Class: Subject

Field Property* Datatype Example
(Type) rdf:type sg:Subject sg:Subject
rdf:type skos:Concept skos:Concept
(ID) sg:id xsd:string "biological-sciences"
.
Related sg:related sg:Subject subjects:...
Replaced By sg:replacedBy sg:Subject subjects:...
Replaces sg:replaces sg:Subject subjects:...
Webpage sg:webpage rdfs:Resource http://www.nature.com/subjects/...
.
.
Label
Label rdfs:label xsd:string "..."
Alt Label skos:altLabel rdf:langString "..."@en
Definition skos:definition rdf:langString "..."@en
Pref Label skos:prefLabel rdf:langString "..."@en
.
Structure
Broader skos:broader skos:Concept subjects:...
Narrower skos:narrower skos:Concept subjects:...
In Scheme skos:inScheme skos:ConceptScheme subjects:
Top Concept Of skos:topConceptOf skos:ConceptScheme subjects:
.
Note
Note skos:note xsd:string "..."

Further Info

This section contains links to tutorials, learning materials and other resources related to SciGraph.

  • The mappings folder contains an initial set of semantic links to other ontologies and datasets. We are continuously improving it, but feel free to submit a pull request if you think you could help!
  • The blog post Exploring SciGraph data using JSON-LD, Elastic Search and Kibana provides a step by step tutorial on how to download these datasets and start interrogating them using free-download software.
  • The shapes folder contains examples of the SHACL shapes we used for data extraction and validation.
Clone this wiki locally
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.
Press h to open a hovercard with more details.