Skip to content

Latest commit

 

History

History
349 lines (180 loc) · 25.8 KB

Beginners.md

File metadata and controls

349 lines (180 loc) · 25.8 KB

http://www.tdwg.org

Beginner's Guide to RDF

Date: (Created) 17 February 2012; (Last modified) 19 Feb 2014

Document Status: Not part of any standard (Type 3 document)

Permanent URL: http://code.google.com/p/tdwg-rdf/wiki/Beginners

TDWG Task Group: TDWG RDF/OWL Task Group

Contributors: Steve Baskauf (TDWG RDF/OWL Task Group)

Abstract: This document provides concise information about topics related to RDF and OWL in the context of the biodiversity informatics community. It is intended as an introduction for persons who are not already familiar with RDF and OWL and as a reference for persons who are familiar but would like organized access to additional reference material.

http://i.creativecommons.org/l/by/3.0/88x31.png Licensed under Creative Commons Attribution 3.0 License

Table of Contents:

0. Introduction (this page)

1. Resources and URIs

2. What is RDF and what is it for?

3. RDF basics

4. Vocabularies, DCAM, and RDFS

5. Discovery, Transmission, and Storage of RDF data

6. Querying with SPARQL

7. Ontologies and OWL


Beginner's guide to RDF: 0. Introduction

Contents

0.1. Purpose

The purpose of this guide is to provide concise information about topics related to RDF and OWL which are significant to the work of the TDWG RDF/OWL Task Group (RDF/OWL TG) in particular and to the biodiversity informatics community in general. It is intended as an introduction for persons who are not already familiar with RDF and OWL and as a reference for persons who are familiar but would like organized access to additional reference material. Although it is intended that the information provided be correct, because it is intended as an introduction the amount of technical jargon and formal language has been kept to a minimum. For those who are interested in normative descriptions of the concepts and technologies discussed here, footnotes are provided to hyperlinks to more detailed information.

It should also be noted that in almost every instance, the URIs that are used in the examples are "real" URIs (no fake "example.org" URIs) from the wild which in most cases will actually dereference (at least in the case of HTTP URIs and LSIDs). This is not to imply that there is something special about those URIs or that the examples that they provide are a model of any sort, but rather it is to give the reader an opportunity to actually try out the tools that have been provided for validating, describing, browsing, etc.

0.2. Format

The guide is laid out into sections, each of which deals with a major issue relevant to the work of the RDF/OWL TG. This allows users who wish to access the guide as a reference to jump quickly to the point of interest. However, the later sections presuppose knowledge of concepts discussed in earlier sections, so a beginner may have difficulty starting at a later section.

Within each section, subsections are laid out hierarchically and linked in a Table of Contents at the top of the section page. At the end of each section is a numbered reference list which is also linked to the Table of Contents as well as the actual references in the text.

A beginner who wants to gain a basic understanding of RDF and OWL should start with section 1 and work through the guide sequentially. However, it is not intended that this guide be a tutorial capable of enabling a beginner to write RDF or OWL. Consulting additional references will undoubtedly be required.

0.3 Significant documents, tools, and RDF implementations

The mention of any document or tool in this guide does not imply the endorsement of TDWG or the RDF/OWL TG. (The exception to this is ratified TDWG standards which are official recommendations of TDWG.) The referenced items below are simply ones which have been found to be helpful. The lists are not exhaustive; resources may be added when discovered or removed if they cease to be available.

An important general reference is Architecture of the World Wide Web, a W3C Recommendation.

The document Linked Data: Evolving the Web into a Global Data Space provides a general overview of Linked Data issues and implementation strategies.

See also a list of design issues intended to avoid repetition of discussions once resolved.

The Pedantic Web group is concerned with the quality of RDF data and has created a list of Frequently Observed Problems on the Web of Data.

Information Standards Quarterly had a special issue (Vol. 24, No. 2/3, doi:10.3789/isqv24n2-3.2012 devoted to Linked Data in Libraries, Archives, and Museums.

A PowerPoint presentation for an RDF Primer session at TDWG 2013 is here. The background video for the session is here.

0.3.1. Related to Uniform Resource Identifier (URI) and GUID/Persistent Identifier

RFC 3986. Uniform Resource Identifier (URI): Generic Syntax (section 1.2.2. defining "dereference", "resolution", and "representation")

RFC 3987. Internationalized Resource Identifiers (IRIs)

Compact URI (CURIE) Syntax 1.0 W3C Working Group Note

Cool URIs for the Semantic Web

RFC 2616. Hypertext Transfer Protocol -- HTTP/1.1 (section 1.3 Terminology including definition of "resource" and "representation") (section 12 on Content Negotiation)

Apache mod_rewrite (URL rewriting)

Best Practice Recipes for Publishing RDF Vocabularies (content negotiation and URI dereferencing)

GUID Applicability Statement - Current TDWG Standard (click here for PDF viewable via browser)

A Beginner's Guide to Persistent Identifiers published by the Global Biodiversity Information Facility(GBIF)

Integrated Digital Biocollections (iDigBio) GUID Statement

Life Sciences Identifiers Applicability Statement - Current TDWG Standard (click here for PDF viewable via browser)

LSID Best Practices

LSID Specification

Jones et al. 2011. Identifying and relating biological concepts in the Catalogue of Life. Journal of Biomedical Semantics 2011,2:7. doi:10.1186/2041-1480-2-7

DOIs as HTTP URIs

To locate DOIs for titles (books, monographs, or whole journal runs) in the Biodiversity Heritage Library (BHL), locate the title through searching, then access the URL

http://www.biodiversitylibrary.org/bibliography/[title_number]

or

http://www.biodiversitylibrary.org/title/[title_number]

where [title_number] is the BHL-assigned identifying number for the title. Currently, BHL does not assign DOIs to individual articles unless they happen to correspond to a monograph.

DOIs can be exported en masse through BHLs data export at http://biodivlib.wikispaces.com/Data+Exports and are exposed at the BHL API and OpenURL resolver http://biodivlib.wikispaces.com/Developer+Tools+and+API .

Thanks to Rod Page and Chris Freeland for this information.

An example of DOIs used with specimens

0.3.2. Related to the Resource Description Framework (RDF) model

RDF Primer (intended for beginners)

RDF Concepts and Abstract Syntax (normative description of namespace, datatypes, and abstract syntax)

RDF/XML Syntax Specification (normative)

Terse RDF Triple Language (Turtle)

RDFa Core 1.1

JSON-LD JSON-based Serialization for Linked Data

RDF Semantics (normative)

RDF Vocabulary Description Language 1.0: RDF Schema (RDFS)

RSS specification (view source to see RDF)

SPARQL 1.1 Query Language (communication protocol)

Page 2006 "Taxonomic names, metadata, and the Semantic Web" (modeling taxonomic metadata relationships in RDF)

W3C Interest Group note: Mapping and linking life science data using RDF (Thanks to Hilmar Lapp for providing the link.)

Early draft of expressing Darwin Core taxonomies in SKOS (Thomas Bandholtz)

Open Annotation Core Data Model (for creating associations between distant pieces of information)

Proposal for linking metadata to the web pages they describe

Information about the draft Shape Expressions language for validating RDF Shape Expressions primer Shape Expressions definition

0.3.3. Related to the Dublin Core Metadata Initiative (DCMI) Abstract Model

DCMI Abstract Model (DCAM) specification (Relationship between DCAM and RDFS)

Representing DCAM constructs using the RDF Model

Notes on DCMI specifications for Dublin Core metadata in RDF

Domains and Ranges for DCMI Properties

Darwin Core abstract model (vs. DCAM)

0.3.4. Related to Web Ontology Language (OWL)

OWL 2 Web Ontology Language Primer (for beginners)

OWL 2 Web Ontology Language Document Overview

OWL 2 Web Ontology Language RDF-Based Semantics

A Practical Guide To Building OWL Ontologies Using Protégé 4 and CO-ODE Tools

Allemang and Hendler Semantic Web for the Working Ontologist, Second Edition: Effective Modeling in RDFS and OWL (not available online)

Hogan et al. 2011 Scalable OWL 2 Reasoning for Linked Data

Link to Aidan Hogan's Ph.D. thesis "Exploiting RDFS and OWL for Integrating Heterogeneous, Large-Scale, Linked Data Corpora" and other OWL and RDF related links

Ontobee web server for Ontologies Can be set up with PURL redirection to dereference ontology terms. See this email for further information.

Some rule-based extensions to OWL/SPARQL/RDF

SWRL submission to W3C (stalled?)

SPIN submission to W3C (SPARQL-based)

W3C Rule Interchange Format (RIF) FAQ page

0.3.5. Software tools

rdfEditor (syntax highlighting, validates and converts)

jEdit (plugin for XML will validate, no RDF plugin)

Protégé ontology editor (guide)

SWOOP tool for creating, editing, and debugging OWL ontologies

Pellet OWL Reasoner (Pellet tutorial)

ELK reasoner (supports the OWL 2 EL profile)

See also http://www.w3.org/RDF/ for a listing of tools relevant to RDF

mx - a Ruby-based platform that consists of a Ruby on Rails application and various supporting gems/libraries developed as a collaborative web-based content management system for biodiversity informatics.

0.3.6. Web interfaces

Vapour Linked Data Validator for examining the dereferencing process for a URI

URI Debugger to dereference URIs and visualize the HTTP response of the server

Rod Page's LSID tester

TDWG LSID resolver

Rod Page's LSID resolver

Rod Page's OpenURL Resolver which can be used to proxy some identifiers as HTTP URIs which return RDF Paper: "bioGUID: resolving, discovering, and minting identifiers for biodiversity informatics" doi:10.1186/1471-2105-10-S14-S5

W3C RDF Validation Service (includes graphical display)

rdfabout.com RDF Validator and Converter (converts from XML to N3 and vice versa)

RSS validator (also XML validation)

Manchester OWL Validator

WonderWeb OWL Ontology Validator

Mindswap OWL Consistency Checker

Live OWL Documentation Environment generates human-readable versions of OWL ontologies (see this documentation

Rod Page's "Status of Biodiversity Services" checker (takes a long time to run so be patient)

Marbles Linked Data browser

OpenLink Data Explorer Linked Data Browser

Zitgist Linked Data viewer

Disco Hyperdata Browser

Semantic Information Mashup (sig.ma) using Sindice's semantic web index

URIBurner test basic SPARQL endpoint

URIBurner enhanced SPARQL endpoint

0.3.7. Biodiversity-related and General vocabularies and ontologies

Audubon Core (AC) draft TDWG standard (draft term list)

Australian Plant Name Index (APNI) vocabulary (in RDF)

Bibliographic Ontology Specification (BIBO) (in RDF/N3)

Content-in-RDF10 (Representing Content in RDF 1.0 vocabulary)

Creative Commons Rights Expression Language (CC REL) (in RDF)

Darwin Core (DwC) TDWG Standard (homepage) (term quick reference guide) (term definitions in RDF) (type vocabulary) (type definitions in RDF)

darwin-sw (DSW) ontology (in RDF)

Dublin Core (DC) terms (in RDF) (type vocabulary) (in RDF)

FOAF Vocabulary Specification 0.98 (in RDF)

Basic Geo WGS84 lat/long (GEO) Vocabulary (in RDF)

GeoSciML Geologic Timescale model Ontology (in RDF)

Library of Congress Authorities and Vocabularies: ISO639 languages, MARC countries and geographic areas (Use view page source: ISO639, MARC geographic areas, etc. in RDF)

Linked Open Data of Ecology (LODE) vocabulary (Ecoinformatics Working Group of the TFRI)

MARC (Library of Congress) Relators list (terms are subPropertyOf dc:contributor and possibly other dc:terms) see also MARC Relator terms and Dublin Core

Natural Collections Descriptions (NCD) draft TDWG Standard (PDF non-normative document viewable in browser) (PDF normative document viewable in browser) (Collection in RDF) (Institution in RDF) (InstitutionType in RDF) (ContactDetails in RDF)

Open Annotation Core Data Model

OBO foundry (http://www.obofoundry.org/ ) ontologies:

PROV (Provenance) Model primer

PAV (Provenance, Authoring and Versioning) Ontology (a lightweight ontology that is a refinement of PROV)

Publishing Requirements for Industry Standard Metadata (PRISM) (RDF???)

SKOS Simple Knowledge Organization System Namespace Document (in RDF)

Taxon Meta-Ontology TaxMeOn Associated paper: Making species checklists understandable to machines - a shift from relational databases to ontologies http://dx.doi.org/10.1186/2041-1480-5-40

Taxonomic Concept Transfer Schema (TCS) (TDWG Standard) (Users Guide PDF viewable in browser) (XML schema details PDF viewable in browser) (TaxonConcept in RDF) (TaxonName in RDF) (Common in RDF) Journal of Biomedical Semantics 2:7 article on application of LSIDs and TCS at Catalogue of Life

TaxonConcept Ontologies (in RDF)

TDWG Ontology (human readable) (in RDF)

vCard (in RDF)

VoID vocabulary (see also)

0.3.8. Biodiversity-related RDF in the wild

Atlas of Living Australia (ALA) National Species List (NSL) Linked Data Services; uses DC, TCS, APNI (example resource)

Biodiversity Collections Index; uses DC, vCard, and NCD (example resource)

Bioimages; uses DC, DwC, TCS, FOAF, AC, DSW, and others (example resource) (example resource)

DOIs dereferenced as RDF; uses DC, BIBO, PRISM (example resource URI is http://dx.doi.org/10.1525/auk.2009.09022 but RDF can't be viewed in a browser directly)

International Chronostratigraphic Chart (2012) (Example resource as RDF: Middle Jurassic; URI=http://resource.geosciml.org/classifier/ics/ischart/MiddleJurassic)

Linked Open Data of Ecology; uses DwC, FOAF, GEO, LODE (example resource)

TaxonConcept; uses DC, FOAF, BIBO, TaxonConcept, SKOS, and others (example resource)

uBio; uses DC and locally defined predicates (example resource)

Virtual International Authority File (VIAF); URIs for persons (http://viaf.org/viaf/27063124 example resource)


Questions? Comments? Contact Steve Baskauf