Skip to content

DTUWindEnergy/EXTRACT-taxonomy

Repository files navigation

CI

Controlled vocabularies such as taxonomies allow an accurate and controlled approach in describing datasets. One of such controlled vocabulary is Wind Energy Taxonomy of External Conditions. This taxonomy is the result of EERA JP WIND IRPWind Open Data initiative that took place in 2017 in which wind energy experts generated the first version of the taxonomy as an input for defining and structuring wind energy metadata.

The report of this work is available at Zenodo: https://www.zenodo.org/record/1199489#.XSD6haeQ3RY

In 2018, the taxonomy of topics was improved during the internal project of DTU Wind Energy titled 'FAIR Digitalization': https://www.zenodo.org/record/1493874#.XSD7TaeQ3RY

In 2021, the taxonomy was converted into FAIR machine-actionable controlled vocabulary using sheet2rdf. The controlled vocabulary is served to humans and machines using an instace of OntoStack hosted by DTU Wind Energy. sheet2rdf and OntoStack are developed by Nikola Vasiljevic.

Additionally, in 2021, the persistent URL purl.org/extract has been registered for the taxonomy. This allows us to:

  1. If there is a needed to move the taxonomy to another domain (currently under data.windenergy.dtu.dk) URLs will be still resolvable and unchanged
  2. Use short URls as PIDs for the taxonomy terms and properties such as:

Tooling

This repository hosts automatic workflow, executed by means of Github actions, and underlying shell and python scripts which:

  • Fetches Google Sheet, containing the taxonomy terms and their defitions, from Google Drive and stores is at xlsx and csv files
  • Converts fetched sheet to machine-actionable and FAIR RDF vocabulary using xls2rdf
  • Tests the resulting RDF vocabulary using qSKOS
  • Commits conversion results and tests logs to this repository
  • and deploy RDF vocabulary to OntoStack to be served to humans and machines

OntoStack is a set of orchestrated micro-services configured and interfaced such that they can intake vocabularies and resolve their terms and RDF properties upon requests either by humans or machines.

Some of OntoStack micro-services are:

  • Jena Fuseki a graph database
  • SKOSMOS a web-based SKOS browser acting as a front-end for the vocabularies persisted by the graph database
  • Træfik an edge router responsible for proper serving of URL requests

NEAT is served by DTU Wind Energy instance of OntoStack: https://data.windenergy.dtu.dk/ontologies/view

Taxonomy implementation

The taxonomy is implemented in following services:

NEWA: New European Wind Atlas

Visualize taxonomy

If you are interested to visualize taxonomy follow steps described in this YouTube video.

Contribute

The taxonomy is intended to be used and further developed by the community. Therefore, we welcome collaborators willing to take part in the further development of the taxonomy. If you are one of them either request to become one of the taxonomy admins and/or post GitHub issues on what we can improve in the current taxonomy.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published