Skip to content

cadmiumkitty/data-governance

Repository files navigation

Data Governance Schema and Taxonomy

Introduction

This scheme is based on the Data Beliefs I outlined in the LinkedIn article back in 2022. It formalises Concepts, Data Containers, Data Elements, Data Ownership, Transformations and Controls. It also provides a set of common hierarchies for things such as Data Quality Dimensions and Controls.

Usage

Import the schema into Confluence with Taxonomies for Confluence add-on and use it to govern your organisation's data.

Data Governance RDFS Schema in Taxonomies for Confluence

Data Governance SKOS Taxonomy in Taxonomies for Confluence

References

  1. Data and Information entries in the ISKO Encyclopedia of Knowledge Organization by Birger Hjørland.
  2. Data and Reality by William Kent.
  3. Metaphors We Live By by George Lakoff and Mark Johnson.
  4. Creating a Structured Vocabulary by Leonard Will.
  5. Data on the Outside vs. Data on the Inside by Pat Helland.
  6. Why not one big database? Principles for data ownership by Marshall Van Alstyne, Erik Brynjolfsson, and Stuart Madnick.

Design Decisions

  1. For common properties follow schema.org naming convention for Properties (e.g., classification rather than hasClassification) to increase readability and adoption.
  2. Only use "has" and "is" in property URIs and labels when it helps make the meaning clear (e.g., hasBroaderConcept rather than broader as in SKOS) to increase readability and adoption.
  3. Doing away with domain and range and using names of the classes in the UIRs of the properties and, in most cases, in the labels.
  4. TTL file combines the RDFS-based schema and the SKOS-based taxonomy.
  5. Do away with Term Class for the time being to simplify the scheme. Terms are only useful when we want to specify context where the term is used explicitly (e.g., in specific business unit).
  6. Do away with DataTransferContainer and DataStorageContainer classes. Data Containers are not intrinsically transfer or storage, e.g. Excel file can be used to enter, process, transfer and store data.
  7. No CriticalDataElement Class, rather just criticality taxonomy.
  8. Control Class is for Resources corresponding to control instances, taxonomy for types can be created to specify types of controls and their hierarchy.
  9. When documenting Data Elements on Confluence people typically assume one-to-one relationship between Data Container, Concept and Data Element. To simplify data entry, yet allow query and integration with other Data Governance tools, separate set of properties were created for each of the Classes (e.g., dataContainerCategory, controlCategory)

Attributions and Disclaimers

APRA

Part of this work, "Data Governance Taxonomy", is a derivative of Prudential Practice Guide CPG 235 - Managing Data Risk (September 2013) by Australian Prudential Regulation Authority (APRA) used under Creative Commons Attribution 3.0 Australia Licence (CCBY 3.0)

This prudential practice guide is not legal advice and users are encouraged to obtain professional advice about the application of any legislation or prudential standard relevant to their particular circumstances and to exercise their own skill and care in relation to any material contained in this guide.

APRA disclaims any liability for any loss or damage arising out of any use of this prudential practice guide.

APRA do not endorse this derivative work or its author.

World Wide Web Consortium

"The RDF Concepts Vocabulary (RDF)" and "The RDF Schema vocabulary (RDFS)" are distributed with the release without changes under W3C® Software and Document License.

The RDF Concepts Vocabulary (RDF): http://www.w3.org/2000/01/rdf-schema#
Copyright © 2004-2014 World Wide Web Consortium. All Rights Reserved. This work is distributed under the W3C® Software and Document License in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

The RDF Concepts Vocabulary (RDF): http://www.w3.org/1999/02/22-rdf-syntax-ns#
Copyright © 2019 World Wide Web Consortium. All Rights Reserved. This work is distributed under the W3C® Software and Document License in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

About

Data Governance Schema and Taxonomy exploring options for data ownership modelling.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages