Skip to content

TEC-Toolkit/Data-Validation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data validation

This repository contains the data validation module for the TEC Toolkit.

Getting Started

These instructions explain how to run and customise the Data Validation module.

Prerequisites

In the following, we assume that it is installed and added to the PATH.

Running (RDFox)

The main entry point is the master script.

This data validation module reads the RDF files and checks if some constraints are verified.

How to execute it:

RDFox sandbox <root> master

where <root> is the "scripts" folder (. if you are inside it).

Functionalities

Check if the data are "valid"

  1. Run the data validation module
  2. Check that all queries have 0 answers

Add another validation check

  1. Add a file "check-CUSTOM_NAME-rules" with the rules to check
  2. Add a file "check-CUSTOM_NAME-queries" with the ASK queries
    • Note that the queries must return no answer if the check is passed
  3. Add a new command exec check CUSTOM_NAME in the "validate" file

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Some parts have been adapted from the PRObs system (by the UK FIRES project).


This tool is part of the TEC-Toolkit.

TEC-Toolkit Logo

Footnotes

  1. RDFox is a commercial product, but anyone with a corporate email address can request a free RDFox trial license. Moreover, Oxford Semantic Technologies offers research licenses for free to accredited entities.