Permalink
Branch: master
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
42 lines (29 sloc) 1.58 KB

goodtables.io: Continuous tabular data validation

Goodtables increases your confidence in your data by performing a number of checks on it. It is able to validate things like:

  • Is my CSV valid?
  • Do all rows have the same number of columns?
  • Are all dates valid?
  • Is there any invalid e-mail?

If any of the checks fail, goodtables generates a report telling you which and where the errors occurred.

Example report with failed validation

By adding goodtables to your data publishing workflow, you'll make sure your data is free from these types of errors.

Features

  • Structural checks: Ensure that there are no empty rows, no blank headers, etc.
  • Content checks: Ensure that the values have the correct types ("string", "number", "date", etc.), that their format is valid ("string must be an e-mail"), and that they respect the constraints ("age must be a number greater than 18").
  • Support for multiple tabular formats: CSV, Excel, LibreOffice, Data Package, etc.
  • Automatically validate data on every update: Support for data on GitHub and Amazon S3.

Table of Contents

.. toctree::
   :maxdepth: 2

   getting_started/index
   configuration/index
   configuration/goodtables_yml
   api/index