Schema and utilities for Google Dataset Publishing Language
Branch: master
Clone or download
nkrishnaswami Merge pull request #1 from cclauss/modernize-Python-2-codes
Use print() function in both Python 2 and Python 3
Latest commit 339df30 Jan 22, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
datasets prepare for github Dec 20, 2018
schema switch to lxml for parsing/validation Jan 22, 2019
tools/dspltools Merge pull request #1 from cclauss/modernize-Python-2-codes Jan 23, 2019
CONTRIBUTING.md prepare for github Dec 20, 2018
LICENSE prepare for github Dec 20, 2018
README.md update readme to refer to lxml instead of minixsv Jan 22, 2019

README.md

Dataset Publishing Language

Introduction

DSPL stands for Dataset Publishing Language. It is a representation format for both the metadata (information about the dataset, such as its name and provider, as well as the concepts it contains and displays) and actual data (the numbers) of datasets. Datasets described in this format can be imported into the Google Public Data Explorer, a tool that allows for rich, visual exploration of the data.

This site hosts miscellaneous, open source content (i.e., schemas, example files, and utilities) associated with the DSPL standard. See our documentation site for more details on what DSPL is and how to use it. The utilities in this repository are documented at this site.

Build and install

To build the tools, install lxml, then use the setup.py script in tools/dspltools/. You can use pip to install these:

pip install -r tools/dspltools/requirements.txt
pip install tools/dspltools