Skip to content
Schema and utilities for Google Dataset Publishing Language
Python JavaScript HTML CSS
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.

Dataset Publishing Language


DSPL stands for Dataset Publishing Language. It is a representation format for both the metadata (information about the dataset, such as its name and provider, as well as the concepts it contains and displays) and actual data (the numbers) of datasets. Datasets described in this format can be imported into the Google Public Data Explorer, a tool that allows for rich, visual exploration of the data.

This site hosts miscellaneous, open source content (i.e., schemas, example files, and utilities) associated with the DSPL standard. See our documentation site for more details on what DSPL is and how to use it. The utilities in this repository are documented at this site.

Build and install

To build the tools, install lxml, then use the script in tools/dspltools/. You can use pip to install these:

pip install -r tools/dspltools/requirements.txt
pip install tools/dspltools


The draft of the DSPL 2 specification, which replaces the existing XML metadata format with markup, can be found at the DSPL GitHub page. The source for the specification is at docs/

Some initial library and tool support is available in tools/dspl2

Build and install

To build the tools, install the prerequisites, then use the script in tools/dspl2/. You can use pip to install these:

pip install -r tools/dspl2/requirements.txt
pip install tools/dspl2
You can’t perform that action at this time.