Skip to content
Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

Dataset Publishing Language


DSPL stands for Dataset Publishing Language. It is a representation format for both the metadata (information about the dataset, such as its name and provider, as well as the concepts it contains and displays) and actual data (the numbers) of datasets. Datasets described in this format can be imported into the Google Public Data Explorer, a tool that allows for rich, visual exploration of the data.

This site hosts miscellaneous, open source content (i.e., schemas, example files, and utilities) associated with the DSPL standard. See our documentation site for more details on what DSPL is and how to use it. The utilities in this repository are documented at this site.

Build and install

To build the tools, install lxml, then use the script in tools/dspltools/. You can use pip to install these:

pip install -r tools/dspltools/requirements.txt
pip install tools/dspltools


The draft of the DSPL 2 specification, which replaces the existing XML metadata format with markup, can be found at the DSPL GitHub page. The source for the specification is at docs/

Some initial library and tool support is available in tools/dspl2

Build and install

To build the tools, install the prerequisites, then use the script in tools/dspl2/. You can use pip to install these:

pip install -r tools/dspl2/requirements.txt
pip install tools/dspl2