DSPL stands for Dataset Publishing Language. It is a representation format for both the metadata (information about the dataset, such as its name and provider, as well as the concepts it contains and displays) and actual data (the numbers) of datasets. Datasets described in this format can be imported into the Google Public Data Explorer, a tool that allows for rich, visual exploration of the data.
This site hosts miscellaneous, open source content (i.e., schemas, example files, and utilities) associated with the DSPL standard. See our documentation site for more details on what DSPL is and how to use it. The utilities in this repository are documented at this site.
To build the tools, install lxml
, then use the setup.py
script in
tools/dspltools/
. You can use pip to install these:
pip install -r tools/dspltools/requirements.txt
pip install tools/dspltools
The draft of the DSPL 2 specification, which replaces the existing XML metadata
format with schema.org markup, can be found at the DSPL GitHub
page. The source for the specification is at
docs/dspl2-spec.md
.
Some initial library and tool support is available in tools/dspl2
To build the tools, install the prerequisites, then use the setup.py
script in
tools/dspl2/
. You can use pip to install these:
pip install -r tools/dspl2/requirements.txt
pip install tools/dspl2