Skip to content
Pull request Compare This branch is 3642 commits behind datagovuk:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


ckanext-dgu - extension

This is an extension to CKAN that provides customisations specifically for the project:

  • DGU's package form - includes a number of custom fields such as temporal_coverage and geographic_coverage.
  • the Form API - exposes in the API the form expressed as HTML for insertion in the Drupal front-end.
  • Harvest Source form, supplied by Form API to the Drupal front-end.
  • Harvest Object inserted into the CKAN package view page.
  • gov_daily - a script (for running daily) that save the database dumps for end-users (JSON/CSV) and backups (SQL).
  • ons_loader - an import script for data from the Office of National Statistics.
  • cospread - an import script for packages listed in a standardised spreadsheet format.
  • various other command-line utilities


This is how to install ckanext-dgu, ckan and their dependencies into a python virtual environment:

virtualenv pyenv
pip -E pyenv install -e git+
pip -E pyenv install -e git+
pip -E pyenv install -r pyenv/src/ckan/pip-requirements.txt
pip -E pyenv install -r pyenv/src/ckanext-dgu/pip-requirements.txt


Different parts of the DGU extension require options to be set in the CKAN configuration file (.ini) in the [app:main] section

To use the DGU form specify:

package_form = package_gov3

To enable the Form API:

ckan.plugins = dgu_form_api

For the Drupal RPC connection (for user data etc.) supply the hostname, and credentials for HTTP Basic Authentication (if necessary):

dgu.xmlrpc_domain =
dgu.xmlrpc_username = ckan
dgu.xmlrpc_password = letmein


There is a front-page added to CKAN which describes the Catalogue APIs. The usual CKAN front-page has been moved to /ckan/ .



There are a number of command-line scripts for processing data. To run one of these, you should activate the environment first. For example to load in some ONS data you might start like this:

. pyenv/bin/activate
ons_loader --help


To test the DGU extension you need the setup with CKAN (see above) and creation of a configured pyenv/src/ckan/development.ini (see ).

To run the tests:

cd {pyenv}/ckanext-dgu
nosetests --ckan ckanext/dgu/tests/

or run them from another directory by specifying the test.ini:

nosetests {pyenv}/src/ckanext-dgu/ckanext/dgu/tests/ --ckan --with-pylons={pyenv}/src/ckanext-dgu/test.ini {pyenv}/src/ckanext-dgu/ckanext/dgu/tests/

You can either run the 'quick and dirty' tests with SQLite or more comprehensively with PostgreSQL. Set --with-pylons to point to the relevant configuration - either test.ini or test-core.ini (both from the ckanext-dgu repo, not the ckan one). For more information, see .

Test issues

Address and Connection errors

  • socket.error: [Errno 98] Address already in use
  • error: [Errno 111] Connection refused

These errors usually means a previous run of the tests has not cleaned up the Mock Drupal process. You can verify that:

$ ps a | grep mock_drupal
4748 pts/8    S      0:00 /home/dread/hgroot/pyenv-dgu/bin/python /home/dread/hgroot/pyenv-dgu/bin/paster --plugin=ckanext-dgu mock_drupal run -q

Now kill it before running the tests again:

$ kill 4748

Config errors

  • DrupalXmlRpcSetupError: Drupal XMLRPC not configured.

The missing settings that result in this error are to be found in {pyenv}/src/ckanext-dgu/test-core.ini which is also imported into {pyenv}/src/ckanext-dgu/test.ini, so make sure you are specifying either of these config files in your nosetests --with-pylons parameter.


DGU is an extension for CKAN:

This README file is part of the DGU Developer Documentation, stored in the ckanext-dgu repo at ckanext-dgu/doc.

The Developer Docs can be built using Sphinx:

python build_sphinx
Something went wrong with that request. Please try again.