Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
tiny little project to harvest rdfa metadata from data.gov.uk
Python
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
.gitignore
README
crawl.py
data.ntriples
data.rdf
distributions.py
distributions.txt
subjects.py
subjects.txt

README

This is a rdfa crawler for the data.gov.uk site. It walks the 
complete listing of datasets and extracts the metadata from the 
HTML that is expressed as RDFa.

After you install python and rdflib you should be able to run it:

    % crawl.py

Comments, questions:

Ed Summers <ehs@pobox.com>
    
Something went wrong with that request. Please try again.