tiny little project to harvest rdfa metadata from data.gov.uk
Python
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.gitignore
README
crawl.py
data.ntriples
data.rdf
distributions.py
distributions.txt
subjects.py
subjects.txt

README

This is a rdfa crawler for the data.gov.uk site. It walks the 
complete listing of datasets and extracts the metadata from the 
HTML that is expressed as RDFa.

After you install python and rdflib you should be able to run it:

    % crawl.py

Comments, questions:

Ed Summers <ehs@pobox.com>