Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by the W3C Semantic Web Interest Group task force, in March 2012. The module can be used to produce serialized versions of the extracted graph, or simply an RDFLib Graph Object.
Python CSS JavaScript Shell

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
Doc-pyMicrodata
pyMicrodata
scripts
tests
LICENSE.txt
PKG-INFO
README.txt
epydoc.css
generatePyMicrodataDoc
setup.py

README.txt

pyMicrodata distiller/parser library. The distribution contains:

- ./pyMicrodata: the Python library. You should copy the directory
  somewhere into your PYTHONPATH. Alternatively, you can also run the

    python setup.py install

  script in the directory.

- ./scripts/CGI_microdata.py: can be used as a CGI script to invoke the library.
  It has to be adapted to the local server setup, namely in setting the right paths

- ./scripts/localMicrodata.py: script that can be run locally on to transform
  a file into RDF (on the standard output). Run the script with "-h" to
  get the available flags.

- ./Doc-pyMicrodata: (epydoc) documentation of the classes and functions


The package primarily depends on:
 - RDFLib<http://rdflib.net>. Version 3.2.0 or higher is strongly recommended.
 - html5lib<http://code.google.com/p/html5lib/> (in the additional packages folder)
    
The package has been tested on Python version 2.4 and higher. Python 2.6 is strongly recommended. The package
does not run with Python 3.

For the details on the conversion of Microdata to RDF, see:

http://www.w3.org/TR/2012/NOTE-microdata-rdf-20120308/


Release notes (starting from version 1.0)
=========================================

- this is the first fully released version

Something went wrong with that request. Please try again.