command line utility to dump records in an oai-pmh repository as xml in a pairtree
Python
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.gitignore
README.md
oai2pairtree.py
setup.py

README.md

oai2pairtree

oai2pairtree.py harvests records from an oai-pmh repository and stores them in a pairtree on the filesystem.

Usage

oai2pairtree.py http://www.pubmedcentral.nih.gov/oai/oai.cgi 

or if you want to limit to a particular set:

oai2pairtree.py http://www.pubmedcentral.nih.gov/oai/oai.cgi --set pmc-open

or if you want to also limit to a particular kind of record metadata:

oai2pairtree.py http://www.pubmedcentral.nih.gov/oai/oai.cgi --set pmc-open --metadata_prefix pmc

Installation

oai2pairtree requires that the lxml and ptree to run. The best way to get these is to:

easy_install oai2pairtree

or:

pip install oai2pairtree

or, if you prefer:

git clone https://github.com/edsu/oai2pairtree.git
cd oai2pairtree
python setup.py install

License

  • CC0