oai2pairtree harvests records from an oai-pmh repository and stores them in a pairtree on the filesystem.


or if you want to limit to a particular set: --set pmc-open

or if you want to also limit to a particular kind of record metadata: --set pmc-open --metadata_prefix pmc


oai2pairtree requires that the lxml and ptree to run. The best way to get these is to:

easy_install oai2pairtree


pip install oai2pairtree

or, if you prefer:

git clone
cd oai2pairtree
python install


  • CC0