Open Access Media Importer

A tool for harvesting media files from Open Access articles for upload into Wikimedia Commons
The aim of this project is to write a tool that would:
* regularly spider PubMed Central to locate audio and video files published in the supplementary materials of CC BY-licensed articles in the Open subset
* convert these files to OGG
* upload them to Wikimedia Commons, along with the respective metadata
* provide for easy extension to other CC-BY sources, beyond PubMed Central
* (possibly) suggest Wikipedia articles for which the video might be relevant

Wiki page:

    oa-get [download-metadata|download-media] [dummy|pmc|pmc_doi]
    oa-cache [browse-database|clear-database|clear-media|convert-media|find-media|list-articles|stats] [dummy|pmc|pmc_doi]
    oa-put upload-media [dummy|pmc|pmc_doi]

    python-dateutil <>
    python-elixir <>
    python-gst0.10 <>
    python-magic <>
    python-mutagen <>
    python-progressbar <>
    python-xdg <>
    python-werkzeug <>
    python-wikitools <> (python-wikitools was imported into our tree and patched to ease deployment)

    sqlitebrowser <>

To use the upload feature of oa-put, copy the userconfig.example file to

A screencast showing usage can be played back with “ttyplay screencast”.

To plot mimetypes occurring in sources, install python-matplotlib and pipe the output of “oa-cache stats [source]” to the included plot-helper script.

The Open Access Media Importer is free software: 
you can redistribute it and/or modify it 
under the terms of the GNU General Public License 
as published by the Free Software Foundation, 
either version 3 of the License, 
or (at your option) any later version.
