A simple Python wrapper for the DocumentCloud API
Python
Pull request Compare This branch is 180 commits behind datadesk:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
documentcloud
.gitignore
README.textile
requirements.txt
setup.py
test.pdf
test.py

README.textile

  ____                                        _    ____ _                 _ 
 |  _ \  ___   ___ _   _ _ __ ___   ___ _ __ | |_ / ___| | ___  _   _  __| |
 | | | |/ _ \ / __| | | | '_ ` _ \ / _ \ '_ \| __| |   | |/ _ \| | | |/ _` |
 | |_| | (_) | (__| |_| | | | | | |  __/ | | | |_| |___| | (_) | |_| | (_| |
 |____/ \___/ \___|\__,_|_| |_| |_|\___|_| |_|\__|\____|_|\___/ \__,_|\__,_|/  

A simple python wrapper for the DocumentCloud API.

Features

  • Retrieve and edit documents and projects, both public and private, from documentcloud.org
  • Upload PDFs into your documentcloud.org account and organize them into projects
  • Download text, images and entities extracted from your PDFs by DocumentCloud

Getting started

Documentation is available at http://datadesk.github.com/python-documentcloud/

Getting started is as easy as…

$ pip install python-documentcloud