Python library to interact with https://pdftables.com API
Switch branches/tags
Nothing to show
Clone or download
Latest commit b818a25 Jun 28, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
pdftables_api Decode gzip content Sep 25, 2017
test Support eliding output name; return string or bytes Sep 25, 2017
.gitignore Initial commit Aug 24, 2016
.travis.yml Add Python 3.6 to Travis Apr 17, 2018
LICENSE Update to correct licence Sep 6, 2016
README.md Update installation notes Jun 28, 2018
setup.py Initial commit Aug 24, 2016

README.md

pdftables-api

Build Status

Python library to interact with the PDFTables.com API.

Installation

pip: (requires git installed)

pip install git+https://github.com/pdftables/python-pdftables-api.git

pip: (without git)

pip install https://github.com/pdftables/python-pdftables-api/archive/master.tar.gz

Locally:

python setup.py install

Usage

import pdftables_api

c = pdftables_api.Client('my-api-key')
c.xlsx('input.pdf', 'output.xlsx')

Test

python -m unittest test.test_pdftables_api

Configuring a timeout

If you are converting a large document (hundreds or thousands of pages), you may want to increase the timeout.

Here is an example of the sort of error that might be encountered:

ReadTimeout: HTTPSConnectionPool(host='pdftables.com', port=443): Read timed out. (read timeout=300)

The below example allows 60 seconds to connect to our server, and 1 hour to convert the document:

import pdftables_api

c = pdftables_api.Client('my-api-key', timeout=(60, 3600))
c.xlsx('input.pdf', 'output.xlsx')