Convert pdf files to text and html (eventually MS Word, too)
Python JavaScript HTML CSS
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
fileupload
static
templates
LICENSE.txt
README.md
__init__.py
__init__.pyc
manage.py
settings.py
settings.pyc
urls.py
urls.pyc

README.md

Update

TODO Use the terrific library for pdf to html conversion: pdf2htmlEX, using ttfautohint as --external-hint-tool=ttfautohint

Conversion to Word can use pandoc

A web-based converter from pdf to html and eventually other formats (text and MSWord). The UI is based on a Django-backed app built on jQuery-File-Upload. That JQuery app was developed by Sebastian Tschan, with the source available on github. This was ported to Django by Sigurd Gartmann (sigurdga on github).

Ari Hershowitz has connected it to a pdf converter, based on the server. For a Django app to use JQuery-File-Upload, you should branch from here.

License

MIT, as the original project. See LICENSE.txt.