Skip to content

Latest commit

 

History

History
66 lines (43 loc) · 1.39 KB

README.md

File metadata and controls

66 lines (43 loc) · 1.39 KB

Docvert

Converts Word Processor office files (e.g. .DOC files) to OpenDocument, DocBook, and structured HTML.

This is Docvert for Python 2. To find Docvert for Python 3 see http://github.com/holloway/docvert-python3/

Web Service

python2 ./docvert-web.py [-p PORT] [-H host]

Command Line

python2 ./docvert-cli.py

usage: docvert-cli.py [-h] [--version] --pipeline PIPELINE
    [--response {auto,path,stdout}]
    [--autopipeline {Break up over Heading 1.default,Nothing one long page}]
    [--url URL]
    [--list-pipelines]
    [--pipelinetype {tests,auto_pipelines,pipelines}]
    infile [infile ...]

Community

http://lists.catalyst.net.nz/mailman/listinfo/docvert

Requirements

Python 2.6 or 2.7
libreoffice
python-uno
python-lxml
python-imaging
pdf2svg
librsvg2-2

Quickstart Guide

sudo apt-get install libreoffice python-uno python-lxml python-imaging pdf2svg librsvg2-2

/usr/bin/soffice --headless --norestore --nologo --norestore --nofirststartwizard --accept="socket,port=2002;urp;"

then in another terminal

cd ~

git clone git://github.com/holloway/docvert.git

cd docvert

python2 ./docvert-web.py

and browse to http://localhost:8080

LICENCE

Released under the GPL3 see LICENCE