Skip to content

holloway/docvert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Docvert

Converts Word Processor office files (e.g. .DOC files) to OpenDocument, DocBook, and structured HTML.

This is Docvert for Python 2. To find Docvert for Python 3 see http://github.com/holloway/docvert-python3/

Web Service

python2 ./docvert-web.py [-p PORT] [-H host]

Command Line

python2 ./docvert-cli.py

usage: docvert-cli.py [-h] [--version] --pipeline PIPELINE
    [--response {auto,path,stdout}]
    [--autopipeline {Break up over Heading 1.default,Nothing one long page}]
    [--url URL]
    [--list-pipelines]
    [--pipelinetype {tests,auto_pipelines,pipelines}]
    infile [infile ...]

Community

http://lists.catalyst.net.nz/mailman/listinfo/docvert

Requirements

Python 2.6 or 2.7
libreoffice
python-uno
python-lxml
python-imaging
pdf2svg
librsvg2-2

Quickstart Guide

sudo apt-get install libreoffice python-uno python-lxml python-imaging pdf2svg librsvg2-2

/usr/bin/soffice --headless --norestore --nologo --norestore --nofirststartwizard --accept="socket,port=2002;urp;"

then in another terminal

cd ~

git clone git://github.com/holloway/docvert.git

cd docvert

python2 ./docvert-web.py

and browse to http://localhost:8080

LICENCE

Released under the GPL3 see LICENCE

About

Docvert for Python 2: Converts Office files to DocBook and clean HTML, diagrams to SVG/PNG, etc.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published