Parsing docx files with Ruby and output them as HTML
Ruby
Permalink
Failed to load latest commit information.
bin Added option handling into command May 2, 2012
lib Merge branch 'fix-image-extraction' of https://github.com/jhubert/ydocx Jun 14, 2015
spec
.gitignore Add more specs Feb 19, 2014
.travis.yml Remove wrong script line Feb 18, 2014
Gemfile Updated to use rubyzip >= 1.0.0 Jul 7, 2014
Gemfile.lock
History.txt
LICENSE
Manifest.txt
README.txt
Rakefile
ydocx.gemspec Added a gemspec to able to reference ydocx in a Gemfile as :path => '… Feb 19, 2014

README.txt

== ydocx by ywesee GmbH

* https://github.com/zdavatz/ydocx
* Parsing docx files with Ruby and output them as HTML and XML.

== Supports

* Tables
* Uppercase letters, numbers
* Lowercase letters, numbers
* Umlaute
* bold, italic, underline
* Images 
 ** wmf requires imagemagick and is only partially supported due to imagemagick
 ** png files are copied 1:1

* works on Windows as well.

== Usage

* Usage: docx2html file [options]
    -f, --format    Format of style and chapter {(fi|fachinfo)|(pi|patinfo)|(pl|plain)|none}, default fachinfo.
    -h, --help      Display this help message.
    -l, --lang      Language option for templates {de|fr}
    -v, --version   Show version.

== Using the great libraries

* rubyzip (< 1.0.0)
* nokogiri
* htmlentities
* RMagick

Install them using bundle install.

Requires Ruby >= 1.9.2.

== TODO

For unknown reason the gem fails to  parse properly the Présentation for sinovial_fr.
Failing spec is: ydocx should convert sinovial_FR to xml
     Failure/Error: doc.xpath('//chapters/chapter[contains(heading, "Présentation")]').size.should > 0
     

== License GPLv3.0

* http://www.gnu.org/licenses/gpl.html