Python library for parsing .docx (Office Open XML) files


About Python-OOXML

Python-OOXML is a Python library for parsing Office Open XML files. At the moment it only supports HTML as output format. Strong emphasis is put on easy customization of the output. The library comes with an importer which is capable of splitting a document into separate chapters. It works both with documents which use Word styles, and documents where they are not used.

Python-OOXML is used in Booktype 2.0 from Sourcefabric.


Developer documentation for Python-OOXML can be found at Read the Docs.


Python-OOXML is licensed under the AGPL license.