Skip to content

cosmojg/doc2docx

 
 

Repository files navigation

doc2docx

PyPI

Convert doc to docx on Windows or macOS directly using Microsoft Word (must be installed).

On Windows, this is implemented via win32com while on macOS this is implemented via JXA (Javascript for Automation, aka AppleScript in JS).

Install

Via brew:

brew install cosmojg/tap/doc2docx

Via pipx:

pipx install doc2docx

Via pip:

pip install doc2docx

CLI

usage: doc2docx [-h] [--keep-active] [--version] input [output]

Example Usage:

Convert single doc file in-place from myfile.doc to myfile.docx:
    doc2docx myfile.doc

Batch convert doc folder in-place. Output docx files will go in the same folder:
    doc2docx myfolder/

Convert single doc file with explicit output filepath:
    doc2docx input.doc output.docx

Convert single doc file and output to a different explicit folder:
    doc2docx input.doc output_dir/

Batch convert doc folder. Output docx files will go to a different explicit folder:
    doc2docx input_dir/ output_dir/

positional arguments:
  input          input file or folder. batch converts entire folder or convert
                 single file
  output         output file or folder

optional arguments:
  -h, --help     show this help message and exit
  --keep-active  prevent closing word after conversion
  --version      display version and exit

Library

from doc2docx import convert

convert("input.doc")
convert("input.doc", "output.docx")
convert("my_doc_folder/")

See CLI docs above (or in doc2docx --help) for all the different invocations. It is the same for the CLI and python library.

Jupyter Notebook

If you are using this in the context of jupyter notebook, you will need ipywidgets for the tqdm progress bar to render properly.

pip install ipywidgets
jupyter nbextension enable --py widgetsnbextension

Acknowledgements

Many thanks to @AlJohri for the excellent docx2pdf upon which this is based!

About

A tool for converting older Word 97–2004 Documents (.doc) to modern Word 2007–365 Documents (.docx) based on the excellent docx2pdf

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.3%
  • Makefile 1.7%