ContextGem: Effortless LLM extraction from documents
-
Updated
Jun 4, 2025 - Python
ContextGem: Effortless LLM extraction from documents
Simple and straight forward Python utility that converts a Microsoft Word document (`.docx`) to a Markdown file (`.md`) and vice versa. It supports multiple Markdown elements, including headings, bold and italic text, both unordered and ordered lists, and many more.
This project is a Python application that converts Word documents to Markdown files. It uses the python-docx library to read Word documents, parses the content into headings, paragraphs, lists, tables, and images, converts these elements to their Markdown equivalents, and writes the result to a Markdown file.
Add a description, image, and links to the docx2md topic page so that developers can more easily learn about it.
To associate your repository with the docx2md topic, visit your repo's landing page and select "manage topics."