all2md v1.0.1 - Initial Release
Release Notes for v1.0.1 - Initial Release
all2md: The Universal Document Conversion Library
The first stable release of all2md, a comprehensive Python document conversion library for bidirectional transformation between 40+ file formats and Markdown.
Highlights
- Universal Conversion - Convert PDFs, Office files, HTML, emails, spreadsheets, and 40+ formats to clean Markdown — and back again
- AST-Based Architecture - Powerful document manipulation pipeline with built-in transforms
- MCP Server - Built-in Model Context Protocol server for direct AI assistant integration (Claude Desktop, etc.)
- Production-Ready CLI - Multi-file processing, parallel execution, directory watching, and rich terminal output
Supported Formats
Input: PDF, DOCX, PPTX, HTML, MHTML, EML, MBOX, MSG, PST/OST, EPUB, FB2, CHM, XLSX, ODT/ODP/ODS, CSV/TSV, Jupyter Notebooks, OpenAPI/Swagger, LaTeX, RST, AsciiDoc, Org-Mode, MediaWiki, Textile, ZIP archives, and 200+ source code languages
Output: Markdown (GFM, CommonMark), DOCX, PPTX, PDF, HTML, RTF, ODT/ODP, RST, AsciiDoc, Org-Mode, MediaWiki, Textile, LaTeX, CSV, Jupyter Notebooks, and custom Jinja2 templates
Key Features
- Advanced PDF Parsing - Table detection, multi-column layout analysis, header/footer removal, OCR support
- Transform System - 10 built-in transforms (remove-images, heading-offset, link-rewriter, etc.)
- Security-First - SSRF protection, ZIP bomb detection, path traversal prevention, HTML sanitization
- Plugin System - Extensible via Python entry points for custom formats and transforms
- Smart Dependencies - Core is dependency-free; install only what you need
- Powerful CLI - Convert documents in the terminal
all2md paper.md --out paper.docx
Installation
pip install all2md # Core
pip install "all2md[pdf]" # With PDF support
pip install "all2md[docx,pdf,ppt]" # Include the formats you need!
pip install "all2md[all]" # Everything