Skip to content

all2md v1.0.1 - Initial Release

Choose a tag to compare

@thomas-villani thomas-villani released this 15 Dec 16:03
· 188 commits to main since this release

Release Notes for v1.0.1 - Initial Release

all2md: The Universal Document Conversion Library

The first stable release of all2md, a comprehensive Python document conversion library for bidirectional transformation between 40+ file formats and Markdown.

Highlights

  • Universal Conversion - Convert PDFs, Office files, HTML, emails, spreadsheets, and 40+ formats to clean Markdown — and back again
  • AST-Based Architecture - Powerful document manipulation pipeline with built-in transforms
  • MCP Server - Built-in Model Context Protocol server for direct AI assistant integration (Claude Desktop, etc.)
  • Production-Ready CLI - Multi-file processing, parallel execution, directory watching, and rich terminal output

Supported Formats

Input: PDF, DOCX, PPTX, HTML, MHTML, EML, MBOX, MSG, PST/OST, EPUB, FB2, CHM, XLSX, ODT/ODP/ODS, CSV/TSV, Jupyter Notebooks, OpenAPI/Swagger, LaTeX, RST, AsciiDoc, Org-Mode, MediaWiki, Textile, ZIP archives, and 200+ source code languages

Output: Markdown (GFM, CommonMark), DOCX, PPTX, PDF, HTML, RTF, ODT/ODP, RST, AsciiDoc, Org-Mode, MediaWiki, Textile, LaTeX, CSV, Jupyter Notebooks, and custom Jinja2 templates

Key Features

  • Advanced PDF Parsing - Table detection, multi-column layout analysis, header/footer removal, OCR support
  • Transform System - 10 built-in transforms (remove-images, heading-offset, link-rewriter, etc.)
  • Security-First - SSRF protection, ZIP bomb detection, path traversal prevention, HTML sanitization
  • Plugin System - Extensible via Python entry points for custom formats and transforms
  • Smart Dependencies - Core is dependency-free; install only what you need
  • Powerful CLI - Convert documents in the terminal all2md paper.md --out paper.docx

Installation

pip install all2md # Core
pip install "all2md[pdf]" # With PDF support
pip install "all2md[docx,pdf,ppt]" # Include the formats you need!
pip install "all2md[all]" # Everything

Links