Description
When passing Atlassian Confluence pages or exported Confluence content to MarkItDown, there is currently no support for converting them to Markdown. Confluence uses its own storage format (XHTML-based) with custom macros, which standard HTML converters don't handle well.
Proposal
Add native support for converting Confluence documents to Markdown, leveraging the existing open-source tool confluence-markdown-exporter as a reference or integration point.
The confluence-markdown-exporter already supports:
- Converting Confluence pages to Markdown format using the Atlassian API
- Handling various Confluence elements: headings, paragraphs, lists, tables, code blocks, tasks, and alerts
- Retaining formatting such as bold, italic, and underline
- Converting Confluence macros to equivalent Markdown syntax where possible
- Handling images and attachments by linking them appropriately in the Markdown output
- Supporting extended Markdown features like front matter and Mermaid diagrams
- Supporting Confluence add-ons such as draw.io, PlantUML, and Markdown Extensions
Use Case
Many teams store documentation, design specs, and knowledge bases in Confluence. Being able to convert these documents to clean Markdown would enable:
- Migrating documentation from Confluence to Markdown-based platforms (e.g., Obsidian, Azure DevOps Wikis, GitHub)
- Feeding Confluence content into LLM pipelines that expect Markdown input
- Archiving Confluence pages in a portable, version-control-friendly format
Suggested Approach
- Integrate or reference the confluence-markdown-exporter (pip install confluence-markdown-exporter) as a converter backend
- Support both Confluence Cloud and Confluence Server formats
- Handle Confluence-specific macros (info panels, code blocks, status labels, etc.) gracefully
Environment
- markitdown version: latest (pip install)
- Python 3.13
- macOS
Description
When passing Atlassian Confluence pages or exported Confluence content to MarkItDown, there is currently no support for converting them to Markdown. Confluence uses its own storage format (XHTML-based) with custom macros, which standard HTML converters don't handle well.
Proposal
Add native support for converting Confluence documents to Markdown, leveraging the existing open-source tool confluence-markdown-exporter as a reference or integration point.
The
confluence-markdown-exporteralready supports:Use Case
Many teams store documentation, design specs, and knowledge bases in Confluence. Being able to convert these documents to clean Markdown would enable:
Suggested Approach
Environment