Skip to content

Supported Formats

Hyacinthe edited this page Jun 10, 2026 · 1 revision

Supported Formats

Format selection – document, image, audio, video tabs


πŸ“„ Documents

Input Output Engine chain
TXT PDF ReportLab β†’ pypandoc (if Pandoc installed)
TXT DOCX Word COM β†’ pypandoc β†’ python-docx native
RTF PDF ReportLab native (full fidelity, no Office needed)
RTF DOCX Word COM β†’ pypandoc β†’ striprtf + python-docx
CSV JSON Native
JSON CSV Native
XLSX PDF Microsoft Excel COM β†’ LibreOffice CLI β†’ ReportLab (auto portrait/landscape, multi-sheet)
XLSX JSON openpyxl native
XLSX CSV openpyxl native
PPTX PDF Microsoft PowerPoint COM β†’ LibreOffice CLI β†’ python-pptx + ReportLab + Matplotlib
HTML PDF pdfkit (wkhtmltopdf) β†’ WeasyPrint β†’ ReportLab
PDF HTML PyMuPDF dict-mode (flow layout, base64-embedded images)
EPUB PDF pypandoc β†’ spine-order native (images embedded)

PDF β†’ DOCX has its own dedicated pipeline: pdf2docx (layout-preserving) β†’ text-only fallback β†’ image+text hybrid. Quality depends on the source PDF β€” scanned PDFs without an embedded text layer are only partially recoverable.

.ppt (legacy binary format): requires Microsoft Office or LibreOffice. python-pptx cannot open old .ppt files natively.


πŸ–ΌοΈ Images

Input Output Notes
Any image PNG EXIF preserved
Any image JPEG Max quality, EXIF preserved
Any image JPG
Any image BMP
Any image WEBP
Any image HEIC via pillow-heif
Any image TIFF
Any image PSD via psd-tools or ImageMagick
Any image SVG
Any image AVIF
Any image J2K
Any image DNG via rawpy and ImageMagick
Any image ICO Multi-resolution icon generation

All image conversions use Pillow at maximum quality with EXIF metadata preserved.


πŸ”Š Audio

Input Output
Audio MP3
Audio WAV
Audio AAC
Audio OGG
Audio FLAC
Audio M4A
Video MP3
Video WAV
Video AAC
Video FLAC

Audio input covers: .mp3, .wav, .aac, .ogg, .flac, .m4a


🎬 Video

Input Output
Video MP4
Video WEBM
Video MOV
Video MKV
Video AVI

Video input covers: .mp4, .webm, .mov, .mkv, .avi

All audio and video conversions are powered by a bundled ffmpeg binary located automatically at runtime, with quality presets per format.

Clone this wiki locally