MDFlux

Turn any document into clean, AI-ready Markdown. Local-first · reads scanned PDFs · up to 6× fewer tokens than vision models

Works with PDF · DOCX · PPTX · XLSX · EPUB · HTML · CSV · JSON · XML · images · audio

⬇️ Download for Windows · Website · The proof · How it compares · Report a bug · ❤️ Sponsor

Drop a document, get clean Markdown in seconds

Why MDFlux?

Getting a document into a shape an LLM can use is more annoying than it should be. You either dump the raw text and lose every heading, table, and list, or you ship your pages to a cloud vision model as images, which means your documents leave your machine and you pay by the page to read your own files. And scanned PDFs? Plain text extractors just hand you back nothing. The text is right there, and the tool acts like the page is blank.

MDFlux is what I wanted instead. Drop in a file or a folder and get back clean, structured Markdown, with OCR for those "blank" scanned pages, batch processing for a whole directory, and an optional cleanup pass to tidy up messy extraction. It runs entirely on your machine. It's built on Microsoft's MarkItDown, with everything around it that makes the engine actually usable day to day.

The proof: fewer tokens, lower cost

Every time a document gets read by an LLM, you pay for it in tokens. The common way to feed a document to a model is to send its pages as images to a vision model, and images are an expensive way to spend tokens. MDFlux hands the model clean Markdown instead, which is far cheaper to read and reuse.

Why clean Markdown costs fewer tokens than feeding documents to AI:

You pay for pixels, not words. A page sent to a vision model as an image costs a fixed chunk of tokens (often well over a thousand) no matter how little text is on it. The same page as Markdown is usually a few hundred.
Plain text tokenizes efficiently. Markdown is just text, with no image data, no markup bloat, no base64 blobs.
Clean beats raw. MDFlux strips the broken layout, repeated whitespace, and junk characters that pad out messy extractions, so you don't spend tokens on noise.
Structure stays compact. Headings, tables, and lists carry the document's meaning in very few tokens, so the model understands more for less.
It compounds. The saving lands on every single call that reads the document, so it multiplies across a pipeline or a batch instead of happening once.

About 2 to 6 times fewer tokens than vision. For ordinary documents, MDFlux Markdown runs around 4 times lighter than sending the pages as images, and up to 5.7 times lighter on scanned pages. That saving lands on every single call that reads the document, so it compounds quickly across a pipeline or a large batch.

Two to six times fewer tokens than a vision model

And it reads what other tools return empty. Point a plain extractor at a scanned, image-only PDF and you get zero usable text. MDFlux's OCR recovers it, and even then it stays lighter than the vision route:

Scanned, image-only PDF	Usable tokens of text
Plain text extractor	0
Vision model (page as an image)	10,731
MDFlux (OCR to Markdown)	1,893

That's the full text recovered in about 5.7 times fewer tokens than the vision model, which still has to OCR the image on its end anyway.

⬇️ Download for Windows · Free & MIT-licensed · No account, no cloud

Key features


💸 Fewer tokens, lower cost	Clean Markdown costs about 2 to 6 times fewer tokens than sending pages to a vision model, so every LLM call that reads the document is cheaper.
🔒 Local and private	Your documents never leave your machine. No cloud, no API key, no account.
🔍 Reads scanned PDFs	Built-in OCR recovers text that plain extractors return as zero characters.
🧱 Real structure	Proper Markdown with headings, tables, and lists intact. Readable, greppable, diff-able.
🖥️ No terminal needed	Portable app. Unzip, run, click through a one-time setup. Done.
📦 Many formats	PDF, DOCX, PPTX, XLSX, EPUB, HTML, CSV, JSON, XML, images, and audio.
🔁 Batch a whole folder	Convert everything at once, with progress, cancellation, and per-file diagnostics.
🧹 Optional cleanup	Off, rule-based, or an AI pass (local or API) to tidy up messy extractions.

Who it's for

🤖 AI and RAG builders: feed clean, structured source documents to any model instead of raw text or pricey vision tokens

🔬 Researchers: batch-convert papers, reports, and scanned archives into searchable Markdown

🧑‍💻 Developers: get diff-able, version-controllable text out of binary document formats

📝 Writers and analysts: pull clean copy out of PDFs and Office files without the formatting mess

🔒 Privacy-conscious users: convert sensitive contracts, records, and decks with nothing ever uploaded

How it works

1. Drop a file or folder   →   PDF, Office, EPUB, scans, audio, and more
2. Pick a cleanup mode      →   Off, rule-based, or AI (local or API)
3. Get clean Markdown       →   Preview, copy, or save as .md. 100% offline.

The first launch sets up a private, self-contained Python environment (one time, needs internet). Every conversion after that runs fully offline.

MDFlux cleanup mode selector: Off, Rule-based, Local AI, and API

See it in action

01 · Select. Grab a folder of mixed files: Word, PowerPoint, Excel, PDFs, scans, web pages, anything. No conversion settings to pick.

02 · Convert. One drop turns the whole batch into AI-ready Markdown, locally. OCR recovers text from scans that other tools read as empty.

03 · Stays healthy. Dependency health and diagnostics run on-device, so you can see exactly what's installed and working.

How it compares to Microsoft MarkItDown

MDFlux is built on Microsoft's MarkItDown, which is a genuinely great conversion library. What MDFlux adds is everything around it: the OCR for scans, the desktop app, the batching, the reliability, and the privacy-by-default packaging that lets anyone run it against a folder of files without touching a command line.

	Microsoft MarkItDown	MDFlux
Core conversion engine	yes	yes (uses MarkItDown)
Scanned / image-only PDFs	returns roughly 0 characters	built-in OCR recovers the text
Install and run	`pip install` plus a terminal	portable app, no terminal
Dependency setup	manual (pip, ffmpeg, OCR extras)	sets itself up on first launch
Batch a whole folder	write your own script	built in, runs concurrently with progress
Timeouts and cancel	can hang with no feedback	every job streams progress and can be cancelled
Cleanup modes	raw output	Off, rule-based, or AI cleanup (local or API)
Preview and diagnostics	none	rendered preview plus a health panel
Audio transcription	plugin or Azure	local, built in
Privacy	local if you wire it up	local by default

On already-clean files the output is close to identical, because under the hood it is MarkItDown. The point isn't to beat the engine. It's to make that engine genuinely usable.

_{If MDFlux sounds useful, consider ⭐ starring the repo. It helps others find it.}

Getting started

Requirements: Windows 10 or 11 (x64) with the WebView2 runtime, which ships with current Windows. The first launch needs internet to set up a local Python environment; everything after runs offline. macOS is on the roadmap.

1. Download and run. Get the portable zip from Releases, extract it anywhere, and double-click MDFlux.exe. No installer, no admin rights.

SmartScreen warning? The build is open source and unsigned. Click "More info" then "Run anyway". You'll need the WebView2 runtime, already on current Windows 10/11.

2. First launch (one-time, internet required). MDFlux sets up a private, self-contained Python environment. This happens once. After that, it runs fully offline.

3. Convert. Drop a document onto the window, pick a cleanup mode, and click "Convert to AI-Ready Markdown". Preview it, then copy straight from the preview or save as .md. For a whole folder, you pick an output folder up front and every converted file lands there.

To verify your download, check the SHA-256 posted on the release page.

Supported formats

Documents	Office	Web and data	Other
PDF (including scanned, via OCR)	DOCX	HTML	Audio to transcript (MP3, WAV, M4A, OGG, FLAC, AAC)
EPUB	PPTX	CSV, JSON, XML	Images via OCR (PNG, JPG, GIF, WEBP, TIFF, BMP)
TXT, Markdown	XLSX, XLS		OCR on images embedded in documents

Troubleshooting

"Windows protected your PC": That's SmartScreen reacting to an unsigned build. Click "More info" then "Run anyway". The build is open source; code signing is on the roadmap.

The first launch is downloading for a while: That's the one-time setup of the local Python environment. It only happens once, and every launch after is instant and offline.

A conversion finished with a warning or looks empty: Open the diagnostics panel. It tells you what's installed and healthy and what went wrong, so you get a clear next step instead of a silent empty file.

MDFlux diagnostics panel: dependency health and per-file status

Where are my converted files? In the output folder shown in the app. For batch jobs you pick the folder up front.

Anything else: Open an issue. Bug reports genuinely help.

Roadmap

MCP server, so Claude Code and other agents can convert documents through MDFlux directly
CLI for scripted, headless conversion in pipelines and CI
macOS build (arm64 and Intel)
Code signing, to remove the SmartScreen warning
More OCR languages and tuning presets

The full list lives in ROADMAP.md. Open an issue if you want to shape it.

For developers

MDFlux is a Tauri 2 (Rust) shell around a Python sidecar (MarkItDown + OCR + audio), with a Svelte 5 front end. You'll need Node, Rust (stable), and the Tauri prerequisites for your OS; then clone it, run npm install, and npm run tauri dev. See CONTRIBUTING.md for the full build, run, and test steps, the project layout, and the tech stack.

Contributing

Contributions are genuinely welcome. Honestly, it's the main reason I'm open-sourcing this. Bug reports, ideas, code, and testing on different hardware all help.

Start with a good first issue, and see CONTRIBUTING.md to get a dev build running. Commits use a DCO sign-off (git commit -s). Be kind; we follow a Code of Conduct.

Support the project

MDFlux is free and MIT-licensed. If it saves you time, supporting it goes straight into the roadmap above: the macOS build, code signing, and the MCP server and CLI.

And starring the repo is free, which helps more than you'd think.

License

Built on open-source foundations: MarkItDown · Tauri · RapidOCR · pypdfium2

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github		.github
app		app
docs/media		docs/media
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
THIRD-PARTY-LICENSES.md		THIRD-PARTY-LICENSES.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MDFlux

Why MDFlux?

The proof: fewer tokens, lower cost

Key features

Who it's for

How it works

See it in action

How it compares to Microsoft MarkItDown

Getting started

Supported formats

Troubleshooting

Roadmap

For developers

Contributing

Support the project

License

About

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

MDFlux

Why MDFlux?

The proof: fewer tokens, lower cost

Key features

Who it's for

How it works

See it in action

How it compares to Microsoft MarkItDown

Getting started

Supported formats

Troubleshooting

Roadmap

For developers

Contributing

Support the project

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Contributors

Uh oh!

Languages