Skip to content

arlse/Markalldown

Repository files navigation

Markalldown

Convert documents (PDF, DOCX, Excel, HTML) to Markdown. Available as a web app and MCP server.

MIT License npm version

English | 中文

Features

  • PDF - Extract text with heading detection, table recognition
  • Word (.docx, .doc) - Preserve formatting, lists, tables
  • Excel (.xlsx, .xls) - Convert sheets to Markdown tables
  • HTML - Clean conversion with Turndown

Online Service

Try it online: markalldown.com

Self-Hosting

Web App

# Install dependencies
npm install

# Development
npm run dev

# Build
npm run build

# Start
npm start

Open http://localhost:3000

MCP Server

For AI agents (Claude Code, etc.), install the MCP server:

npm install -g markalldown-mcp

See mcp-server/README.md for configuration details.

Project Structure

markalldown/
├── app/                  # Next.js web app
├── components/           # React components
├── lib/                  # Browser-side parsers
├── mcp-server/           # MCP server (Node.js)
│   ├── src/parsers/      # Node.js parsers
│   └── README.md
└── public/               # Static assets

License

MIT License - see LICENSE for details.

Note: The markalldown logo and brand assets are not covered by the MIT License.

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

About

This is a tool for converting PDF files into Markdown format.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors