File MCP

A Model Context Protocol (MCP) server for document conversion using Pandoc. This server provides tools to create and convert files between various document formats through the FastMCP framework.

Features

Create files from text content: Convert markdown or HTML content to various document formats
Convert existing files: Transform documents between different formats
Multiple format support: Handle 10+ document formats including PDF, DOCX, HTML, Markdown, LaTeX, and more
Advanced Pandoc features: Support for reference documents, custom filters, and defaults files
Intelligent path resolution: Automatic filter path resolution from multiple locations

Supported Formats

Text: .txt, .md (Markdown)
Web: .html, .htm
Office: .docx, .doc, .odt
PDF: .pdf (requires TeX Live with xelatex)
Publishing: .epub, .latex, .tex, .rst
Notebooks: .ipynb (Jupyter Notebook)

Installation

This project uses uv for dependency management.

Prerequisites

Python 3.13 or higher
Pandoc must be installed and available in your PATH
For PDF generation: TeX Live or similar LaTeX distribution

Setup

# Clone the repository
git clone <repository-url>
cd file-mcp

# Install dependencies with uv
uv sync

# Or manually install dependencies
uv add fastmcp pypandoc-binary pyyaml

Usage

Running the Server

# Using uv
uv run server.py

# Or with Python directly
python server.py

Available Tools

1. `create_file`

Create a new file from text content (markdown or HTML).

Parameters:

content (str): The text content to convert (markdown or HTML)
output_file (str): Complete path where to save the file (including extension)
input_format (str): Source format of the content (markdown or html)
reference_doc (str, optional): Path to a reference DOCX file for styling (docx output only)
filters (list[str], optional): List of Pandoc filter paths to apply
defaults_file (str, optional): Path to a Pandoc defaults YAML file

Example:

# Convert markdown content to PDF
create_file(
    content="# Hello World\n\nThis is a test document.",
    output_file="D:/documents/hello.pdf",
    input_format="markdown"
)

# Convert HTML to DOCX with custom styling
create_file(
    content="<h1>Report</h1><p>Content here</p>",
    output_file="/home/user/report.docx",
    input_format="html",
    reference_doc="/templates/corporate.docx"
)

2. `convert_file`

Convert an existing file from one format to another.

Parameters:

input_file (str): Complete path to the input file to convert (including extension)
output_file (str): Complete path where to save the converted file (including extension)
reference_doc (str, optional): Path to a reference DOCX file for styling (docx output only)
filters (list[str], optional): List of Pandoc filter paths to apply
defaults_file (str, optional): Path to a Pandoc defaults YAML file

Example:

# Convert DOCX to PDF
convert_file(
    input_file="D:/documents/report.docx",
    input_format="docx",
    output_file="D:/documents/report.pdf"
)

# Convert Markdown to HTML with custom filters
convert_file(
    input_file="/home/user/notes.md",
    input_format="markdown",
    output_file="/home/user/notes.html",
    filters=["custom-filter.py"],
    defaults_file="/config/pandoc-defaults.yaml"
)

Advanced Features

Pandoc Filters

The server supports custom Pandoc filters for advanced document processing. Filters are searched in multiple locations:

Relative to the current working directory
Relative to the defaults file directory (if provided)
In ~/.pandoc/filters/ directory

Filters must be executable. The server will attempt to make them executable automatically if needed.

Defaults Files

You can provide a Pandoc defaults YAML file to configure conversion options. The defaults file should be a valid YAML dictionary containing Pandoc options.

Example defaults file:

variables:
  geometry: margin=1in
  fontsize: 12pt
pdf-engine: xelatex
number-sections: true

Reference Documents

For DOCX output, you can provide a reference document that defines styles, fonts, and formatting. This allows you to maintain consistent branding and styling across generated documents.

PDF Generation

PDF generation requires a LaTeX engine (xelatex is used by default). The server automatically adds:

--pdf-engine=xelatex
-V geometry:margin=1in

Ensure you have TeX Live or a similar LaTeX distribution installed.

Development

Project Structure

file-mcp/
├── server.py    # Main MCP server with conversion tools
├── pyproject.toml       # Project dependencies and metadata
├── ruff.toml           # Ruff linter configuration
└── README.md           # This file

Dependencies

fastmcp: Framework for building MCP servers
pypandoc-binary: Python wrapper for Pandoc (includes Pandoc binary)
pyyaml: YAML parser for defaults files

Development Tools

# Install development dependencies
uv add --dev ruff

# Run linter
uv run ruff check .

# Format code
uv run ruff format .

Error Handling

The server provides detailed error messages for common issues:

Missing files: Clear messages when input files or reference documents don't exist
Invalid formats: Validation of input/output format compatibility
Filter errors: Helpful messages when filters are not found or not executable
Pandoc errors: Detection and reporting of Pandoc-specific issues
Defaults file errors: Validation of YAML structure and content

Environment Variables

PANDOC_OUTPUT_DIR: Automatically set to the output file directory during conversion

License

[Add your license here]

Contributing

[Add contribution guidelines here]

Support

For issues and questions:

Check that Pandoc is installed: pandoc --version
For PDF issues, verify TeX Live installation: xelatex --version
Review Pandoc documentation: https://pandoc.org/MANUAL.html

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml
server.py		server.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

File MCP

Features

Supported Formats

Installation

Prerequisites

Setup

Usage

Running the Server

Available Tools

1. `create_file`

2. `convert_file`

Advanced Features

Pandoc Filters

Defaults Files

Reference Documents

PDF Generation

Development

Project Structure

Dependencies

Development Tools

Error Handling

Environment Variables

License

Contributing

Support

About

Uh oh!

Releases

Packages

Languages

lequan310/file-mcp-python

Folders and files

Latest commit

History

Repository files navigation

File MCP

Features

Supported Formats

Installation

Prerequisites

Setup

Usage

Running the Server

Available Tools

1. create_file

2. convert_file

Advanced Features

Pandoc Filters

Defaults Files

Reference Documents

PDF Generation

Development

Project Structure

Dependencies

Development Tools

Error Handling

Environment Variables

License

Contributing

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. `create_file`

2. `convert_file`

Packages