Local PDF Reader MCP

A Model Context Protocol (MCP) server that enables Claude to read and analyze local PDF files. Supports text extraction, figure extraction, and intelligent navigation for large documents.

Features

Full Text Extraction - Extract all text content from PDFs
Figure Extraction - Extract images and vector graphics with captions
Smart Navigation - Get PDF structure/TOC and read specific sections
Large PDF Support - Auto-chunking for sections over 10k words

Installation

Prerequisites

pip install pymupdf pypdf mcp

Install MCP

From local file:

claude mcp add /path/to/local-pdf-reader.mcpb

From GitHub release:

claude mcp add https://github.com/YOUR_USERNAME/local-pdf-reader-mcp/releases/download/v1.0/local-pdf-reader.mcpb

Tools

`read_pdf_text`

Extract all text from a PDF file.

read_pdf_text(file_path="/path/to/document.pdf")

`read_pdf_figures`

Extract figures and images with captions.

read_pdf_figures(file_path="/path/to/document.pdf")

`get_pdf_structure`

Get the table of contents with page ranges and word counts. Recommended for large PDFs.

get_pdf_structure(file_path="/path/to/document.pdf")

Returns:

Total pages and word count
Section hierarchy with IDs, titles, page ranges
Chunking info for large sections (>10k words)

`read_pdf_section`

Read a specific section by ID or title.

# By section ID
read_pdf_section(file_path="/path/to/doc.pdf", section_id=0)

# By title (fuzzy match)
read_pdf_section(file_path="/path/to/doc.pdf", section_title="Introduction")

# Read chunk 2 of a large section
read_pdf_section(file_path="/path/to/doc.pdf", section_id=5, chunk=2)

Usage Guide

Small PDFs (< 20 pages): Use read_pdf_text directly.

Large PDFs (> 20 pages):

Call get_pdf_structure to get the TOC
Use read_pdf_section to read relevant sections
For sections marked needs_chunking: true, use chunk parameter

Extract figures: Use read_pdf_figures to get all images and charts.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
README.md		README.md
manifest.json		manifest.json
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local PDF Reader MCP

Features

Installation

Prerequisites

Install MCP

Tools

`read_pdf_text`

`read_pdf_figures`

`get_pdf_structure`

`read_pdf_section`

Usage Guide

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Local PDF Reader MCP

Features

Installation

Prerequisites

Install MCP

Tools

read_pdf_text

read_pdf_figures

get_pdf_structure

read_pdf_section

Usage Guide

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`read_pdf_text`

`read_pdf_figures`

`get_pdf_structure`

`read_pdf_section`

Packages