Ariadne Core — Personal Edition

Ariadne Core works with any agentic system — Claude Code, Open Brain, OpenClaw, Cursor, Gemini, OpenAI Desktop, and more — to dramatically reduce document ingestion tokens. Our tool directly addresses the token cost problem Nate is talking about in this video (article version), and more. The document ingestion deep-dive starts a bit after 4:20. We'll let him make the case — we'll just explain how we address the problem and how to set it up for you and your agents to use.

Install the plugin

Ariadne Core is distributed as a plugin — a packaged bundle of skills, metadata, and (optionally) an MCP server. A plugin is just an advanced form of a skill: where a standalone skill is a single SKILL.md that teaches an agent how to do one thing, a plugin bundles multiple skills together with a manifest so they can be discovered, installed, and updated as a unit.

This plugin includes seven skills — a router, onboarding, install, deploy, build, and document intelligence.

Claude Code (recommended)

Two commands — add the marketplace, then install the plugin:

# Add the marketplace (one-time)
/plugin marketplace add denson/ariadne-core

# Install the plugin
/plugin install ariadne-core@ariadne-core

Choose a scope when prompted:

User scope — available in every project (recommended for personal use)
Project scope — available to all collaborators on the current repo
Local scope — only for you, only in the current repo

After installing, reload plugins to activate:

/reload-plugins

Note: Slash command availability varies between Claude Code (terminal) and Claude Desktop. If /reload-plugins isn't recognized in your environment, start a new session instead — that always works. The plugin system is evolving rapidly; if any command here doesn't match your experience, ask your AI assistant what's changed.

Then say "run the install skill" to deploy and connect, or "run the onboarding skill" for a visual walkthrough.

You can also install from the CLI outside a session:

claude plugin marketplace add denson/ariadne-core
claude plugin install ariadne-core@ariadne-core --scope user

Cowork (Claude Desktop)

Cowork discovers plugins installed via Claude Code — install the plugin in Claude Code first (see above), and the skills will be available in both environments. Plugins installed at user scope are shared across Claude Code and Cowork.

You can also browse and manage plugins directly in Cowork through Customize > Plugins in the sidebar.

Private repos only: If the plugin repo is private, Cowork needs the Claude GitHub App installed on the repo to access it. Install the app, authorize it for the repo, and Cowork will be able to sync the plugin. You can also install the app from Claude Code with /install-github-app. Public repos don't need this.

Note: The Claude Desktop UI and plugin system are evolving rapidly. If the steps above don't match what you see, ask your AI assistant how plugin management currently works — it will know the latest flow.

Updating the plugin

When a new version is released, update from either environment:

Claude Code:

/plugin marketplace update ariadne-core
/plugin install ariadne-core@ariadne-core

Then run /reload-plugins or start a new session to pick up the changes.

Cowork (Claude Desktop):

Go to Customize > Plugins, find Ariadne Core, and use the update option. Start a new conversation or use /reload-plugins if available.

Note: Plugin updates are keyed by version number. If the version hasn't been bumped, the cache won't refresh. If you suspect stale skills, uninstall and reinstall. The plugin system and available slash commands are evolving — terminal and desktop environments may support different commands at any given time.

Uninstalling

/plugin uninstall ariadne-core

Run skills directly from the repo

If you'd rather skip the plugin system, you can run skills directly from a cloned copy of this repo:

Want to understand what this is? Open this repo in Claude Desktop (setup guide) and say "run the onboarding skill." It walks you through everything visually — one concept at a time, with illustrations — and adapts to whether you're a developer, an agent builder, or evaluating the tool.

Want to get it running? Open this repo in Claude Code and say "run the install skill." It handles deployment and connection step by step. Or point your agentic system (OpenClaw, Open Brain, or any agent with terminal access) at the install skill — the AI path is designed for autonomous execution without human intervention.

What it does

It takes documents in (PDF, DOCX, PPTX, XLSX, HTML, CSV, and 20+ other formats), extracts them into clean Markdown, chunks and embeds them for search, and stores everything in Postgres with pgvector. When your agent needs information, it searches and gets back 500 tokens of exactly what's relevant — not 100,000 tokens of raw PDF. That's 100-200x more efficient.

Extracts documents to clean Markdown using MarkItDown (Microsoft, MIT license, free)
Deduplicates by fingerprinting content before any expensive processing (free)
Enriches images by sending them to a vision API for descriptions (optional — skip if your docs don't have images worth describing)
Chunks intelligently based on document type (headings for reports, pages for slide decks, fixed windows for logs) (free)
Embeds chunks via any OpenAI-compatible embedding API (required for search)
Stores everything in Postgres + pgvector for semantic search with SQL filtering
Tracks provenance on every interaction: which agent, which model, when, what action

Basic extraction is free via MarkItDown, but the full pipeline needs API keys. The only costs are cheap embedding and vision API calls (pennies per document). The expensive model never touches raw documents.

This is the Personal Edition — open source, you deploy and manage it yourself. Managed versions (Managed, Team, Enterprise) are coming if you'd rather not deal with infrastructure and security at scale or you need this organization wide.

Only need extraction? You don't need Ariadne Core for that — MarkItDown is open source and handles extraction on its own. Claude Code or any coding agent can install it for you. Ariadne Core adds dedup, chunking, embedding, search, and provenance on top.

Supported formats

Over 20 formats, including PDF, DOCX, PPTX, XLSX, CSV, HTML, XML, JSON, TXT, Markdown, RTF, EPUB, EML, MSG, ZIP (recursive extraction), Jupyter notebooks, WAV, MP3, and more. MarkItDown handles all conversion. Extraction quality improves with API keys for vision-dependent content.

Images (JPG, PNG, GIF) are supported but require a vision API key (VISION_API_KEY) for content extraction. Without it, image files are accepted but produce empty output.

Architecture

Ariadne Core runs as a hosted service. One deployment serves all clients over HTTPS. No local installation required for end users.

Railway / Fly.io / VPS
┌─────────────────────────┐
│  ariadne-core          │
│  ├── MCP Server          │
│  ├── REST API            │
│  ├── Postgres + pgvec    │
│  ├── MarkItDown          │
│  └── Chunking/Embed      │
└─────────────────────────┘
  MCP Server
     ▲  ▲  ▲  ▲
     │  │  │  └── Claude Cowork (Managed edition or roll your own OAuth)
     │  │  └───── OpenClaw
     │  └──────── Open Brain
     └─────────── Claude Code

Authentication is by API key for Personal edition and OAuth for Managed and higher
editions. You can also create your own OAuth for the Personal edition.

All endpoints require API key authentication via X-API-Key header (except /api/health).

Manual setup (if you prefer doing it yourself)

Prerequisites

A Railway account — railway.com. Free tier works for personal use.
An API key from any OpenAI-compatible provider — for embeddings and image descriptions. OpenAI, Google Gemini, Groq, DeepSeek, Together AI, Mistral, or any local model server (Ollama, LM Studio) all work. See Compatible providers below.

Step 1: Deploy to Railway

railway login
cd ariadne-core
railway init
railway add --plugin postgresql
railway up

Step 2: Set environment variables

In the Railway dashboard or via CLI:

railway variables set EMBEDDING_API_KEY=your-provider-api-key
railway variables set VISION_API_KEY=your-provider-api-key
railway variables set EMBEDDING_MODEL=text-embedding-3-small
railway variables set VISION_MODEL=gpt-4o-mini
railway variables set ARIADNE_API_KEY=your-secret-api-key

EMBEDDING_API_KEY and VISION_API_KEY work with any OpenAI-compatible provider — they don't have to be OpenAI. Both can use the same key if you use the same provider for both. If you use a non-OpenAI provider, also set EMBEDDING_BASE_URL and/or VISION_BASE_URL to match (see Compatible providers).

ARIADNE_API_KEY is the key clients use to authenticate — pick any strong secret.

Railway provides DATABASE_URL automatically via the Postgres plugin.

Step 3: Get your URL

railway domain

This gives you a public HTTPS URL like https://ariadne-core-production.up.railway.app.

Step 4: Verify

curl https://your-url.up.railway.app/api/health

You should see {"status": "healthy"}.

Step 5: Connect Claude Code

claude mcp add ariadne-core https://your-url.up.railway.app/mcp \
  --transport http --scope user \
  --header "X-API-Key:your-api-key"

Restart Claude Code. The six Ariadne Core tools should appear. Verify with claude mcp list.

Step 6: Connect other clients

All clients (Open Brain, OpenClaw, Cursor, etc.) connect via MCP the same way as Claude Code. The REST API is also available for scripts and automation:

curl -X POST https://your-url.up.railway.app/api/search \
  -H "Content-Type: application/json" \
  -H "X-API-Key: your-api-key" \
  -d '{"query": "quarterly revenue trends", "top_k": 5}'

Hosting options

Option	Cost	Notes
Railway	Free tier / ~$5/mo	Simplest deployment
Fly.io	Free tier / ~$5/mo	Scale to zero when idle
Hetzner VPS	~$4.50/mo	Best value for always-on
Any Docker host	Varies	`docker compose up -d` with `Dockerfile`

All options use the same Dockerfile and environment variables. See deploy skill for platform-specific instructions.

MCP tools

Six tools are available to any connected MCP client. See SPEC.md for full parameter details.

Tool	What it does
`convert_document`	Convert a document to Markdown. Chunks, embeds, and stores automatically. Handles dedup.
`search`	Semantic search over your document store. Filters by collection, source file, file type, or tags. Returns ranked chunks with provenance.
`get_document`	Retrieve the full Markdown content, chunks, and interaction history for a document by ID.
`list_documents`	Browse documents by collection or file type. Returns metadata for pagination.
`list_collections`	List all collections with document counts.
`ingest`	Batch-ingest a directory of documents. Processes all supported files and returns a summary.

All tools accept caller metadata (agent_id, agent_type, model, initiated_by, agent_notes, agent_metadata) for provenance tracking. Every call creates an interaction record, even dedup skips.

REST API

The REST API mirrors MCP tool functionality and adds collection management and stats. All endpoints require X-API-Key header except /api/health.

Method	Endpoint	Description
`POST`	`/api/upload`	Upload a file and get a server-side path for use with `convert_document`
`POST`	`/api/documents`	Convert and store a document (same as MCP `convert_document`)
`GET`	`/api/documents`	List documents with optional `collection` filter, `page`, `per_page`
`GET`	`/api/documents/{id}`	Get full document by ID (same as MCP `get_document`)
`POST`	`/api/ingest`	Batch directory ingestion (same as MCP `ingest`)
`POST`	`/api/search`	Semantic search (same as MCP `search`)
`GET`	`/api/collections`	List all collections with document counts
`POST`	`/api/collections`	Create a new collection
`GET`	`/api/stats`	System statistics (document count, chunk count, collections)
`GET`	`/api/health`	Health check (no auth required)

# Upload a file
curl -X POST https://your-url/api/upload \
  -H "X-API-Key: your-api-key" \
  -F "file=@report.pdf"

# Convert and store
curl -X POST https://your-url/api/documents \
  -H "Content-Type: application/json" \
  -H "X-API-Key: your-api-key" \
  -d '{"uri": "https://example.com/report.pdf", "collection": "research"}'

# Search
curl -X POST https://your-url/api/search \
  -H "Content-Type: application/json" \
  -H "X-API-Key: your-api-key" \
  -d '{"query": "quarterly revenue trends", "top_k": 5}'

# Health check (no auth)
curl https://your-url/api/health

How dedup works

Every document is fingerprinted (SHA-256 on normalized text) before any expensive processing. If the fingerprint already exists in the target collection:

Extraction, chunking, and embedding are skipped
A document_interactions row is still created (recording who asked, when, and why)
The existing document is returned to the caller

This means you can point multiple agents at the same document store without wasting compute. The force flag on convert_document and ingest overrides this when you know a document has changed.

How provenance works

Every agent call creates a record, even dedup skips. When you search and get a result, you also get the full history of who has touched that document:

processing_chain (on the document) tells you how the content was processed: which extraction tool, which enrichment steps, which embedding model, with timestamps
document_interactions (separate table) tells you who touched it: which agent, which model, when, what action, whether it was a dedup skip, plus agent_notes and agent_metadata

Development

For local development, run Postgres in Docker and the app on the host:

docker compose up -d          # start Postgres
pip install -e src/           # install the app
ariadne-core serve          # start MCP (:8081) + REST API (:8000)

Project structure

ariadne-core/
├── config/ariadne.yaml       # Main config
├── Dockerfile                # Production container
├── docker-compose.yml        # Postgres for local dev
├── .env.example              # Environment variables template
├── migrations/               # Postgres schema (auto-runs on first start)
├── src/pipeline/             # All source code
│   ├── __main__.py           # CLI entrypoint (serve command)
│   ├── mcp_server.py         # MCP tool definitions
│   ├── api/                  # FastAPI REST endpoints + auth
│   ├── extraction/           # MarkItDown wrapper
│   ├── enrichment/           # Vision API image descriptions
│   ├── chunking/             # by_title, by_page, fixed_size
│   ├── embedding/            # OpenAI-compatible embedding client
│   └── storage/              # Vector store (pgvector default)
├── tests/
├── benchmarks/
└── docs/

Documentation

SPEC.md — source of truth for all tool signatures, API endpoints, and behavior
Architecture spec — full technical design
Configuration reference — all ariadne.yaml options
OB1 integration — using with Open Brain

Skills (included in the plugin)

Skill	Runtime	What it does
Router	Any	Routes to the right skill and serves as onboarding entry point
Onboarding	Cowork	Standalone visual walkthrough with illustrations
Install	Claude Code / terminal agent	Deploy and connect
Deploy	Claude Code / terminal agent	Platform-specific deployment
Document Intelligence	Any	Best practices for using the tools
Build	Claude Code / terminal agent	Developing on the codebase

Compatible providers

Ariadne Core works with any OpenAI-compatible API for embeddings and vision. You are not locked to OpenAI — use whichever provider fits your needs and budget.

Providers with native OpenAI compatibility

Provider	Base URL	Good models for this task
OpenAI	`https://api.openai.com/v1` (default)	`text-embedding-3-small`, `gpt-4o-mini`
Google Gemini	`https://generativelanguage.googleapis.com/v1beta/openai/`	`gemini-3-flash-preview`, `gemini-2.5-pro`
Groq	`https://api.groq.com/openai/v1`	`llama-4-70b`, `mixtral-8x22b`
DeepSeek	`https://api.deepseek.com/v1`	`deepseek-chat`
Together AI	`https://api.together.xyz/v1`	`BAAI/bge-large-en-v1.5` (embeddings)
Mistral AI	`https://api.mistral.ai/v1`	`mistral-large-2512`
Perplexity	`https://api.perplexity.ai/v1`	`sonar-huge-online`

Local models (on-device)

Runtime	Base URL
Ollama	`http://localhost:11434/v1`
LM Studio	`http://localhost:1234/v1`
vLLM	`http://localhost:8000/v1`

Anthropic Claude

Anthropic uses a proprietary API format, not OpenAI-compatible. If you want to use Claude models for vision or embeddings, route through an aggregator:

OpenRouter: https://openrouter.ai/api/v1 — supports anthropic/claude-4.6-opus, anthropic/claude-4.6-sonnet
Bifrost: https://api.bifrost.ai/v1 — enterprise gateway option

Configuration

To use a non-default provider, set the base URL and model alongside the API key:

railway variables set EMBEDDING_BASE_URL=https://api.together.xyz/v1
railway variables set EMBEDDING_MODEL=BAAI/bge-large-en-v1.5
railway variables set EMBEDDING_API_KEY=your-together-key

Google Gemini and OpenAI models work particularly well for this task. We plan to deploy custom open models in the future.

Tip: Some providers accept both https://api.example.com and https://api.example.com/v1. If you get a 404, try adding or removing the /v1 suffix.

Cost

Basic extraction is free — no API calls needed. But for full performance (image descriptions, embeddings for search), you need API keys from any OpenAI-compatible provider. No local GPU required. Costs vary by provider — here are OpenAI's rates as a baseline:

What	Model	Cost
Embedding	text-embedding-3-small	~$0.02 per million tokens
Image description	gpt-4o-mini	~$0.15 per million input tokens

Many providers (Groq, DeepSeek, Together AI) are significantly cheaper or offer free tiers. Local models (Ollama, LM Studio) cost nothing beyond hardware.

For a typical document, this works out to less than $0.01 per document. A personal document library of thousands of documents costs a few dollars to process.

Platform costs

Railway, Fly.io, and similar platforms charge for compute, storage, and data egress. For personal use this is minimal — a few dollars a month. Storage costs scale with your Postgres database size; egress costs scale with how often clients query the API.

If your other systems (OpenClaw, Open Brain, etc.) run on the same platform, egress between services is typically free or negligible — the data stays on the same network. This is the recommended setup for keeping costs low.

Limitations

Database scaling

The Personal Edition uses Postgres with pgvector for vector storage. This works well for personal document libraries — thousands of documents, hundreds of thousands of chunks. But you will eventually outgrow a single Postgres instance if your corpus grows large enough.

We're currently testing more robust vector database options for the Managed, Team, and Enterprise editions, including Pinecone, Snowflake, Qdrant, and Weaviate Cloud. We'll share our findings and recommendations for personal/professional users as we evaluate them.

Format limitations

Scanned PDFs with no text layer produce poor results — MarkItDown can't do OCR
Legacy Office formats (.doc, .ppt) are not supported
Complex table layouts with merged cells use heuristics, not ML layout detection
BMP, TIFF, HEIC images are not supported

The Managed and Team editions will fill these gaps with enhanced extraction capabilities beyond what MarkItDown provides.

Editions

This is the Personal Edition — open source, self-hosted, deploy it yourself.

Edition	Status	What you get
Personal	Available now	Open source, self-hosted, full pipeline, you manage everything
Managed	Coming soon	Managed hosting, enhanced security, priority support
Team	Coming soon	Managed hosting, team features, SSO, audit logging
Enterprise	Coming soon	Dedicated infrastructure, custom integrations, SLA

Higher tiers are fully managed — for those who'd rather not deal with infrastructure and security at scale or need this organization wide.

Interested in Managed, Team, or Enterprise? Get in touch.

License

Apache 2.0. See LICENSE.

All dependencies are Apache 2.0 or MIT compatible.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.claude-plugin		.claude-plugin
.claude/skills/walkthrough		.claude/skills/walkthrough
benchmarks		benchmarks
config		config
docs		docs
migrations		migrations
skills		skills
src		src
tests		tests
walkthrough_html		walkthrough_html
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
FILE_SUMMARIES.md		FILE_SUMMARIES.md
FIXES.md		FIXES.md
HTTP_proxy_fix.md		HTTP_proxy_fix.md
IMPLEMENT.md		IMPLEMENT.md
LICENSE		LICENSE
README.md		README.md
SPEC.md		SPEC.md
docker-compose.yml		docker-compose.yml
eval-results.md		eval-results.md
eval_harness.py		eval_harness.py
skill.md		skill.md

Folders and files

Latest commit

History

Repository files navigation

Ariadne Core — Personal Edition

Install the plugin

Claude Code (recommended)

Cowork (Claude Desktop)

Updating the plugin

Uninstalling

Run skills directly from the repo

What it does

Supported formats

Architecture

Manual setup (if you prefer doing it yourself)

Prerequisites

Step 1: Deploy to Railway

Step 2: Set environment variables

Step 3: Get your URL

Step 4: Verify

Step 5: Connect Claude Code

Step 6: Connect other clients

Hosting options

MCP tools

REST API

How dedup works

How provenance works

Development

Project structure

Documentation

Skills (included in the plugin)

Compatible providers

Providers with native OpenAI compatibility

Local models (on-device)

Anthropic Claude

Configuration

Cost

Platform costs

Limitations

Database scaling

Format limitations

Editions

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages