RAG system for research papers. Feed it PDFs, ask it questions, get answers with source references.
pnpm install
docker compose up -d # starts QdrantYou'll need a .env file:
REPLICATE_API_TOKEN=your_token
# Extract markdown from PDF
pnpm --filter @paper/main dev pdf paper.pdf
# Ingest into vector store
pnpm --filter @paper/main dev ingest paper.md
# Search chunks
pnpm --filter @paper/main dev search "attention mechanism"
# Ask questions
pnpm --filter @paper/main dev ask "how does the transformer handle long sequences?"- PDF → Markdown via Marker on Replicate
- Chunk with LangChain's MarkdownTextSplitter
- Embed with Qwen3 (1024 dims) on Replicate
- Store in Qdrant
- Generate answers with Claude
- TypeScript
- pnpm workspaces
- Qdrant (vector store)
- Replicate (embeddings, PDF extraction)
- Biome (lint/format)
- Zod (validation)