Paper2Skill

An intelligent Agent Skill Builder that transforms academic papers and technical documents into actionable Skill.md files designed for AI systems.

Overview

Paper2Skill uses a multi-agent system built on LangChain/LangGraph to analyze documents and extract what is useful - the core theory, algorithm, model, or idea that can be built or implemented. The output is a self-contained Skill.md file that provides step-by-step implementation guidance, not just a paper review.

Key Philosophy: The goal is to generate actionable skills. For example, from "Attention is All You Need," the system extracts how to build a Transformer module, not just summarize the paper.

Features

✅ Actionable Output: Generates implementation guides with steps, tools, and validation criteria
✅ Value Extraction: Identifies the core useful thing (algorithm, model, architecture) from papers
✅ Multi-Format Support: Process PDF, Word (.docx), PowerPoint (.pptx), and Markdown documents
✅ LangChain/LangGraph Ecosystem: Built on industry-standard AI frameworks
✅ Multi-Agent Analysis: 5 specialized agents for understanding, extraction, and implementation guidance
✅ Tool Recognition: Identifies required tools and libraries
✅ Self-Contained Output: Generates comprehensive Skill.md files ready for AI consumption
✅ Fallback Mode: Works without LLM for basic processing

Installation

Using pip

pip install -e .

Using requirements.txt

pip install -r requirements.txt

Dependencies

Python >= 3.9
LangChain >= 0.1.0
LangGraph >= 0.0.20
pypdf >= 3.17.0 (for PDF support)
python-docx >= 1.0.0 (for Word support)
python-pptx >= 0.6.21 (for PowerPoint support)

Quick Start

Basic Usage

# Process a document (with LLM)
paper2skill examples/sample_paper.md

# Process without LLM (fallback mode)
paper2skill document.pdf --no-llm

# Specify custom output path
paper2skill paper.docx -o custom_output.md

Using as a Library

from paper2skill.loaders import MultiFormatLoader
from paper2skill.agents import SkillBuilderWorkflow
from paper2skill.generators import SkillMarkdownGenerator
from paper2skill.utils import get_llm, setup_environment

# Setup
setup_environment()

# Load document
document_text = MultiFormatLoader.load("paper.pdf")

# Run multi-agent workflow
llm = get_llm()  # Optional: None for fallback mode
workflow = SkillBuilderWorkflow(llm=llm)
state = workflow.run(document_text, "paper.pdf")

# Generate Skill.md
markdown = SkillMarkdownGenerator.generate(state, "output.skill.md")

Architecture

Multi-Agent System

The system uses a LangGraph-based workflow with 5 specialized agents:

Document Understanding Agent: Analyzes overall document structure and content
Concept Extraction Agent: Identifies main concepts, theorems, and results
Tool Identification Agent: Recognizes tools, methods, and techniques
Value Extraction Agent: Identifies what is USEFUL from the paper (core algorithm, model, architecture)
Implementation Guide Agent: Generates actionable steps to build/implement the useful thing

Workflow

Document Input → Document Loader → Multi-Agent Analysis → Value Extraction → Implementation Guide → Skill.md Generation

Configuration

Configuration File

Create a config.yaml file to configure multiple AI models. Copy config.example.yaml to get started:

cp config.example.yaml config.yaml

Example configuration:

# Select which model to use by default
default_model: openai

# Model configurations
models:
  openai:
    provider: openai
    model_name: gpt-3.5-turbo
    temperature: 0
    # api_key: your-api-key-here  # Or use environment variable

  anthropic:
    provider: anthropic
    model_name: claude-3-sonnet-20240229
    temperature: 0

  azure:
    provider: azure
    deployment_name: your-deployment-name
    azure_endpoint: https://your-resource.openai.azure.com/
    api_version: "2024-02-15-preview"

  ollama:
    provider: ollama
    model_name: llama2
    base_url: http://localhost:11434

Switching Models

Use the --model flag to switch between configured models:

# Use the default model (from config.yaml)
paper2skill document.pdf

# Use a specific model
paper2skill document.pdf --model anthropic
paper2skill document.pdf --model ollama

# Use a custom config file
paper2skill document.pdf --config /path/to/config.yaml

Environment Variables

Create a .env file for API keys:

# API Keys (environment variables take precedence)
OPENAI_API_KEY=your_openai_api_key_here
ANTHROPIC_API_KEY=your_anthropic_api_key_here
AZURE_OPENAI_API_KEY=your_azure_api_key_here

# Override default model selection
PAPER2SKILL_MODEL=anthropic

Without an API key, the system runs in fallback mode with rule-based extraction.

Output Format

The generated Skill.md is action-oriented and includes:

What You Will Build: Description of the core useful thing
Why This Is Useful: Value proposition and use cases
Prerequisites: Required knowledge and background
Core Principles: Key concepts that make it work
Implementation Guide: Step-by-step instructions to build it
Required Tools: Libraries, frameworks, and dependencies
External Resources: References and additional materials
Validation Criteria: How to verify the implementation works
Theoretical Foundation: Supporting theorems and propositions
Expected Results: Documented outcomes and benchmarks

Examples

See the examples/ directory for sample documents:

sample_paper.md: Example research paper on distributed algorithms

Running Examples

# Process the sample paper
paper2skill examples/sample_paper.md

# Output will be saved as sample_paper.skill.md

Development

Running Tests

# Install development dependencies
pip install -e ".[dev]"

# Run tests
pytest tests/

# Run specific test file
pytest tests/test_loaders.py

Project Structure

paper2skill/
├── agents/          # Multi-agent system
│   ├── nodes.py     # Agent implementations
│   ├── state.py     # Shared state definitions
│   └── workflow.py  # LangGraph workflow
├── loaders/         # Document loaders
│   └── document_loader.py
├── generators/      # Skill.md generation
│   └── skill_generator.py
├── utils/           # Utilities
│   └── llm.py
└── main.py          # CLI entry point

Requirements Met

This implementation fulfills all specified requirements:

✅ LangChain/LangGraph Ecosystem: Uses LangGraph for multi-agent orchestration
✅ Multiple Input Types: Supports PDF, Word, Markdown, PowerPoint
✅ Value Extraction: Identifies what is useful from papers (core algorithm, model, or idea)
✅ Implementation Guidance: Generates actionable steps to build the useful thing
✅ Tool Handling: Recognizes required tools and libraries
✅ Self-Contained Skill.md: Generates comprehensive, actionable skill documents

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

License

MIT License - See LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
examples		examples
paper2skill		paper2skill
tests		tests
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
README.md		README.md
USAGE.md		USAGE.md
config.example.yaml		config.example.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper2Skill

Overview

Features

Installation

Using pip

Using requirements.txt

Dependencies

Quick Start

Basic Usage

Using as a Library

Architecture

Multi-Agent System

Workflow

Configuration

Configuration File

Switching Models

Environment Variables

Output Format

Examples

Running Examples

Development

Running Tests

Project Structure

Requirements Met

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Paper2Skill

Overview

Features

Installation

Using pip

Using requirements.txt

Dependencies

Quick Start

Basic Usage

Using as a Library

Architecture

Multi-Agent System

Workflow

Configuration

Configuration File

Switching Models

Environment Variables

Output Format

Examples

Running Examples

Development

Running Tests

Project Structure

Requirements Met

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages