🎯 SlideForge

🚀 Automagically transform your documents into beautiful PowerPoint presentations using AI

SlideForge is a multi-agent AI system that automatically generates professional PowerPoint presentations from various document formats (PDF, Word, TXT). It analyzes your documents, extracts key information, creates well-structured slides, and applies appropriate styling - all without manual intervention.

✨ Features

📄 Multi-format Support: Process PDF, DOCX, and TXT files
📏 Large Document Support: Efficiently handles documents of 100+ pages with intelligent chunking
🧠 AI-Powered Content Extraction: Intelligently extract and synthesize key information
📊 Smart Slide Generation: Create well-structured slides with proper hierarchy
🎨 Automatic Styling: Apply context-appropriate visual designs
🔄 Processing Pipeline: Track job status from upload to completion
🔒 User Authentication: Secure access with JWT authentication
📱 RESTful API: Clean API for integration with any client

🏗️ Architecture

SlideForge uses a modular, multi-agent architecture:

┌───────────────┐     ┌───────────────┐     ┌───────────────┐
│  Extraction   │     │  Generation   │     │ Optimization  │
│     Agent     │────►│     Agent     │────►│     Agent     │
└───────────────┘     └───────────────┘     └───────────────┘
        │                     │                     │
        └─────────────┬──────┴─────────────┬───────┘
                      ▼                    ▼
              ┌───────────────┐    ┌───────────────┐
              │   Database    │    │  File Storage │
              └───────────────┘    └───────────────┘
                      ▲
                      │
              ┌───────────────┐
              │  FastAPI      │
              │  Backend      │
              └───────────────┘
                      ▲
                      │
              ┌───────────────┐
              │    Client     │
              │  Application  │
              └───────────────┘

Three-Agent System

🔍 Extraction & Synthesis Agent
- Processes uploaded documents using PyPDF for PDF and python-docx for DOCX files
- Handles large documents (100+ pages) using intelligent chunking and strategic extraction
- Extracts text, structure, and metadata
- Analyzes content using OpenAI o3-mini and Anthropic Claude 3.7 Sonnet via LangChain
- Generates summaries, extracts keywords, and structures content
- Creates a presentation-ready data structure
📝 Slide Generation Agent
- Creates slide structure
- Organizes content hierarchically
- Generates PPTX files
- Creates appropriate sections and summaries
✨ Graphic Optimization Agent
- Analyzes content context
- Selects appropriate visual styles
- Enhances typography and layout
- Applies consistent design principles

🛠️ Tech Stack

Backend: FastAPI
Database: SQLAlchemy with SQLite/PostgreSQL
Authentication: JWT
AI/ML: LangChain with OpenAI o3-mini and Anthropic Claude 3.7 Sonnet
Document Processing: PyPDF, python-docx
Presentation Generation: python-pptx
Task Processing: Async processing
Storage: Local filesystem (expandable to S3)

📋 Prerequisites

Python 3.10+
OpenAI API key
Anthropic API key
PostgreSQL (optional, for production)

🚀 Installation

Clone the repository

git clone https://github.com/yourusername/slideforge.git
cd slideforge

Install dependencies

pip install -r requirements.txt

Set up environment variables

Create a .env file in the project root:

DEBUG=true
SECRET_KEY=your_secret_key
# LLM API Keys
OPENAI_API_KEY=your_openai_api_key
ANTHROPIC_API_KEY=your_anthropic_api_key
# Uncomment for PostgreSQL
# DATABASE_URI=postgresql://postgres:postgres@localhost/slideforge

Initialize the database and create a superuser

python setup.py

🏃‍♂️ Running the Application

Start the development server:

python run.py

The API will be available at http://localhost:8000.

API documentation is available at:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

📘 Usage

1. Register and get a token

curl -X POST "http://localhost:8000/api/auth/register" \
     -H "Content-Type: application/json" \
     -d '{"email": "user@example.com", "password": "SecurePassword123", "full_name": "John Doe"}'

curl -X POST "http://localhost:8000/api/auth/login" \
     -H "Content-Type: application/json" \
     -d '{"username": "user@example.com", "password": "SecurePassword123"}'

2. Upload a document

curl -X POST "http://localhost:8000/api/documents" \
     -H "Authorization: Bearer YOUR_TOKEN" \
     -F "file=@path/to/your/document.pdf"

3. Create a presentation job

curl -X POST "http://localhost:8000/api/jobs" \
     -H "Authorization: Bearer YOUR_TOKEN" \
     -H "Content-Type: application/json" \
     -d '{"document_id": 1, "settings": {"style": "corporate"}}'

4. Check job status

curl -X GET "http://localhost:8000/api/jobs/1" \
     -H "Authorization: Bearer YOUR_TOKEN"

5. Download the presentation

curl -X GET "http://localhost:8000/api/presentations/1/download" \
     -H "Authorization: Bearer YOUR_TOKEN" \
     --output presentation.pptx

📊 API Endpoints

Authentication

POST /api/auth/register - Register a new user
POST /api/auth/login - Log in and get JWT token
GET /api/auth/me - Get current user info

Documents

POST /api/documents - Upload a document
GET /api/documents - List documents
GET /api/documents/{id} - Get document details
DELETE /api/documents/{id} - Delete a document

Jobs

POST /api/jobs - Create a presentation job
GET /api/jobs - List jobs
GET /api/jobs/{id} - Get job status
DELETE /api/jobs/{id} - Cancel a job

Presentations

GET /api/presentations - List presentations
GET /api/presentations/{id} - Get presentation details
GET /api/presentations/{id}/download - Download presentation
GET /api/presentations/{id}/thumbnail - Get presentation thumbnail
DELETE /api/presentations/{id} - Delete a presentation

🧠 LLM Integration

SlideForge uses state-of-the-art LLMs from OpenAI and Anthropic to process documents:

Text Summarization: Uses Anthropic Claude 3.7 Sonnet with step-by-step thinking for comprehensive document summarization. The system prompt instructs Claude to think through its reasoning process in detail before providing a summary.
Keyword Extraction: Uses OpenAI o3-mini for efficient and accurate keyword identification, balancing quality and cost-effectiveness.
Content Structuring: Uses Anthropic Claude 3.7 Sonnet with thinking to analyze document structure and organize content into a coherent presentation format, with clear sections and priority points.

The LLM integration is managed through LangChain, providing:

Structured output parsing with Pydantic models
Context management for accurate processing
Model fallbacks for reliability
Special system prompts that enhance Claude's reasoning capabilities

📄 Large Document Processing

SlideForge implements intelligent strategies to handle large documents:

PDF Processing: For large PDFs (30+ pages), the system extracts the table of contents, introduction, conclusion, and strategically distributed content samples to create a comprehensive representation of the document.
DOCX Processing: For large Word documents (500+ paragraphs), the system analyzes the document structure, extracts headings, and samples content from key sections to maintain context while keeping processing manageable.
TXT Processing: For large text files (1MB+), the system extracts the beginning, end, and strategically distributed chunks from throughout the file.

This approach enables the system to:

Process arbitrarily large documents without running into token limits
Capture the most important information from each document
Maintain context and coherence despite not processing every word
Optimize LLM usage by focusing on the most relevant content

🔭 Future Development

🖥️ Web-based user interface
📱 Mobile app integration
🧩 Custom template system
🔗 Integration with cloud storage services
📊 More chart and diagram types
🧠 Enhanced AI content extraction
🔄 Real-time collaboration
🌐 Multi-language support

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgements

LangChain for AI orchestration
FastAPI for the web framework
python-pptx for presentation generation
SQLAlchemy for database ORM
OpenAI and Anthropic for LLM APIs

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
memory-bank		memory-bank
migrations		migrations
slideforge		slideforge
.clinerules-architect		.clinerules-architect
.clinerules-ask		.clinerules-ask
.clinerules-code		.clinerules-code
.clinerules-debug		.clinerules-debug
.gitignore		.gitignore
README.md		README.md
alembic.ini		alembic.ini
projectBrief.md		projectBrief.md
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎯 SlideForge

✨ Features

🏗️ Architecture

Three-Agent System

🛠️ Tech Stack

📋 Prerequisites

🚀 Installation

🏃‍♂️ Running the Application

📘 Usage

1. Register and get a token

2. Upload a document

3. Create a presentation job

4. Check job status

5. Download the presentation

📊 API Endpoints

Authentication

Documents

Jobs

Presentations

🧠 LLM Integration

📄 Large Document Processing

🔭 Future Development

🤝 Contributing

📄 License

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎯 SlideForge

✨ Features

🏗️ Architecture

Three-Agent System

🛠️ Tech Stack

📋 Prerequisites

🚀 Installation

🏃‍♂️ Running the Application

📘 Usage

1. Register and get a token

2. Upload a document

3. Create a presentation job

4. Check job status

5. Download the presentation

📊 API Endpoints

Authentication

Documents

Jobs

Presentations

🧠 LLM Integration

📄 Large Document Processing

🔭 Future Development

🤝 Contributing

📄 License

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages