YouTube Transcript Processing and Story Generation System

Overview

This project is a FastAPI-based application that provides functionality for processing YouTube video transcripts, generating stories from those transcripts, and managing content using various APIs. The system integrates with YouTube for video data, OpenAI for text generation and Firebase for data storage.

Key Features

YouTube video transcript extraction and processing
Automatic content categorization using AI
Story generation from video transcripts
Semantic search capabilities
Content management with categories
Multiple API integrations (YouTube, OpenAI, Firebase)

Prerequisites

Python 3.8+
Firebase account and project
OpenAI API access
YouTube API access

Setup Instructions

1. Environment Setup

First, clone the repository and create a virtual environment:

git clone <repository-url>
cd blackprince001-scripter-tool-system
pip install uv
uv venv
source .venv/bin/activate # activating the environment will be different on windows
uv pip install -r requirements.txt

2. Configuration

Create a .env file in the root directory with the following variables:

FIREBASE_CONFIG_FILE=path/to/your/firebase-config.json
OPENAI_API_KEY=your_openai_api_key
YOUTUBE_API_KEY=your_youtube_api_key

3. API Keys and Services Setup

Firebase Setup

Go to the Firebase Console
Create a new project
Navigate to Project Settings > Service Accounts
Generate a new private key
Save the JSON file as firebase-config.json in your project
Update the FIREBASE_CONFIG_FILE path in your .env

OpenAI API

Visit OpenAI's platform
Create an account or sign in
Navigate to API keys
Generate a new API key
Add it to your .env file as OPENAI_API_KEY

YouTube API

Go to the Google Cloud Console
Create a new project or select an existing one
Enable the YouTube Data API v3
Create credentials (API key)
Add it to your .env file as YOUTUBE_API_KEY

Running the Application

Start the FastAPI server:

uvicorn main:app --reload --host 0.0.0.0 --port 8000

The API will be available at http://localhost:8000

API Documentation

Documentation Reference

Main Endpoints

Transcript Processing

POST /transcripts/process

Process a YouTube video and extract its transcript.

Parameters:

url: YouTube video URL
category: (optional) Category for the transcript
auto_categorize: (optional) Enable AI category generation

Story Generation

POST /generate/story

Generate stories from categorized transcripts.

Parameters:

category_weights: List of categories and their weights
variations_count: Number of story variations to generate
style: Story style (casual/professional/creative)
length: Desired story length

Category Management

GET /categories/

Retrieve all available categories.

Project Structure

blackprince001-scripter-tool-system/
├── app/
│   ├── core/         # Core functionality and API clients
│   ├── models/       # Data models
│   ├── router/       # API routes
│   ├── schemas/      # Pydantic schemas
│   └── utils/        # Utility functions
├── tests/            # Test files
└── main.py          # Application entry point

Error Handling

The application includes custom error handling for various scenarios:

Invalid YouTube URLs
Missing transcripts
API failures
Database errors

All errors return structured responses with:

error_code: Specific error identifier
message: Human-readable error message
details: Additional error information

Testing

Run the test suite using pytest:

pytest

Firebase Collections Structure

The application uses the following Firestore collections:

transcripts: Main collection for all transcripts
transcripts_{category}: Category-specific transcript collections
categories: Available content categories
stories: Generated stories

Development Guidelines

Use the provided models and schemas for data validation
Follow the existing error handling patterns
Add tests for new functionality
Update documentation for API changes

Security Considerations

API keys are managed through environment variables
CORS is configured for specific origins
Firebase security rules should be properly configured
Rate limiting should be implemented for production use

Performance Optimization

The application implements several optimization techniques:

LRU caching for frequently accessed data
Efficient transcript processing
Optimized database queries
Semantic search capabilities

Troubleshooting

Common issues and solutions:

Firebase Connection Issues
- Verify Firebase configuration file path
- Check Firebase project permissions
- Ensure proper network connectivity
YouTube API Errors
- Verify API key validity
- Check daily quota limits
- Ensure video URLs are properly formatted
OpenAI API Issues
- Verify API key
- Check rate limits
- Monitor token usage

Contributing

Fork the repository
Create a feature branch
Commit changes
Push to the branch
Create a pull request

License

[Add License Information]

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
app		app
prisma		prisma
tests		tests
web-interface		web-interface
.dockerignore		.dockerignore
.gitignore		.gitignore
APPLICATION_FLOW.md		APPLICATION_FLOW.md
DEPLOYMENT.md		DEPLOYMENT.md
DOCS.md		DOCS.md
Dockerfile		Dockerfile
README.md		README.md
STORY_FINALIZATION_FEATURES.md		STORY_FINALIZATION_FEATURES.md
deploy-api.sh		deploy-api.sh
deploy-web.sh		deploy-web.sh
docker-compose.yml		docker-compose.yml
fly.api.toml		fly.api.toml
fly.web.toml		fly.web.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup-docker.sh		setup-docker.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YouTube Transcript Processing and Story Generation System

Overview

Key Features

Prerequisites

Setup Instructions

1. Environment Setup

2. Configuration

3. API Keys and Services Setup

Firebase Setup

OpenAI API

YouTube API

Running the Application

API Documentation

Main Endpoints

Transcript Processing

Story Generation

Category Management

Project Structure

Error Handling

Testing

Firebase Collections Structure

Development Guidelines

Security Considerations

Performance Optimization

Troubleshooting

Contributing

License

About

Uh oh!

Releases

Packages

Languages

blackprince001/scripter-tool-system

Folders and files

Latest commit

History

Repository files navigation

YouTube Transcript Processing and Story Generation System

Overview

Key Features

Prerequisites

Setup Instructions

1. Environment Setup

2. Configuration

3. API Keys and Services Setup

Firebase Setup

OpenAI API

YouTube API

Running the Application

API Documentation

Main Endpoints

Transcript Processing

Story Generation

Category Management

Project Structure

Error Handling

Testing

Firebase Collections Structure

Development Guidelines

Security Considerations

Performance Optimization

Troubleshooting

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages