Skip to content

MattSegal/feed-plan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

Content Aggregation System - Build Guide

A blueprint for building a content aggregation platform that transforms YouTube channels, podcasts, and blogs into a unified feed of AI-generated Twitter-style threads.

What This Is

This repository contains documentation and code snippets for building a system that:

  • Aggregates content from YouTube, podcasts (RSS), and blogs (Substack, WordPress, RSS)
  • Transcribes audio/video using Whisper on Modal's serverless GPU infrastructure
  • Generates personas using Claude Opus to capture each source's writing voice
  • Creates threads using Claude Sonnet to produce Twitter-style summaries
  • Displays everything in a paginated feed via a REST API

Documentation

Read these in order to understand and build the system:

Tech Stack

Layer Technology
Backend Django, Django REST Framework, Django-Q
Database PostgreSQL
Transcription Modal (serverless GPU) + OpenAI Whisper
AI Generation Anthropic Claude (Opus for personas, Sonnet for threads)
Frontend React (not covered in this guide)

Getting Started

  1. Read the Overview to understand the architecture
  2. Set up your Data Models
  3. Implement scrapers for your desired content sources (docs 02-04)
  4. Set up Transcription if handling audio/video
  5. Implement Persona and Thread generation
  6. Wire it together with Background Tasks
  7. Expose via REST API

Required API Keys

  • YOUTUBE_API_KEY — YouTube Data API v3
  • ANTHROPIC_API_KEY — Claude API for persona and thread generation
  • AWS_ACCESS_KEY_ID / AWS_SECRET_ACCESS_KEY — S3 for temporary audio storage
  • Modal account — For serverless Whisper transcription

About

A plan on how you would go about writing Feed

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published