Clip Learner

Clip Learner is a SvelteKit app for learning English from real video clips. The current version is built around YouTube clips and transcript-driven study: you paste a clip URL, the app downloads the audio, transcribes it with Whisper, breaks the transcript into study chunks, asks an LLM to explain jokes/slang/cultural references, and lets you save vocabulary to a notebook.

The product started from learning English through British comedy, especially Taskmaster, but the codebase is already aimed at a more general "learn from clips" workflow.

Current Features

Paste a YouTube URL and create a study item
Download YouTube audio with yt-dlp
Transcribe audio with a Whisper API or local whisper CLI
Parse SRT output into transcript segments
Analyze transcript lines for humor, slang, idioms, and cultural references
Show scene breakdowns and vocabulary suggestions
Hover or select words in the transcript to get instant definitions
Save vocabulary into a notebook
Download clips for offline playback
Generate a simple quiz from annotations and saved vocabulary

Stack

SvelteKit + Svelte 5
TypeScript
SQLite via Node's built-in node:sqlite
OpenAI-compatible chat API client for transcript analysis and word explanation
yt-dlp for YouTube metadata, audio download, and offline video download
ffmpeg / ffprobe for audio conversion and duration detection

Project Structure

src/routes/
  +page.svelte                 Home page and clip list
  episode/[id]/+page.svelte    Study view
  episode/[id]/quiz/+page.svelte
  notebook/+page.svelte        Saved vocabulary
  api/process/+server.ts       Clip ingest + transcription/analysis
  api/explain/+server.ts       Segment explanation + word lookup
  api/download/+server.ts      Offline video download
  api/notebook/+server.ts      Vocabulary notebook API

src/lib/server/
  db.ts                        SQLite schema
  whisper.ts                   Audio download + Whisper transcription
  ytdlp.ts                     YouTube metadata helpers
  subtitles.ts                 SRT parsing and segment merging
  analysis.ts                  Transcript processing pipeline
  claude.ts                    LLM prompts and response parsing

Requirements

Node.js 22.5+ (required for node:sqlite)
npm
yt-dlp available on PATH
ffmpeg and ffprobe available on PATH
Whisper transcription, either:
- WHISPER_API_KEY configured for an OpenAI-compatible Whisper endpoint, or
- local whisper CLI available on PATH
LLM analysis, either configured per user in Settings or via .env fallback

Example .env:

# LLM analysis fallback
ANTHROPIC_API_KEY=your-api-key
ANTHROPIC_BASE_URL=

# Whisper transcription fallback
WHISPER_API_KEY=
WHISPER_BASE_URL=https://api.openai.com/v1
WHISPER_MODEL=whisper-1
WHISPER_LOCAL_MODEL=tiny.en

Local Development

Install dependencies:

npm install

Start the app:

npm run dev

Quality checks:

npm run check
npm run build

Data Storage

SQLite database: data/app.db
Downloaded media: media/<episode_id>/

The database schema is created automatically on startup.

Current Limitations

The app is tightly coupled to YouTube URLs today
Transcript quality depends on Whisper and source audio quality
Vocabulary suggestions from analysis are shown to the user, but users save notebook words manually
The app now uses local user accounts and per-user settings, but there is no third-party auth or email recovery flow yet
The UI and naming still mix "Taskmaster" history with the broader "Clip Learner" direction

Product Direction

This repo is already close to becoming a reusable language-learning platform for:

comedy clips
interviews
talk shows
anime or drama clips with subtitles
English news/articles with word lookup, phrase detection, and vocabulary saving
any video workflow where the learner wants instant explanation without leaving the player

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
.claude		.claude
.vscode		.vscode
scripts		scripts
src		src
static		static
.env.example		.env.example
.gitignore		.gitignore
.npmrc		.npmrc
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
COOKIE-SYNC-SETUP.md		COOKIE-SYNC-SETUP.md
DEPLOY.md		DEPLOY.md
README.md		README.md
ecosystem.config.cjs		ecosystem.config.cjs
package-lock.json		package-lock.json
package.json		package.json
svelte.config.js		svelte.config.js
sync-cookies.sh		sync-cookies.sh
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clip Learner

Current Features

Stack

Project Structure

Requirements

Local Development

Data Storage

Current Limitations

Product Direction

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clip Learner

Current Features

Stack

Project Structure

Requirements

Local Development

Data Storage

Current Limitations

Product Direction

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages