hearsay v0.1.0
crawl4ai for video & audio. One command turns any YouTube video, podcast episode, or local recording into clean, timestamped, chunked, LLM-ready markdown.
uv tool install hearsay # or: pipx install hearsay
hearsay "https://www.youtube.com/watch?v=VIDEO_ID"Highlights:
- Captions-first (fast, no download) with automatic local Whisper fallback when a video has no captions;
--transcribeto force it,--no-vadfor music. - Readable paragraph grouping with real timestamps; chapters → sections.
- Podcasts & YouTube playlists in batch;
--jsonsidecar with a stable schema for RAG. - MCP server (
hearsay mcp) so AI agents can ingest media themselves.
Requires Python 3.11+ and ffmpeg. MIT licensed.