PodAI is a web application that gives your voice a new personality. It transforms raw audio recordings into professional podcast segments, witty monologues, or structured narratives using Google's Gemini AI.
- Smart Transcription: Instantly converts raw audio (MP3, WAV, M4A) into text using Gemini 2.5 Flash.
- Persona Transformation: Rewrites your content into specific styles using Gemini 3 Pro:
- The Stand-Up: Comedy host style.
- The Analyst: Tech reviewer style.
- The Narrator: NPR-style storytelling.
- The Provocateur: High-energy debate.
- The Essentialist: Minimalist productivity guru.
- The Futurist: Sci-fi visionary.
- Interactive Chat: Chat directly with the specific persona about the transcript content.
- Local History: Saves your generations locally so you can revisit them later.
- Frontend: React, Vite, TypeScript
- Styling: Tailwind CSS
- AI: Google GenAI SDK (@google/genai)
- Icons: Lucide React
- Home page
- Upload Audio and Choose a Persona Select how you want your content to be reimagined.
- Generated Script View the original transcript side-by-side with the transformed persona edition.
Have a conversation with the specific persona about the content of your recording.
