An AI-powered YouTube video summarization and question-answering app built with Gradio, LangChain, FAISS, and IBM watsonx.ai. This application extracts YouTube transcripts, summarizes long videos, and allows users to ask contextual questions grounded strictly in the video content.
- 📜 Automatic YouTube Transcript Extraction
- ✂️ Smart Text Chunking using LangChain splitters
- 🧠 Semantic Search with FAISS
- 🧾 Concise Video Summarization
- ❓ Context-aware Question Answering
- 🔗 IBM Granite LLM (watsonx.ai)
- 🌐 Gradio Web UI
YouTube URL
↓
Transcript Extraction (youtube-transcript-api)
↓
Preprocessing & Chunking (LangChain)
↓
Embeddings (IBM Slate-30M)
↓
FAISS Vector Store
↓
LLM (IBM Granite 3.2)
↓
Summary / Q&A Output
| Layer | Technology |
|---|---|
| UI | Gradio |
| LLM | IBM Granite 3.2 (watsonx.ai) |
| Embeddings | IBM Slate-30M |
| Framework | LangChain |
| Vector DB | FAISS |
| Transcript | youtube-transcript-api |
| Language | Python 3.10+ |
git clone https://github.com/PRONGS-CHIRAG/Vidintel.git
cd Vidintelpython -m venv venv
source venv/bin/activate # macOS/Linuxpip install -r requirements.txtYou must have access to IBM watsonx.ai.
- IBM Cloud Account
- watsonx.ai project
- Project ID
The app currently uses:
project_id = "skills-network"
url = "https://us-south.ml.cloud.ibm.com"
⚠️ For production use, store credentials securely using environment variables.
python app.pyThen open:
http://localhost:7860
- Paste a YouTube video URL
- Click “Summarize Video”
- Receive a concise AI-generated summary
- Enter a question about the video
- Click “Ask a Question”
- Get an answer grounded in the transcript
.
├── app.py # Main Gradio application
├── requirements.txt # Python dependencies
├── README.md # Project documentation
- ibm/granite-3-2-8b-instruct
- ibm/slate-30m-english-rtrvr-v2
- ❌ Works only with videos that have English transcripts
- ❌ Private / restricted videos are not supported
⚠️ Transcript fetching depends on YouTube availability⚠️ Cold start latency due to LLM initialization
- ⏱️ Timestamp-aware answers
- 🌍 Multi-language support
- 💾 Persistent vector storage
- 🔄 Streaming responses
- 🔐 Secure credential handling
- 🚀 Deployment on Hugging Face Spaces
Contributions are welcome!
- Fork the repository
- Create a feature branch
- Commit your changes
- Open a Pull Request
MIT License Feel free to use, modify, and distribute.
- IBM watsonx.ai
- LangChain
- Gradio
- FAISS
- YouTube Transcript API
If you’re building AI, RAG, or agentic systems and want to collaborate:
Chirag 🔗 LinkedIn - https://www.linkedin.com/in/chiragnvijay/ 💡 Open to research, startups, and applied AI projects