Skip to content

Latest commit

 

History

History
44 lines (30 loc) · 1.67 KB

README.md

File metadata and controls

44 lines (30 loc) · 1.67 KB

Streamlit Multimodal RAG

This Streamlit application implements a multimodal Question Answering (QA) system using the LangChain library.

Key Features

  • Interactive Streamlit UI for file uploads, DB build, and QA
  • Accepts input files in PDF, audio (WAV, MP3, opus), and text formats
  • Transcribes audio to text using HuggingFace DistilWhisper models
  • Audio transcription runs in close to real-time on CPU
  • Background loading of models takes time, notice top-right running indicator
  • Requires HuggingFace API key
  • Docker container exposes port 8001, access UI with browser localhost:8001

Flowchart

Models Used

Installation

docker build -t streamlit-app .
docker run -p 8001:8001 --rm streamlit-app

GUI access localhost:8001

Note

Please be aware that this is only a Proof of Concept system and may contain bugs or unfinished features.

Resources