A Retrieval-Augmented Generation (RAG) system using Pinecone, Ollama, and Streamlit.
- 📄 Multi-document upload (PDF, TXT, MD, etc.)
- 🔍 Semantic search with vector embeddings
- 💬 Chat with your documents using local LLM
- 🎨 Beautiful Streamlit UI
- Install dependencies:
uv sync- Set up Pinecone:
- Sign up at https://pinecone.io
- Get your API key
- Create
.streamlit/secrets.toml:
PINECONE_API_KEY = "your_key_here"- Install Ollama models:
ollama pull hf.co/CompendiumLabs/bge-base-en-v1.5-gguf
ollama pull hf.co/bartowski/Llama-3.2-1B-Instruct-GGUF- Run the app:
streamlit run main.pylast upadted 11 Jan 2026 at 9:47 PM Sunday 9:47 PM
Still in progress..Testing and running right now.
- Upload documents via sidebar
- Click "Process Documents"
- Start chatting!