Sinhala Hybrid RAG Chatbot (Offline)

Complete Sinhala chatbot system using Hybrid RAG (JSON + Text), Streamlit UI, FAISS retrieval, and Ollama local inference.

Assignment Coverage

Fully offline capable at runtime
Ollama local LLM inference (http://localhost:11434/api/generate)
Streamlit chat interface
Sinhala Unicode input/output
In-session memory for recent conversation turns (last 10)
Hybrid retrieval flow:
1. JSON semantic retrieval (Top 2)
2. Text semantic retrieval from selected topics (Top 3)
3. Context merge and grounded generation

Project Structure

project/
├── app.py
├── chatbot/
│   ├── hybrid_retriever.py
│   ├── json_retriever.py
│   ├── text_retriever.py
│   ├── embeddings.py
│   ├── prompt.py
│   ├── ollama.py
│   ├── memory.py
│   ├── build_indexes.py
│   └── test_queries.py
├── data/
│   ├── knowledge.json
│   └── documents/
│       ├── headache.txt
│       ├── stress.txt
│       └── ...
├── vectorstore/
│   ├── json_index.faiss
│   └── text_index.faiss
├── requirements.txt
└── README.md

Offline Requirements

Ollama must be installed locally.
Gemma model must already exist locally.
Sentence-transformers embedding model must be locally available.

Runtime code uses local-only embedding loading by default (EMBEDDING_LOCAL_ONLY=1).

Install

pip install -r requirements.txt

Run Ollama Offline

ollama serve
ollama pull gemma

After model pull is completed once, usage is local/offline.

Build FAISS Indexes

python -m chatbot.build_indexes

This generates:

vectorstore/json_index.faiss
vectorstore/text_index.faiss

Run Streamlit Chatbot

streamlit run app.py --server.port 8501

Open:

http://localhost:8501

Prompt Policy

The exact assignment prompt is used in chatbot/prompt.py with strict fallback:

"මට ඒ පිළිබඳ ප්‍රමාණවත් තොරතුරු නොමැත"

Testing (20 Sinhala Queries)

Run retrieval tests:

python -m chatbot.test_queries

The script includes 20 Sinhala test cases and checks retrieval correctness by expected topic.

Notes

No keyword matching is used for retrieval.
Full dataset is never sent to the model.
Only retrieved context is sent to Ollama.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
chatbot		chatbot
data		data
vectorstore		vectorstore
.gitignore		.gitignore
ASSIGNMENT_SUBMISSION_GUIDE.md		ASSIGNMENT_SUBMISSION_GUIDE.md
ASSIGNMENT_TEST_CASES_REPORT.md		ASSIGNMENT_TEST_CASES_REPORT.md
Full logo.png		Full logo.png
Iconlogo.png		Iconlogo.png
Namelogo.png		Namelogo.png
README.md		README.md
Send_Icon.png		Send_Icon.png
Setting_Icon.png		Setting_Icon.png
TEST_CASES_QUICK_REFERENCE.txt		TEST_CASES_QUICK_REFERENCE.txt
app.py		app.py
build_indexes.log		build_indexes.log
report_output.txt		report_output.txt
requirements.txt		requirements.txt
test_output.txt		test_output.txt
test_results.log		test_results.log
user.png		user.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sinhala Hybrid RAG Chatbot (Offline)

Assignment Coverage

Project Structure

Offline Requirements

Install

Run Ollama Offline

Build FAISS Indexes

Run Streamlit Chatbot

Prompt Policy

Testing (20 Sinhala Queries)

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sinhala Hybrid RAG Chatbot (Offline)

Assignment Coverage

Project Structure

Offline Requirements

Install

Run Ollama Offline

Build FAISS Indexes

Run Streamlit Chatbot

Prompt Policy

Testing (20 Sinhala Queries)

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages