Local AI chatbot using Ollama with persistent conversation memory stored in SQLite.
Designed for low-resource devices (CPU-only laptops) without GPU.
- Local LLM using Ollama
- Persistent chat memory with SQLite
- Session-based conversations
- Lightweight design (CPU friendly)
- Simple CLI interface
User → Python CLI → SQLite Memory → Ollama → Response
Install dependencies:
pip install ollamaMake sure Ollama is installed and running:
Pull a lightweight model:
ollama pull llama3.2:1bEdit config.json:
{
"model": "llama3.2:1b",
"system_prompt": "You are Jajang, a helpful AI assistant. Answer clearly and concisely.",
"stream": false
}Run chatbot:
python app.pyYou: halo selamat malam
AI: Selamat malam! Ada yang bisa saya bantu?
You: nama kamu siapa
AI: Saya Jajang, asisten AI kamu.
Chat history is stored in SQLite database:
- Persistent across sessions
- Session-based isolation
- Automatically loaded into model context
- Uses small model (llama3.2:1b)
- Running on CPU only may cause slower responses
- Possible hallucination due to model size
- Limited context window
This project is designed for:
- Learning LLM integration
- Local AI experimentation
- Lightweight chatbot systems
- Embedded / low-resource AI applications
Not intended for production-scale deployment.
MIT License