🚀 Built as a hands-on exploration of AI engineering using Spring AI and Retrieval-Augmented Generation (RAG) patterns.
This is an actively evolving project focused on learning and implementing real-world AI engineering patterns.
AI Assistant is a backend application that enables querying and extracting insights from documents using retrieval-based techniques.
The system is designed to simulate real-world LLM-powered applications while running fully locally, without requiring paid APIs.
This project was built to:
- Move beyond theoretical AI learning into hands-on implementation
- Understand how real-world AI systems are structured (chat, retrieval, services)
- Explore Retrieval-Augmented Generation (RAG) patterns in a practical setup
- Build a cost-efficient AI system that runs without external dependencies
- 📄 Document ingestion and processing
- 🔍 Context-aware retrieval
- 💬 Chat-based query interface
- 🧩 Modular architecture (Chat + RAG layers)
- 💻 Fully local execution (no paid LLM APIs)
- Backend: Java, Spring Boot
- AI Framework: Spring AI
- Architecture: Layered (Controller → Service → AI components)
- Configuration: YAML
The application is structured into clear, modular components:
Handles user interaction and query processing.
Responsible for:
- Document ingestion
- Context retrieval
Encapsulates business logic and orchestration.
- User submits a query via the chat endpoint
- System retrieves relevant document context
- Context is used to generate a response
- Response is returned to the user
This follows a simplified Retrieval-Augmented Generation (RAG) workflow.