A local-first AI developer assistant for understanding, documenting, analyzing, and improving software repositories using free AI models and modern retrieval pipelines.
Built as a production-style AI engineering portfolio project focused on developer tooling, repository intelligence, semantic retrieval, and AI-assisted maintenance workflows.
- Safe local repository scanning
- Repository structure visualization
- Architecture layer detection
- Dependency/import analysis
- Language and file-type detection
- Repository metrics and statistics
- AI repository summaries
- Repository Q&A using semantic retrieval
- AI file explanations
- README generation
- Refactor/improvement suggestions
- Safe patch preview generation
- Suggested fix detection
- TODO / FIXME extraction
- Potential issue detection
- Suggested improvements
- Architecture insights
- Maintainability analysis
- Multi-provider AI support
- Ollama
- Groq
- Gemini
- Semantic vector retrieval
- Persistent FAISS vector indexes
- Embedding-based repository search
- Local-first caching system
Frontend (Streamlit)
│
▼
FastAPI Backend API Layer
│
├── Repository Scanner
├── Analysis Engine
├── AI Provider Layer
├── Retrieval Pipeline
├── Vector Search (FAISS)
├── Caching System
└── Refactor / Patch Engine
- Python
- FastAPI
- SQLite
- Streamlit
- Plotly
- Ollama
- Groq API
- Gemini API
- sentence-transformers
- FAISS
- Local JSON caching
- Modular API routing
- Repository-safe scanning rules
backend/
├── analysis/
├── api/
├── core/
├── ingestion/
├── llm/
├── retrieval/
├── scanner/
frontend/
├── streamlit_app.py
data/
├── cache/
├── indexes/
git clone <your-repository-url>
cd ai-codebase-assistantpython -m venv .venvActivate:
.venv\Scripts\activatesource .venv/bin/activatecd backend
pip install -r requirements.txtuvicorn app.main:appOpen another terminal:
cd frontend
streamlit run streamlit_app.pyInstall Ollama:
Example local model:
ollama pull gemma2:2bSet environment variable:
GROQ_API_KEY=your_key_hereSet environment variable:
GEMINI_API_KEY=your_key_here- Local repository scanning
- Persistent vector search indexes
- AI-assisted repository understanding
- AI-powered developer Q&A
- Dependency mapping
- README generation
- Refactor suggestion generation
- Patch preview generation
- Multi-provider AI routing
- Local caching system
- Tree-sitter AST parsing
- Monaco-style code viewer
- True patch application engine
- Dockerized deployment
- GitHub repository ingestion
- Multi-language parsing
- Async/background processing
- Session/project history
- Advanced architecture graphing
This project was designed to:
- simulate real AI developer tooling platforms
- demonstrate AI engineering architecture skills
- showcase retrieval-augmented workflows
- demonstrate scalable backend organization
- remain fully usable with free/local AI models
MIT License








