Refactor to production-grade Jarvis AI assistant with FastAPI backend by askmy-stack · Pull Request #1 · askmy-stack/interactive-chatbot

askmy-stack · 2026-04-08T19:40:09Z

Summary

Completely refactored the codebase from a simple Streamlit chatbot into a production-grade personal AI assistant called Jarvis. The new architecture separates concerns into a FastAPI backend with a tool-calling LangChain agent and a Streamlit frontend that streams responses token-by-token.

Key Changes

Backend Architecture: Created a FastAPI backend (backend/main.py) that exposes /chat/stream for streaming responses, /health for liveness checks, and session management endpoints
LangChain Agent: Implemented a tool-calling agent (backend/agent.py) using gpt-4o-mini with four integrated tools:
- Web search via DuckDuckGo (no API key required)
- Live weather via Open-Meteo (no API key required)
- System resource monitoring via psutil
- Smart home device control via Home Assistant REST API
Vector Memory: Added persistent ChromaDB-backed memory (backend/memory.py) that stores conversation exchanges and retrieves semantically similar past interactions to inject context into the agent
Streaming: Implemented token-by-token streaming via AgentExecutor.astream_events(version="v2") so users see text appearing in real time as the LLM generates it
Configuration: Introduced typed, validated settings via pydantic-settings (backend/config.py) that reads from .env with sensible defaults
Streamlit UI: Completely rewrote app.py to:
- Connect to the FastAPI backend via HTTP
- Display chat history with proper message roles
- Stream responses with a visual cursor indicator
- Show backend health status and example prompts in sidebar
- Support session management with UUID-based session IDs
Testing: Added comprehensive test suite:
- tests/test_tools.py — unit tests for all tools (no LLM calls)
- tests/test_api.py — FastAPI endpoint tests with mocked agent
- tests/conftest.py — shared fixtures for CI compatibility
DevOps & Documentation:
- Added Dockerfile and docker-compose.yml for one-command deployment
- Created comprehensive README.md with architecture diagram, quick start, and configuration guide
- Added CLAUDE.md for AI assistant guidance on the codebase
- Added GitHub Actions CI workflow (ci.yml) for linting, type-checking, testing, and secret scanning
- Added .gitignore and .env.example
Project Metadata: Created pyproject.toml with all dependencies, dev extras, and tool configurations (ruff, mypy, pytest)

Notable Implementation Details

Security: Device control tool uses strict allow-lists for both entity domains and actions to prevent LLM from executing arbitrary Home Assistant API calls
Observability: Integrated structlog for structured JSON logging and optional LangSmith tracing support
Session Management: In-memory session store with per-session chat history; ready to swap for Redis for horizontal scaling
Lazy Initialization: Agent executor is built lazily on first request, making imports test-safe
Error Handling: Graceful fallbacks for missing optional services (Home Assistant, LangSmith) and clear user-facing error messages

https://claude.ai/code/session_01BsGXyhfA6BWuSjaA23KmET

Documents project structure, tech stack, setup instructions, key conventions, known issues, and LangChain patterns for use by AI coding assistants. https://claude.ai/code/session_01BsGXyhfA6BWuSjaA23KmET

Complete rewrite implementing the full modernization roadmap: Architecture: - FastAPI async backend (backend/main.py) with /chat/stream SSE endpoint - Streamlit frontend (app.py) calls backend, renders streaming tokens in real time - LangChain v0.3 tool-calling agent (backend/agent.py) with astream_events v2 - ChromaDB persistent vector memory (backend/memory.py) — survives restarts - pydantic-settings typed config (backend/config.py) — fails fast on missing keys Four Jarvis tools: - get_system_info — psutil CPU/memory/disk (no API key) - web_search — DuckDuckGo (no API key) - get_weather — Open-Meteo free API (no API key) - control_device — Home Assistant REST API (optional, allow-listed) Production readiness: - .gitignore, .env.example, pyproject.toml with uv support - Dockerfile + docker-compose.yml (one-command deploy, chroma_db volume) - GitHub Actions CI: ruff lint, mypy type-check, pytest, gitleaks secret scan - structlog structured logging throughout - LangSmith tracing support (opt-in via LANGCHAIN_TRACING_V2=true) Tests: - tests/test_tools.py — tool unit tests, allow-list enforcement, no LLM calls - tests/test_api.py — FastAPI endpoint tests with mocked agent - tests/conftest.py — fake API key fixture for CI Fixes all pre-existing bugs: - Logic flaw: model no longer called on every keystroke - Deprecated langchain.chat_models / langchain.llms imports updated to v0.3 - Deprecated chat(messages) call replaced with astream_events - input builtin no longer shadowed https://claude.ai/code/session_01BsGXyhfA6BWuSjaA23KmET

Documents: - What has been completed (architecture redesign, tools, streaming, memory, etc.) - What remains to do (revoke API keys, set up .env, run tests, deploy) - Step-by-step execution for local development, Docker deployment, and CI/CD - Architecture explained in simple terms - Troubleshooting common issues - Complete checklist for deployment https://claude.ai/code/session_01BsGXyhfA6BWuSjaA23KmET

claude added 3 commits April 6, 2026 22:58

Add CLAUDE.md with codebase documentation for AI assistants

38eb03f

Documents project structure, tech stack, setup instructions, key conventions, known issues, and LangChain patterns for use by AI coding assistants. https://claude.ai/code/session_01BsGXyhfA6BWuSjaA23KmET

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor to production-grade Jarvis AI assistant with FastAPI backend#1

Refactor to production-grade Jarvis AI assistant with FastAPI backend#1
askmy-stack wants to merge 3 commits into
mainfrom
claude/add-claude-documentation-GUacQ

askmy-stack commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

askmy-stack commented Apr 8, 2026

Summary

Key Changes

Notable Implementation Details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants