A production-ready template to help you kickstart and organize your Generative AI projects with clarity and scalability in mind.
Designed to reduce chaos in early development and support long-term maintainability with proven structure and practices.
A production-ready template for building scalable Generative AI apps β structured, maintainable, and built on real-world best practices.
π config/ β YAML config for models, prompts, logging
π data/ β Prompts, embeddings, and other dynamic content
π examples/ β Minimal scripts to test key features
π notebooks/ β Quick experiments and prototyping
π tests/ β Unit, integration, and end-to-end tests
π src/ β The core engine β all logic lives here:
βββ agents/ β Agent classes: planner, executor, base agent
βββ memory/ β Short-term and long-term memory modules
βββ pipelines/ β Chat flows, doc processing, and task routing
βββ retrieval/ β Vector search and document lookup
βββ skills/ β Extra abilities: web search, code execution
βββ vision_audio/ β Multimodal processing: image and audio
βββ prompt_engineering/β Prompt chaining, templates, few-shot logic
βββ llm/ β OpenAI, Anthropic, and custom LLM routing
βββ fallback/ β Recovery logic when LLMs fail
βββ guardrails/ β PII filters, output validation, safety checks
βββ handlers/ β Input/output processing and error management
βββ utils/ β Logging, caching, rate limiting, token counting
- Track prompt versions and results
- Separate configs using YAML files
- Structure code by clear module boundaries
- Cache responses to reduce latency and cost
- Handle errors with custom exceptions
- Use notebooks for rapid testing and iteration
- Monitor API usage and set rate limits
- Keep code and docs in sync
- Clone the repo
- Install via
requirements.txt - Set up model configs
- Check sample code
- Begin in notebooks
- Use modular structure
- Test components early
- Track with version control
- Keep datasets fresh
- Monitor API usage
requirements.txtβ Package dependenciesREADME.mdβ Project overview and usageDockerfileβ Container build instructions
This project is licensed under the Apache 2.0 License.
You are free to use, modify, and distribute with minimal restriction.