π AI Systems & LLM Backend Engineer
Building scalable AI architectures, production-ready agent systems, and high-performance backend infrastructures.
I specialize in designing and engineering robust AI-powered systems β from Retrieval-Augmented Generation (RAG) pipelines to advanced LLM orchestration and distributed backend architectures.
- π€ Designing AI Agent Architectures (tool-calling, multi-agent workflows, orchestration)
- π Building Production-Grade RAG Pipelines (vector search, hybrid retrieval, reranking)
- β‘ LLM Infrastructure (vLLM, inference optimization, prompt engineering)
- ποΈ Scalable Backend Systems for AI applications
- π¬ Experimenting with fine-tuning strategies & model optimization
I focus on understanding systems at a fundamental level β because once the core mechanics are clear, you can build anything on top of them.
Python β’ LangChain β’ LLM APIs β’ RAG β’ Agents β’ Vector Databases β’ Embeddings β’ vLLM β’ Prompt Engineering
Node.js β’ NestJS β’ Express β’ TypeScript β’ REST APIs β’ Kafka β’ Docker β’ AWS β’ Microservices
MongoDB β’ PostgreSQL β’ Cloud Basics (AWS) β’ GitHub Actions
React (MERN) β Building UI layers for AI-powered applications
- AI-powered backend systems
- Agent-driven automation workflows
- Scalable APIs for LLM-based products
- Full-stack AI applications
- Clean, understandable educational AI implementations
- Advanced Agent Orchestration
- Streaming LLM systems
- High-performance inference optimization
- Distributed AI architectures
