AI Engineer | LLM Training & Post-Training | Agentic AI | RAG | GenAI
Building, training, and orchestrating intelligent systems β from data and gradients to autonomous agents
- π§ͺ Working on LLM training & post-training β reinforcement learning gyms for tool use, GRPO and policy-optimization pipelines, reward modeling, and SFT
- π€ Building agentic systems with LangChain & LangGraph β multi-step reasoning, tool-calling, and stateful multi-agent orchestration
- π Shipping production-grade GenAI applications with FastAPI, vector stores, and Enterprise-grade RAG architectures
- π€ Hosting webinars & workshops on Prompt Engineering, Agent-based LLM systems, and LLM evaluation
| Project | Description | Tech |
|---|---|---|
| ποΈ RL Gym for Tool Use | Custom RL environment to train LLMs on multi-tool, multi-turn agent tasks β reward shaping, trajectory rollouts, and verifier-based scoring | NVIDIA NeMo-Gym, PyTorch, vLLM |
| π― GRPO Training Pipeline | Group Relative Policy Optimization pipeline for post-training LLMs on tool-use and reasoning traces, with reference-model KL control and reward aggregation | GRPO, PyTorch, Hugging Face TRL |
| πΈοΈ LangGraph Multi-Agent Orchestrator | Stateful agent graphs with planner / executor / critic loops, tool routing, memory, and human-in-the-loop checkpoints | LangGraph, LangChain, FastAPI |
| π§ LLM Eval & Reward Modeling | Scenario-based evaluation harness with pass@k, tool-call verifiers, and reward-model training for preference data | Python, LangSmith, TRL |
| π§Ύ Legal Contract AI Reviewer | Agentic AI bot that reviews contracts for risks using LangGraph + Azure OpenAI | LangChain, RAG, Azure |
| π€ Job Application Automator | Automated ATS with resume parsing, answer generator, and Power BI dashboard | FastAPI, PostgreSQL, GPT |
| π Price Intelligence Bot | AI-based price matcher & extractor from competitor eCommerce sites | GPT-4o Search, Crawler, Python |
| π§ Prompt Engineering Toolkit | Set of reusable, tested prompt templates with LangSmith evaluation logs | PromptFlow, LangChain, FastAPI |
ποΈ I recently moved to this GitHub account. My earlier projects (68+ public repos across GenAI, RAG, agentic apps, and more) live on my previous profile β feel free to browse them there:
π§ βGreat AI systems donβt just respond β they act, adapt, and evolve.β



