I'm a recent grad who got into AI because I wanted to understand how LLMs actually work, not just call an API and move on.
Most of my projects start with a question I couldn't find a good answer to. Why does RAG fail silently? Why does the top-ranked result from hybrid search feel wrong once you re-read the original prompt? Why does Spotify know your listening history but not what you were going through when you hit repeat?
That curiosity tends to turn into code.
Python · HuggingFace Transformers · LangChain · FAISS · BM25 · FastAPI · React · Three.js
Smart Bollywood Song Recommender You describe your exact moment. One line. It finds the song where a specific lyric proves it understood you, not your mood, not your genre, your moment. Hybrid search, dual LLM pipeline, 3D constellation UI. Live on HuggingFace.
Explainable RAG System RAG that shows its work. Full debug UI so you can see why each chunk was retrieved, how it ranked, and where the answer actually came from. Built because black-box retrieval is a real problem, not a hypothetical one.
LLM Visualizer Peek inside GPT-2. Tokens, attention maps, embedding space, next-token probabilities. All local. Built this to understand what was actually happening inside the model. Turns out it helps others too.
Error Clustering System Takes thousands of noisy logs, clusters them into meaningful failure groups, names the root causes, tells you where to look first. Log noise is a real engineering problem.
AI Debate Arena Two agents argue opposite sides of any topic. A judge scores each round on logic, evidence, and rebuttal. What happens when LLMs have to persuade each other.
Digging into agent architectures and mechanistic interpretability. Looking for a full-time AI/ML role, onsite or hybrid.
- LinkedIn: https://www.linkedin.com/in/ilaa-chenjeri
- Email: ilaa.chenjeri@gmail.com
If you're working on something interesting in the LLM space, I'm always up for a conversation.