Full-Stack Developer | AI/ML Engineer | Building data-efficient, interpretable AI solutions
Welcome to my GitHub where I build AI systems that work with real-world constraints (limited data, need for transparency, immediate deployment).
A production-grade quality inspection system demonstrating interpretable ML without massive datasets. This project showcases cutting-edge thinking about agentic AI and why transparency matters in high-stakes domains.
What makes this special:
- β Agentic Architecture β Dynamic tool selection & decision-making workflows
- β LLM Integration β OpenAI, Google Gemini, Claude, with graceful fallbacks
- β Data Efficiency β Works with minimal samples; no training data required
- β Full Interpretability β Every decision is traceable to specific measurements
- β Production Ready β Deployed on Streamlit with comprehensive documentation
Tech Stack: Python, PyTorch, Streamlit, LLMs (OpenAI/Claude/Gemini), scikit-image
Real-world Impact: Shows how to build AI for agricultural & manufacturing quality control when data is scarce
A production-grade RAG pipeline designed for deep analysis of academic research. This tool bridges the gap between raw PDF data and actionable insights using a sophisticated hybrid retrieval architecture.
What makes this special:
- β Hybrid Retrieval β Combines FAISS semantic search with BM25 keyword matching for maximum recall.
- β
Cross-Encoder Reranking β Prioritizes the most relevant paper segments using
ms-marco-MiniLMto ensure high-quality context. - β
Dual AI Engine β Flexibility between local
Flan-T5for privacy andGemini 1.5 Flashfor high-reasoning synthesis. - β Metadata Anchoring β Enforces strict grounding in paper Abstracts and Titles to minimize hallucinations.
- β Optimized Workflow β Features parse-and-purge memory management and adaptive context windowing.
Tech Stack: Python, Streamlit, Google Gemini 1.5, FAISS, BAAI Embeddings, PyMuPDF4LLM
Real-world Impact: A powerful, privacy-focused RAG pipeline that bridges the gap between raw research PDFs and actionable scientific insights using hybrid search and dual AI engines.
π€ AI Projects β Collection of production AI applications:
- π§ OmniConvert β All-in-one media converter (audio, text, images, video)
- π Course Generator β Personalized course outlines with GPT-4 | Demonstrates prompt engineering & API integration
- π€ FAQ Chatbot β Production chatbot with FastAPI & OpenAI GPT-3.5 | Built conversation history management
- π Gift Recommendation System β AI-powered personalization engine
π§ Machine Learning Projects β Diverse ML applications:
- π©Ί Diabetes Prediction β Health data classification with scikit-learn
- π΅ Lyrics Generator β Generative model using LSTM networks
- 𧬠MBTI Personality Prediction β NLP-based personality classification
ποΈ Computer Vision β OCR application with Streamlit & Flask
Languages: Python, JavaScript
ML/AI: PyTorch, TensorFlow, scikit-learn, LLM APIs (OpenAI, Claude, Gemini)
Frameworks: FastAPI, Streamlit, Flask, Spring Boot
Tools: Git, Jupyter, Docker, Figma, Regex101
Resources for mastering key technologies:
- π¦ Spring Boot Learning Path β My roadmap for Java Springboot
- β‘ FastAPI Hands-On β Quick tutorial for API beginners
- π’οΈ SQL Tutorial β Interactive SQL fundamentals
Design & UI: Canva | Figma | CodePen
Development: Regex101
- Open source contributions in ML/AI and interpretable AI
- Collaborations on data-efficient, production-grade AI systems
- Roles combining full-stack development with ML engineering
I'd love to discuss AI, engineering challenges, or collaboration opportunities:
- πΌ LinkedIn β Let's connect professionally
- π Portfolio β See my work
- π§ Email: moumitabasu597@gmail.com
Building AI systems that are fast, interpretable, and production-ready. π