Skip to content
View MoumitaBasu's full-sized avatar

Block or report MoumitaBasu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MoumitaBasu/README.md

πŸ‘‹ Hi there, I'm Moumita!

Full-Stack Developer | AI/ML Engineer | Building data-efficient, interpretable AI solutions

Welcome to my GitHub where I build AI systems that work with real-world constraints (limited data, need for transparency, immediate deployment).


🌟 Featured Projects

πŸ”¬ Synthetic MRI Inspector β€” Data-Efficient Medical Imaging AI

Repository | Live Demo

A production-grade quality inspection system demonstrating interpretable ML without massive datasets. This project showcases cutting-edge thinking about agentic AI and why transparency matters in high-stakes domains.

What makes this special:

  • βœ… Agentic Architecture β€” Dynamic tool selection & decision-making workflows
  • βœ… LLM Integration β€” OpenAI, Google Gemini, Claude, with graceful fallbacks
  • βœ… Data Efficiency β€” Works with minimal samples; no training data required
  • βœ… Full Interpretability β€” Every decision is traceable to specific measurements
  • βœ… Production Ready β€” Deployed on Streamlit with comprehensive documentation

Tech Stack: Python, PyTorch, Streamlit, LLMs (OpenAI/Claude/Gemini), scikit-image
Real-world Impact: Shows how to build AI for agricultural & manufacturing quality control when data is scarce

πŸ”¬ Research Paper Discovery AI β€” Intelligent RAG for Academic Analysis

Repository | Live Demo

A production-grade RAG pipeline designed for deep analysis of academic research. This tool bridges the gap between raw PDF data and actionable insights using a sophisticated hybrid retrieval architecture.

What makes this special:

  • βœ… Hybrid Retrieval β€” Combines FAISS semantic search with BM25 keyword matching for maximum recall.
  • βœ… Cross-Encoder Reranking β€” Prioritizes the most relevant paper segments using ms-marco-MiniLM to ensure high-quality context.
  • βœ… Dual AI Engine β€” Flexibility between local Flan-T5 for privacy and Gemini 1.5 Flash for high-reasoning synthesis.
  • βœ… Metadata Anchoring β€” Enforces strict grounding in paper Abstracts and Titles to minimize hallucinations.
  • βœ… Optimized Workflow β€” Features parse-and-purge memory management and adaptive context windowing.

Tech Stack: Python, Streamlit, Google Gemini 1.5, FAISS, BAAI Embeddings, PyMuPDF4LLM
Real-world Impact: A powerful, privacy-focused RAG pipeline that bridges the gap between raw research PDFs and actionable scientific insights using hybrid search and dual AI engines.


πŸš€ All Projects

AI & Deep Learning

πŸ€– AI Projects β€” Collection of production AI applications:

  • 🎧 OmniConvert β€” All-in-one media converter (audio, text, images, video)
  • πŸ“š Course Generator β€” Personalized course outlines with GPT-4 | Demonstrates prompt engineering & API integration
  • πŸ€– FAQ Chatbot β€” Production chatbot with FastAPI & OpenAI GPT-3.5 | Built conversation history management
  • 🎁 Gift Recommendation System β€” AI-powered personalization engine

🧠 Machine Learning Projects β€” Diverse ML applications:

πŸ‘οΈ Computer Vision β€” OCR application with Streamlit & Flask


πŸ› οΈ Tech Stack & Skills

Languages: Python, JavaScript
ML/AI: PyTorch, TensorFlow, scikit-learn, LLM APIs (OpenAI, Claude, Gemini)
Frameworks: FastAPI, Streamlit, Flask, Spring Boot
Tools: Git, Jupyter, Docker, Figma, Regex101


πŸ“š Learning Resources I Curate

Resources for mastering key technologies:


🎨 Useful Tools & Resources

Design & UI: Canva | Figma | CodePen
Development: Regex101


🎯 What I'm Looking For

  • Open source contributions in ML/AI and interpretable AI
  • Collaborations on data-efficient, production-grade AI systems
  • Roles combining full-stack development with ML engineering

πŸ“« Let's Connect!

I'd love to discuss AI, engineering challenges, or collaboration opportunities:


πŸ“Š GitHub Stats

GitHub Stats


Building AI systems that are fast, interpretable, and production-ready. πŸš€

Pinned Loading

  1. faq-chatbot faq-chatbot Public

    Python

  2. Computer-Vision Computer-Vision Public

    Python

  3. MachineLearningProjects MachineLearningProjects Public

    Jupyter Notebook

  4. portfolio portfolio Public