Skip to content
View shubhrat12's full-sized avatar

Block or report shubhrat12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
shubhrat12/README.md

๐Ÿ‘‹ Hi, I'm Ashruj Gautam

๐ŸŽ“ Master's in CS @ University at Buffalo ย ย |ย ย  ๐Ÿค– AI/ML Enthusiast & SWE Explorer
๐Ÿ”— Email โ€ข LinkedIn

I build intelligent systems โ€” from scientific RAG pipelines to medical diagnostics,
and love turning deep learning + Software engineering into real-world applications.


๐Ÿง  About Me

  • ๐Ÿš€ I enjoy building production-grade AI systems and developer-focused tools with measurable real-world impact
  • ๐Ÿ’ก Passionate about the intersection of LLMs, RAG, multimodal learning, and software engineering
  • ๐Ÿงฐ Equally confident in training deep models or scaling fast APIs and databases
  • ๐Ÿ’ผ Open to roles in: AI/ML Engineering, Fullstack Development, Software Engineering, and Applied Research
  • ๐Ÿ“ซ Reach out:

๐Ÿš€ Highlighted Projects

Built an advanced Retrieval-Augmented Generation system for scientific PDFs โ€” integrating section classification, citation graph analytics, figure detection, and local LLM generation (Mistral 7B).

  • ๐Ÿ’ก Used Grobid, LayoutLM, ChromaDB, and fine-tuned BERT
  • ๐Ÿ”— Full hybrid vector + keyword retrieval system with prompt optimization
  • ๐Ÿ“š F1 Score: 99.54% (Reference Parsing), 79.94% (Section Classification)

AI diagnostic system combining X-ray imaging, clinical NLP, and tabular risk analytics into a unified medical assistant with prescription guidance.

  • ๐Ÿฉป DenseNet121 (CheXpert) + BioBERT + MLP Ensemble (Kidney, Heart, Diabetes)
  • ๐Ÿค– Integrated with LLaMA for contextual recommendations
  • ๐Ÿ’ป Packaged with PyQt5 GUI and local data fusion

Production-grade PostgreSQL analytics engine with stored procedures, triggers, indexing, and planned LLM augmentation for feedback analysis.

  • ๐Ÿง  11+ table schema, optimized queries, and stored functions in PL/pgSQL
  • ๐Ÿ“ˆ Real-time business metrics: customer behavior, delays, seller ranks
  • ๐Ÿง  Extending with Zephyr LLM to auto-summarize trends & complaints

Custom YOLO variant optimized for occluded fruit detection in retail scenarios. Compared performance against SSD, Faster R-CNN.

  • ๐Ÿ–ผ๏ธ Trained on MinneApple + COCO with resolution-aware tuning
  • ๐Ÿš€ Real-time object detection via optimized bounding box anchors

Built a semantic image search engine that lets users query with text or images and returns top-matching results using precomputed embeddings.

  • ๐Ÿง  Uses OpenAI CLIP + FAISS for vector similarity
  • ๐Ÿ–ผ๏ธ Visual & text-to-image search in real-time

๐Ÿ› ๏ธ Languages & Tools

Languages:
Python C++ SQL Java

Frameworks & APIs:
FastAPI Flask Streamlit React Node.js

AI & ML:
PyTorch TensorFlow HuggingFace LangChain ChromaDB

Data & Infra:
PostgreSQL MongoDB Docker Git GitHub


๐Ÿ“Š GitHub Stats

Ashruj's GitHub Stats Top Languages


โ€œCode is the closest thing we have to magic โ€” I just make sure mine solves the right problems.โ€

Pinned Loading

  1. Pratheek-Tirunagari-and-Ashruj-Gautam/ResearchRAG-End-to-End-Scientific-Research-Assistant-using-RAG Pratheek-Tirunagari-and-Ashruj-Gautam/ResearchRAG-End-to-End-Scientific-Research-Assistant-using-RAG Public

    A modular RAG pipeline for scientific QA, featuring structured PDF parsing, custom NLP models, citation analytics, and hybrid retrieval over 1000+ chunks. Powered by local Mistral 7B for fast, offlโ€ฆ

    Python

  2. Pratheek-Tirunagari-and-Ashruj-Gautam/Multi-Modal-Health-Insights-Platform Pratheek-Tirunagari-and-Ashruj-Gautam/Multi-Modal-Health-Insights-Platform Public

    A medical diagnostic AI system combining chest X-ray analysis (DenseNet121: 0.95 F1, 0.97 AUC), medical text interpretation (BioBERT: 0.87 F1, 0.99 AUC), and specialized disease classifiers (Kidneyโ€ฆ

    Python

  3. Pratheek-Tirunagari-and-Ashruj-Gautam/InsightEdge-AI-Powered-E-Commerce-Intelligence-Platform Pratheek-Tirunagari-and-Ashruj-Gautam/InsightEdge-AI-Powered-E-Commerce-Intelligence-Platform Public

    InsightEdge is a PostgreSQL-based e-commerce analytics system enhanced with LLM-powered feedback analysis. It combines SQL, PL/pgSQL, and fine-tuned LLMs to extract insights from structured data anโ€ฆ

    PLpgSQL

  4. Pratheek-Tirunagari-and-Ashruj-Gautam/Occlusion_Aware_Apple_Detection_Using_Custom_YOLO_Architecture Pratheek-Tirunagari-and-Ashruj-Gautam/Occlusion_Aware_Apple_Detection_Using_Custom_YOLO_Architecture Public

    An Android app for real-time apple detection using a custom YOLO architecture. Compared SSD, Faster R-CNN, and YOLO; enhanced YOLO for occlusion handling and small object detection using COCO and Mโ€ฆ

    Python

  5. Image-search-engine Image-search-engine Public

    A semantic image search engine built with CLIP and FAISS that allows searching by text descriptions or visual similarity.

    Python 2