Skip to content
View Pratham-Jain-3903's full-sized avatar
๐Ÿ’ญ
Working...
๐Ÿ’ญ
Working...

Block or report Pratham-Jain-3903

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Pratham-Jain-3903/README.md

๐Ÿ‘‹ Pratham Jain - Data Engineer/ SWE - AI and Data

Profile Views

Welcome to my GitHub profile! Iโ€™m Pratham Jain, a dedicated Data Scientist with a strong foundation in data engineering, machine learning, and AI-driven solutions. My work is centered around transforming data into actionable insights that drive business growth and innovation. Browse through my projects and see my approach to tackling real-world problems using data science and engineering.


๐Ÿ“ง Contact Information

Email LinkedIn GitHub HackerRank


๐ŸŒŸ Professional Summary

I'm a Software Engineer and Data Scientist with a passion for using technology to solve real-world problems. Proficient in Python, SQL, and machine learning, I am driven by a desire to extract valuable insights from data and build innovative solutions for efficiency and growth. I aim to work with advanced technologies in data science and engineering, enhancing both personal and team performance.


๐Ÿ† Achievements

  • Amazon ML Challenge 2024: Ranked 84 / 74,830
  • Luminous TechnoX 2024 First Runner Up (Rank 2/ 9600 teams)
  • Flipkart Grid 6.0: Level 2 in Robotic Track & SDE Track
  • AWS DeepRacer Competition (Asia-Pacific 2023): Top 20
  • Numerai Competition: Top 8% consistently
  • Kimo.ai AI and Machine Learning Hackathon: 1st Place

๐Ÿ’ผ Key Skills

Languages & Frameworks

Python C++ Java

Data Engineering & Analysis

Apache Spark Pandas Numpy

Cloud & Database

AWS GCP MongoDB PostgreSQL

DevOps & CI/CD

Docker Kubernetes Jenkins

Machine Learning & AI

TensorFlow PyTorch Scikit-Learn


๐Ÿ’ป Tech Stack

Python SQL Tableau AWS Docker


๐Ÿ“ Experience Highlights

Data Engineer

R&D Department, Luminous India (Schneider Electric Group)
Feb 2025 โ€“ Aug 2025

  • API Deployment & Optimization: Deployed and optimized APIs on serverless Azure Functions and VMWare AI Foundry instances using PM2. Engineered a backend pipeline with Gunicorn, Postman, and Azure Virtual Machines to process user queries from mobile Customer Support systems, ensuring high scalability, low latency, and seamless interactions for the Cache Augmented Generative (CAG) chatbot.
  • Security & Guardrailing: Integrated guardrailing and validator agents to inspect and sanitize API responses, effectively mitigating prompt and SQL injection vulnerabilities.
  • Local-First Architecture: Designed a Local-First architecture with an in-memory database synchronized via an event-driven backend to Azure CosmosDB, reducing query latency by 78% for 10K customers. Developed peripheral APIs for real-time document updates and automated reindexing in the vector store, ensuring accurate and efficient retrieval within the CAG framework.
  • AI Agent Orchestration: Built adaptive AI agents that dynamically adjust token limits and orchestrated specialized agents to handle queries for both Luminous and Amaze, delivering tailored responses at 30% lower cost, a 7% increase in customer satisfaction, and an 88% reduction in delays.

Data Engineer

Raichur Institute of Medical Sciences
Aug 2024 โ€“ Present

  • Hybrid Cloud Storage Management: Optimized patient record storage for 2,000+ patients yearly using AWS S3 & EC2.
  • Data Access Solution: Built a MongoDB-backed access portal, streamlining patient data retrieval for healthcare providers.

Research Assistant

Bosch Global Software Technologies | HVAC Systems - AI-Driven HVAC Efficiency Optimization
Mar 2024 โ€“ Mar 2025

  • Real-Time Data Pipelines: Developed scalable software solutions for real-time data pipelines in HVAC systems using Python, processing 300+GB of IoT signals.
  • Fault-Tolerant Systems: Engineered distributed fault-tolerant systems for STM32 microcontrollers, enhancing real-time processing and system reliability.
  • Cloud Deployment: Deployed solutions on AWS (S3, EC2, Lambda), ensuring seamless access and optimized resource management.
  • Energy Optimization: Increased energy savings by 15.5% (p<0.00003) through improved model performance and system optimization.

Research Assistant

Bosch Global Software Technologies & Medical Institutions | Multimodal AI-Powered Breast Cancer Screening System
Jan 2024 โ€“ Nov 2024

  • Distributed Backend Architecture: Designed and deployed a distributed backend architecture on AWS EC2 for a multimodal screening system, achieving 96.8% diagnostic accuracy (p<0.05).
  • API & Docker Integration: Implemented APIs for real-time data ingestion and integration with Dockerized services, enabling scalable production deployments.
  • System Validation: Conducted system validation on FNAC datasets with 500+ medical images, achieving 97.02% accuracy.

๐ŸŽ“ Education

B.Tech, Computer Science & Engineering
Indian Institute of Information Technology, Raichur
Expected Graduation: December 2025
CGPA: 8.1


Recent Blog Posts

Thank you for exploring my GitHub profile! Feel free to check out my repositories and reach out if you'd like to collaborate on projects or discuss data science and engineering solutions.

Pinned Loading

  1. CRM_Mini_Project Public

    GenAI Credit Platform

    JavaScript 1

  2. AmazonMLChallenge24 Public

    Our team ranked 84th out of 74,830 teams in the Amazon ML Challenge 2024. The challenge involved building a scalable machine learning solution without using external APIs or gateways, leveraging onโ€ฆ

    Python

  3. BGSW-CAD-BreastCancerPrediction Public

    This repository hosts the open-source implementation of a Breast Cancer Prediction System developed under the collaboration between Bosch Global Software Technologies (BGSW) and the Indian Institutโ€ฆ

    Jupyter Notebook

  4. Flipkart_Grid_6.0 Public

    Python

  5. Luminous-TechnoX-Hackathon-Submission-2024 Public

    Luminous TechnoX Hackathon Submission 2024

    Jupyter Notebook 2

  6. AGN_regression_research_paper Public

    Python