- - - - - Resumes: https://github.com/shirsh10mall/Resume-Shirsh - - - - -
- Aspiring Data Scientist π | MSc. Physics π
- Email: shirsh10malll@gmail.com , shirshmall10work@gmail.com | LinkedIn Profile: www.linkedin.com/in/shirsh10mall
- Master of Science (MSc) in Physics, Indian Institute of Technology (IIT Delhi) ποΈ, 2021-2023
- Bachelor of Science (BSc) in Physics Honours, Dyal Singh College, Delhi University π«, 2018-2021
- Programming Languages: Python π β’ Elementary C++
- Database: MySQL.
- Machine Learning π: Supervised Learning β’ Unsupervised Learning β’ Model Deployment π β’ Model Optimization βοΈ β’ Hyperparameter Tuning β’ Model Interpretability (SHAP) π§ β’ AutoML (H2O) π€ β’ Scikit-Learn β’ Classical Algorithms β’ Tree-Based Algorithms π² β’ Ensemble-based Algorithms.
- Natural Language Processing (NLP) π: Recurrent Neural Network (RNN) β’ Hugging Face Transformers π€ β’ Text Classification β’ Text Generation β’ Text Summarization π β’ Topic Modeling β’ Machine Translation π β’ Question Answering π¬ β’ NLP Data Preprocessing π§Ή β’ Generative AI with LLMs β’ Image-Text Multi-models πΌοΈπ β’ Semantic Search π β’ Vector Database β’ Named Entity Extraction.
- Computer Vision πΌοΈ: Image Processing β’ Convolutional Neural Networks (CNN) π§ π· β’ Image Segmentation β’ Object Detection π΅οΈ β’ Image Classification β’ Transfer Learning π β’ Feature Extraction β’ Image Semantic Search ππΌοΈ.
- Model Deployment Tools π: Streamlit β’ Flask β’ FastAPI π β’ Gradio π‘ β’ Heroku β’ Docker Containerization π³ β’ Elementary HTML and CSS π¨.
- Additional Skills : Time Series Forecasting β³πβ’ Web Scraping (BeautifulSoup, Selenium, Requests) πΈοΈ β’ Linux.
- Deep Learning Tools/Framework : NumPy β’ Pandas πΌ β’ Scikit-Learn β’ TensorFlow β’ PyTorch β’ Keras β’ Hugging Face Transformers π€ β’ NLTK β’ SpaCy β’ Gensim β’ Word2Vec β’ GloVe .
- Data Analysis and Visualization π: Data Wrangling π§Ή β’ Tableau β’ Plotly β’ Seaborn β’ Matplotlib .
Dedicated to securing a data science role where I can apply my skills and contribute to data-driven decision-making.
- Initial Years: Seeking to master data science skills, achieve continuous learning, and gain hands-on experience through collaborative projects.
- Future Aspiration: Aspiring to secure a Data Science Management position, leveraging my technical expertise and leadership skills.
Team Player π€ | Collaborative π€
I transitioned from a background in Physics to Data Science due to my strong affinity for computational skills, coding, and numerical analysis. Encouraged by professors' endorsements for my computational abilities during graduation and master's academic years, I discovered my passion for data science and machine learning, ultimately leading me to pursue a career in this dynamic field.
I have diligently undertaken a range of data science courses and successfully executed diverse projects. My journey began with foundational linear regression and simple machine learning projects, eventually progressing to creating interactive dashboards, advanced machine learning and deep learning endeavours, and foraying into intricate realms such as Computer Vision and advanced Natural Language Processing.
-
Road Traffic Accidents Severity Prediction
- Objective: Develop a machine learning model to predict the severity of road traffic accidents using multi-class classification algorithms.
- Link: https://github.com/shirsh10mall/Road-Traffic-Severity-Classification
-
Patient Survival Prediction
- Objective: Develop a predictive model for patient survival using medical data to aid urgent healthcare decisions.
- Link: https://github.com/shirsh10mall/Patient-Survival-Prediction
-
Site Energy Intensity Prediction
- Objective: Develop a machine learning model, primarily using CatBoost, to predict Site Energy Usage Intensity (Site EUI) for buildings, aiming to reduce energy consumption.
- Link: https://github.com/shirsh10mall/Site_Energy_Intensity_Prediction
-
Particle Precipitate Flux-Prediction Mesoscale
- Objective: Enhance electron particle precipitation prediction accuracy using curated DMSP data and advanced deep learning models.
- Link: https://github.com/shirsh10mall/Defense-Meteorological-Satellite-Program-DMSP---Particle-Precipitate-Flux-Prediction-Mesoscale-
-
Flu Shot Learning: Predict H1N1 and Seasonal Flu Vaccines
- Objective: Develop a predictive model to determine the likelihood of individuals receiving H1N1 and seasonal flu vaccines based on demographic, opinion, and health behaviour data.
- Link: https://github.com/shirsh10mall/Flu-Shot-Learning---Classification-Data-Science-Projects
-
Revenue Forecasting in Dynamic Price System:
- Objective: Develop a revenue forecasting model for dynamic pricing optimization in e-commerce by analyzing historical data to predict revenue per user action and enable dynamic price adjustments.
- Link: https://github.com/shirsh10mall/Revenue-Forecast-for-Dynamic-Pricing
-
Object Detection for Traffic Vehicle Detection:
- Objective: Develop an accurate object detection system for classifying various vehicle types in road traffic using the IITM-HeTra dataset, with a focus on enhancing traffic management and safety.
- Link: https://github.com/shirsh10mall/Object-Detection-for-Traffic-Vehicle-Detection
-
Cotton Disease Prediction - Image Classification
- Objective: Develop a robust image classification model using Convolutional Neural Networks (CNNs) to accurately predict diseases in cotton plants.
- Link: https://github.com/shirsh10mall/Cotton-Disease-Prediction-Image-Classification-
-
Water Bodies Segmentation in Satellite Images
- Objective: Precisely segment water bodies in Sentinel-2 satellite images using binary image segmentation techniques.
- Link: https://github.com/shirsh10mall/Water-body-segmentation-in-Satellite-Images-Image-Segmentation-
-
Semantic Search-based News Recommendation and Wikipedia Articles Question
- Objective: Develop an innovative Semantic Search-based News Recommendation and Wikipedia Articles Question Answering System to enable efficient exploration of news articles and accurate information retrieval.
- Link: https://github.com/shirsh10mall/Semantic-News-Search-and-Question-Answering-Wiki-Articles
-
Fine-Tuning Google Flan T5 Large LLM for Instruction-Based Question Answering (Instruct QA) - LLM QLoRA PEFT - Open Orca Dataset
- Objective: Fine-tune the Flan T5 Large model to answer questions from the Open Orca dataset with task-specific instructions, employing PEFT and Transfer Learning for enhanced performance.
- Link: https://github.com/shirsh10mall/Instruct-QnA-Fine-Tuning-Google-Flan-T5-Large-LLM-QLoRA-PEFT-Open-Orca-Dataset
-
English-Hindi Neural Machine Translation
- Objective: Develop a neural machine translation system to translate English sentences into Hindi using an Encoder-Decoder architecture based on Bidirectional GRU with Multi-head Attention Layers and Fine-tuned pre-trained model.
- Link: https://github.com/shirsh10mall/English-to-Hindi-Text-Translation-
-
Topic Modelling on Customer Complaints
- Objective: Apply topic modelling techniques (LSA, LDA, and BERTopic) to cluster customer complaints, revealing underlying topics to enhance understanding of consumer feedback and improve financial products and services.
- Link: https://github.com/shirsh10mall/Topic-Modeling-BERTopic-LSI-LDA-Customer-Complaints
-
Text Classification - Cyberbullying Detection and E-commerce Product Classification:
- Objective: Develop accurate text classification models to identify instances of cyberbullying content and classify E-commerce products based on their descriptions into multiple categories.
- Link: https://github.com/shirsh10mall/Text-Classification-NLP-Binary-Multiclass
- Image-Text Multi-Modal Deep Learning
- Objective: Develop a multi-modal deep learning solution for Image Captioning (ViT-BERT) and Image Retrieval (CLIP Model).
- Link: https://github.com/shirsh10mall/Image-Text-Multi-Models
- Stocks and Financial Analysis Dashboard
- Objective: Create an interactive Tableau dashboard for analyzing historical stocks and financial data of Infosys and TCS, providing investors with insights to make informed investment decisions.
- Link: https://github.com/shirsh10mall/Stocks-and-Financial-Analysis-Dashboard
-
MSc Thesis - Advancing Computational Particle Physics through Data and Deep Learning
- Objective: Employ data analysis, simulation, and machine learning to enhance our understanding of particle collisions at the LHC and contribute to advancements in particle physics beyond the Standard Model.
- Link: https://github.com/shirsh10mall/MSc-Project---Studies-at-Large-Hadron-Collider
-
Course Assignments
-
Link: https://github.com/shirsh10mall/Deep-Learning-Course-IIT-Delhi-APL745-
-
Non-Linear Regression and Projectile Motion: Implemented numpy and scipy-based univariate and multivariate non-linear regression models for projectile motion prediction.
-
Classification Algorithms and Optimization Techniques: Developed Logistic Regression and explored techniques like Gradient Descent, Stochastic Gradient Descent, and Mini-batch Gradient Descent for binary and multi-class classification.
-
Artificial Neural Network (ANN) for Classification: Built numpy-based ANN, performed binary/multi-class tasks on MNIST data, grasped feedforward, backpropagation, and gradient descent.
-
Convolutional Neural Network (CNN) for Image Classification: Created numpy-powered CNN, trained on MNIST images, understood convolution, pooling, and fully connected layers.
-
Recurrent Neural Network (RNN) for Time Series Data: Implemented basic RNN from scratch, applied on time series, and grasped sequential data handling.
-
Solving Differential Equations with PINN: Used PyTorch-based PINN to solve static bar problems' differential equations under specific loads.
-
Elasticity Partial Differential Equations using PINN: Applied PINN to solve elasticity-related partial differential equations for 2D planes, merging physics with neural networks.
-
DeepONet Architecture for Integration Operator: Implemented DeepONet, trained to learn integration, showcasing neural network's mathematical concept mastery.
-