Master's in Data Science at Stevens Institute of Technology
Data Science | AI/ML | Analytics | Big Data
I am a Master's student in Data Science at Stevens Institute of Technology with interests in machine learning, predictive modeling, time series, financial analytics, and large-scale data systems.
I like building projects that take messy data through a full workflow: cleaning, feature engineering, modeling, evaluation, and communication. This GitHub is being organized as a portfolio of practical data science and analytics work across forecasting, statistical learning, visualization, climate analytics, and distributed machine learning.
- M.S. in Data Science, Stevens Institute of Technology, 2024-2026
- GPA: 4.0/4.0
- Presidential Merit Scholarship
- Graduate Teaching Assistant for Mathematical Department
- Building end-to-end machine learning and analytics projects with cleaner documentation
- Strengthening work in forecasting, model evaluation, and feature engineering
- Exploring big data workflows with Spark and cloud-based pipelines
- Publishing more applied projects in forecasting, analytics, and machine learning
Languages and querying
Machine learning and analytics
Data engineering and cloud
| Project | Focus |
|---|---|
| U.S. Airline Departure Delay Prediction at Scale | Distributed PySpark and Spark MLlib pipeline with feature engineering, clustering, anomaly monitoring, and Dataproc scaling analysis |
| Climate Data Analysis and Rainfall Prediction | PCA, SVD, clustering, and machine learning on multi-decade climate data |
| Time Series Forecasting and Risk Analysis | ARIMA and SARIMA modeling, stationarity testing, residual diagnostics, and forecasting |
| Spotify Track Popularity Prediction with Statistical Learning | Statistical testing, regression modeling, and feature-driven popularity analysis |
| Analyzing EV Adoption: Mapping the Future of Clean Mobility | Tableau-based data visualization and storytelling around EV adoption trends |
- S&P 500 sector classification using unsupervised learning and financial data
- Synthetic fraud data generation and fraud detection modeling
- Additional academic and applied projects being cleaned and published one by one
- Databricks Certified Data Analyst Associate
- AWS Certified Data Engineer Associate