This repository contains a collection of my Data Science related projects; specifically focused on Machine Learning - including Supervised, Unsupervised and Deep Learning methods. The assortment of projects contained in the repository leverage a variety of technologies/algorithms; and are organized as such. Please note that the content found here is open-source and can be utilized as a study or reference aid for the general public in implementing their own prediction models.
| Objective | Method | Library/Frameworks | Data | About | Link |
|---|---|---|---|---|---|
| Regression | Univariate Linear Regression | Sci-Kit Learn, Pandas, Matplotlib, SciPy & NumPy | 2015 NBA Player Data | Linear Regression Model for NBA Player Weights. | Nbviewer |
| Regression | Multivariate Linear Regression (Ridge Regression/Polynomial Regression) | Sci-Kit Learn, Pandas, Matplotlib, SciPy & NumPy | UCI Auto MPG Data | A comparison of Ridge Regression and Polynomial Regression on MPG Data. | Nbviewer |
| Regression | Multivariate Polynomial Regression (Ridge /Lasso /Elastic-Net) | Sci-Kit Learn, Pandas, Matplotlib & NumPy | CalcoFi | Comparative analysis of various Regularization techniques. | Nbviewer |
| Classification | KNN/Decision Tree | Sci-Kit Learn, Pandas, Matplotlib & NumPy | UCI Audit Data | An assessment of a KNN classification model vs. a Decision Tree classifier. | Nbviewer |
| Ensemble (Classification) | Random Forest/KNN | Sci-Kit Learn, Pandas & Matplotlib | stats.nba.com | An All-Star Classifier based on NBA Player Per Game Averages. | Nbviewer |
| Multi-Class Classification | Random Forest/KNN | Sci-Kit Learn, Pandas, Sci-Py & Matplotlib | Rapid-API.com -Hotel API | A Multi-Class classfication project for Hotel Star-Ratings. | Nbviewer |
| Classification | SVM/AdaBoost | Sci-Kit Learn, Pandas, & Matplotlib | Spotify API | A Binary-Class classfication project for Metal and Classical tracks. | Nbviewer |
| Regression | Lasso/AdaBoost/ GradientBoosting |
Sci-Kit Learn, Pandas, Seaborn, & Matplotlib | Teleport API | Predict Life Expectancies in Urban Areas around the World. features. | Nbviewer |
| Regression | XGBoost | Sci-Kit Learn,BaseMap,XGBoost, Pandas, Seaborn, & Matplotlib | Mashvisor AirBnb API | Predict Property Listing Prices in San Fransciso/Bay Area. features. | Nbviewer |
| Objective | Method | Library/Frameworks | Data | About | Link |
|---|---|---|---|---|---|
| Clustering | K-Means / T-SNE | Sci-Kit Learn, Pandas, Matplotlib, & NumPy | Alpha Vantage API | Perform K-Means Clustering on Index Fund Closing Price Movements. We'll see if the K-Means Algorithm can find insights related to fund types based on Closing Prices. | Nbviewer |
| Dimension Reduction | PCA | Sci-Kit Learn, Pandas, Matplotlib, SciPy & NumPy | Chicago City Data Portal API | A Data Engineering and Dimension Reduction project with Chicago City Block Data. | Nbviewer |
| Clustering | Hierarchical Clustering | Scipy, Pandas, Matplotlib | Spotify API | A Data Engineering and Clustering project with songs from The Beatles. | Nbviewer |
| Objective | Method | Library/Frameworks | Data | About | Link |
|---|---|---|---|---|---|
| Regression | Neural Network | Sci-Kit Learn, Pandas, Matplotlib, & NumPy, Keras | Beijing Multi-Site Air-Quality Data Data Set | A Deep Learning that predicts S02 (sulphur dioxide) concentration levels in Bejing based on air quality monitoring sites. | Nbviewer |
| Classification (Multi-Class) | Neural Network | Sci-Kit Learn, Pandas, Matplotlib, & NumPy, Keras | OpenML.org Letter Data Set | A Deep Learning model that classify Letters. | Nbviewer |
| Library/Frameworks | About | Link |
|---|---|---|
| Beautiful Soup & Pandas | Scraping NBA player data from a webpage | Nbviewer |
| Requests/Pandas/Numpy | Hotel API Data Engineering Project | Nbviewer |