Skip to content

TivoK/DataScienceRepo.ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

110 Commits
 
 

Repository files navigation

DataScienceRepo.ai

This repository contains a collection of my Data Science related projects; specifically focused on Machine Learning - including Supervised, Unsupervised and Deep Learning methods. The assortment of projects contained in the repository leverage a variety of technologies/algorithms; and are organized as such. Please note that the content found here is open-source and can be utilized as a study or reference aid for the general public in implementing their own prediction models.

Supervised Learning

Objective Method Library/Frameworks Data About Link
Regression Univariate Linear Regression Sci-Kit Learn, Pandas, Matplotlib, SciPy & NumPy 2015 NBA Player Data Linear Regression Model for NBA Player Weights. Nbviewer
Regression Multivariate Linear Regression (Ridge Regression/Polynomial Regression) Sci-Kit Learn, Pandas, Matplotlib, SciPy & NumPy UCI Auto MPG Data A comparison of Ridge Regression and Polynomial Regression on MPG Data. Nbviewer
Regression Multivariate Polynomial Regression (Ridge /Lasso /Elastic-Net) Sci-Kit Learn, Pandas, Matplotlib & NumPy CalcoFi Comparative analysis of various Regularization techniques. Nbviewer
Classification KNN/Decision Tree Sci-Kit Learn, Pandas, Matplotlib & NumPy UCI Audit Data An assessment of a KNN classification model vs. a Decision Tree classifier. Nbviewer
Ensemble (Classification) Random Forest/KNN Sci-Kit Learn, Pandas & Matplotlib stats.nba.com An All-Star Classifier based on NBA Player Per Game Averages. Nbviewer
Multi-Class Classification Random Forest/KNN Sci-Kit Learn, Pandas, Sci-Py & Matplotlib Rapid-API.com -Hotel API A Multi-Class classfication project for Hotel Star-Ratings. Nbviewer
Classification SVM/AdaBoost Sci-Kit Learn, Pandas, & Matplotlib Spotify API A Binary-Class classfication project for Metal and Classical tracks. Nbviewer
Regression Lasso/AdaBoost/
GradientBoosting
Sci-Kit Learn, Pandas, Seaborn, & Matplotlib Teleport API Predict Life Expectancies in Urban Areas around the World. features. Nbviewer
Regression XGBoost Sci-Kit Learn,BaseMap,XGBoost, Pandas, Seaborn, & Matplotlib Mashvisor AirBnb API Predict Property Listing Prices in San Fransciso/Bay Area. features. Nbviewer

Unsupervised Learning

Objective Method Library/Frameworks Data About Link
Clustering K-Means / T-SNE Sci-Kit Learn, Pandas, Matplotlib, & NumPy Alpha Vantage API Perform K-Means Clustering on Index Fund Closing Price Movements. We'll see if the K-Means Algorithm can find insights related to fund types based on Closing Prices. Nbviewer
Dimension Reduction PCA Sci-Kit Learn, Pandas, Matplotlib, SciPy & NumPy Chicago City Data Portal API A Data Engineering and Dimension Reduction project with Chicago City Block Data. Nbviewer
Clustering Hierarchical Clustering Scipy, Pandas, Matplotlib Spotify API A Data Engineering and Clustering project with songs from The Beatles. Nbviewer

Deep Learning

Objective Method Library/Frameworks Data About Link
Regression Neural Network Sci-Kit Learn, Pandas, Matplotlib, & NumPy, Keras Beijing Multi-Site Air-Quality Data Data Set A Deep Learning that predicts S02 (sulphur dioxide) concentration levels in Bejing based on air quality monitoring sites. Nbviewer
Classification (Multi-Class) Neural Network Sci-Kit Learn, Pandas, Matplotlib, & NumPy, Keras OpenML.org Letter Data Set A Deep Learning model that classify Letters. Nbviewer

Web Scraping/Data Wrangling & Data Cleaning Projects

Library/Frameworks About Link
Beautiful Soup & Pandas Scraping NBA player data from a webpage Nbviewer
Requests/Pandas/Numpy Hotel API Data Engineering Project Nbviewer

About

An Assortment of Data Science Projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors