Skip to content
View dsmoove's full-sized avatar

Block or report dsmoove

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dsmoove/README.md

Hi there 👋

About Me.

I am a data scientist currently looking for work. I have a background in Supply Chain and a passion for learning.

Projects

This project aims to create a model that will capture any observations on the field including game situations and offense, that will help to make a better decision on defensive side of the ball. The model looks at the positioning of the players on offense on the field and the different types personnel on the field (WR/TE/RB). Teams employ different offensive personnel breakdowns (WR/TE/RB). These positions will be explained in more detail later. The model also looks at the game situation of the down situation and the yards needed for a first down. The goal of this model is to address the issue of making decisions analytically and quickly by calculating the EPA of the next play and recommending the coverage that predicts the lowest EPA. EPA is short for Expected Points Added, which will be the target variable for our model.

I created a model that used the King County data set that predicted the housing prices in that region. In specific, I built a linear regression model and looked to isolate the top variables that affect price while also generating additional features to better predict prices. Polynominal models were also generated in order to generate a model of better fit. The model has its shortcomings as there can be many factors that are missing from the dataset.

My partner and I built a various Multi-Classification models (Logistic Regression, KNN, Decision Trees, Random Forest, and XGBoost) to prove that music can be classified by the time period they originate from by their musical attributes. New artist discovery through genre classification not only benefits users, but also artists and Spotify. Unknown artists benefit from more methods of discovery and Spotify potentially gains more revenue and more data.

Pinned Loading

  1. KingCountyHousing KingCountyHousing Public

    Jupyter Notebook

  2. pigskinplaycaller pigskinplaycaller Public

    Football Predictor

    Jupyter Notebook

  3. khyateed/song-classification-project khyateed/song-classification-project Public

    Multi-label classification of Spotify rock songs into musical eras, using Logistic Regression, KNN, Decision Trees, Random Forest, and XGBoost (Flatiron Project 3)

    Jupyter Notebook 1