Skip to content

nazstephen/Data-Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

294 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Projects

Compilation of all projects I've completed throughout TripleTen's Data Science Bootcamp

NBA-Archetypes

Name Description Libraries
Python Project In this project, you will work with data from the entertainment industry. You will study a dataset with records on movies and shows. The research will focus on the "Golden Age" of television, which began in 1999 with the release of The Sopranos and is still ongoing. The aim of this project is to investigate how the number of votes a title receives impacts its ratings. pandas
EDA Project Instacart is a grocery delivery platform where customers can place a grocery order and have it delivered to them, similar to how Uber Eats and Door Dash work. Your mission is to clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers. matplotlib and pandas
SDA Project You work as an analyst for the telecom operator Megaline. The company offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget. You are going to carry out a preliminary analysis of the plans based on a relatively small client selection. matplotlib, numpy, pandas and scipy
Software Development Tools Project This project is an interactive web application built with Streamlit, designed for analyzing a car advertisement dataset. It provides users with a platform to explore various attributes of cars, including condition, model year, price, and mileage. Through features like histograms and scatter plots, users can gain insights into relationships between these attributes, such as understanding the distribution of car conditions across model years and the impact of mileage on car prices. pandas, plotly and streamlit
Integrated Project 1 You work for the online store Ice, which sells video games all over the world. User and expert reviews, genres, platforms (e.g. Xbox or PlayStation), and historical data on game sales are available from open sources. You need to identify patterns that determine whether a game succeeds or not. This will allow you to spot potential big winners and plan advertising campaigns. matplotlib, numpy, pandas and scipy
SQL Project You're working as an analyst for Zuber, a new ride-sharing company that's launching in Chicago. Your task is to find patterns in the available information. You want to understand passenger preferences and the impact of external factors on rides. BeautifulSoup, matplotlib, pandas, requests and scipy
Machine Learning Project Mobile carrier Megaline has found out that many of their subscribers use legacy plans. They want to develop a model that would analyze subscribers' behavior and recommend one of Megaline's newer plans: Smart or Ultra. Develop a model with the highest possible accuracy. pandas and sklearn
Supervised Learning Project Beta Bank customers are leaving: little by little, chipping away every month. The bankers figured out it’s cheaper to save the existing customers rather than to attract new ones. Predict whether a customer will leave the bank soon by building a machine learning model with the maximum possible F1 score. matplotlib, numpy, pandas and sklearn
Machine Learning in Business Project You work for the OilyGiant mining company. Your task is to find the best place for a new well. You have data on oil samples from three regions. Parameters of each oil well in the region are already known. Build a model that will help to pick the region with the highest profit margin. Analyze potential profit and risks using the Bootstrapping technique. numpy, pandas, scipy and sklearn
Integrated Project 2 Prepare a prototype of a machine learning model for Zyfra. The company develops efficiency solutions for heavy industry. The model should predict the amount of gold recovered from gold ore. You have the data on extraction and purification. The model will help to optimize the production and eliminate unprofitable parameters. matplotlib, numpy, pandas, seaborn and sklearn
Linear Algebra Project The Sure Tomorrow insurance company wants to solve several tasks with the help of Machine Learning and you are asked to evaluate that possibility. Protect clients' personal data by developing a data transformation algorithm that would make it hard to recover personal information if the data fell into the wrong hands. numpy, pandas, seaborn and sklearn
Numerical Methods Project Develop a predictive model for Rusty Bargain that accurately estimates car values based on historical data while optimizing for prediction accuracy, speed, and training time. catboost, lightgbm, matplotlib, numpy, pandas, seaborn, sklearn, time and xgboost
Time Series Project Sweet Lift Taxi company has collected historical data on taxi orders at airports. To attract more drivers during peak hours, we need to predict the amount of taxi orders for the next hour. Build a model for such a prediction. lightgbm, matplotlib, numpy, pandas, pmdarima, sklearn and statsmodels
Machine Learning for Texts Project The Film Junky Union, a new edgy community for classic movie enthusiasts, is developing a system for filtering and categorizing movie reviews. The goal is to train a model to automatically detect negative reviews. You'll be using a dataset of IMBD movie reviews with polarity labelling to build a model for classifying positive and negative reviews. lightgbm, matplotlib, nltk, numpy, pandas, seaborn, sklearn, spacy and tqdm
Computer Vision Project The supermarket chain Good Seed would like to explore whether Data Science can help them adhere to alcohol laws by making sure they do not sell alcohol to people underage. You are asked to conduct that evaluation. Utilized Computer vision methods to determine age of a person from a photo. matplotlib, pandas and tensorflow
Unsupervised Learning Project This project uses K-means clustering, an unsupervised learning technique, to analyze NBA player data and derive distinct player archetypes based on performance metrics. By grouping players into clusters, I identified key archetypes that highlight different playing styles and roles within the game. matplotlib, numpy, pandas, seaborn and sklearn
Final Project Develop a machine learning framework to predict client churn for Interconnect, a telecom operator. Interconnect's goal is to implement targeted marketing strategies that reduce churn rates and enhance customer loyalty. Employ strategies such as data preprocessing, exploratory data analysis, and the application of various classification models to determine the most effective approach for churn prediction. catboost, functools, lightgbm, matplotlib, numpy, pandas, seaborn, sklearn and xgboost

About

TripleTen Bootcamp Projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published