Skip to content

Data Analysis projects I completed when going through the Data Analysis training course in Practicum by Yandex.

Notifications You must be signed in to change notification settings

yyeusiyevich/data_analysis_projects

Repository files navigation

Yandex.Practicum Data Analysis Projects

YM-logo

Repository containing portfolio of data analysis projects completed by me during the training courses at Yandex.Praсtiсum.

Programming languages: Python, SQL.

IDE: Jupyter Notebook.


Project Description Tags
Yandex Music Based on the real data provided by Yandex.Music platform we analyze the weekly activity and musical preferences of users from two big cities - Moscow and St.Petersburg. Advertising platforms, Internet services, Internet shops, Digital-agencies / Marketing / PR / SEO, Startups, Tech-company, Business services [b2b].
Credit Score Analysis Based on the clients solvency statistics we find out if a customer’s marital status, age, education, number of children and some other factors have an impact on whether they will default on a loan. Banking / FinTech, Startups, Tech-company, Business services [b2b], Lending, Financial analyst.
Real Estate Market Analysis Based on the real data provided by Yandex.Realty platform we determine the market value and define the parameters of real estate properties in St. Petersburg, Russia, and the surrounding areas. Internet services, Advertising platforms, Services for business [b2b] (outsourcing consulting audit), Startups, Tech-company, Marketing analyst, Fraud-analyst, Fraud monitoring.
Film Market Analysis Based on the real data provided by Kinopoisk platform and the Department's of Culture web portal, we identify the main trends in film industry in general and in Russian segment particularly. Internet services, Streaming platforms, Cinema, Offline, Advertising platforms, Marketing analyst, Data Analyst, Financial Analyst, Data Vizualization.
Startup Investments There are hundreds of new companies being founded each day and venture capital has become a substantial asset class. Using SQL queries we analyze the database which contains information about the startup ecosystem. Startups, Investments, Data Analyst, Financial Analyst, Reporting Analyst.
Catering Market Analysis Using open data, we analyze the catering market in Moscow to open a new restaurant network. Based on the research, we prepare a presentation for investors. Startups, Investments, Catering, Tech-company, Marketing analyst, Data Analyst, Offline, Data Vizualization.
Yandex Zen In this project we automate the analysis of user interaction with article cards by creating an interactive dashboard (Tableau) based on the pipeline developed for us by data engineers. Advertising platforms, Internet services, Startups, Tech-company, Offline, Marketing analyst, Data Analyst, BI-analyst, Data Vizualization, Tableau.
NPS (Net Promoter Score) A large telecommunications company has the task to determine the current level of customer loyalty, or NPS (Net Promoter Score). We prepare an interactive dashboard with the results of this survey. Tableau, Data Visualization, SQL, SQlAlchemy,Telecommunications, Marketing analyst, Data Analyst, BI-analyst, Reporting Analyst, Product Analyst.
Procrastinate Pro+ This project focuses on how to calculate and analyze standard business metrics such as retention rates, conversion rates, lifetime value (LTV), customers acquisition cost (CAC), and return of marketing investment (ROMI). We will try to understand the causes of company's losses and help the business start making money. The data used are server logs of users' visits, number of orders, and marketing expenses for the given period. Finally, recommendations based on the analysis were presented as a conclusion. Data processing, Retention Rate (RR), Conversion Rate, LTV, Online services, Marketing analyst, Product analyst.
Stack Overflow Stack overflow is a professional community for developers. In this project we analyze the database that contains data on posts for 2008 using SQL queries. Project has two sections: Jupyter notebbok (connection to the database with SQLAlchemy and displaying output as dataframes) and SQL file. Some tasks in Jupyter are visualized (pie chart, heatmap). SQLAlchemy, Window Functions, Subqueries, Join, Common Table Expression (CTE), Retention Rate, Pandas, Seaborn, Internet Services, Data Analyst, Reporting Analyst, Product Analyst.
Telecom A telecom operator Megaline is looking at using insights from analytics to make adjustments in advertising budget. The company offers its clients two prepaid plans, Smart and Ultra. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget. As a Data analyst working for the telecom operator, we are required to carry out a preliminary analysis of the plans based on a relatively small client selection (500 clients). Telecommunication, Startups, Business services [b2b], Scipy, Histogram, T-test, Hypothesis Testing, Marketing analyst, Data analyst, Financial Analyst.
A/B-test In this project we working as analysts at a big online store. Together with the marketing department, we've compiled a list of hypotheses that may help boost revenue. We need to prioritize these hypotheses, launch an A/B test, and analyze the results. A/B-testing, Internet Services, Scipy stats, A/B-test, ICE, RICE, Hypothesis testing and prioritization, Null and alternate hypotheses, P-value, Exploratory Data Analysis, Marketing analyst, Product Analyst.
Gym (ML) The gym chain is developing a customer interaction strategy based on analytical data. One of the most common problems gyms and other services is customer churn. For a gym, it makes sense to say a customer has left if they don't come for a month. In order to fight churn, gym chain has digitized a number of its customer profiles. Our task is to analyze them and come up with a customer retention strategy. Dendrogram, K-means, Normalization, Standartization, Accuracy, Precision, ROC-AUC, Logistic Regression, Random Forest Classifier, Exploratory Data Analysis, Marketing analyst, Product Analyst.
Games User and expert reviews, genres, platforms (e.g. Xbox or PlayStation), and historical data on game sales are available from open sources. As an analyst of that store, we need to identify patterns that determine whether a game succeeds or not. This will allow us to spot potential big winners and plan advertising campaigns. We have data from 2016. Let’s imagine that it’s December 2016 and we're planning a campaign for 2017. Games, Internet Services, Scipy stats, Hypothesis testing, Null and alternate hypotheses, P-value, Histogram, Barplot, Data Analyst.
Space Brothers (Final) Many games make money due to advertising. And they all face a contradiction: the user will get angry and leave if you start showing him the ad too early; but the later the ad starts, the less money creators will make; We help businesses to choose the optimal time to launch advertising. Knowing the cost of promotion, we can calculate its payback under different scenarios. Monetization, Video Games, Internet Services, Scipy stats, Hypothesis testing, Null and alternate hypotheses, P-value, Histogram, Barplot, Data Analyst.
SQL (Final) This project analysed a database of an online reader application, with data about books, publishers, authors, and user reviews of books. The goal of the project was to answer questions about the number of books, reviews and ratings, and the publishers and authors associated with them. Python, SQL, SQLAlchemy, Database, Database Management System, Database Query, Database Programming, Books, Reading.
A/B-test (Final) This project was focused on analyzing the results of an A/B test. To evaluate the correctness of the test, we used the z-criterion, as well as checked for the intersection of the test audience with the competing test, and the coincidence of the test and marketing events. The project also included data analysis, results evaluation, and hypothesis testing. A/B Testing, User Actions, Technical Task, Data Analysis, Results Evaluation, Z-Criterion, Test Audience, Marketing Events, Data Analytics, Analysis Tools, Data Visualization, Statistical Analysis, Hypothesis Testing, KPI Analysis, Monitoring Metrics, AB Testing Strategies, Randomization Testing, Experiment Design.
SQL Adventure Works (additional) The main goal of the project is to analyze the database using SQL queries (PostgreSQL). The Adventure Works database is a Microsoft product sample for an online transaction processing (OLTP) database. It supports a fictitious, multinational manufacturing company called Adventure Works Cycles. Exploratory Data Analysis, SQL, PostgeSQL, DML, Window Functions, Window Frames, Filtering, Grouping, Profit, Data Analysis, Data Analyst, Conversion, Retention Rate, Business Metrics.
SQL Northwind (additional) The main goal of the project is to analyze the database using SQL queries (PostgreSQL). The Northwind database is a sample database that was originally created by Microsoft and used as the basis for their tutorials in a variety of database products for decades. The database contains the sales data for a fictitious company called "Northwind Traders", which imports and exports specialty foods from around the world. Exploratory Data Analysis, SQL, PostgeSQL, DML, Window Functions, Window Frames, Filtering, Grouping, Profit, Data Analysis, Data Analyst, Conversion, Retention Rate, Business Metrics.
ProCharity (additional) ProCharity is an online platform that connects skilled volunteers with non-profit organizations to address specialized issues. Through the platform, NGOs can find helpers, volunteers can contribute their knowledge and skills, and partner companies can increase employee loyalty and promote philanthropy. Our project aims to analyze volunteer behavior and activity on ProCharity between 2020-2022, focusing on key metrics such as registration, moderation, and retention. Exploratory Data Analysis, Data Vizualization, Marketing Analyst, Reporting Analyst, Finance, Startups, Tech-company, Charity, Business Metrics, Retention Rate, Conversion, Business Funnels.

About

Data Analysis projects I completed when going through the Data Analysis training course in Practicum by Yandex.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published