This repo consists of all the projects completed as a part of the Post Graduate Program in Data Science and Data Management Systems professional certification from University of Texas at Austin.
The food aggregator company has stored the data of the different orders made by the registered customers in their online portal. They want to analyze the data to get a fair idea about the demand of different restaurants which will help them in enhancing their customer experience. The Data Science team has shared some of the key questions that need to be answered. Perform the data analysis to find answers to these questions that will help the company to improve the business.
Web Scraping, Pandas, Dataframes, Seaborn, Exploratory Data Analysis, NumPy, Descriptive Statistics, RegEx
New-Wheels sales have been dipping steadily in the past year and due to the critical customer feedback and ratings online, there is a drop in new customers every quarter, which is concerning. The data is not organized and is dumped as flat files only to be used occasionally. Create a pipeline to organize and maintain this data using a SQL database so that it becomes easy to answer questions. Then, use the data to answer the questions posed and create a Quarterly Business Report for the CEO.
MySQL, Normalizing Data Schemas with DDL, Querying the Data with DML, Tables Views and Functions, Automating data transformation with Stored Procedures, Creating Business Presentations
Gamers' Arena is a website that provides information about video games. As an analytics lead, you must design an interactive dashboard that helps the director of the company make a decision on a new sales model (a subscription model) that will attract more gamers to our platform. In order to answer these questions, a dashboard needs to be created using Tableau.
Tableau, Stories, Dashboarding, Views, Storyboarding, KPI Controls, Blends
Dx-diagnostics, an online medical health tracker startup is an application to enable users to measure their health indicators at regular intervals. As an analytics engineer, you'll be designing an Airflow DAG on the cloud to serve as the prototype's backend data architecture. The DAG should calculate the summary statistics of the heart rate and O2 levels of the patient every 15 mins and send a report over a Slack channel. The anomalies in these metrics need to be flagged and saved separately.
GCP, Cloud Composer, MySQL Cloud DB, Airflow UI, Airflow DAG, Airflow Operators, Xcoms, Slack WebHook