sparkify
Here are 12 public repositories matching this topic...
Data Analysis in Spark to Identify Customer Churn for a fictional music service.
-
Updated
Nov 25, 2019 - Jupyter Notebook
Sparkify project for predicting customer loyality.
-
Updated
Nov 3, 2019 - HTML
An ETL model designed using Postgres SQL for Sparkify database 🗄, modeling user activity data to create a database and ETL pipeline🔀 for a music streaming app 🎼.
-
Updated
Jun 2, 2020 - Jupyter Notebook
Udacity Data Engineer Nanodegree: Project Data Lake
-
Updated
Aug 21, 2019 - Python
Project: Data Modeling with Cassandra
-
Updated
May 19, 2019 - Jupyter Notebook
This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.
-
Updated
Feb 15, 2023 - Jupyter Notebook
Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.
-
Updated
Jun 4, 2019 - Python
Cloud Data Warehouse of Sparkify Data using Redshift
-
Updated
Jun 16, 2020 - Python
Churn Prediction using PySpark
-
Updated
Jan 29, 2021 - HTML
This is the final project for the Data Scientist Nanodegree, where our goal is to predict churn for a fictional streaming service called Sparkify.
-
Updated
Jul 6, 2023 - HTML
Improve this page
Add a description, image, and links to the sparkify topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sparkify topic, visit your repo's landing page and select "manage topics."