sparkify

An ETL model designed using Postgres SQL for Sparkify database 🗄, modeling user activity data to create a database and ETL pipeline🔀 for a music streaming app 🎼.

database etl postgresql data-modeling datamodel etl-pipeline sparkify

Updated Jun 2, 2020
Jupyter Notebook

fpcarneiro / Data-Modeling-with-Cassandra

Star

Project: Data Modeling with Cassandra

udacity cassandra etl-pipeline sparkify

Updated May 19, 2019
Jupyter Notebook

Guli-Y / Sparkify-s3-Spark-s3

Star

ETL script for reading data from s3, processing them using Spark and loading them back to s3 for data analysis team

emr spark etl s3 sparkify

Updated May 21, 2021
Python

SimplifyData / Cloud-Data-Warehouse-with-Redshift-AWS

Star

Cloud Data Warehouse of Sparkify Data using Redshift

database data-engineering data-lake redshift data-modeling music-database aws-redshift dimension-tables etl-pipeline staging-tables sparkify data-warehouses analytics-tables redshift-aws

Updated Jun 16, 2020
Python

abduygur / churn-prediction-using-spark

Star

Churn Prediction using PySpark

data-science machine-learning pyspark churn-prediction sparkify

Updated Jan 29, 2021
HTML

alessiococchieri / BDA-project-sparkify

Star

This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.

machine-learning big-data spark apache-spark pyspark churn-prediction big-data-analytics big-data-processing churn-analysis sparkify

Updated Feb 15, 2023
Jupyter Notebook

fpcarneiro / Data-Warehouse

Star

Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.

udacity redshift data-engineer etl-pipeline sparkify data-warehouses

Updated Jun 4, 2019
Python

fpcarneiro / data-lake

Star

Udacity Data Engineer Nanodegree: Project Data Lake

udacity spark data-lake data-engineer etl-pipeline sparkify

Updated Aug 21, 2019
Python

Improve this page

Add a description, image, and links to the sparkify topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sparkify topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparkify

Here are 12 public repositories matching this topic...

brunowdev / sparkify

Guli-Y / SparkifyRedshift

Mcamin / User-Churn-Prediction

cdumen / Sparkify_Churn_Prediction

pratikwatwani / ETL-pipeline-for-Sparkify

fpcarneiro / Data-Modeling-with-Cassandra

Guli-Y / Sparkify-s3-Spark-s3

SimplifyData / Cloud-Data-Warehouse-with-Redshift-AWS

abduygur / churn-prediction-using-spark

alessiococchieri / BDA-project-sparkify

fpcarneiro / Data-Warehouse

fpcarneiro / data-lake

Improve this page

Add this topic to your repo