Data Modeling with Apache Cassandra

A project completed with the Udacity Nanodegree Program in Data Engineering.

This project performs datamodelling on a denormalized NoSQL database. For comparison, similar queries are found with Relational databases as found my repository on Data Modeling with PostgreSQL.

Purpose

Sparkify are a new music streaming app. They've been collecting data on the songs in their catalogue as well as user activity. The data are currently found in a directory of JSON logs on user activity, as well as a directory with JSON metadata on the songs in their app. The analytics team wish to understand what songs users are listening to, but there's currently no straightforward and simply way to query the data in its current form.

Schema

As this is a NoSQL database, which are optimized for reads over writes, a unique table is modelled for each query. Using Apache Cassandra, the database is created in denormalized form which implies that we will have multiple copies of the data. In Cassandra, there are no duplicates, thus a unique PRIMARY KEY must be created for each table. Composite primary keys are a good way to achieve this. The schema uses the framework of 'partition' and 'clustering' keys when creating a PRIMARY KEY. The partition key determines the column on which the data will be partitioned - having equal sized partitions is preferred.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
event_data		event_data
images		images
Project_1B_ Project_Template.ipynb		Project_1B_ Project_Template.ipynb
README.md		README.md
event_datafile_new.csv		event_datafile_new.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Modeling with Apache Cassandra

Purpose

Schema

About

Uh oh!

Releases

Packages

Languages

mlnotebook/data_modeling_with_cassandra

Folders and files

Latest commit

History

Repository files navigation

Data Modeling with Apache Cassandra

Purpose

Schema

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages