Shazam

When anyone wants to watch a movie and doesn’t know what to watch, they would want to check the trailers of some trending movies around them or the top rated movies of all time. Shazam is a web portal similar to Netflix where users can watch trailers of various movies. Users can also rate the youtube trailers. Users will be given a choice to watch the trending videos and highly rated movies through filters across genres. Shazam is deployed on AWS and comprises of various technologies like EC2, DynamoDB, ElasticSearch, Lambdas, SQS, Python, Node.js, PySpark.

Architecture

Commands to package and deploy SAM templates

sam package --template-file template.yml --s3-bucket shazam-sam-templates --output-template-file output-template.yml sam deploy --template-file output-template.yml --stack-name shazam --capabilities CAPABILITY_IAM sam delete-stack --stack-name shazam

Command to run Spark job on EMR

spark-submit --master local[*] --conf "spark.mongodb.input.uri=mongodb+srv://nikhil:nikhil@shazamdb-ci1rz.mongodb.net/" --conf "spark.mongodb.output.uri=mongodb+srv://nikhil:nikhil@shazamdb-ci1rz.mongodb.net/" --packages org.mongodb.spark:mongo-spark-connector_2.11:2.4.0 trending.py

Screenshots

Homescreen:

Trailer thumbnail:

You can check screens for the Shazam:

https://chinmay609410.invisionapp.com/prototype/ck3uz6xaa004g6g01yvn0wr2h/play

Application

For demo of the application, visit the YouTube link:

https://www.youtube.com/watch?v=qJ7a99oaO8Y&t=6s

Check out the Medium post for the entire application published in Towards Data Science here:

https://towardsdatascience.com/shazam-699a95d640f9

Dataset

For movies and ratings, we are using dataset from grouplens.org 'MovieLens 20M' dataset.
For youtube trailers, we will be using the grouplens.org MovieLens 20M Youtube Trailers dataset.
We will use a separate dataset from Kaggle to fetch movie posters.

The links for the datasets are given below:

https://grouplens.org/datasets/movielens/20m
https://grouplens.org/datasets/movielens/20m-youtube
https://www.kaggle.com/neha1703/movie-genre-from-its-poster

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
dataset		dataset
images		images
lambda-functions		lambda-functions
scripts		scripts
views		views
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
Shazam - Report.pdf		Shazam - Report.pdf
Shazam.pptx		Shazam.pptx
output-template.yml		output-template.yml
requirements.txt		requirements.txt
spark_invoker.py		spark_invoker.py
template.yml		template.yml
trending.py		trending.py

NikhilNar/Shazam

Folders and files

Latest commit

History

Repository files navigation

Shazam

Architecture

Commands to package and deploy SAM templates

Command to run Spark job on EMR

Screenshots

Homescreen:

Trailer thumbnail:

Application

Dataset

Team

About

Resources

Stars

Watchers

Forks

Languages