YouTube Data Analysis

It analyse YouTube data and gives most popular genres on YouTube based on views and uploads.

Structure

GBvideos.csv (Dataset)
YouTube Data Analysis (Implementation MapReduce model to find the most popular genre on YouTube based on uploads)
Top Viewed Categories (Implementation MapReduce model to find the most popular genre on YouTube based on views)
Top Categories Output (Output files)

The output is obtained by creating a .jar file using the following lines of code on Linux terminal

hdfs dfs -mkdir /YouTubeInput

hdfs dfs -put /Downloads/YouTubeDataAnalysis/GBvideos.csv /YouTubeInput

hadoop jar /home/hadoop/TopViewedCategories.jar TopCategoryDriver /YouTubeInput /YouTubeOutput

hdfs dfs -cat /YouTubeOutput/*

hdfs dfs -get /YouTubeOutput/* /Downloads/YouTubeAnalysis/TopCategoryOutput

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
GBvideos.csv		GBvideos.csv
TopCategoryOutput		TopCategoryOutput
TopViewedCategories		TopViewedCategories
YouTube Data Analysis		YouTube Data Analysis
.gitattributes		.gitattributes
README.md		README.md