Skip to content

tmtsmrsl/TrendingYoutubeVideos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

TrendingYoutubeVideos

An analysis on a list of ~47k trending videos appearances in Indonesia since the beginning of this year up to 13 Sep 2022 (around 200 trending videos were updated on each day). You can view the full report here. Throughout the analysis I managed to solve the following questions:

  • What are the most and least popular video categories in 2022?
  • When is the optimal day/time to publish youtube videos?
  • Which videos and channels appear the most on the trending page?
  • What are the most popular keywords for titles and tags?
  • How do video definition and caption availability affect the chance of a video becoming trending?
  • How often do videos in other languages (besides Indonesian) appear on the trending page?
  • How does the engagement rate (like and comment count per view) differ between categories?
  • Etc.

The project is done using Python and include the following steps. First I made an ETL pipeline to request the data from Youtube API, transform it from json format to dataframe, and load the data into BigQuery table. Since it's not possible to request trending videos data in the past, I resorted to using a dataset which is updated daily by Syahrul B Hamdani. Then I performed data wrangling to remove unnecessary data, convert the data type and handle missing value. And last I did the EDA along with the visualization, which involves image manipulation with PIL, requesting additional data (youtube channel infos and videos language) from Youtube API, making waffle chart using a scatter plot and a bit of NLP technique. trending_2022_1.csv is the cleaned data used for the EDA. If you are interested with the code, you can visit my notebook here.

About

An analysis about trending youtube videos in Indonesia since the beginning of 2022 up to 13 Sep 2022.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published