# Exploratory Data Analysis of Mr Beast's YouTube Channel

### Intro
YouTube has become an incredible platform for content creators to showcase their work, engage with audiences, and build communities. One prominent figure in the YouTube landscape is Mr Beast, known for his philanthropic stunts, attention-grabbing challenges, and entertaining content.

In this notebook, we will delve into an exploratory analysis of Mr Beast's YouTube channel data. The dataset includes key metrics such as video title, description, duration, view count, like count, comment count, publish date, and publish time. Our goal is to uncover insights and answer several intriguing questions about Mr Beast's content.

1. What is the average duration of Mr Beast's videos in seconds?
2. Which video has the highest view count?
3. What is the correlation between the number of views and the number of likes on Mr Beast's videos?
4. On which day of the week are Mr Beast's videos most commonly published?
5. Is there any correlation between video duration and the number of comments received?
6. What is the distribution of video durations in Mr Beast's channel (e.g., histogram or boxplot)?
7. Are there any specific words or phrases that commonly appear in video titles or descriptions that could be linked to higher view counts?

### 1. Import Libraries

In [1]:
import pandas as pd
import numpy as np

### 2. Data input and wrangling

In [2]:
data = pd.read_csv("/kaggle/input/mr-beast-youtube-video-statistics/MrBeast_youtube_stats.csv")
data.head(5)

Unnamed: 0,id,title,description,publishTime,kind_stats,duration_seconds,viewCount,likeCount,commentCount,thumbnails.default.url,...,thumbnails.high.width,thumbnails.high.height,contentDetails.duration,contentDetails.dimension,topicDetails.topicCategories,snippet.defaultLanguage,localizations.en.title,localizations.en.description,snippet.tags,contentDetails.contentRating.ytRating
0,TQHEJj68Jew,I Got Hunted By A Real Bounty Hunter,"Sign up for Current w/ my Creator Code ""BEAST""...",2021-04-24 20:00:00+00:00,youtube#video,861,84717282.0,2876493.0,128922.0,https://i.ytimg.com/vi/TQHEJj68Jew/default.jpg,...,480.0,360.0,PT14M21S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,,,,,
1,00NgUctWoLQ,"Extreme $1,000,000 Hide And Seek",I didn't expect that to happen at the end I wa...,2021-12-18 21:00:00+00:00,youtube#video,729,32090178.0,2125183.0,73593.0,https://i.ytimg.com/vi/00NgUctWoLQ/default.jpg,...,480.0,360.0,PT12M9S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,en,"Extreme $1,000,000 Hide And Seek",I didn't expect that to happen at the end I wa...,,
2,,MrBeast,Accomplishments - Raised $20000000 To Plant 20...,2012-02-20 00:43:50+00:00,,0,,,,https://yt3.ggpht.com/ytc/AKedOLTctGKJ32CdDLiS...,...,,,,,,,,,,
3,ayXxwJJId_c,I Bought The World&#39;s Largest Mystery Box! ...,I cant believe I spent over $500000 on mystery...,2021-04-03 20:00:01+00:00,youtube#video,709,101745632.0,3110824.0,162796.0,https://i.ytimg.com/vi/ayXxwJJId_c/default.jpg,...,480.0,360.0,PT11M49S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,,,,,
4,cExLQ1o2pDw,"First To Rob Bank Wins $100,000",I didnt think he would actually rob the bank.....,2021-09-26 20:00:06+00:00,youtube#video,482,50008942.0,2359606.0,120621.0,https://i.ytimg.com/vi/cExLQ1o2pDw/default.jpg,...,480.0,360.0,PT8M2S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,,,,,


In [3]:
data.shape

(247, 26)

In [4]:
data.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 247 entries, 0 to 246
Data columns (total 26 columns):
 #   Column                                 Non-Null Count  Dtype  
---  ------                                 --------------  -----  
 0   id                                     246 non-null    object 
 1   title                                  247 non-null    object 
 2   description                            247 non-null    object 
 3   publishTime                            247 non-null    object 
 4   kind_stats                             246 non-null    object 
 5   duration_seconds                       247 non-null    int64  
 6   viewCount                              246 non-null    float64
 7   likeCount                              244 non-null    float64
 8   commentCount                           245 non-null    float64
 9   thumbnails.default.url                 247 non-null    object 
 10  thumbnails.default.width               246 non-null    float64
 11  thumbn

In [5]:
data['publishTime'] = pd.to_datetime(data['publishTime'])
data['publishDate'] = data['publishTime'].dt.date
data['publishTimestamp'] = data['publishTime'].dt.time
data.head(5)

Unnamed: 0,id,title,description,publishTime,kind_stats,duration_seconds,viewCount,likeCount,commentCount,thumbnails.default.url,...,contentDetails.duration,contentDetails.dimension,topicDetails.topicCategories,snippet.defaultLanguage,localizations.en.title,localizations.en.description,snippet.tags,contentDetails.contentRating.ytRating,publishDate,publishTimestamp
0,TQHEJj68Jew,I Got Hunted By A Real Bounty Hunter,"Sign up for Current w/ my Creator Code ""BEAST""...",2021-04-24 20:00:00+00:00,youtube#video,861,84717282.0,2876493.0,128922.0,https://i.ytimg.com/vi/TQHEJj68Jew/default.jpg,...,PT14M21S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,,,,,,2021-04-24,20:00:00
1,00NgUctWoLQ,"Extreme $1,000,000 Hide And Seek",I didn't expect that to happen at the end I wa...,2021-12-18 21:00:00+00:00,youtube#video,729,32090178.0,2125183.0,73593.0,https://i.ytimg.com/vi/00NgUctWoLQ/default.jpg,...,PT12M9S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,en,"Extreme $1,000,000 Hide And Seek",I didn't expect that to happen at the end I wa...,,,2021-12-18,21:00:00
2,,MrBeast,Accomplishments - Raised $20000000 To Plant 20...,2012-02-20 00:43:50+00:00,,0,,,,https://yt3.ggpht.com/ytc/AKedOLTctGKJ32CdDLiS...,...,,,,,,,,,2012-02-20,00:43:50
3,ayXxwJJId_c,I Bought The World&#39;s Largest Mystery Box! ...,I cant believe I spent over $500000 on mystery...,2021-04-03 20:00:01+00:00,youtube#video,709,101745632.0,3110824.0,162796.0,https://i.ytimg.com/vi/ayXxwJJId_c/default.jpg,...,PT11M49S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,,,,,,2021-04-03,20:00:01
4,cExLQ1o2pDw,"First To Rob Bank Wins $100,000",I didnt think he would actually rob the bank.....,2021-09-26 20:00:06+00:00,youtube#video,482,50008942.0,2359606.0,120621.0,https://i.ytimg.com/vi/cExLQ1o2pDw/default.jpg,...,PT8M2S,2d,['https://en.wikipedia.org/wiki/Lifestyle_(soc...,,,,,,2021-09-26,20:00:06


In [6]:
data = data[['id','title','description','kind_stats','duration_seconds','viewCount','likeCount','commentCount','thumbnails.high.width','thumbnails.high.height','publishDate','publishTimestamp']]
data.head(5)

Unnamed: 0,id,title,description,kind_stats,duration_seconds,viewCount,likeCount,commentCount,thumbnails.high.width,thumbnails.high.height,publishDate,publishTimestamp
0,TQHEJj68Jew,I Got Hunted By A Real Bounty Hunter,"Sign up for Current w/ my Creator Code ""BEAST""...",youtube#video,861,84717282.0,2876493.0,128922.0,480.0,360.0,2021-04-24,20:00:00
1,00NgUctWoLQ,"Extreme $1,000,000 Hide And Seek",I didn't expect that to happen at the end I wa...,youtube#video,729,32090178.0,2125183.0,73593.0,480.0,360.0,2021-12-18,21:00:00
2,,MrBeast,Accomplishments - Raised $20000000 To Plant 20...,,0,,,,,,2012-02-20,00:43:50
3,ayXxwJJId_c,I Bought The World&#39;s Largest Mystery Box! ...,I cant believe I spent over $500000 on mystery...,youtube#video,709,101745632.0,3110824.0,162796.0,480.0,360.0,2021-04-03,20:00:01
4,cExLQ1o2pDw,"First To Rob Bank Wins $100,000",I didnt think he would actually rob the bank.....,youtube#video,482,50008942.0,2359606.0,120621.0,480.0,360.0,2021-09-26,20:00:06


In [7]:
data.shape

(247, 12)

In [8]:
data.isnull().sum()

id                        1
title                     0
description               0
kind_stats                1
duration_seconds          0
viewCount                 1
likeCount                 3
commentCount              2
thumbnails.high.width     1
thumbnails.high.height    1
publishDate               0
publishTimestamp          0
dtype: int64

In [9]:
data.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 247 entries, 0 to 246
Data columns (total 12 columns):
 #   Column                  Non-Null Count  Dtype  
---  ------                  --------------  -----  
 0   id                      246 non-null    object 
 1   title                   247 non-null    object 
 2   description             247 non-null    object 
 3   kind_stats              246 non-null    object 
 4   duration_seconds        247 non-null    int64  
 5   viewCount               246 non-null    float64
 6   likeCount               244 non-null    float64
 7   commentCount            245 non-null    float64
 8   thumbnails.high.width   246 non-null    float64
 9   thumbnails.high.height  246 non-null    float64
 10  publishDate             247 non-null    object 
 11  publishTimestamp        247 non-null    object 
dtypes: float64(5), int64(1), object(6)
memory usage: 23.3+ KB
