# Analyzing Regular Show data

One of my favorite TV shows is the Cartoon Network classic called Regular Show. But apart from just watching it, I would actually like to learn more about it and gain some deeper insights into it. This will be a fun way to hone my programming and data analysis skills while also getting some interesting insights from it all in the process. So let's begin!

**Warning: this analysis will contain spoilers**

# Research Questions

In this analysis I will try and analyze the Regular Show episode dataset. Apart from just conducting exploratory analysis to see if I chance upon any interesting insights along the way, I will also be looking to answer the following:

1. What are the most underrated Regular Show episodes, when underrated means highest IMDB score to viewership ratio (highly rated episodes that few people watched)?
2. What are the most overrated Regular Show episodes, with the lowest IMDB score to viewership ratio (poorly rated episodes that everybody watched)?
3. Was there any correlation between the episode writer(s) and the IMDB score (implying certain writers wrote better episodes)?
4. Which episodes were responsible for renewed interest, even if only partially, for the series as a whole?
5. Was the meta-analysis of the Seer from one of the very last episodes accurate?

In [4]:
import pandas as pd

# Import the data file that I downloaded from Kaggle as a csv
reg_show = pd.read_csv("./regular_show.csv")

# Get an idea of what the dataset looks like
reg_show.head()

Unnamed: 0,number_overall,season,number_in_season,episode_name,animation_direction_by,written_and_storyboarded_by,original_air_date,us_viewers_mil,imdb_rating
0,1,1,1,The Power,"Robert Alvarez, Brian Sheesley",J. G. Quintel,2010-09-06,2.1,8.8
1,2,1,2,Just Set Up the Chairs,"Robert Alvarez, Brian Sheesley","Sean Szeles, Shion Takeuchi",2010-09-13,1.9,8.8
2,3,1,3,Caffeinated Coffee Tickets,"Robert Alvarez, Brian Sheesley","J. G. Quintel, Mike Roth",2010-09-20,1.72,8.5
3,4,1,4,Death Punchies,"Robert Alvarez, Brian Sheesley","J. G. Quintel, Mike Roth, Jake Armstrong",2010-09-27,1.98,8.9
4,5,1,5,Free Cake,"Robert Alvarez, Brian Sheesley","Kat Morris, Paul Scarlata",2010-10-04,2.1,8.5


While there is clearly a variety of writers being used for the episodes right off the bat, is there a similar variety for the animation direction? I better check because it would be fun to analyze whether animation direction affects the imdb rating too.

In [5]:
reg_show.tail()

Unnamed: 0,number_overall,season,number_in_season,episode_name,animation_direction_by,written_and_storyboarded_by,original_air_date,us_viewers_mil,imdb_rating
256,257,8,27,Meet the Seer,"Robert Alvarez, Jeff Hall","Benton Connor, Sam Spina",2017-01-14,1.08,9.6
257,258,8,28,Cheer Up Pops,"Robert Alvarez, Jeff Hall","Casey Crowe, Gideon Chase",2017-01-16,1.33,9.5
258,259,8,29,A Regular Epic Final Battle,"Robert Alvarez, Richard Collado, Jeff Hall","J.G. Quintel, Madeline Queripel, Alex Cline, M...",2017-01-16,1.33,9.9
259,260,8,30,A Regular Epic Final Battle,"Robert Alvarez, Richard Collado, Jeff Hall","J.G. Quintel, Madeline Queripel, Alex Cline, M...",2017-01-16,1.37,9.9
260,261,8,31,A Regular Epic Final Battle,"Robert Alvarez, Richard Collado, Jeff Hall","J.G. Quintel, Madeline Queripel, Alex Cline, M...",2017-01-16,1.37,9.9


The later episodes show that the animation directors also change, so that becomes another variable I can analyze in the future. For now, just to get my bearings I'd like to see the trend in viewership over time.