# Project 1: Explanatory Data Analysis & Data Presentation (Movies Dataset)

# Project Brief for Self-Coders

Here you´ll have the opportunity to code major parts of Project 1 on your own. If you need any help or inspiration, have a look at the Videos or the Jupyter Notebook with the full code. <br> <br>
Keep in mind that it´s all about __getting the right results/conclusions__. It´s not about finding the identical code. Things can be coded in many different ways. Even if you come to the same conclusions, it´s very unlikely that we have the very same code. 

## Data Import and first Inspection

In [1]:
import pandas as pd
import numpy as np

1. __Import__ the movies dataset from the CSV file "movies_complete.csv". __Inspect__ the data.

__Some additional information on Features/Columns__:

* **id:** The ID of the movie (clear/unique identifier).
* **title:** The Official Title of the movie.
* **tagline:** The tagline of the movie.
* **release_date:** Theatrical Release Date of the movie.
* **genres:** Genres associated with the movie.
* **belongs_to_collection:** Gives information on the movie series/franchise the particular film belongs to.
* **original_language:** The language in which the movie was originally shot in.
* **budget_musd:** The budget of the movie in million dollars.
* **revenue_musd:** The total revenue of the movie in million dollars.
* **production_companies:** Production companies involved with the making of the movie.
* **production_countries:** Countries where the movie was shot/produced in.
* **vote_count:** The number of votes by users, as counted by TMDB.
* **vote_average:** The average rating of the movie.
* **popularity:** The Popularity Score assigned by TMDB.
* **runtime:** The runtime of the movie in minutes.
* **overview:** A brief blurb of the movie.
* **spoken_languages:** Spoken languages in the film.
* **poster_path:** The URL of the poster image.
* **cast:** (Main) Actors appearing in the movie.
* **cast_size:** number of Actors appearing in the movie.
* **director:** Director of the movie.
* **crew_size:** Size of the film crew (incl. director, excl. actors).

In [2]:
movies = pd.read_csv('movies_complete.csv', index_col= 'id')
movies.head()

Unnamed: 0_level_0,title,tagline,release_date,genres,belongs_to_collection,original_language,budget_musd,revenue_musd,production_companies,production_countries,...,vote_average,popularity,runtime,overview,spoken_languages,poster_path,cast,cast_size,crew_size,director
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
862,Toy Story,,1995-10-30,Animation|Comedy|Family,Toy Story Collection,en,30.0,373.554033,Pixar Animation Studios,United States of America,...,7.7,21.946943,81.0,"Led by Woody, Andy's toys live happily in his ...",English,<img src='http://image.tmdb.org/t/p/w185//uXDf...,Tom Hanks|Tim Allen|Don Rickles|Jim Varney|Wal...,13,106,John Lasseter
8844,Jumanji,Roll the dice and unleash the excitement!,1995-12-15,Adventure|Fantasy|Family,,en,65.0,262.797249,TriStar Pictures|Teitler Film|Interscope Commu...,United States of America,...,6.9,17.015539,104.0,When siblings Judy and Peter discover an encha...,English|Français,<img src='http://image.tmdb.org/t/p/w185//vgpX...,Robin Williams|Jonathan Hyde|Kirsten Dunst|Bra...,26,16,Joe Johnston
15602,Grumpier Old Men,Still Yelling. Still Fighting. Still Ready for...,1995-12-22,Romance|Comedy,Grumpy Old Men Collection,en,,,Warner Bros.|Lancaster Gate,United States of America,...,6.5,11.7129,101.0,A family wedding reignites the ancient feud be...,English,<img src='http://image.tmdb.org/t/p/w185//1FSX...,Walter Matthau|Jack Lemmon|Ann-Margret|Sophia ...,7,4,Howard Deutch
31357,Waiting to Exhale,Friends are the people who let you be yourself...,1995-12-22,Comedy|Drama|Romance,,en,16.0,81.452156,Twentieth Century Fox Film Corporation,United States of America,...,6.1,3.859495,127.0,"Cheated on, mistreated and stepped on, the wom...",English,<img src='http://image.tmdb.org/t/p/w185//4wjG...,Whitney Houston|Angela Bassett|Loretta Devine|...,10,10,Forest Whitaker
11862,Father of the Bride Part II,Just When His World Is Back To Normal... He's ...,1995-02-10,Comedy,Father of the Bride Collection,en,,76.578911,Sandollar Productions|Touchstone Pictures,United States of America,...,5.7,8.387519,106.0,Just when George Banks has recovered from his ...,English,<img src='http://image.tmdb.org/t/p/w185//lf9R...,Steve Martin|Diane Keaton|Martin Short|Kimberl...,12,7,Charles Shyer


## The best and the worst movies...

2. __Filter__ the Dataset and __find the best/worst n Movies__ with the

- Highest Revenue
- Highest Budget
- Highest Profit (=Revenue - Budget)
- Lowest Profit (=Revenue - Budget)
- Highest Return on Investment (=Revenue / Budget) (only movies with Budget >= 10) 
- Lowest Return on Investment (=Revenue / Budget) (only movies with Budget >= 10)
- Highest number of Votes
- Highest Rating (only movies with 10 or more Ratings)
- Lowest Rating (only movies with 10 or more Ratings)
- Highest Popularity

__Define__ an appropriate __user-defined function__ to reuse code.

__Movies Top 5 - Highest Revenue__

In [14]:
movies.sort_values(by='revenue_musd', ascending = False)[['title','revenue_musd']].head(5)

Unnamed: 0_level_0,title,revenue_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1
19995,Avatar,2787.965087
140607,Star Wars: The Force Awakens,2068.223624
597,Titanic,1845.034188
24428,The Avengers,1519.55791
135397,Jurassic World,1513.52881


__Movies Top 5 - Highest Budget__

In [16]:
movies.nlargest(5,'budget_musd')[['title','budget_musd']]

Unnamed: 0_level_0,title,budget_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1
1865,Pirates of the Caribbean: On Stranger Tides,380.0
285,Pirates of the Caribbean: At World's End,300.0
99861,Avengers: Age of Ultron,280.0
1452,Superman Returns,270.0
38757,Tangled,260.0


__Movies Top 5 - Highest Profit__

In [34]:
movies['profit_musd'] = movies['revenue_musd'] - movies['budget_musd']
movies.nlargest(5,'profit_musd')[['title','profit_musd']]
movies.head()

Unnamed: 0_level_0,title,tagline,release_date,genres,belongs_to_collection,original_language,budget_musd,revenue_musd,production_companies,production_countries,...,popularity,runtime,overview,spoken_languages,poster_path,cast,cast_size,crew_size,director,profit_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
862,Toy Story,,1995-10-30,Animation|Comedy|Family,Toy Story Collection,en,30.0,373.554033,Pixar Animation Studios,United States of America,...,21.946943,81.0,"Led by Woody, Andy's toys live happily in his ...",English,<img src='http://image.tmdb.org/t/p/w185//uXDf...,Tom Hanks|Tim Allen|Don Rickles|Jim Varney|Wal...,13,106,John Lasseter,343.554033
8844,Jumanji,Roll the dice and unleash the excitement!,1995-12-15,Adventure|Fantasy|Family,,en,65.0,262.797249,TriStar Pictures|Teitler Film|Interscope Commu...,United States of America,...,17.015539,104.0,When siblings Judy and Peter discover an encha...,English|Français,<img src='http://image.tmdb.org/t/p/w185//vgpX...,Robin Williams|Jonathan Hyde|Kirsten Dunst|Bra...,26,16,Joe Johnston,197.797249
15602,Grumpier Old Men,Still Yelling. Still Fighting. Still Ready for...,1995-12-22,Romance|Comedy,Grumpy Old Men Collection,en,,,Warner Bros.|Lancaster Gate,United States of America,...,11.7129,101.0,A family wedding reignites the ancient feud be...,English,<img src='http://image.tmdb.org/t/p/w185//1FSX...,Walter Matthau|Jack Lemmon|Ann-Margret|Sophia ...,7,4,Howard Deutch,
31357,Waiting to Exhale,Friends are the people who let you be yourself...,1995-12-22,Comedy|Drama|Romance,,en,16.0,81.452156,Twentieth Century Fox Film Corporation,United States of America,...,3.859495,127.0,"Cheated on, mistreated and stepped on, the wom...",English,<img src='http://image.tmdb.org/t/p/w185//4wjG...,Whitney Houston|Angela Bassett|Loretta Devine|...,10,10,Forest Whitaker,65.452156
11862,Father of the Bride Part II,Just When His World Is Back To Normal... He's ...,1995-02-10,Comedy,Father of the Bride Collection,en,,76.578911,Sandollar Productions|Touchstone Pictures,United States of America,...,8.387519,106.0,Just when George Banks has recovered from his ...,English,<img src='http://image.tmdb.org/t/p/w185//lf9R...,Steve Martin|Diane Keaton|Martin Short|Kimberl...,12,7,Charles Shyer,


__Movies Top 5 - Lowest Profit__

In [35]:
movies['profit_musd'] = movies['revenue_musd'] - movies['budget_musd']
movies.nsmallest(5,'profit_musd')[['title','profit_musd']]

Unnamed: 0_level_0,title,profit_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1
57201,The Lone Ranger,-165.71009
10733,The Alamo,-119.180039
50321,Mars Needs Moms,-111.007242
339964,Valerian and the City of a Thousand Planets,-107.447384
1911,The 13th Warrior,-98.301101


__Movies Top 5 - Highest ROI__

In [43]:
movies10 = movies.copy()
movies10 = movies10[movies10['budget_musd']>=10]
movies10['ROI'] = movies10['revenue_musd']/movies10['budget_musd']
movies10.nlargest(5,'ROI')[['title','ROI']]

Unnamed: 0_level_0,title,ROI
id,Unnamed: 1_level_1,Unnamed: 2_level_1
601,E.T. the Extra-Terrestrial,75.520507
11,Star Wars,70.490728
114,Pretty Woman,33.071429
77338,The Intouchables,32.806221
1891,The Empire Strikes Back,29.911111


__Movies Top 5 - Lowest ROI__

In [45]:
movies10.nsmallest(5,'ROI')[['title','ROI']]

Unnamed: 0_level_0,title,ROI
id,Unnamed: 1_level_1,Unnamed: 2_level_1
14844,Chasing Liberty,5.217391e-07
18475,The Cookout,7.5e-07
33927,Deadfall,1.8e-06
10944,In the Cut,1.916667e-06
98339,The Samaritan,0.0002100833


__Movies Top 5 - Most Votes__

In [47]:
movies.nlargest(5,'vote_count')[['title','vote_count']]

Unnamed: 0_level_0,title,vote_count
id,Unnamed: 1_level_1,Unnamed: 2_level_1
27205,Inception,14075.0
155,The Dark Knight,12269.0
19995,Avatar,12114.0
24428,The Avengers,12000.0
293660,Deadpool,11444.0


__Movies Top 5 - Highest Rating__

In [58]:
movies['vote_count'].apply(lambda cnt: int(cnt))
movies[movies['vote_count']>=10].nlargest(5,'vote_average')[['title','vote_average']]

Unnamed: 0_level_0,title,vote_average
id,Unnamed: 1_level_1,Unnamed: 2_level_1
130824,As I Was Moving Ahead Occasionally I Saw Brief...,9.5
420714,Planet Earth II,9.5
26397,The Civil War,9.2
19404,Dilwale Dulhania Le Jayenge,9.1
409926,Cosmos,9.1


__Movies Top 5 - Lowest Rating__

In [59]:
movies[movies['vote_count']>=10].nsmallest(5,'vote_average')[['title','vote_average']]

Unnamed: 0_level_0,title,vote_average
id,Unnamed: 1_level_1,Unnamed: 2_level_1
279988,Extinction: Nature Has Evolved,0.0
398818,Call Me by Your Name,0.0
341689,How to Talk to Girls at Parties,0.0
22727,The Beast of Yucca Flats,1.6
13383,Santa Claus,1.6


__Movies Top 5 - Most Popular__

In [60]:
movies.nlargest(5,'popularity')[['title','popularity']]

Unnamed: 0_level_0,title,popularity
id,Unnamed: 1_level_1,Unnamed: 2_level_1
211672,Minions,547.488298
297762,Wonder Woman,294.337037
321612,Beauty and the Beast,287.253654
339403,Baby Driver,228.032744
177572,Big Hero 6,213.849907


## Find your next Movie

3. __Filter__ the Dataset for movies that meet the following conditions:

In [61]:
movies.columns

Index(['title', 'tagline', 'release_date', 'genres', 'belongs_to_collection',
       'original_language', 'budget_musd', 'revenue_musd',
       'production_companies', 'production_countries', 'vote_count',
       'vote_average', 'popularity', 'runtime', 'overview', 'spoken_languages',
       'poster_path', 'cast', 'cast_size', 'crew_size', 'director',
       'profit_musd'],
      dtype='object')

__Search 1: Science Fiction Action Movie with Bruce Willis (sorted from high to low Rating)__

In [126]:
#movies['genres'].apply(lambda genres: genres.split('|'))
#'Animation' in movies['genres'].iloc[0].split('|')
movies1 = movies.copy()
movies1 = movies1[movies1['genres'].notnull()]
# filter to only Science Fiction movies
movies1 = movies1[movies1['genres'].apply(lambda genres: 'Science Fiction' in genres.split('|'))]
# filter to only Bruce Willis movies
movies1 = movies1[movies1['cast'].notnull()]
movies1 = movies1[movies1['cast'].apply(lambda cast: 'Bruce Willis' in cast.split('|'))]
movies1.sort_values(by= 'vote_average', ascending = False)[['title', 'genres','cast','vote_average']]

Unnamed: 0_level_0,title,genres,cast,vote_average
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1
63,Twelve Monkeys,Science Fiction|Thriller|Mystery,Bruce Willis|Madeleine Stowe|Brad Pitt|Christo...,7.4
18,The Fifth Element,Adventure|Fantasy|Action|Thriller|Science Fiction,Bruce Willis|Gary Oldman|Ian Holm|Milla Jovovi...,7.3
9741,Unbreakable,Science Fiction|Thriller|Drama,Bruce Willis|Samuel L. Jackson|Robin Wright|Sp...,6.9
59967,Looper,Action|Thriller|Science Fiction,Joseph Gordon-Levitt|Bruce Willis|Emily Blunt|...,6.6
95,Armageddon,Action|Thriller|Science Fiction|Adventure,Bruce Willis|Billy Bob Thornton|Ben Affleck|Li...,6.5
5172,The Astronaut Farmer,Adventure|Comedy|Drama|Science Fiction,Billy Bob Thornton|Virginia Madsen|Max Thierio...,6.2
19959,Surrogates,Action|Science Fiction|Thriller,Bruce Willis|Radha Mitchell|Rosamund Pike|Jame...,5.9
72559,G.I. Joe: Retaliation,Adventure|Action|Science Fiction|Thriller,Dwayne Johnson|D.J. Cotrona|Adrianne Palicki|B...,5.4
31586,North,Comedy|Drama|Family|Fantasy|Science Fiction,Elijah Wood|Jason Alexander|Julia Louis-Dreyfu...,4.8
307663,Vice,Thriller|Science Fiction|Action|Adventure,Ambyr Childers|Thomas Jane|Bryan Greenberg|Bru...,4.1


__Search 2: Movies with Uma Thurman and directed by Quentin Tarantino (sorted from short to long runtime)__

In [154]:
movies2 = movies.copy()
# remove null values
movies2.dropna(subset = ['cast', 'director'], how = 'any', inplace = True)
movies2 = movies2[movies2['cast'].apply(lambda cast: 'Uma Thurman' in cast.split('|'))]
movies2 = movies2[movies2['director'].apply(lambda director: 'Quentin Tarantino' in director.split('|'))]
movies2.sort_values(by='runtime')

Unnamed: 0_level_0,title,tagline,release_date,genres,belongs_to_collection,original_language,budget_musd,revenue_musd,production_companies,production_countries,...,popularity,runtime,overview,spoken_languages,poster_path,cast,cast_size,crew_size,director,profit_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
24,Kill Bill: Vol. 1,Go for the kill.,2003-10-10,Action|Crime,Kill Bill Collection,en,30.0,180.949,Miramax Films|A Band Apart|Super Cool ManChu,United States of America,...,25.261865,111.0,An assassin is shot at the altar by her ruthle...,English|日本語|Français,<img src='http://image.tmdb.org/t/p/w185//v7Ta...,Uma Thurman|Lucy Liu|Vivica A. Fox|Daryl Hanna...,36,161,Quentin Tarantino,150.949
393,Kill Bill: Vol. 2,The bride is back for the final cut.,2004-04-16,Action|Crime|Thriller,Kill Bill Collection,en,30.0,152.159461,Miramax Films|A Band Apart|Super Cool ManChu,United States of America,...,21.533072,136.0,The Bride unwaveringly continues on her roarin...,English|普通话|Español|广州话 / 廣州話,<img src='http://image.tmdb.org/t/p/w185//2yhg...,Uma Thurman|David Carradine|Daryl Hannah|Micha...,27,130,Quentin Tarantino,122.159461
680,Pulp Fiction,Just because you are a character doesn't mean ...,1994-09-10,Thriller|Crime,,en,8.0,213.928762,Miramax Films|A Band Apart|Jersey Films,United States of America,...,140.950236,154.0,"A burger-loving hit man, his philosophical par...",English|Español|Français,<img src='http://image.tmdb.org/t/p/w185//d5iI...,John Travolta|Samuel L. Jackson|Uma Thurman|Br...,54,87,Quentin Tarantino,205.928762


__Search 3: Most Successful Pixar Studio Movies between 2010 and 2015 (sorted from high to low Revenue)__

In [181]:
movies3 = movies.copy()
# remove null values
movies3.dropna(subset = ['production_companies','release_date'], how = 'any', inplace = True)
movies3 = movies3[movies3['production_companies'].apply(lambda x: 'Pixar Animation Studios' in x.split('|'))]
movies3['release_yr'] = movies3['release_date'].apply(lambda date: int(date.split('-')[0]))
movies3 = movies3[(movies3['release_yr'] >= 2010) & (movies3['release_yr'] <= 2015)]
movies3.sort_values(by = 'revenue_musd', ascending = False)[['title','production_companies','release_date','revenue_musd']]

Unnamed: 0_level_0,title,production_companies,release_date,revenue_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1
10193,Toy Story 3,Walt Disney Pictures|Pixar Animation Studios,2010-06-16,1066.969703
150540,Inside Out,Walt Disney Pictures|Pixar Animation Studios,2015-06-09,857.611174
62211,Monsters University,Walt Disney Pictures|Pixar Animation Studios,2013-06-20,743.559607
49013,Cars 2,Walt Disney Pictures|Pixar Animation Studios,2011-06-11,559.852396
62177,Brave,Walt Disney Pictures|Pixar Animation Studios,2012-06-21,538.983207
105864,The Good Dinosaur,Walt Disney Pictures|Pixar Animation Studios,2015-11-14,331.926147
40619,Day & Night,Walt Disney Pictures|Pixar Animation Studios,2010-06-17,
200481,The Blue Umbrella,Pixar Animation Studios,2013-02-12,
213121,Toy Story of Terror!,Walt Disney Pictures|Pixar Animation Studios,2013-10-15,
83564,La luna,Pixar Animation Studios,2011-01-01,


__Search 4: Action or Thriller Movie with original language English and minimum Rating of 7.5 (most recent movies first)__

In [188]:
movies4 = movies.copy()
# remove null values
movies4.dropna(subset = ['spoken_languages'], how = 'any', inplace = True)
movies4 = movies4[movies4['spoken_languages'].apply(lambda lang: 'English' in lang.split('|'))]
movies4[movies4['vote_average'] >= 7.5].sort_values(by = 'release_date', ascending = False)

Unnamed: 0_level_0,title,tagline,release_date,genres,belongs_to_collection,original_language,budget_musd,revenue_musd,production_companies,production_countries,...,popularity,runtime,overview,spoken_languages,poster_path,cast,cast_size,crew_size,director,profit_musd
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
374471,Porto,,2017-09-14,Romance|Drama,,en,,,,Poland|Portugal|France|United States of America,...,2.152189,75.0,Jake and Mati are two outsiders in the norther...,English|Français|Português,<img src='http://image.tmdb.org/t/p/w185//2jsN...,Lucie Lucas|Anton Yelchin|Paulo Calatré|Franço...,5,17,Gabe Klinger,
460135,LEGO DC Super Hero Girls: Brain Drain,,2017-08-30,Animation,DC Super Hero Girls Collection,en,,,Warner Bros. Animation,United States of America,...,8.413734,,"When Supergirl, Wonder Woman, Batgirl, Bumbleb...",English|Polski|Português,<img src='http://image.tmdb.org/t/p/w185//8wz6...,Grey Griffin|Tara Strong|Anais Fairweather|Tea...,17,6,Todd Grimes,
411741,Ingrid Goes West,She'll follow you.,2017-08-11,Drama|Comedy,,en,,,Star Thrower Entertainment|141 Entertainment|Neon,United States of America,...,3.647596,97.0,Ingrid becomes obsessed with a social network ...,English,<img src='http://image.tmdb.org/t/p/w185//3LEy...,Aubrey Plaza|Elizabeth Olsen|O'Shea Jackson Jr...,21,19,Matt Spicer,
374720,Dunkirk,The event that shaped our world,2017-07-19,Action|Drama|History|Thriller|War,,en,100.000000,519.876949,Canal+|Studio Canal|Warner Bros.|Syncopy|RatPa...,Netherlands|France|United Kingdom|United State...,...,30.938854,107.0,The miraculous evacuation of Allied soldiers f...,English|Français|Deutsch,<img src='http://image.tmdb.org/t/p/w185//ebSn...,Fionn Whitehead|Tom Glynn-Carney|Jack Lowden|H...,66,214,Christopher Nolan,419.876949
416477,The Big Sick,An awkward true story.,2017-06-23,Comedy|Drama|Romance,,en,,52.620184,FilmNation Entertainment|Apatow Productions|Am...,United States of America,...,23.424794,120.0,Pakistan-born comedian Kumail Nanjiani and gra...,English|اردو,<img src='http://image.tmdb.org/t/p/w185//qquE...,Kumail Nanjiani|Zoe Kazan|Holly Hunter|Ray Rom...,44,129,Michael Showalter,
461634,Rory Scovel Tries Stand-Up for the First Time,,2017-06-20,Comedy,,en,,,Netflix,United States of America,...,0.593041,66.0,Comedian Rory Scovel storms the stage in Atlan...,English,<img src='http://image.tmdb.org/t/p/w185//z4pp...,Rory Scovel,1,2,Scott Moran,
382614,The Book of Henry,Never leave things undone.,2017-06-16,Thriller|Drama|Crime,,en,10.000000,4.219536,Sidney Kimmel Entertainment|Double Nickel Ente...,United States of America,...,24.553725,105.0,"Naomi Watts stars as Susan, a single mother of...",English,<img src='http://image.tmdb.org/t/p/w185//suLF...,Naomi Watts|Jaeden Lieberher|Jacob Tremblay|Sa...,27,27,Colin Trevorrow,-5.780464
433471,Lou,,2017-06-16,Animation,,en,,,Pixar Animation Studios,,...,1.770297,6.0,A Pixar short about a lost-and-found box and t...,English,<img src='http://image.tmdb.org/t/p/w185//zAQm...,,0,1,Dave Mullins,
382127,Manifesto,ALL CURRENT ART IS FAKE,2017-06-15,Drama,,en,0.010000,,Bayerischer Rundfunk|Schiwago Film,Australia|Germany,...,2.022670,130.0,"Manifesto draws on the writings of Futurists, ...",English|Italiano,<img src='http://image.tmdb.org/t/p/w185//eszm...,Cate Blanchett|Erika Bauer|Ruby Bustamante|Car...,12,2,Julian Rosefeldt,
461805,The Putin Interviews,Know Your Enemy,2017-06-12,Documentary,,en,,,Ixtlan Productions|Showtime Documentary Films|...,United States of America,...,0.642527,240.0,"Academy Award-winning filmmaker, Oliver Stone ...",English|Pусский,<img src='http://image.tmdb.org/t/p/w185//irsz...,Vladimir Putin|Oliver Stone,2,2,Oliver Stone,


## Are Franchises more successful?

4. __Analyze__ the Dataset and __find out whether Franchises (Movies that belong to a collection) are more successful than stand-alone movies__ in terms of:

- mean revenue
- median Return on Investment
- mean budget raised
- mean popularity
- mean rating

hint: use groupby()

__Franchise vs. Stand-alone: Average Revenue__

In [5]:
movies1 = movies.copy()
movies1['franchise'] = np.where(movies1['belongs_to_collection'].isnull(), 'Stand_alone', 'Franchise')
movies1.head()
movies1.groupby(by ='franchise')['revenue_musd'].mean()

franchise
Franchise      165.708193
Stand_alone     44.742814
Name: revenue_musd, dtype: float64

__Franchise vs. Stand-alone: Return on Investment / Profitability (median)__

__Franchise vs. Stand-alone: Average Budget__

__Franchise vs. Stand-alone: Average Popularity__

__Franchise vs. Stand-alone: Average Rating__

## Most Successful Franchises

5. __Find__ the __most successful Franchises__ in terms of

- __total number of movies__
- __total & mean budget__
- __total & mean revenue__
- __mean rating__

## Most Successful Directors

6. __Find__ the __most successful Directors__ in terms of

- __total number of movies__
- __total revenue__
- __mean rating__