This Python code analyses Netflix movie data by performing the following tasks:
- Importing necessary libraries: pandas for data manipulation and matplotlib for visualisation.
- Read the Netflix data from a CSV file into a DataFrame.
- Filtering the DataFrame to include only movies.
- Selecting relevant columns such as title, country, genre, release year, and duration.
- Filtering movies with a duration shorter than 60 minutes.
- Assigning colors to movies based on their genres (Children, Documentaries, Stand-Up, and others).
- Creating a scatter plot to visualize the relationship between movie duration and release year.
- Answering the question "Are movies getting shorter?" with a simple "no".
- Incorporate more advanced statistical analysis techniques.
- Enhance visualization by exploring different plot types and styles.
- Expand the analysis to include TV shows and other content types available on Netflix.
- Data Source (https://www.kaggle.com/datasets/shivamb/netflix-shows)