Skip to content

Salma-Mamdoh/Investigating-Netflix-Movies-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Investigating-Netflix-Movies-Project 📺🍿

Overview

This project involves analyzing Netflix movie data to determine whether movie durations are getting shorter over time. The project uses Python's pandas, seaborn, and matplotlib libraries for data analysis and visualization.

Data Source

The data is sourced from a CSV file named netflix_data.csv which contains information about Netflix shows and movies, including columns like title, genre, release year, and duration.

Analysis Steps

  1. Data Loading: Load the dataset using pandas.
  2. Data Cleaning: Remove missing values and filter movies from the dataset.
  3. Data Exploration: Visualize movie durations over the years using scatter plots.
  4. Genre Color Mapping: Assign specific colors to different genres.
  5. Visualization: Create scatter plots with genre-based coloring to analyze movie durations over the years.

Findings

Based on the analysis, it appears that the average duration of movies has been declining, with a noticeable variation across different genres.

Code Usage

  • The code for data loading, cleaning, exploration, and visualization is provided in Python using libraries like pandas, seaborn, and matplotlib.
  • The analysis code is available in the provided Jupyter Notebook.

How to Run

  1. Ensure you have Python and the required libraries installed.
  2. Download the netflix_data.csv dataset and place it in the same directory.
  3. Open the Jupyter Notebook or Python script and run the code cells to perform the analysis.

Acknowledgements

The project uses the Netflix movie dataset and leverages the power of pandas, seaborn, and matplotlib for data analysis and visualization.

About

My Project to learn the Basics of Analysis on DataCamp

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published