This project is a data analysis portfolio project where I explored and analyzed IMDB movie data using Python. The dataset contains information about movies, their ratings, revenue, genres, directors, and other attributes.
IMDB-movie-project/
β
βββ data/
β βββ IMDB-Movie-Data.csv
β
βββ images/
β βββ Screenshot_1.png
β βββ Screenshot_2.png
β βββ Screenshot_3.png
β βββ Screenshot_4.png
β
βββ notebook/
β βββ IMDB_movie_project.ipynb
β
βββ README.md
- Python π
- Jupyter Notebook
- Libraries Used:
- Pandas
- Numpy
- Matplotlib
- Seaborn
-
Data Loading & Cleaning
- Imported dataset into Pandas DataFrame
- Checked for missing values, duplicates, and data consistency
-
Exploratory Data Analysis (EDA)
- Summary statistics
- Genre distribution
- Correlation between rating, votes, and revenue
-
Data Visualization
- Bar plots, histograms, heatmaps, and scatter plots
- Insights on movie popularity, genre trends, and revenue patterns
-
Key Insights
- Higher rated movies often generate higher revenue
- Action and Adventure genres dominate in revenue
- Directors with consistent high ratings attract more votes
- Strong correlation between vote count and revenue
-
Clone the repository:
git clone https://github.com/your-username/IMDB-movie-project.git cd IMDB-movie-project
-
Install required libraries: pip install pandas numpy matplotlib seaborn jupyter
-
Open the notebook: jupyter notebook notebook/IMDB_movie_project.ipynb
π Result & Learning
- Learned how to perform end-to-end EDA on real-world data
- Improved skills in Python, Pandas, and Seaborn visualizations
- Gained insights into movie trends and audience preferences
- This project showcases analytical and visualization skills for data analyst roles
π About Me I am a fresher aspiring for data analyst roles with skills in:
- Python
- SQL
- Excel
- Power BI
- Data Visualization
- Statistical Analysis -Statistical Analysis