# 🎬 Movie Database Analysis - Project Report  
### **Group 19**
**Abhijeet (055002), Rohan Jha (055057)**


##  1. Project Information & Data Source

### **Project Name:**  
**Movie Database Analysis using MongoDB Compass & Atlas Charts**  

### **Data Source:**  
- The movie dataset is sourced from **public movie databases** such as IMDb, TMDb, or Kaggle datasets.  
- The dataset consists of **structured JSON documents** stored in **MongoDB Atlas**.  
- **Key Fields in the Dataset:**  
  - `title` (Movie name)  
  - `year` (Release year)  
  - `genre` (List of genres)  
  - `rating` (IMDb rating)  
  - `actors` (List of actors)  
  - `director` (Director name)  
  - `box_office` (Revenue in USD)  
  - `runtime` (Movie duration in minutes)  



## 2. Problem Statements

#### **1️ Understanding Movie Success Factors**
The movie industry is highly competitive, and understanding what makes a movie successful is crucial.  
Factors like **IMDb ratings, genre, cast, director, and box office revenue** play a key role in determining success.  
How can we **identify key attributes** that contribute to a movie's success?

#### **2️ Genre Popularity & Trends**
Different movie genres have different audience appeal.  
Some genres are more **critically acclaimed**, while others generate **higher box office revenue**.  
How has **genre popularity changed over time**, and which genres perform best?

#### **3️ Movie Ratings vs. Box Office Revenue**
A high IMDb rating does not always translate to high box office earnings.  
Which **genres and types of movies** achieve both **high ratings and strong financial performance**?  
Is there a **correlation between IMDb rating and revenue**?

#### **4️ Actor & Director Impact on Movie Performance**
Certain actors and directors consistently deliver **high-rated or high-grossing** movies.  
Which **actors and directors have the most influence** on a movie's success?  
Do some actors perform better in specific genres?

#### **5️ Evolution of Movie Production Over Time**
The **number of movies produced annually** has significantly increased.  
What are the key **historical trends in movie releases**?  
How has the **runtime, genre preference, and movie format** changed over decades?

#### **6️ Optimal Movie Runtime for Audience Engagement**
Longer movies allow for **more storytelling depth**, but **shorter movies retain audience attention better**.  
What is the **ideal movie runtime**, and does it vary by genre?  
How does runtime affect **IMDb ratings and box office performance**?

#### **7️ The Rise of Streaming & Impact on Movie Trends**
Streaming platforms like **Netflix, Amazon Prime, and Disney+** have changed how movies are produced and consumed.  
How has the **growth of streaming services** impacted **movie genres, runtimes, and audience preferences**?

#### **8️ Awards & Critical Acclaim**
Some genres win **more awards** than others, affecting their prestige and marketability.  
Which **genres and types of movies** have historically won the most **awards**?  
How does **award recognition correlate with IMDb ratings and revenue**?

#### **9️ Recommendations for Movie Studios & Streaming Platforms**
Using data-driven insights, **how can production houses and streaming services**:  
- Optimize **genre selection** for audience engagement?  
- Improve **casting and director choices**?  
- Balance **box office success with critical acclaim**?   

##  Objective  

The goal of this project is to:  
 **Analyze trends in movie data** using MongoDB queries & visualizations.  
 **Find top-performing movies** based on ratings, genres, and revenue.  
 **Identify patterns in movie releases** (year-wise, genre popularity, actor collaborations).  
 **Understand factors affecting movie success** (rating vs. revenue, genre impact).  
 **Provide recommendations** for movie production strategies.  



##  3. Analysis & Queries  

###  **Query 1: Top 5 Highest-Rated Movies**
```json
[
  { "$sort": { "rating": -1 } },
  { "$limit": 5 },
  { "$project": { "_id": 0, "title": 1, "rating": 1 } }
]


```json
[
  { "$unwind": "$genre" },
  { "$group": { "_id": "$genre", "count": { "$sum": 1 } } },
  { "$sort": { "count": -1 } }
]
```

```json
[
  { "$group": { "_id": "$year", "count": { "$sum": 1 } } },
  { "$sort": { "_id": 1 } }
]
```

```json
[
  { "$sort": { "box_office": -1 } },
  { "$limit": 10 },
  { "$project": { "_id": 0, "title": 1, "box_office": 1 } }
]
```

```json
[
  { "$unwind": "$actors" },
  { "$group": { "_id": "$actors", "count": { "$sum": 1 } } },
  { "$sort": { "count": -1 } },
  { "$limit": 5 }
]
```

![Image](0.png)

![Image](1.png)

![Image](2.png)

![Image](3.png)

![Image](4.png)

![Image](5.png)

![Image](6.png)

![Image](7.png)

![Image](8.png)

![Image](9.png)

![Image](Movie%20Analytics%20Dashboard.png)

##  4. Observations  

 **Genre Trends:**  
- **Action, Comedy, and Drama** are the most common genres.  
- **Sci-Fi & Fantasy movies** tend to have **higher IMDb ratings** on average.  

 **Yearly Movie Production Trends:**  
- A **gradual increase** in the number of movies released post-2000.  
- **2020 shows a decline** (likely due to COVID-19 pandemic effects).  

 **Top-Performing Movies:**  
- High IMDb ratings **do not always** translate to high box-office earnings.  
- **Superhero movies** dominate the highest-grossing category.  

 **Actor & Director Insights:**  
- Certain actors appear **frequently in blockbuster movies** (e.g., Leonardo DiCaprio, Robert Downey Jr.).  
- Directors like **Christopher Nolan & Steven Spielberg** consistently produce **high-rated & high-grossing** films. 

##  Observations from Movie Analytics Dashboard

##### **1️ Genre Popularity Over Time**
- The **number of movies produced** has significantly increased over the years, especially after **1980**.
- The **rise of digital streaming platforms** has contributed to the increasing number of movie releases.
- **Action, Drama, and Comedy** continue to dominate the annual release trends.

##### **2️ Annual Movie Releases by Genre**
- **Action, Drama, and Comedy** have the highest number of releases.
- **Fantasy and Sci-Fi movies** have seen a steady increase in recent decades.
- **Documentaries and Independent films** are gaining popularity.

##### **3️ Average IMDb Ratings by Genre**
- **Documentary, War, and Biography** movies tend to have **higher IMDb ratings**.
- **Action and Horror movies** generally receive lower ratings.
- **Critically acclaimed movies** are often from niche genres.

##### **4️ Actor Appearance Frequency by Genre**
- Some actors frequently appear in **specific genres**, especially **Romance, Drama, and Action**.
- **A-list actors** tend to dominate box-office performance in high-budget movies.
- **Character actors** often specialize in specific genres like Horror or Sci-Fi.

##### **5️ Top-Rated Action Movies**
- The **highest-rated Action movies** include well-known classics.
- **Strong correlation** between a movie’s **director, cast, and IMDb rating**.
- Well-directed action films tend to perform well both critically and commercially.

##### **6️ Evolution of Movie Genres Over the Years**
- **Sci-Fi, Fantasy, and Animation genres** have grown significantly over time.
- **Western movies**, once dominant, have declined in popularity.
- Advances in **CGI & VFX technology** have boosted genres like Sci-Fi and Superhero films.

##### **7️ Average Movie Runtime by Genre**
- **History, Biography, and Musical movies** have the longest runtimes.
- **Animation and Horror movies** tend to be shorter in duration.
- **Movies with longer runtimes** are often **critically acclaimed** but may perform worse at the box office.

##### **8️ Average Ratings Breakdown by Movie Genre**
- **Film-Noir and History genres** have some of the highest IMDb ratings.
- **Comedy and Horror movies** have more varied ratings, with both **hits and flops**.
- **Drama films** tend to be the most awarded and critically analyzed.

##### **9️ Trends in Average Movie Runtime Over the Years**
- The **average runtime of movies** has remained stable between **100-130 minutes**.
- **Older movies (pre-1980s)** had longer runtimes, while **modern audiences** prefer slightly shorter movies.
- Streaming platforms may be influencing **shorter runtimes** for better engagement.

##### **10 Awards and Recognitions by Genre**
- **Drama and Romance movies** win the most awards.
- **Action and Comedy movies** perform well commercially but receive fewer awards.
- **Historical and War movies** often win technical and screenplay awards.



##  Insights & Recommendations  

##### **1️ For Movie Producers**
 **Invest in Sci-Fi and Action genres** for higher audience engagement and box-office success.  
 **Consider producing Biography and Documentary films** for higher IMDb ratings and award recognition.  
 **Leverage franchise potential** (e.g., Superhero, Sci-Fi, and Fantasy) as they consistently perform well.  
 **Ensure a strong script and storytelling** for longer movies (History, Biography) to maintain audience engagement.  

---

##### **2️ For Streaming Platforms (Netflix, Amazon Prime, Disney+)**
 **Prioritize critically acclaimed genres** such as Documentary, Biography, and Drama.  
 **Increase investment in Sci-Fi & Animation**, as their popularity is growing rapidly.  
 **Optimize movie runtimes** to stay within the ideal range of **100-130 minutes** for better audience retention.  
 **Analyze actor-director trends** to secure exclusive content deals with high-performing teams.  

---

##### **3️ For Marketing & Distribution Teams**
 **Use IMDb ratings and genre trends** in promotional campaigns to highlight audience interest.  
 **Leverage popular actors and directors** to maximize audience reach and pre-release hype.  
 **Promote movies based on storytelling depth** in genres like Drama and Romance for awards and critical acclaim.  
 **Target regional markets** by analyzing which genres perform well in different demographics.  

---

##### **4️ For Film Festival & Award Recognition**
 **Focus on Drama, War, and Biography movies**, as they have the highest award-winning potential.  
 **Develop strong screenplays and character-driven narratives**, which are more likely to receive critical acclaim.  
 **Optimize movie releases for award seasons** to increase visibility and impact.  

---

##### **5️ For Future Industry Trends**
 **Monitor the impact of streaming platforms** on movie trends and audience preferences.  
 **Invest in data-driven decision-making** to analyze box-office performance before movie releases.  
 **Leverage AI & Big Data** to predict audience demand and optimize production strategies.  
 **Analyze decade-based trends** to determine which genres and storytelling techniques remain timeless.  

##  6. Conclusion & Next Steps  
This project provided **data-driven insights** into the movie industry using **MongoDB Compass & Atlas Charts**.  

 **Next Steps:**  
 - **Expand dataset** by adding user reviews and streaming performance data.  
 - **Analyze international movie trends** beyond Hollywood.  
 - **Apply Machine Learning** for **predicting movie success** based on historical data.  

 **With these insights, stakeholders can make informed decisions in movie production, marketing, and streaming services!**

The **movie industry is evolving**, and **data-driven insights** can help:  
- Identify **key success factors** for movies.  
- Track **trends in audience preferences**.  
- Provide **actionable insights** for production studios, streaming platforms, and marketers.  

🎬 **With these recommendations, stakeholders can make more informed decisions to maximize both financial success and critical acclaim!** 
