# Pro Wrestling Events Data Analysis

**Author:** _Bryan Sauka_  
**Date:** _2025-06-14_

---

## Table of Contents
1. [Introduction](#introduction)  
2. [Dataset Overview](#dataset-overview)  
3. [Data Cleaning](#data-cleaning)  
4. [Exploratory Data Analysis](#exploratory-data-analysis)  
5. [Key Findings](#key-findings)  
6. [Conclusion](#conclusion)  

---

## Introduction <a id="introduction"></a>

This analysis explores data from professional wrestling events to identify trends in event frequency, attendance, and promotion activity over time. The data includes detailed information on event dates, promotions, locations, match types, outcomes, and attendance, providing a comprehensive view of the professional wrestling landscape. By examining these variables, the analysis aims to uncover patterns in how events are organized, broadcast, and received by fans.


---

## Dataset Overview <a id="dataset-overview"></a>

- **Source:** The raw event and match data was scraped from cagematch.net using Python, loaded into BigQuery, then cleaned and modeled using dbt (data build tool)
- **Data Range:** Covers data from 1/2021 through 6/2025
- **Filters:** Focuses on the event types (TV-Shows, Pay Per Views and Premium Live Events) for both promotions. Matches with 0 event attendance were removed
- **Size:** Each record in this dataset represents a wrestling match leaving 5,150 records
- **Columns:**
    - calendar_date
    - promotion
    - event_name
    - event_type
    - event_show_name
    - event_show_number
    - event_broadcast_type
    - event_broadcast_network
    - event_arena_name
    - event_city
    - event_state
    - event_country
    - match_type
    - is_match_stipulation
    - is_title_match
    - match_winner
    - match_loser
    - match_ended_by
    - match_time_limit
    - match_title_change
    - match_duration_seconds
    - event_attendance
      
---

## Data Cleaning <a id="data-cleaning"></a>

The data cleaning and modeling process was performed using dbt, following a structured approach to ensure data quality and usability:

- **Staging Layer:**  
  Raw data from cagematch.net was loaded and cleaned, such as removing duplicates, handling missing values, and standardizing formats for dates and categorical fields

- **Dimension and Fact Tables:**  
  Cleaned data was modeled into dimension tables (e.g., arenas, events, locations, matches) and fact tables (e.g., match durations, event attendance) to support analysis and reporting

- **Reporting Layer:**  
  Final reporting tables were created by joining fact and dimension tables, providing a user-friendly and analysis-ready dataset

- **Validation:**  
  Data was validated at each step to ensure accuracy, consistency, and completeness

This layered dbt workflow ensures a transparent, reproducible, and well-organized data pipeline, resulting in a high-quality dataset for analysis.

---

## Exploratory Data Analysis <a id="exploratory-data-analysis"></a>

### 1. Executive Summary  
**Observations**
- Avg attendance is going in opposite directions for each promotion
  - AEW has been on the decline YoY since 2021
  - WWE has been on the rise YoY since 2021

Thoughts: The arrival of CM Punk to AEW created buzz and added star power to the promotion in 9/2021. It was his first return to action after a 7 year absence. This buzz contributed to the higher avg attendance for AEW during that time period. CM Punk departed AEW in 9/2023 for WWE. WWE also joined the TKO umbrella in 9/2023 which opened up more opportunities and reach for the promotion

**Avg Attendance Over Time**
![summary table events](images/avg_attendance_by_month.png)


- Event counts for WWE remain steady YoY while AEW coninues to add a large number of events each year

Thoughts: AEW could be burning out its fan base by continuing to add events and over-exposing their stars 

**Events & Attendance Summary**
![summary table events](images/summary_table_by_year_events_attendance.png)


### 2. Geographic Distribution  
**Observations**
- AEW had success in London, England. The 2 events held there attracted 6x the avg audience size compared to their next highest avg attendance. AEW has not captured the same avg attendance magic outside of London
- WWE has success everywhere it goes. The avg attendance is steady amongst the top locations

Thoughts: WWE has much stronger brand recognition

**Avg Attendance by Location**
![summary table events](images/avg_attendance_by_location_AEW.png)
![summary table events](images/avg_attendance_by_location_WWE.png)


- AEW hotspots include Chicago, New York, and Jacksonville
- WWE hotspots include Orlando, New York, Chicago, and Philadelphia

**Event Count by Location**
![summary table events](images/event_count_by_location_AEW.png)
![summary table events](images/event_count_by_location_WWE.png)


### 3. Event Distribution  
**Observations**
- The big events attract the large audiences for both promotions. Pay Per View (for AEW) and Premium Live Events (for WWE)
- All of the events below list the largest events for each promotion 


**Avg Attendance by Event**
![summary table events](images/avg_attendance_by_event_name_AEW.png)
![summary table events](images/avg_attendance_by_event_name_WWE.png)


### 4. Broadcast Distribution  
**Observations**
- WWE focuses on producing live events and has more contracts with larger networks 


**Event Count by Broadcast Type**
![summary table events](images/event_count_by_broadcast_type_AEW.png)
![summary table events](images/event_count_by_broadcast_type_WWE.png)


**Event Count by Broadcast Network**
![summary table events](images/event_count_by_broadcast_network_AEW.png)
![summary table events](images/event_count_by_broadcast_network_WWE.png)


### 5. Match Distribution  
**Observations**
- Both promotions follow a similar match duration distribution, with a large % of matches taking between 10-15 min to complete
- Wrestlers compete in matches between 20-30 times per year (on avg)
- The titles that are considered "working titles" are the titles most competed over (AEW: TNT & TBS titles; WWE: US & Intercontinental titles). The World title is rarely defended between promotions

Toughts: Wrestlers usually only appear and compete on their respective weekly shows, so they have the opportunity to compete 52 weeks per year + about 10-12 PPV/PLE events per year (65 matches total per year). This table can be used as a wrestler health gauge. The more matches a wrestler has, the healthier they are to comepete. The limited number of matches per wrestler can be due to limiting wrestler exposure to avoid fatigue of seeing the same thing over and over again. 


**Match Duration Distribution**
![summary table events](images/match_duration_distribution_AEW.png)
![summary table events](images/match_duration_distribution_WWE.png)


**Matches by Wrestler by Year** (top 20 wrestlers by total matches)
![summary table events](images/summary_table_matches_by_wrestler.png)


**Matches by Title** 
![summary table events](images/title_match_count_by_match_type_AEW.png)
![summary table events](images/title_match_count_by_match_type_WWE.png)

---

## Conclusion <a id="conclusion"></a>

- WWE is growing avg attendance while keeping events steady YOY
- AEW is shrinking avg attendance while expanding the number of events YoY

---

## References

- Source data: https://www.cagematch.net/
- AI help used: https://www.perplexity.ai/ 

---

*End of Report*
