Summary

In this project I will take face value ticket prices from TicketMaster for concerts in the largest US markets, as well as prices for tickets on resale markets (TicketMaster resale, SeatGeek), along with data on artist and genre popularity to predict markups on secondhand concert tickets.

Question

What determines the price of a concert ticket in the secondary market? Understanding this question can help artists and tour promoters more accurately price tickets and gage demand for tickets. Consumers may also find such a tool helpful in determining when ahd how to buy and sell tickets

Solution

1. Data Gathering

I gather data from 4 sources:

Ticketmaster API (Face value price information)
SeatGeek API (Resale value price information)
Stubhub API (Resale value price information)
Spotify (Popularity information on artists)

The concerts data is matched between sources based on datetime and venue name, under the assumption that a venue only has a single event at a given time. I include a few notes on how the venues were matched in the 'Venue Matching Notes' notebook.

2. Data Exploration and Feature Engineering

1. Import data, initial cleaning and feature engineering
2. Feature Summaries
  1. Continuous variables
  2. Categorical Variables
  3. Category Consolidation
3. Data Exploration
  1. Scatterplots (Spotify info, ticket listings, time information)
  2. Venue State
  3. Day of Week
  4. Genre and Subgenre
  5. Artist Count
  6. Promoter
  7. Resale Ticket Source

3. Statistical Analyses

1. Continuous Variables - Pearson & Spearman Correlation
2. Categorical Variables - T-tests, ANOVA F-tests, and Pair Tukey tests

See the "Statistical Analyses" report for visualizations and findings

4. Machine Learning

Data Preprocessing
Linear Regression
1. Standard Linear Regression
2. ElasticNet
3. Lasso
4. Comparison of Methods
Classification
1. Variable & Function Setup
2. Classifier Evaluation with 3 & 4 Bins
3. Learning Curves
4. Feature Importance Rankings
5. Hyperparameter Tuning
6. Model Ensembling and Comparison on Test Set
7. Neural Net with Keras and Tensorflow

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
Data		Data
Graphs		Graphs
Pickles		Pickles
.DS_Store		.DS_Store
.gitignore		.gitignore
1.1Venue_Matching_Notes.ipynb		1.1Venue_Matching_Notes.ipynb
1.Data_Gathering.ipynb		1.Data_Gathering.ipynb
2.Data_Exploration.ipynb		2.Data_Exploration.ipynb
2.Data_Wrangling_Steps.md		2.Data_Wrangling_Steps.md
3.1Milestone_Report.md		3.1Milestone_Report.md
3.Statistical_Analyses.md		3.Statistical_Analyses.md
3.Stats_Tests.ipynb		3.Stats_Tests.ipynb
4.MachineLearning.ipynb		4.MachineLearning.ipynb
Cap1 Final Report.pdf		Cap1 Final Report.pdf
README.md		README.md

yiaktan/Secondhand_Concert_Tickets

Folders and files

Latest commit

History

Repository files navigation

Summary

Question

Solution

1. Data Gathering

2. Data Exploration and Feature Engineering

3. Statistical Analyses

4. Machine Learning

5. Final Analysis

About

Resources

Stars

Watchers

Forks

Languages