Class Example 03/03/18
-
Project Title: "Evaluation of Salaries vs. Shortstop Statistical Performance in the MLB since 1985"
-
Team Members: Nick Titus, Javier Lopez, Bryan Fawcett, Jordan Tata
-
Project Description/Outline: Using datasets from data.world that have collected historical statistical and salary information, we will explore the relationship between pay and performance in Major League Baseball. This analysis will attempt to find correlations between statistical performance and monetary reward at the shortstop position, and determine infer whether salary at the shortstop position is related to team success.
-
Research questions to answer: a. Is salary of the short stop is it related to performance? -Jordan
batting average home runs Wins above Replacement - reference wrestling class example. This will need to create columns for each year per player.b. Does individual salary/team payroll correlate to individual/team success? -Javier
Possibly look into merging team and player data to see if the player lead the team to a playoffs. Another analysis is to look if the salary of the player actually makes a difference by comparing the top 5 paid players against the bottom five paid players and see how many games were won. The question is is it worth paying such high salaries?c. Does salary in short stop contribute proportionally to statistical contribution? -Nick
Idea, divide average salary of the player for their career and divide by homeruns. Compare this value with other players and compare how many times the team made it to the playoffs and world series. Come up with other statistical measures.
d. How has salary increased over time, adjusting for inflation? -Bryan
import via API or csv file for the years -
Data sources to be used: Two at this time (additional data to be added potentially), both from Data.World a. The Lahman Baseball Data Set b. Baseball Salaries Data Set sourced from USA Today
-
APIs to be consumed: Data.World
-
Rough Breakdown of Tasks: TBD
Preso Notes: