Data and replication code for "Global warming, home runs, and the future of America's pastime," published in the Bulletin of the American Meteorological Society by Christopher Callahan, Nate Dominy, Jerry DeSilva, and Justin Mankin. The paper is available here.
The repository is organized into Scripts/, Figures/, and Data/ folders.
-
Scripts/: All code required to reproduce the findings of our work is included in this folder.
-
Figures/: The final figures from the main text are included in this folder.
-
Data/: This folder includes intermediate and processed summary data that enable replication of most the figures and numbers cited in the text. Some of the files for the climate model projections are quite large, so they are not provided here. Should you desire any of this underlying data, feel free to contact me at Christopher.W.Callahan.GR (at) dartmouth (dot) edu and I will be happy to organize a mass data transfer.
Finally, this repository includes game log and event file data from Retrosheet as well as data on individual batted balls from Statcast. Please credit these organizations if you use any of this data for any reason.
Each script performs a component of the analysis as follows:
Process_Park_HadISD_Data.py
assembles park-level time series of weather variables from the HadISD weather station data.Process_Retrosheet_Temp.ipynb
extracts gametime temperature data from the Retrosheet event files.Construct_Baseball_Panel.py
combines the baseball and weather data into a panel dataset for the regression analysis.Process_Statcast_Distance_Data.ipynb
performs a similar function for the more recent Statcast batted-ball-level data.Temp_HR_Regression.R
performs the core regression analysis andStatcast_HR_Regression.R
performs the supplemental Statcast-based regression.CMIP6_Historical_ParkTemp.py
andCMIP6_Future_ParkTemp.py
construct past and future park-level time series of temperature from the CMIP6 climate models.CMIP6_GMST.py
calculates from global mean temperature from the same models.Generate_Future_Seasons.py
produces a set of randomly generated plausible baseball seasons for the future projections.CMIP6_HR_Attribution.py
performs the attribution of home runs to historical global warming andCMIP6_HR_Projections.py
performs the future projections of home runs over the 21st century.Fig1.ipynb
,Fig2.ipynb
, andFig3.ipynb
plot the main text figures.Plot_TimePeriod_Fig.ipynb
andPlot_RH_Density.ipynb
plot the supplementary figures.