Here we provide the data and pipeline for: Mortality in Puerto Rico after Hurricane Maria
Kishore N, Marqués D, Mahmud A, et al. Mortality in Puerto Rico after Hurricane Maria. N Engl J Med. DOI: 10.1056/NEJMsa1803972
Repository at time of publication
This repository is constantly being updated in response to feedback and inquiries; however, all code will remain entirely reproducible at any point in the commit history.
For full transparency, we wanted to note what the repository looked like before we made additional changes. Thus, the
paper release is the version of the repository that existed at the original time of publication. You can get this release by downloading it or using
git checkout paper in your local repository.
Background: Quantifying the effect on society of natural disasters is critical for recovery of public health services and infrastructure. The death toll can be difficult to assess in the aftermath of a major disaster. In September 2017, Hurricane Maria caused massive infrastructural damage to Puerto Rico, but its effect on mortality remains contentious.
Methods: Using a representative, stratified sample, we surveyed 3299 randomly chosen households across Puerto Rico to produce an independent estimate of all-cause mortality after the hurricane. Respondents were asked about displacement, infrastructure loss, and causes of death. We calculated excess deaths by comparing our estimated post-hurricane mortality rate with official rates for the same period in 2016.
Results: From the survey data, we estimated a mortality rate of 14.3 deaths (95% confidence interval [CI], 9.8 to 18.9) per 1000 persons from September 20 through December 31, 2017. This rate yielded a total of 4645 excess deaths during this period (95% CI, 793 to 8498), equivalent to a 62% increase in the mortality rate as compared with the same period in 2016. However, this number is likely to be an underestimate because of survivor bias. The mortality rate remained high through the end of December 2017, and one third of the deaths were attributed to delayed or interrupted health care. Hurricane-related migration was substantial.
Conclusions: The official estimate of 64 deaths attributed to Hurricane Maria in Puerto Rico is a substantial underestimate. This survey, based on community sampling, indicated that the number of excess deaths is likely to be more than 70 times the official estimate. (Funded by the Harvard T.H. Chan School of Public Health and others.)
a) A comparison of excess death estimates from official reports, press/academic reports, and our survey. b) Reported deaths per month in the survey, categorized by reported cause of death. Two individuals who died of similar causes are superimposed in December who died at the same age resulting in a count of 37 points representing 38 deaths after the hurricane.
code— Scripts and output for figures included in the manuscript and supplement
data— Initial data resources and survey data (more information in the folder
figures— Final figures included in manuscript
ref— pipelines used to clean raw information and generate
.RDSfiles found in
misc— Folder with a Spanish version of the paper as well as an
xmlfile with the survey instrument used to gather data.
- We have tagged the version of this repository at the time of publication under the
- Feel free to create a new branch for further incorporation and analysis
- All geospatial data has been stripped; by using this dataset, you agree to not undertake any steps to identify any respondents or their families
- More information in
For any issues with anonymization or major issues with the functionality of the script please create an issue.
The data collected and presented is licensed under the Creative Commons Attribution 3.0 license, and the underlying code used to format, analyze and display that content is licensed under the MIT license.