View this project at the following link.
This informatics (INFO 498F) project is based off of a dataset of death rates for the top leading causes of death in the United States, including mortality patterns from 1999 through 2013, and by state of residence for selected year and cause of death.
The data was collected and published by the CDC (Centers for Disease Control and Prevention) and accessed through data.gov.
The dataset can be found through this data.gov link.
During our project, we sought to answer four major questions:
- What have been the leading causes of death in order from 1999 to 2013?
- What causes of death have grown the most in the past 14 years from 1999 to 2013?
- What causes of death have declined the most in the past 14 years from 1999 to 2013?
- What is the proportional relation between causes of death most recently? How has it changed in the past 14 years from 1999 to 2013
We chose to answer these questions with multiple data visualizations. To summarize our findings, we found that the leading causes of deaths in total from 1999 to 2013 included Cancer (8.5M) and Disease of Heart (9.7M), with Pneumonitis (256.3k) and Homicide (261.7k) as the lowest.
The full list of number of deaths by cause from 1999 to 2013 is:
- Diseases of Heart: 9.69M
- Cancer: 8.46M
- Stroke: 2.17M
- CLRD: 1.98M
- Unintentional Injuries: 1.73M
- Diabetes: 1.08M
- Alzheimer's disease: 1.05M
- Influenza and pneumonia: 875.1
- Kidney Disease: 659.8k
- Suicide: 517.9k
- Septicemia: 514.8k
- Chronic liver disease and cirrhosis: 443.3k
- Essential hypertension and hypertensive renal disease: 358.1k
- Parkinson’s disease: 294.1k
- Homicide: 261.7k
- Pneumonitis due to solids and liquids: 256.3k
From 1999 to 2013, Alzeihmer's Disease grew the most with 40,231 more deaths in 2013 than 1999; Diseases of Heart declined the most with 114,087 deaths less in 2013 than 1999.
The full list of growth or decline in number of deaths among causes of death from 1999 to 2013 is:
- Alzheimer's disease: 40,231
- Cancer: 35,043
- Unintentional Injuries: 32,697
- CLRD: 25,024
- Essential hypertension and hypertensive renal disease: 13,802
- Suicide: 11,950
- Kidney Disease: 11,587
- Parkinson's disease: 10,603
- Chronic liver disease and cirrhosis: 10,168
- Septicemia: 7,476
- Diabetes: 7,179
- Pneumonitis due to solids and liquids: 3,311
- Homicide: -768 (decline)
- Influenza and pneumonia: -6,751 (decline)
- Stroke: -38,388 (decline)
- Diseases of Heart: -114,087 (decline)
During 1999, Cancer and Diseases of Heart accounted for over 60% of deaths - 27.1% and 35.8% respectively. During 2013, both causes still accounted for over 57% of deaths - 28.2% and 29.4% respectively.
This project was all done in R using RStudio.
- Creates two line plots: number of deaths and age adjusted death rates by location and year
- Shows the distribution of the death rates by year for specific causes.
- Proportion of the death rate per given year in the U.S and bar graph of total deaths by year.
- Contains functions that calculates or creates:
- total number of deaths from the top causes from 1999 to 2013,
- the unique death causes,
- difference in death totals from 1999 to 2013 by cause of death,
- cause of death with the maximum growth from 1999 to 2013,
- cause of death with the minimum growth from 1999 to 2013.
- a bar graph of the total number of deaths from the top causes by given year
- a pie chart of the total number of deaths from the top causes by given year
- a bar graph of total deaths per cause from 1999 to 2013