UCI Data Analytics Bootcamp | Project 3
Nico Barzotti | Jake Boden | Justine Pile
This dataset was created by combining different datasets already available independently but not combined before. In this dataset, 5 heart datasets are combined with over 11 common features which makes it the largest heart disease dataset available so far for research purposes. The five datasets used for its curation are:
- Cleveland: 303 observations
- Hungarian: 294 observations
- Switzerland: 123 observations
- Long Beach VA: 200 observations
- Stalog (Heart) Data Set: 270 observations
Total: 1190 observations
Duplicated: 272 observations
Final Data: 918 observations
Every dataset used can be found under the Index of heart disease datasets from UCI Machine Learning Repository on the following link: https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
fedesoriano. (September 2021). Heart Failure Prediction Dataset. Retrieved 04/10/2023 from https://www.kaggle.com/fedesoriano/heart-failure-prediction
[
{
"age": 40,
"ageGroup": "40-49",
"chestPainType": "ATA",
"cholesterol": 289,
"exerciseAngina": "N",
"fastingBS": 0,
"heartDisease": 0,
"maxHR": 172,
"oldPeak": 0.0,
"restingBP": 140,
"restingECG": "Normal",
"sex": "M",
"stSlope": "Up"
},
{
"age": 49,
"ageGroup": "40-49",
"chestPainType": "NAP",
"cholesterol": 180,
"exerciseAngina": "N",
"fastingBS": 0,
"heartDisease": 1,
"maxHR": 156,
"oldPeak": 1.0,
"restingBP": 160,
"restingECG": "Normal",
"sex": "F",
"stSlope": "Flat"
},
...