Skip to content

The data and consists of four databases: Cleveland, Hungary, Switzerland, and Long Beach V. It contains 76 attributes, including the predicted attribute, but all published experiments refer to using a subset of 14 of them. The "target" field refers to the presence of heart disease in the patient. It is integer valued 0 = no disease and 1 = disease.

Notifications You must be signed in to change notification settings

YamanAlBochi/HeartDiseaseAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

HeartDiseaseAnalysis

The data and consists of four databases: Cleveland, Hungary, Switzerland, and Long Beach V. It contains 76 attributes, including the predicted attribute, but all published experiments refer to using a subset of 14 of them. The "target" field refers to the presence of heart disease in the patient. It is integer valued 0 = no disease and 1 = disease.

Heart disease is the number one cause of death globally. Heart disease is concertedly contributed by hypertension, diabetes, overweight and unhealthy lifestyles. This project covers manual exploratory data analysis and using pandas in Jupyter Notebook. Questions:

  1. Import The Libraries And Dataset
  2. Display Top 5 Rows of The Dataset
  3. Check The Last 5 Rows of The Dataset
  4. Find Shape of Our Dataset (Number of Rows And Number of Columns)
  5. Get Information About Our Dataset Like Total Number Rows, Total Number of Columns, Datatypes of Each Column And Memory Requirement
  6. Check Null Values In The Dataset
  7. Check For Duplicate Data and Drop Them
  8. Get Overall Statistics About The Dataset
  9. Draw Correlation Matrix
  10. How Many People Have Heart Disease, And How Many Don't Have Heart Disease In This Dataset?
  11. Find Count of Male & Female in this Dataset
  12. Find Gender Distribution According to The Target Variable
  13. Check Age Distribution In The Dataset
  14. Check Chest Pain Type
  15. Show The Chest Pain Distribution As Per Target Variable
  16. Show Fasting Blood Sugar Distribution According To Target Variable
  17. Check Resting Blood Pressure Distribution
  18. Compare Resting Blood Pressure As Per Sex Column
  19. Show Distribution of Serum cholesterol
  20. Plot Continuous Variables

About

The data and consists of four databases: Cleveland, Hungary, Switzerland, and Long Beach V. It contains 76 attributes, including the predicted attribute, but all published experiments refer to using a subset of 14 of them. The "target" field refers to the presence of heart disease in the patient. It is integer valued 0 = no disease and 1 = disease.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published