Skip to content

ankygupta9999/coronary-heart-disease-prediction

Repository files navigation

coronary-heart-disease-prediction

Problem Statement:

A healthcare organization together with a couple of government hospitals in a city has collected information about the vitals that would reveal if the person might have a coronary heart disease in the next ten years or not. This study is useful in early identification of disease and have medical intervention if necessary. This would help not only in improving the health conditions but also the economy as it has been identified that health performance and economic performance are interlinked.
As a data scientist, you are required to construct a classification model based on the available data and evaluate its efficacy. Your activities should include - performing various activities pertaining to the data such as, preparing the dataset for analysis; checking for any correlations; creating a model; evaluating the performance of the classification model. Visualizations would be a value add.

Important Points to be considered

  • Missing Data may be represented by either NAs, Blanks or values such as -999/-99 etc. Please check for various possibilities.
  • Results can be varying from team to team. But, you should be able to justify your result.

EDA (Exploratory Data Analysis)

- Target Column: Visualizing Distribution.
- Missing Data: Check on features with Missing data.
- Univariate Analysis: Plot the features distribution to identify skewness.
- Bivariate Analysis: Plot the features against Target to visualize the relation.
- Multicolinearity check: Identify if any features which are correlated with other features.

Data Preprocessing.

- Data Imputation: Imputing the missing values.
- Data scaling/normalization: Normalizing/ Scaling the data features.

Select Train/Test data.

- Train-Test split: Splitting the data into training set and testing set. Split ratio: 70 (training):30 (testing)

Training the model.

- Training Model: Training differnt models on training set.

Model Evaluation.

- Model Evaluation: Evaluating the performance of the model and finalizing the model.

• Any interesting observations • Challenges faced and how you mitigated the challenges • Assumptions if any

Suggestions: DM me.

About

Classification problem: Solved via Decision Tree

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published