This project predicts the likelihood of a stroke using machine learning and deep learning techniques. It includes exploratory data analysis (EDA), data preprocessing, and model training with traditional and ensemble learning methods.
- Analyzing feature distributions
- Correlation analysis with stroke occurrence
- Handling missing values
- Feature scaling and encoding
- SMOTE for dataset balancing
- KNN, Logistic Regression, Decision Tree, Naive Bayes, SVM
- Voting Classifier, Bagging Classifier, Stacking Classifier
- Neural Networks (MLP) with dropout & early stopping
- Recurrent Neural Networks (RNN) with hyperparameter tuning
pandas,numpy,seaborn,matplotlib
sklearn,imblearn
tensorflow,keras
- Clone the repository and install dependencies:
git clone <repo_link> cd stroke_prediction pip install -r requirements.txt
- Run the Jupyter Notebook for data analysis and model training.
- Evaluate models using accuracy, confusion matrix, and classification reports.
- Multiple models are evaluated to determine the best-performing classifier.
- Stacking Classifier and Neural Networks achieve the highest accuracy.