Readmitted-Rate

Introduction

Patients readmitted rate, which is a supervised learning classification problem.

Using a combination of models (KNN, SVM, Decision Tree, Perceptron and Naïve Bayes algorithms） to deal with the real-world datasets from UCI machine learning lab.

Including:

EDA
Data Cleaning
Feature Seletion
Models tuning
Ensemble
and get the best accuracy about the readmitted rate.

Implementation:

Put these file { Smote_BackwardWrapper_Pipeline.py,

ForwardWrapper_Smote.py,

Results for the test set.py,

GridSearch Main Version Updated.py,

Base_Scoring_Algorithm (Corr, Chi2, Mutual_Info, C4_5 Importance).py,

New_train_set.csv,

New_test_set.csv } in the same location.

Filter Method:

Base_Scoring_Algorithm

(Corr, Chi2, Mutual_Info, C4_5 Importance).py is for feature selection:

use to calculate correlation, chi-square, mutual information between variables and target label

GridSearch Main Version Updated.py:

Using the results of Scoring to do feature selection with filter method; Tuning different models with selected features using gridsearch

###Results for the test set.py: Compute different accuracy score of test set with best tuned parameters

Wrapper Method:

Fordward wrapper to find the global best collection of features.

ForwardWrapper_Smote.py:

Input training set data and return selected features as csv

Smote_BackwardWrapper_Pipeline.py

Take advantages of pipeline to do SMOTE, backward wrapper and tuning;

Compute different accuracy score of test set with best tuned parameters;

Save best parameters, test scores, and feature selected as csv

Name	Name	Last commit message	Last commit date
Latest commit Marco Wang update Jun 9, 2019 bda00b5 · Jun 9, 2019 History 46 Commits
Data_Cleaning_and_Engineering	Data_Cleaning_and_Engineering	Delete Placeholder.txt	Dec 3, 2018
Feature_Selection	Feature_Selection	Add files via upload	Dec 2, 2018
Graphes	Graphes	Add files via upload	Nov 16, 2018
.gitignore	.gitignore	update	Jun 9, 2019
Base_Scoring_Algorithm (Corr, Chi2, Mutual_Info, C4_5 Importance).py	Base_Scoring_Algorithm (Corr, Chi2, Mutual_Info, C4_5 Importance).py	update	Jun 9, 2019
Feature Importantance and Ensemble Learning in Patient Readmission (Working Title).docx	Feature Importantance and Ensemble Learning in Patient Readmission (Working Title).docx	update	Jun 9, 2019
Final_project.py	Final_project.py	Update Final_project.py	Nov 28, 2018
Final_project_data_cleaning.py	Final_project_data_cleaning.py	update	Jun 9, 2019
ForwardWrapper_Smote.py	ForwardWrapper_Smote.py	update	Jun 9, 2019
GridSearch Main Version Updated.py	GridSearch Main Version Updated.py	update	Jun 9, 2019
New_test_set.csv	New_test_set.csv	update	Jun 9, 2019
New_train_set.csv	New_train_set.csv	update	Jun 9, 2019
README.md	README.md	update	Jun 9, 2019
ReadMe.txt	ReadMe.txt	update	Jun 9, 2019
Results for the test set.py	Results for the test set.py	update	Jun 9, 2019
Smote_BackwardWrapper_Pipeline.py	Smote_BackwardWrapper_Pipeline.py	update	Jun 9, 2019
diabetic_data.csv	diabetic_data.csv	Add files via upload	Nov 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Readmitted-Rate

Introduction

Including:

Implementation:

Filter Method:

(Corr, Chi2, Mutual_Info, C4_5 Importance).py is for feature selection:

GridSearch Main Version Updated.py:

Wrapper Method:

ForwardWrapper_Smote.py:

Smote_BackwardWrapper_Pipeline.py

About

Releases

Packages

Languages

MarcoXM/Readmittedrate

Folders and files

Latest commit

History

Repository files navigation

Readmitted-Rate

Introduction

Including:

Implementation:

Filter Method:

(Corr, Chi2, Mutual_Info, C4_5 Importance).py is for feature selection:

GridSearch Main Version Updated.py:

Wrapper Method:

ForwardWrapper_Smote.py:

Smote_BackwardWrapper_Pipeline.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages