Skip to content

This Python script shows how to use decision tree classification method to predict university types. It provides detailed steps to preprocess data, consolidate variables, bin variables, apply classification, create confusion matrix, plot ROC and calculate AUC, recall rate, precision rate, etc.

manqi11/Decision_Tree_Classification_University_Admission

Repository files navigation

Decision Tree Classification of the University Admission Data

In this Pythomn script, I process the admission to Teaching vs. Research Universities data. The dataset contains missing values for some of the variables. I will preprocess the data, remove the outliers, and replace the missing values.

Additionaly, I performed the consolidation and bin a categorical data.

Then, I will pick a binomial variable that presents university type to conduct some supervised learning analysis.

In the end, I will generate the Confusion Matrix, calculate the accuracy rate, precision rate, recall rate, error rate, and F1 score to examine how well my model fitted. I will also plot the ROC and calculated the AUC to visualize the results.

More details are decribed for each step.

About

This Python script shows how to use decision tree classification method to predict university types. It provides detailed steps to preprocess data, consolidate variables, bin variables, apply classification, create confusion matrix, plot ROC and calculate AUC, recall rate, precision rate, etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published