Skip to content

IRIS Dataset is used for Exploratory Analysis and Decision Tree classifier is used to classify, experimenting with different parameters for optimization and performance analysis.

Notifications You must be signed in to change notification settings

srilakshmi-thota/IRIS-DATASET-ANALYSIS-DECISION-TREE-CLASSIFIER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IRIS-DATASET-ANALYSIS-DECISION-TREE-CLASSIFIER

IRIS Dataset is used for Exploratory Analysis and Decision Tree classifier is used to classify, experimenting with different parameters like depth and number of leaf nodes of the tree.

Libraries used:
sklearn for DecisionTreeClassifier
pandas for reading the train_data and test_data

Inputs:
iris_train_data.csv
iris_test_data.csv

Outputs:
Depth of learnt tree
Number of leaf nodes of learnt tree
Training accuracy of classifier
Test accuracy using classifier

Pruning results for :
case1: reducing max_depth
Test Accuracy for Max_depth = 4
Test Accuracy for Max_depth = 3
Test Accuracy for Max_depth = 2
Test Accuracy for Max_depth = 1

Pruning results for :
case2: reducing max_leaf_nodes
Test Accuracy for Max_leaf_nodes=8
Test Accuracy for Max_leaf_nodes=7
Test Accuracy for Max_leaf_nodes=6
Test Accuracy for Max_leaf_nodes=5
Test Accuracy for Max_leaf_nodes=4
Test Accuracy for Max_leaf_nodes=3
Test Accuracy for Max_leaf_nodes=2

Functions used:
1.accuracy:
inputs: y_true y_predict
counted the number of correctly classified examples and divided it with the total number of examples and multiplied it with 100 to get the accuracy percentage.

2.pruning_by_max_leaf_nodes
input:number of leaf nodes of the classifier without pruning
Reduced the number of leaf nodes by 1 in each step by giving the max_leaf_nodes parameter to DecisionTreeClassifier and calculated the accuracy in each case and printed it accordingly

3.pruning_by_max_depth
input:depth of the classifier without pruning
Reduced the max_depth by 1 in each step by giving the max_depth parameter to DecisionTreeClassifier and calculated the accuracy in each case and printed it accordingly

About

IRIS Dataset is used for Exploratory Analysis and Decision Tree classifier is used to classify, experimenting with different parameters for optimization and performance analysis.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages