Skip to content

A classifier that can identify the species of iris flowers based on the length and width of their leaves

License

Notifications You must be signed in to change notification settings

MdTanvirHossainTusher/Iris-Flower-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Iris-Flower-Classification

Overview

A multilabel-classifier that can identify the species of iris flowers based on the length and width of their leaves

Data Collection

Data has already available here. The dataset contains only 150 observations with 3 species. Each species have 50 observations.

Data Preprocessing

Data has no NULL, Duplicate value. No imbalance issue in the dataset.

Model Training

Datasets is trained using Logistic Regression, Decision Tree, Support Vector Machine (SVM) and Random Forest classification models.

Result Analysis

In the table we see the beseline models accuracy, training and testing accuracy of the respective models.

Model Baseline Accuracy Training Accuracy Testing Accuracy
Logistic Regression 0.341 0.967 0.967
Decision Tree 0.341 1.000 0.967
SVM 0.341 0.967 0.967
Random Forest 0.341 0.967 0.967

All the models beat the baseline model and their performance all almost similar. Due to Decision Tree's flexibility it may overfit sometimes.

Features Distribution Property

From the image, we can see that, there is a Normal Distribution in Sepal Length & Width (first two images) but Bi-Directional in Petal Length & Width (last two images).

sepal length
sepal width
petal length
petal width