Skip to content

IRIS Dataset The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowe…

Notifications You must be signed in to change notification settings

antrixsh/iris-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

iris-data

IRIS Dataset The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimeters.

Problem Statement

  1. Importing libraries
  2. Reading Dataset
  3. Data Visualization
  4. Classification
  5. Prediction

Libreary Used in the code and why?

  1. pandas #For reading the dataset
  2. numpy #For ndarrays
  3. matplotlib.pyplot #For plotting of histogram graphs
  4. seaborn #For plotting violin graphs and pair plot
  5. sklearn.linear_model #For performing logistic Regression on the dataset
  6. sklearn.model_selection #For splitting dataset for training a model
  7. sklearn.preprocessing # For scaling the data
  8. sklearn.model_selection # For scoring
  9. sklearn.neighbors # For applying KNN classification algorithm
  10. sklearn.ensemble # For applying Random Forest algorithm

About

IRIS Dataset The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowe…

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published