This project includes some studies on data science with Python. I did my studies using the Iris dataset.
You must install the following libraries.
- Scikit-learn
- Pandas
- Numpy
- Matplotlib
- Finding minimum, maximum, average and standard deviation values of all features and printing them on the screen.
- Plotting scatter charts for these attributes and saving them as png.
- Sepal Length vs. Sepal Width
- Petal Length vs. Petal Width
- Sepal Length and Sepal Width vs. Petal Length and Petal Width
- Creation of a decision tree for classification.
- All attributes were used when creating a decision tree.
- 80% of the data were randomly selected for education. The remainder was used for testing.
- Confusion matrix printed.
- Precision and recall criteria were printed.
- The generated decision tree was graphically drawn, displayed, and saved as a file in png.