This project (write a data science blog post) is a part of Udacity Data Scientist Nanodegree Program. In this project, I have analyzed data publicly available about Car Evaluation and was downloaded from the UC Irvine Machine Learning Repository. also I have used supervised machine learning algorithms Random Forest. in this project, I have tried to analyze and answer the following questions:
-
Is the very good car expensive?
-
Is the safety will be high on the expensive cars only?
-
Is the price of cars affect the price of maintenance?
The dataset consists of 1728 rows and 7 columns and you can find more information on the link below:
https://archive.ics.uci.edu/ml/datasets/Car+Evaluation
Class Values:
- unacc, acc, good, vgood
Attributes:
- buying: vhigh, high, med, low.
- maint: vhigh, high, med, low.
- doors: 2, 3, 4, 5more.
- persons: 2, 4, more.
- lug_boot: small, med, big.
- safety: low, med, high.
- numpy
- pandas
- matplotlib
- sklearn
you can see my finds in the blog on the link below: