Project: To identify clusters of passengers that have similar characteristics for the purpose of targeting different segments for different types of mileage offers.
Exploring Insights/Inferences by performing EDA on the given data. Relevant graphs were plotted to get some insights on data using seaborn package. Model fitting via K-means Clustering by Importing sklearn package.
- Pandas
- Numpy
- Matplotlib
- Seaborn
- Scikit learn
- Joblib
-
Data copying and cleaning:
- Read the csv file
- copy the data
- check for null values and other informations
- drop the duplicate values
-
Exploratory Data Analysis:
- Conduct all the necessary EDA using various graphs on the dataset
- interpret the graphs
- check for outliers and correlation among the coloumns
- perform one hot encoding in case of categorical columns
-
Sampling of data:
- Scaling the data using StandardScaler and Normalizer
-
Modelling of data:
- import K-means clustering and initialize it
- determine the k value using elbow method
- fit the model
- predict the model
Still Learning,
So feel free, Anything You wanna contirubute.