Adult_Census_Income_Project

Repo for the Adult Census Income Project

Project Overview

Exploratory Data Analysis, outliers identification and data cleaning.
Modelling using KNeighbors, Logistic Regression, Random Forest, CatBoost amd XGBoost classifiers.
Hyperparameters tuning usings RandomizedSearchCV amd GridSearchCV.
Metrics evaluation and Feature Importance.

Code and Resourses used

Python Version: 3.8.2

Packages: Pandas, Numpy, Matplotlib, Seaborn, SKlearn, XGBoost, CatBoost

EDA: Exploratory Data Analysis

The EDA shows distribution of data and relation between different features' Below are few highlights from the graphs:

Data Cleaning

Create a preprocess_data(df) function that performs transformations on the DataFrame given as parameter and returns its converted version. Below the changes function makes:

Fill missing numerical values with feature median
Convert Object data into numerical

Model Building

Split Data into train and test data
Create fit_and_score(model) function to instantiate and compare accuracy from different estimators simultaneously.
Initially 5 different models: KNeighbors Classifier Logistic Regression Random Forest Classifier XGBoost Classifier CatBoost Classifier
Hyperparameter tuning using RandomizedSearchCV and GridSearchCV for the two best performant classifiers.

Model Performance

Metrics evaluation using Cross Validation (Precision, Recall and F1 scores), ROC curve and AUC, Confusion Matrix and Classification Report
Feature Importance

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Pictures		Pictures
1. Adult Census Income- Intro, EDA and Data Cleaning.ipynb		1. Adult Census Income- Intro, EDA and Data Cleaning.ipynb
2. Adult Census Income - Modelling and Hyperparameters tuning .ipynb		2. Adult Census Income - Modelling and Hyperparameters tuning .ipynb
3. Adult Census Income - Evaluation Metrics and Feature Importance.ipynb		3. Adult Census Income - Evaluation Metrics and Feature Importance.ipynb
Adult Census Income - End to end project.ipynb		Adult Census Income - End to end project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adult_Census_Income_Project

Project Overview

Code and Resourses used

EDA: Exploratory Data Analysis

Data Cleaning

Model Building

Model Performance

About

Releases

Packages

Languages

davideragone/Adult_Census_Income_Project

Folders and files

Latest commit

History

Repository files navigation

Adult_Census_Income_Project

Project Overview

Code and Resourses used

EDA: Exploratory Data Analysis

Data Cleaning

Model Building

Model Performance

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages