Multiclass Obesity Prediction

Overview

This repository contains models that predict the obesity level of patients based on their eating/lifestyle habits and physical condition. The target variable here is a multi-class variable with seven levels - Insufficient Weight, Normal Weight, Overweight Level I, Overweight Level II, Obesity Type I, Obesity Type II and Obesity Type III.

Dataset For Analysis

The novel version of the dataset used for this project can be found at UCI Machine Learning Repository. The synthetically expanded version can be found on Kaggle as a competition dataset. The dataset features are the same from both sources but Kaggle offers a means of checking the model perfomance on a larger data sample. The original dataset had 2111 observations while the synthetically generated version of the dataset had 34598 observations.

The data collected covered demographic data as well as data on eating habits and physical condition from individuals from Colombia, Peru and Mexico. The data contains 16 features and the target variable.The feature attributes related to eating habits are Frequent consumption of high caloric food , Frequency of consumption of vegetables, Number of main meals, Consumption of food between meals, Consumption of water daily, and Consumption of alcohol. The feature attributes related to physical condition are Calories consumption monitoring, Physical activity frequency, Time using technology devices, Transportation used. Demographic attributes such as Gender, Age, Height and Weight were also recorded.

Python Packages and Modules Needed

Pandas and NumPy are imported for Data Manipulation and Wrangling. Seaborn and Matplotlib are employed for the visualuzations. The Machine Learning Models and accessory methods are imported from Scikit-Learn and XGBoost.

Chronology

Import necessary libraries and datasets.
Identify datatypes present and make appropriate conversions.
Preprocess dataset - deal with missing values scale numeric features and label categorical features accordingly.
Split data into the features and target.
Split data into training and test datasets (not necessary for Kaggle dataset).
Training machine learning models on training dataset and check training accuracy.
Tune Hyperparameters of the model (if necessary).
Use trained model on test dataset and make prediction.
Save predictions to a desired file format.

Model Results

Decision Tree Classifier : The Decision Tree Classifier Model attained a 89.82% accuracy on the Training Dataset and 86.77% accuracy score on Test Dataset.

XGBoost Classifier : The XGBoost Classifier Model attained a 90.5% accuracy on the Training Dataset and 88.18% accuracy score on Test Dataset.

Simple Neural Network : The 2-Layer Neural Network Model attained a 89.57% accuracy on the Training Dataset and 86.88% accuracy score on Test Dataset.

Author(s)

Abraham Ajibade Linkedin
Boluwtife Olayinka Linkedin

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
project_files		project_files
MultiClass Obesity Decision Tree.py		MultiClass Obesity Decision Tree.py
MultiClass Obesity PyTorch.py		MultiClass Obesity PyTorch.py
MultiClass Obesity XGBoost.py		MultiClass Obesity XGBoost.py
MultiClass Obesity.ipynb		MultiClass Obesity.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

project_files

project_files

MultiClass Obesity Decision Tree.py

MultiClass Obesity Decision Tree.py

MultiClass Obesity PyTorch.py

MultiClass Obesity PyTorch.py

MultiClass Obesity XGBoost.py

MultiClass Obesity XGBoost.py

MultiClass Obesity.ipynb

MultiClass Obesity.ipynb

README.md

README.md

Repository files navigation

Multiclass Obesity Prediction

Overview

Dataset For Analysis

Python Packages and Modules Needed

Chronology

Model Results

Author(s)

About

Languages

jibbs1703/Multiclass-Obesity-Level-Prediction

Folders and files

Latest commit

History

Repository files navigation

Multiclass Obesity Prediction

Overview

Dataset For Analysis

Python Packages and Modules Needed

Chronology

Model Results

Author(s)

About

Topics

Resources

Stars

Watchers

Forks

Languages