MBTI Personality Prediction using Machine Learning

Introduction

The Myers Briggs Type Indicator (MBTI) is a personality type system that divides everyone into 16 distinct personalities based on four dimensions, namely: Introversion (I) - Extroversion (E), Intuition (N) - Sensing (S), Thinking (T) - Feeling (F), Judging (J) - Perceiving (P). In this project, I have developed a MBTI personality classifier that uses machine learning models to predict a person’s personality based on the social media posts per user as input.

Methodology

Exploring the Dataset

The dataset used has:

8675 rows
2 columns
- type
- posts.

The data in column ‘post’ contains 50 recent social media posts for each user. There are 16 unique labels in column ‘type’ with no null values, each representing 16 MBTI type indicators. The post column had paragraphs which required some natural language processing in order to perform the task of model training.

(Distribution of personality type in dataset)

Preprocessing

This is performed in order to reduced the inconsistency in the data by removing terms which do not contribute much to the person's personality.

Converting data in post column to lowercase so that 2 identical words written in different letter cases can be interpreted as similar.
Removing URLs and links
Removing special characters like ' , ', ' | ', ' - ' etc. and numbers
Removing extra spaces
Removing stopwords such as ‘for’, ‘them’, ‘you’ etc. using the nltk library.
Perform word Lemmatization i.e. grouping of words with the same purpose together (e.g. gone, going, went to go).

Models implemented

There are various classification algorithms present out of which I have implemented the following:

Multinomial Naive Bayes
Random Forest Classifier
LightGBM Classifier
Logistic Regression

Result & Analysis

Multinomial Naïve Bayes performed worst because of it’s poor assumption. Further , we can see ensemble model like LGBM perform the best. Accuracies of models can be increased by required hyperparameter tuning.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
Report.pdf		Report.pdf
main.ipynb		main.ipynb
personality_prediction.py		personality_prediction.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MBTI Personality Prediction using Machine Learning

Introduction

Methodology

Exploring the Dataset

Preprocessing

Models implemented

Result & Analysis

About

Releases

Packages

Languages

Aayush-Gangwar/Personality-Prediction

Folders and files

Latest commit

History

Repository files navigation

MBTI Personality Prediction using Machine Learning

Introduction

Methodology

Exploring the Dataset

Preprocessing

Models implemented

Result & Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages