Skip to content

6shun/mbti-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Myers-Briggs Personality Type Prediction with Text Classification

Semester Project of course Introduction to Machine Learning (CompSci 289A) at UC Berkeley with Prof. Jonathan Shewchuk, Spring 2022

Documents:

Pipeline Visualization

Results:

Metrics of classifiers on test data

Confusion Matrix of best classifier, LDA

Dataset:

Kaggle dataset (MBTI) Myers-Briggs Personality Type Dataset, originally collected through PersonalityCafe forum.

This repo is structured as following:

.
├── data                            
│    ├── cleaned_mbti_train.csv           # Cleaned train data
│    ├── cleaned_mbti_test.csv            # Cleaned test data
│    └── class16_mbti_map.npy           
├── docs                                 
│    ├── Final_Project_Report.pdf         # Final Project Report
│    └── Final_Presentation_Slides.pdf    # Project Presentation Slides
├── notebooks                             # Collection of notebooks
│    ├── bestmodels.ipynb                 # Best model of each classifier, final parameter
│    └── bestmodels_include_tuning.ipynb  # Best model of each classifier, tuning process
├── result                                # results: scores, cm, plot
└── README.md