Skip to content

BERT classification of Myers-Brigg personality types based on Twitter tweets in four different European languages.

Notifications You must be signed in to change notification settings

wesleykwong/Myers-Brigg-Classification

Repository files navigation

Myers-Brigg Personality Classification with Twitter Feed

This is the repo for Wesley Kwong's 5th Year MIDS W266 Natural Language Processing with Deep Learning final project.

The aim of this project was to analyze how various monolingual and multilingual BERT models performed on classifying the Myers-Brigg Personality Test (MBTI) classes based on Twitter tweets. The dataset used for this project was the TwiSty dataset. The tweets are in German (DE), Spanish (ES), Italian (IT), and Dutch (NL).

Dataset: https://www.uantwerpen.be/en/research-groups/clips/research/datasets/

  • Verhoeven, B., Daelemans, W., & Plank, B. (2016) TwiSty: a multilingual Twitter Stylometry corpus for gender and personality profiling. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia.

fastText's Aligned Word Vectors: https://fasttext.cc/docs/en/aligned-vectors.html

  • A. Joulin, P. Bojanowski, T. Mikolov, H. Jegou, E. Grave, Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion
  • P. Bojanowski*, E. Grave*, A. Joulin, T. Mikolov, Enriching Word Vectors with Subword Information

BERT Models:

About

BERT classification of Myers-Brigg personality types based on Twitter tweets in four different European languages.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages