Skip to content

foukonana/multi_emotions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multi-label classification

Often, we may encouter data that can be classified into more than one categories (for example movie genre, items in an image).
However, typical classification tasks involve predicting a single label, as they treat classes as being mutually exclusive.

Multi-Label Classification is the supervised learning problem where an instance may be associated with multiple labels. This is opposed to the traditional task of single-label classification (i.e., multi-class, or binary) where each instance is only associated with a single class label.

Techniques

There are two main categorizations of methods that can be used to solve for the multi-label classification problem

  • problem transformation methods and
  • algorithm adaptation methods

In the first case the learning task is transformed into more or single-label classification tasks. In the second, the algorithms are adapted so that they can handle multi-label data.


The dataset used here is the GoEmotions.
This is a dataset released from Google and it containes the emotions detected in those texts.
It is the largest manually annotated dataset of 58K English Reddit comments, labeled for 27 emotion categories or neutral.
Find the paper on arXiv.org


Set up this project

You will need:

  • Python 3.6 or higher

Create a python virtual environment, activate it and install requirements:

python3 -m venv multiemotions_env
source multiemotions_env/bin/activate
pip install -r requirements.txt

About

Baseline multi-label classification techniques on GoEmotions dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published