Skip to content

fusical/emotiw

Repository files navigation

Fusical: Multimodal Fusion for Video Sentiment

Introduction

Despite cultural diversity, emotions are universal. We will undertake the EmotiW 2020 challenge, doing group-level sentiment recognition on videos from across the globe. Given short clips, the goal is to predict whether the video sentiment is positive, neutral, or negative. This problem is interesting because audio-visual sentiment analysis has implications in psychology and mental health.

Our paper was accepted at the 22nd ACM International Conference on Multimodal Interaction: https://dl.acm.org/doi/10.1145/3382507.3417966

Dataset

We worked with the EmotiW 2020 dataset.

Sample image

Architecture

We ensembled models from four modalities: overall scene, pose, audio, and facial.

Ensemble Architecture

Getting Started

To start, please check out our presentation and slide deck.

Code Layout

The code is organized as follows:

  • src/ - preprocessing, generation, and classification code
  • notebooks/ - notebooks for training and prediction

Try it out

Run this notebook to see how our model works on the dataset.

Model Emporium

We provide many of the models we trained here.

Results

Final Ensemble

Best Submission Confusion Matrix

Ablation Study

Ablation Confusion Matrices

Table - Individual Modalities

Results as reported based on the EmotiW 2020 validation dataset.

Modality Accuracy F1-Score
Scene 0.546 0.541
Pose 0.486 0.489
Audio 0.577 0.577
Face 0.4 0.348
Image Captioning 0.505 0.506

Table - Final Ensemble

Dataset Accuracy
Validation 0.640
Test 0.639

"Saliency" Map

Ablation Confusion Matrices

The Team

Boyang Tom Jin, Leila Abdelrahman, Cong Kevin Chen, Amil Khanzada
CS231n - Stanford University

Citing Our Work

@misc{2020fusical,
  author =       {Boyang Tom Jin and Leila Abdelrahman and Cong Kevin Chen and
                  Amil Khanzada},
  title =        {Fusical},
  howpublished = {\url{https://github.com/kevincong95/cs231n-emotiw}},
  year =         {2020}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5