emorecom

ICDAR2021 Competition Multimodal Emotion Recognition on Comics scenes

Repo strucutre

train.py - training module
preprocess.py - module for concatenating image, transcripts, and label for efficient loading
dataset - data folder
download_warmup_dataset.sh - bash script for downloading warmup data
EDA.ipynb - notebook for EDA
emorecom - core folder consisting of model, data, and utilities

Setup and install datasts

This repo assumed that Tensorflow is installed successfully and run smoothly on your system (support Tensorflow >= 2.0.0).
Initialize settings

pip3 install gdown
pip3 install -r requirements.txt

Install datasets (warm-up, full)

bash download_warmup_dataset.sh
bash download_full_datast.sh

Run preprocessing to concat image-paths, labels, and transcripts into a single TFRecord file for efficient loading

# for training dataset
python3 preprocess.py --test-size 0.2 --training --image warm-up-train/train \
--transcript warm-up-train/train_transcriptions.json \
--lable warm-up-train/train_emotion_labels.csv \
--output train.tfrecords --val-output val.tfrecords

# for testing dataset
python3 preprocess.py --image warm-up-test/test \
--transcript warm-up-test/test_transcriptions.json \
--output test.tfrecords

Install Glove Word-Embeddings

bash download_twitter_glove_we.sh

Training

# remember to preprocess training and validation data as above

# check train.sh for additional arguments
bash train.sh

Inference

# remember to preprocess inference data as above

# make predictions
bash predict.py
# or (assume that all trained models ared saved in /saved_models folder
python3 train.py --experiment-name model_1_resnet_lstm_early_fusion

Dataset details

Warm-up dataset:

Warm-up data is provided with 800 training images (with transcriptions and labels) and 100 test images (with transcriptions)

Full dataset: Full dataset is provied with 8000 training images (with transcriptsion and labels) and 2000 examples (with transcriptions).

Data format

Labels: 8 emotion classes including: 0=Angry, 1=Disgust, 2=Fear, 3=Happy, 4=Sad, 5=Surprise, 6=Neutral, 7=Others.
Each instance includes 10 fields as follows:
- id: id of the image in the corresponding set (train or test)
- image_id: image_id associated with the image name
- emotion0_score: a manually annotated score for emotion0.
- emotion1_score: a manually annotated score for emotion1.
- emotion2_score: a manually annotated score for emotion2.
- emotion3_score: a manually annotated score for emotion3.
- emotion4_score: a manually annotated score for emotion4.
- emotion5_score: a manually annotated score for emotion5. - emotion6_score: a manually annotated score for emotion6.
- emotion7_score: a manually annotated score for emotion7.

References

@InProceedings{Iyyer:Manjunatha-Comics2017, Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives}, Booktitle = {IEEE Conference on Computer Vision and Pattern Recognition}, Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum'{e} III} and Larry Davis}, Year = {2017}}

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
checkpoints		checkpoints
emorecom		emorecom
logs		logs
saved_models		saved_models
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
EDA.ipynb		EDA.ipynb
LICENSE		LICENSE
README.md		README.md
download_full_dataset.sh		download_full_dataset.sh
download_twitter_glove_we.sh		download_twitter_glove_we.sh
download_warmup_dataset.sh		download_warmup_dataset.sh
predict.py		predict.py
predict.sh		predict.sh
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh

License

aisutd/emorecom

Folders and files

Latest commit

History

Repository files navigation

emorecom

Repo strucutre

Setup and install datasts

Dataset details

Data format

References

Links:

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages