Skip to content

liyingxuan1012/Manga109Dialog

Repository files navigation

Manga109Dialog: A Large-scale Dialogue Dataset for Comics Speaker Detection

Official repository of Manga109Dialog (ICME 2024) | Paper | Dataset

Prerequisites

Data preprocessing

Check README.md.

Environment setup

Check INSTALL.md for installation instructions.

How to run

# Training
bash comic_sgg.sh

# Test
bash comic_sgg_test.sh

Evaluation

# PredCls / SGCls
python evaluation_and_visualization/eval_original.py

# SGDet
python evaluation_and_visualization/eval_original_sgdet.py

Visualization

The visualization tools for predictions can be found in evaluation_and_visualization/.

  • 1.visualize_PredCls_and_SGCls.ipynb
  • 2.visualize_SGDet.ipynb
  • 3.visualize_SGDet.ipynb
  • 4.visualize_custom_SGDet.ipynb

Demo

Citation

When using annotations of Manga109Dialog, please cite our paper.

@inproceedings{li2024manga109dialog,
  title={Manga109Dialog: A Large-scale Dialogue Dataset for Comics Speaker Detection},
  author={Li, Yingxuan and Aizawa, Kiyoharu and Matsui, Yusuke},
  booktitle={2024 IEEE International Conference on Multimedia and Expo (ICME)},
  year={2024},
  organization={IEEE}
}

About

Official repository of Manga109Dialog (ICME 2024)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published