Official repository of Manga109Dialog (ICME 2024) | Paper | Dataset
- Manga109 dataset
- Download from http://www.manga109.org/en/download.html
- Manga109Dialog annotation
- Download from https://github.com/manga109/public-annotations
Check INSTALL.md for installation instructions.
Convert the annotations from Manga109 into a format suitable for the scene graph generation (SGG) models. For more details, check README.md.
This is the core part of our model. For details on how to detect characters and texts in comics and predict the speaker based on visual information, check README.md.
In addition to conventional metrics for evaluating SGG models, we have introduced a new metric tailored for comics: Recall@(#text).
# PredCls / SGCls
python eval_and_vis/eval_original.py
# SGDet
python eval_and_vis/eval_original_sgdet.py
You can find details on conventional evaluation metrics in METRICS.md.
The visualization tools for predictions can be found in eval_and_vis/
.
- 1.visualize_PredCls_and_SGCls.ipynb
- 2.visualize_SGDet.ipynb
- 3.visualize_SGDet.ipynb
- 4.visualize_custom_SGDet.ipynb
When using annotations of Manga109Dialog, please cite our paper.
@inproceedings{li2024manga109dialog,
title={Manga109Dialog: A Large-scale Dialogue Dataset for Comics Speaker Detection},
author={Li, Yingxuan and Aizawa, Kiyoharu and Matsui, Yusuke},
booktitle={2024 IEEE International Conference on Multimedia and Expo (ICME)},
year={2024},
organization={IEEE}
}