This is the code of our solution for the ACM MM2021 Robust Logo Detection Grand Challenge. The competition is hold on TIANCHI platform and requires the detection of logos (515 categories) in e-commerce images. It is challenged by long-tail distribution, small objects, and different types of noises. To overcome these challenges, we built a highly optimized and robust detector. More detailed can be found in our paper. We ranked 5/36489 in the competition.
Our code is based on mmdetection code base. Here is a document for installation.
The dataset should be organized as follows for training and testing:
- data
- tianchi
- openbrand_train.json
- train
- xxx.jpg
...
- testA_imgList.json
- val
- xxx.jpg
...
- testB_imgList.json
- testB
- xxx.jpg
...
Our configure files for the preliminary and final round are shown below.
Round | Config | AP_test |
---|---|---|
preliminary | preliminary_round.py | 64.64* |
final | final_round.py | 61.26 |
*: trained 20 epochs due to time constraints.
Some data augmentations used in the final round model are quite slow, and we adopt an offline implementation. They are calculated by the following scripts in advance and randomly replace the original images during training.
*: In order not to violate the rules, we adopt an optimization-based implementation. The style images are sampled from the training set.
If you find this project useful in your research, please consider citing:
@inproceedings{chen2021robust,
title={Robust Logo Detection in E-Commerce Images by Data Augmentation},
author={Chen, Hang and Li, Xiao and Wang, Zefan and Hu, Xiaolin},
booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
pages={4789--4793},
year={2021}
}