This is the repository to the dataset YoutubeFish-35 to the paper "Temporal Transformers for Indiscernible Object Counting from Underwater Videos" [arxiv].
Please download through the link: [GoogleDrive].
The original dataset is collected from YouTube videos under the Creative Commons license. We are incredibly grateful to the original authors for their contribution; they are truly awesome. If you wish your footage not to be included in the data, please let us know. We will promptly respect your request and remove the relevant content as soon as possible.
If you found our work or dataset useful toward your research, please considering citing:
@inproceedings{yang2024density,
title={A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Videos},
author={Yang, Cheng-Yen and Huang, Hsiang-Wei and Jiang, Zhongyu and Wang, Hao and Wallace, Farron and Hwang, Jenq-Neng},
booktitle={ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages={5075--5079},
year={2024},
organization={IEEE}
}