New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducing Camera-Ready Improved Numbers #72
Comments
We have fixed one issue in this commit and the performance on Kinetics-400 can be improved by about 0.5%. |
The results on Kinetics-400 can be reproduced successfully by MMAction2. |
Hm with the latest code I am getting 70.3% on SSv2 and 81.3% on K400 with the 2400 and 1600 epoch off-shelf pretrained weights provided in this repo. I am using this script and this script except I changed the batch size to meet my memory constraints. I am using 64 GPUs
|
Hi @dfan! I think |
The NeurIPS camera ready version (v3 on arXiv) has some significantly higher results than the previous paper version (v2 on arXiv). E.g. for ViT-B pretrained on K400 for 1600 epochs, performance on K400 jumps from 80.9% to 81.5%. For ViT-B pretrained on SSv2 for 2400 epochs, performance on SSv2 jumps from 70.6% to 70.8%. Could the authors share the updated finetuning code and configs? I am unable to reproduce the new results. My results are close to what is reported in v2 of the paper
The text was updated successfully, but these errors were encountered: