Reproduce AVA results of MAE_ST #633

yuanliangzhe · 2022-12-28T00:07:08Z

Hi there,

Could authors share the configs that you used to produce the AVA v2.2 results in the Masked Autoencoders As Spatiotemporal Learners paper?

Throughout the repo I cannot find any related configs for ViT. The hyperparameters that mentioned in the paper (https://arxiv.org/pdf/2205.09113.pdf, appendix A. Table 6) seem to be unreasonable to me. With batch size 128, the learning rate is 7.2 for ViT-L with SGD optimizer.

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproduce AVA results of MAE_ST #633

Reproduce AVA results of MAE_ST #633

yuanliangzhe commented Dec 28, 2022

Reproduce AVA results of MAE_ST #633

Reproduce AVA results of MAE_ST #633

Comments

yuanliangzhe commented Dec 28, 2022