Skip to content
This repository has been archived by the owner on Mar 15, 2024. It is now read-only.

tiny model accuracy #32

Closed
pawopawo opened this issue Jan 10, 2021 · 2 comments
Closed

tiny model accuracy #32

pawopawo opened this issue Jan 10, 2021 · 2 comments
Labels
question Further information is requested

Comments

@pawopawo
Copy link

Accuracy of the network on the 50000 test images: 71.9%
Max accuracy: 71.95%
Training time 1 day, 15:01:41

hi, the accuracy of the tiny model I trained is 71.95, which cannot reach 72.2

@Yuxin-CV
Copy link

For me, the result of DeiT-Ti is:

Accuracy of the network on the 50000 test images: 72.7%
Max accuracy: 72.78%
Training time 1 day, 11:20:59

which is around 0.5% higher than the result reported in the paper.

@fmassa
Copy link
Contributor

fmassa commented Jan 11, 2021

Hi,

There is some small variation that can appear when running on different PyTorch / CUDA versions.
The code we released is a refactored version of the code used for the experiments of the paper, which should give the same results within noise (~0.2).

As @Yuxin-CV was able to reproduce (and even surpass) our reported results using our codebase, I'm closing this issue as normal noise from training, but let us know if you have further questions.

@fmassa fmassa closed this as completed Jan 11, 2021
@fmassa fmassa added the question Further information is requested label Jan 11, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants