Loss stuck at ~6.90 #12

bobi461993 · 2020-04-09T06:43:48Z

I am trying to train MoCo V2 on a machine with 2 GPUs using the hyperparameters recommended in this repo. However, the loss function gets stuck at value 6.90-ish. Is this behaviour normal or should I try with a different set of hyperparameters? I see that you have used a machine with 8 GPUs. Could this explain the difference? Thanks!

amsword · 2020-04-09T06:45:14Z

i had similar loss values, and the fine-tuned performance is 66.9 (67.5 as shown in README). Thus, 6.9 loss value looks not that bad.

bobi461993 · 2020-04-09T06:50:04Z

Thank you for the prompt response!

KaimingHe · 2020-04-09T07:32:50Z

I suggest you finish training and check the final result. The loss should be decreasing if you monitor for a longer time. 6.9 is not any special number here, as the random guess is log(65536), not log(1000).

KaimingHe closed this as completed Apr 9, 2020

Jakel21 mentioned this issue Apr 14, 2020

question about training the linear classification model #15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss stuck at ~6.90 #12

Loss stuck at ~6.90 #12

bobi461993 commented Apr 9, 2020

amsword commented Apr 9, 2020

bobi461993 commented Apr 9, 2020

KaimingHe commented Apr 9, 2020

Loss stuck at ~6.90 #12

Loss stuck at ~6.90 #12

Comments

bobi461993 commented Apr 9, 2020

amsword commented Apr 9, 2020

bobi461993 commented Apr 9, 2020

KaimingHe commented Apr 9, 2020