Open
Description
Hello,
Thank you for your work. I was reading the training hyperparameters, and i noticed that the learning rate for DINOv2 is greater than the learning rate for the decoder. In previous DETR models, the backbone usually has a smaller learning rate than the other parameters. Why did you choose a learning rate of 0.2 for dinov2.
Lines 55 to 57 in ed54776
Metadata
Metadata
Assignees
Labels
No labels