Fine-tuning based on the DETR architecture code, but the verification indicators are all 0 #69

Flyooofly · 2022-11-18T09:25:39Z

Thanks for your work. I noticed that you open-sourced the detreg of the DETR architecture, and then I tried to use the pre-trained model on the imagenet dataset you provided to fine-tune training for my custom dataset. But I found that all the indicators are still 0 after more than fifty batches of pre-training. I have followed the tips in the related issues of DETR (https://github.com/facebookresearch/detr/issues?page=1&q=zero) , the num_calss was modified. Many people mentioned that DETR requires a large amount of training data, or fine-tuning. But I am currently using fine-tuning, and the number of fine-tuning datasets is about one thousand. But the effect is still very poor, may I ask why. It's normal for me to use deformable-detr architecture.

amirbar · 2023-07-18T01:27:38Z

Apologies for the delay in response. What is the size of your dataset? For example, ImageNet is 1m images, so 50 epochs might be enough to train over it. If your dataset is significantly slower, you would need to increase the number of epochs proportionally. Also, make sure you don't drop the learning rate early (this should be mostly relevant for Deformable-DETR).

Repository owner deleted a comment from luoyq6 Jul 18, 2023

amirbar closed this as completed Jul 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-tuning based on the DETR architecture code, but the verification indicators are all 0 #69

Fine-tuning based on the DETR architecture code, but the verification indicators are all 0 #69

Flyooofly commented Nov 18, 2022

amirbar commented Jul 18, 2023

Fine-tuning based on the DETR architecture code, but the verification indicators are all 0 #69

Fine-tuning based on the DETR architecture code, but the verification indicators are all 0 #69

Comments

Flyooofly commented Nov 18, 2022

amirbar commented Jul 18, 2023