About training strategy #36

Vincent-luo · 2023-10-17T12:42:44Z

Hi, I'm now training the reltr model on VG dataset and I find the training time is quite long. It takes ~2.5 days to train for 150 epochs on 4*3090 with batchsize 4. Im not sure whether I'm doing something wrong or it does need much time to train from scratch.

And I want to ask if you have tried other training strategies like multiple stage. For example, in the first stage just train the model for object detection, the in the second stage only train the triplet decoder and freeze the encoder and entity decoder(or updating with a low leaning rate). That sounds more practical and will reduce the training time in theory.

yrcong · 2023-10-20T05:35:07Z

Hi,
We didn't do it since the end-to-end training is our objective. But the multiple-stage training sounds reasonable in practice.

You can even try to load the pretrained DETR.

yrcong closed this as completed Nov 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About training strategy #36

About training strategy #36

Vincent-luo commented Oct 17, 2023

yrcong commented Oct 20, 2023

About training strategy #36

About training strategy #36

Comments

Vincent-luo commented Oct 17, 2023

yrcong commented Oct 20, 2023