Reproducing loss #3

Luvata · 2021-10-10T14:33:45Z

Hi @rmokady, what a clever approach!

I'm trying this approach on my custom dataset and manage to get it start training. I'm figuring out way to add evaluate code to better manage the training, but in the mean time, I wonder what is your loss score when stop training the model on each mode: Train only prefix and Train both prefix and GPT?

The text was updated successfully, but these errors were encountered:

rmokady · 2021-10-10T18:46:10Z

Hi @Luvata , for COCO where we train both prefix and GPT-2 the loss got to 1.47
Hope this is helpful

Luvata · 2021-10-11T02:06:34Z

Thank you @rmokady, that was very helpful

Luvata closed this as completed Oct 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducing loss #3

Reproducing loss #3

Luvata commented Oct 10, 2021

rmokady commented Oct 10, 2021

Luvata commented Oct 11, 2021

Reproducing loss #3

Reproducing loss #3

Comments

Luvata commented Oct 10, 2021

rmokady commented Oct 10, 2021

Luvata commented Oct 11, 2021