dropout is 0.0 #455

dipsivenkatesh · 2024-03-11T14:58:20Z

The dropout of the GPT model in GPTConfig class is set to 0.0, this means there won't be any dropout when training though, correct?

amar-jay · 2024-03-17T02:33:43Z

Yes, by deafult

muerghq · 2024-04-06T04:07:34Z

Many recent researches on LLMs proved that it's ok to not do dropout in pertaining. But you normally want dropout in fine tuning to avoid overfitting.

siddharthji07 · 2024-05-13T13:21:11Z

Yes, Absolutely core because dropout of 0.0 during training shows that there is no dropout during training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dropout is 0.0 #455

dropout is 0.0 #455

dipsivenkatesh commented Mar 11, 2024

amar-jay commented Mar 17, 2024

muerghq commented Apr 6, 2024

siddharthji07 commented May 13, 2024

dropout is 0.0 #455

dropout is 0.0 #455

Comments

dipsivenkatesh commented Mar 11, 2024

amar-jay commented Mar 17, 2024

muerghq commented Apr 6, 2024

siddharthji07 commented May 13, 2024