You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is it possible to use the intermediate layer outputs and generate text ignoring the layers on top?
Basically, I want to check quality of generations as we keep on adding more layers. What modifications in the src/sample.py script would I have to make for the same? Thanks.
The text was updated successfully, but these errors were encountered:
I did experiment once with changing amount of heads and layers. For 345M the optimal seems to be the model set amount. I think it was a multiply of 2 but I'm not sure anymore as I don't remember. Just overwrite the data read from model. Hint: search for hparams.n_ctx in the code and check model configuration file.
In my opinion, however, it's better to just play with temperature or logits (diversity).
Is it possible to use the intermediate layer outputs and generate text ignoring the layers on top?
Basically, I want to check quality of generations as we keep on adding more layers. What modifications in the
src/sample.py
script would I have to make for the same? Thanks.The text was updated successfully, but these errors were encountered: