Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Intermediate layer output #148

Open
bakszero opened this issue Jun 19, 2019 · 1 comment
Open

Intermediate layer output #148

bakszero opened this issue Jun 19, 2019 · 1 comment

Comments

@bakszero
Copy link

bakszero commented Jun 19, 2019

Is it possible to use the intermediate layer outputs and generate text ignoring the layers on top?
Basically, I want to check quality of generations as we keep on adding more layers. What modifications in the src/sample.py script would I have to make for the same? Thanks.

@MrKrzYch00
Copy link

I did experiment once with changing amount of heads and layers. For 345M the optimal seems to be the model set amount. I think it was a multiply of 2 but I'm not sure anymore as I don't remember. Just overwrite the data read from model. Hint: search for hparams.n_ctx in the code and check model configuration file.

In my opinion, however, it's better to just play with temperature or logits (diversity).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants