Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question regarding the output #5

Closed
jzhang38 opened this issue Dec 9, 2021 · 3 comments
Closed

Question regarding the output #5

jzhang38 opened this issue Dec 9, 2021 · 3 comments

Comments

@jzhang38
Copy link

jzhang38 commented Dec 9, 2021

Hi,

Thanks for your solid work and for sharing the code!

May I ask why do you choose to predict the label index (like if the masked token has three possible values, then you will output the index 0 to 2 instead of outputting the actual word id corresponding to the label ) when you generate the output? Have you tried to predict the actual word instead of the index?

Thank you!

@jzhang38
Copy link
Author

jzhang38 commented Dec 9, 2021

To add on, since the model is classifying the index instead of generating the label, can this method still be considered as prompt-tuning? Because It does not actually use the semantics of all those labels. It is more like a multiclass classification.

@THUCSTHanxu13
Copy link
Member

Our model is based on the probability distribution over actual tokens. The use of index is only for the convenience of code implementation. When we implement the model, we first predict the energy scores of all tokens, and then select the energy scores of those words used for label words to renormalize the probability.

@jzhang38 jzhang38 closed this as completed Dec 9, 2021
@jzhang38
Copy link
Author

jzhang38 commented Dec 9, 2021

Thank you so much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants