Question regarding the output #5

jzhang38 · 2021-12-09T02:45:32Z

Hi,

Thanks for your solid work and for sharing the code!

May I ask why do you choose to predict the label index (like if the masked token has three possible values, then you will output the index 0 to 2 instead of outputting the actual word id corresponding to the label ) when you generate the output? Have you tried to predict the actual word instead of the index?

Thank you!

jzhang38 · 2021-12-09T03:22:09Z

To add on, since the model is classifying the index instead of generating the label, can this method still be considered as prompt-tuning? Because It does not actually use the semantics of all those labels. It is more like a multiclass classification.

THUCSTHanxu13 · 2021-12-09T08:22:01Z

Our model is based on the probability distribution over actual tokens. The use of index is only for the convenience of code implementation. When we implement the model, we first predict the energy scores of all tokens, and then select the energy scores of those words used for label words to renormalize the probability.

jzhang38 · 2021-12-09T13:00:51Z

Thank you so much!

jzhang38 closed this as completed Dec 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding the output #5

Question regarding the output #5

jzhang38 commented Dec 9, 2021

jzhang38 commented Dec 9, 2021

THUCSTHanxu13 commented Dec 9, 2021

jzhang38 commented Dec 9, 2021

Question regarding the output #5

Question regarding the output #5

Comments

jzhang38 commented Dec 9, 2021

jzhang38 commented Dec 9, 2021

THUCSTHanxu13 commented Dec 9, 2021

jzhang38 commented Dec 9, 2021