Some questions about constrained decoding #20

rela0426 · 2022-06-26T06:41:43Z

Hello, Mr. Lu. In the constraint decoding algorithm, there is a judgment that is not clear. Can you help explain it?

def check_state(self, tgt_generated):
        if tgt_generated[-1] == self.tokenizer.pad_token_id:
            return 'start', -1

Here,tgt_generated[-1]==self.tokenizer.pad_token_idmeansstart,Why？Can we substitute decoder_start_token_id for self.tokenizer.pad_token_id？Or just use the value 0？

In my opinion, if tgt_generated[-1] == self.tokenizer.pad_token_id,It means that the last one is pad_token, so the generation enters the end phase instead of the start phase.So judge the start of generation with decoder_ start_ token_ id is recommended, is it right?

The text was updated successfully, but these errors were encountered:

luyaojie · 2022-06-26T07:00:07Z

Hi,

This is because T5 uses the pad_token_id as the starting token for decoder_input_ids generation and eos_token as the ending token.
decoder_start_token_id is better for other tokenizers.

rela0426 · 2022-06-26T07:15:19Z

Hi,

This is because T5 uses the pad_token_id as the starting token for decoder_input_ids generation. Employing decoder_start_token_id is better for other tokenizers.

I use XLMRobertaToken, it takes cls_token as the start tag, sep_token as the end tag,pad_token as the pad tag. To adapt to T5Model，I add codes as

config.eos_token_id = tokenizer.eos_token_id
config.pad_token_id = tokenizer.pad_token_id

Does that mean my judgment should be tgt_generated[-1]==self.tokenizer.cls_token_id?

At present, the program can run without constraint decoding algorithm, and the effect is OK; With the constraint decoding algorithm, the program F value is 0. So what's the problem？

luyaojie · 2022-06-26T07:36:04Z

I think it is no need to add config.eos_token_id = tokenizer.eos_token_id.

You can rewrite the constraint decoding based on the XLMRobertaTokenizer, as you stated that cls_token as the start tag, sep_token as the end tag.
I think the modification of some special symbols in the raw code for the constraint decoding should be working.
For example, change type_start/type_end/pad_token_id/eos_token_id according the generation state and the XLMRobertaTokenizer.

For the problem of F=0, it is better to analyze the content of the generation.
For example, one possible results of generation are all empty (no event) <extra_id_0> <extra_id_1>.

rela0426 · 2022-06-26T07:39:24Z

I think it is no need to add config.eos_token_id = tokenizer.eos_token_id.

You can rewrite the constraint decoding based on the XLMRobertaTokenizer, as you stated that cls_token as the start tag, sep_token as the end tag. I think the modification of some special symbols in the raw code for the constraint decoding should be working. For example, change type_start/type_end/pad_token_id/eos_token_id according the generation state and the XLMRobertaTokenizer.

For the problem of F=0, it is better to analyze the content of the generation. For example, one possible results of generation are all empty (no event) <extra_id_0> <extra_id_1>.

Thank you for your analysis. I seem to have some ideas！
Wish you every success in your work!

rela0426 closed this as completed Jun 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about constrained decoding #20

Some questions about constrained decoding #20

rela0426 commented Jun 26, 2022 •

edited by luyaojie

luyaojie commented Jun 26, 2022 •

edited

rela0426 commented Jun 26, 2022

luyaojie commented Jun 26, 2022

rela0426 commented Jun 26, 2022

Some questions about constrained decoding #20

Some questions about constrained decoding #20

Comments

rela0426 commented Jun 26, 2022 • edited by luyaojie

luyaojie commented Jun 26, 2022 • edited

rela0426 commented Jun 26, 2022

luyaojie commented Jun 26, 2022

rela0426 commented Jun 26, 2022

rela0426 commented Jun 26, 2022 •

edited by luyaojie

luyaojie commented Jun 26, 2022 •

edited