Modify generate() method in GPT2CausalLM to support chatbot #846

ADITYADAS1999 · 2023-03-14T02:30:01Z

Modify generate() method in GPT2CausalLM to support chatbot #844

The gap we have is about the end_token_id. In a chatbot system like DialoGPT , the user prompt needs to be concatenated by an end_token before generation, otherwise it will just generate end_token and stop. One simple fix is to add an argument to generate() method, e.g., append_end_token=False, when which is True the prompt will be appended by an end_token before calling our sampler.

cc: @chenmoneygithub

chenmoneygithub · 2023-03-15T03:10:59Z

@ADITYADAS1999 Sorry what does this PR do...? We need to reflect this argument append_end_token in the code to add the end_token after tokenization. We are doing a few refactor work in #804, which will affect this PR, please stay tuned, thanks!

ADITYADAS1999 · 2023-03-15T08:01:22Z

@ADITYADAS1999 Sorry what does this PR do...? We need to reflect this argument append_end_token in the code to add the end_token after tokenization. We are doing a few refactor work in #804, which will affect this PR, please stay tuned, thanks!

thanks for the info !
actually, I am going to try solving with this issue.

chenmoneygithub · 2023-03-16T05:36:14Z

@ADITYADAS1999 So to resolve #853, we need to reflect this argument append_end_token in the code, simply adding the token is not enough.

ADITYADAS1999 · 2023-03-16T05:42:36Z

@ADITYADAS1999 So to resolve #853, we need to reflect this argument append_end_token in the code, simply adding the token is not enough.

Thanks @chenmoneygithub

Is the reflect this argument work already in progress by team ?

mattdangerw · 2023-03-29T20:44:52Z

I will go ahead and close this, as there is no implementation here, and this is something that will be take on by @chenmoneygithub or me I think!

Update gpt2_causal_lm.py

c7ba153

mattdangerw assigned chenmoneygithub Mar 22, 2023

mattdangerw closed this Mar 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modify generate() method in GPT2CausalLM to support chatbot #846

Modify generate() method in GPT2CausalLM to support chatbot #846

Uh oh!

ADITYADAS1999 commented Mar 14, 2023 •

edited

Loading

Uh oh!

chenmoneygithub commented Mar 15, 2023

Uh oh!

ADITYADAS1999 commented Mar 15, 2023 •

edited

Loading

Uh oh!

chenmoneygithub commented Mar 16, 2023

Uh oh!

ADITYADAS1999 commented Mar 16, 2023

Uh oh!

mattdangerw commented Mar 29, 2023

Uh oh!

Uh oh!

Modify generate() method in GPT2CausalLM to support chatbot #846

Modify generate() method in GPT2CausalLM to support chatbot #846

Uh oh!

Conversation

ADITYADAS1999 commented Mar 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenmoneygithub commented Mar 15, 2023

Uh oh!

ADITYADAS1999 commented Mar 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenmoneygithub commented Mar 16, 2023

Uh oh!

ADITYADAS1999 commented Mar 16, 2023

Uh oh!

mattdangerw commented Mar 29, 2023

Uh oh!

Uh oh!

ADITYADAS1999 commented Mar 14, 2023 •

edited

Loading

ADITYADAS1999 commented Mar 15, 2023 •

edited

Loading