why keep "###" before instruct text? #44

Cescfangs · 2023-06-27T03:01:57Z

Video-LLaMA/video_llama/datasets/datasets/video_instruct_dataset.py

Lines 250 to 252 in 1728e14

    
           for tokenized_len, speaker in zip(tokenized_lens, speakers): 
        
               if speaker == "human": 
        
                   target[cur_idx+2:cur_idx + tokenized_len] = IGNORE_INDEX

I was reading _mask_targets(), I guess this function is using mask to ignore loss from instruct text, but why do you manually keep [curr_idx: curr_idx+2] which is "###" before the actual instruct text?

The text was updated successfully, but these errors were encountered:

hangzhang-nlp · 2023-06-27T09:57:33Z

The assistant will learn to generate "###" if it wants to end the current round. So "###" can be understood as the EOS flag of each session.

Cescfangs · 2023-06-29T06:01:31Z

The assistant will learn to generate "###" if it wants to end the current round. So "###" can be understood as the EOS flag of each session.

Thanks for the reply, I understand "###" works like EOS here, and I think the "###" in assistant text is more like EOS? howerver why do we want assistant to learn EOS before instruct?

hangzhang-nlp · 2023-06-29T06:22:11Z

During inference, we will stop generating tokens once the assistant outputs "###".
Suppose that the training data is " ###Human: Hi. ###Assistant: Hi, can I help you. ###Human: yes".
The content which needs to calculate loss is "Hi, can I help you. ###". So the model can learn to generate "###" as the end flag of his reply.

Cescfangs · 2023-06-29T08:34:19Z

During inference, we will stop generating tokens once the assistant outputs "###". Suppose that the training data is " ###Human: Hi. ###Assistant: Hi, can I help you. ###Human: yes". The content which needs to calculate loss is "Hi, can I help you. ###". So the model can learn to generate "###" as the end flag of his reply.

I mean you are also keep the first "###"(the one before "Human: Hi...").

hangzhang-nlp · 2023-06-29T08:48:44Z

Oh, I see. This place is just for convenience, and you can add a judgment, mask the first "###".

Cescfangs · 2023-06-29T08:53:37Z

Okay, thanks for the confirmation

Cescfangs closed this as completed Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why keep "###" before instruct text? #44

why keep "###" before instruct text? #44

Cescfangs commented Jun 27, 2023

hangzhang-nlp commented Jun 27, 2023

Cescfangs commented Jun 29, 2023

hangzhang-nlp commented Jun 29, 2023

Cescfangs commented Jun 29, 2023

hangzhang-nlp commented Jun 29, 2023

Cescfangs commented Jun 29, 2023

why keep "###" before instruct text? #44

why keep "###" before instruct text? #44

Comments

Cescfangs commented Jun 27, 2023

hangzhang-nlp commented Jun 27, 2023

Cescfangs commented Jun 29, 2023

hangzhang-nlp commented Jun 29, 2023

Cescfangs commented Jun 29, 2023

hangzhang-nlp commented Jun 29, 2023

Cescfangs commented Jun 29, 2023