multi-turn dialog format #17

LHolten · 2019-11-22T15:10:41Z

Section 3.1 of the paper states that dialog turns of the same session are concatenated into a long text, ended by the end-of-text token.

Does this mean that there are no special tokens in between dialog turns?

How do I separate dialog turns?

dreasysnail · 2019-11-22T16:36:17Z

There ARE special tokens (<|endoftext|>, id=50256) between dialogue turns in multi-turn setup. Your input format should be like this:

Turn1 <|endoftext|> Turn2 <|endoftext|> ... TurnN

Let us know if you have any further concerns.

LHolten · 2019-11-23T11:55:48Z

Thanks for the clarification, that makes sense.

LHolten · 2019-11-23T12:30:25Z

What are these token_type_ids for?

DialoGPT/prepro.py

Line 107 in 5688864

token_type_ids += [i] * (len(s) + 1)

It looks like they are different for the different turns in a session and get passed to the model while training.
This would be very weird as the GPT2 model from the transformers library converts token_type_ids as if they are regular input_ids and uses the same embedding for them.

intersun · 2019-11-24T21:23:42Z

Yes, they are different for different turns when we prepared the data. I believe we did some experiments about it, and end up using the default token_type_id (by setting token_type_id = None) during training.

LHolten · 2019-11-25T19:26:03Z

I see it now:

DialoGPT/LSP_train.py

Lines 279 to 280 in 5688864

    
           if args.no_token_id: 
        
               token_ids = None

abisee · 2020-10-20T19:31:54Z

Turn1 <|endoftext|> Turn2 <|endoftext|> ... TurnN

@dreasysnail: should there be spaces around the <|endoftext|> or not?

abisee · 2020-10-20T19:33:38Z

@dreasysnail, was DialoGPT trained with any kind of padding (e.g., if the entire dialogue doesn't fill up the max length)? Or did the multi-turn dialogue always fill up the entire max length (as in GPT2 training)?

dreasysnail assigned intersun and ChenRocks Nov 23, 2019

LHolten closed this as completed Nov 25, 2019

ferdinando17 mentioned this issue Feb 6, 2020

Tokens in multi-turn setting #30

Open

GraphGrailAi mentioned this issue Apr 1, 2020

Fine-tune with own dataset - how to multi-turn ? #36

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-turn dialog format #17

multi-turn dialog format #17

LHolten commented Nov 22, 2019

dreasysnail commented Nov 22, 2019

LHolten commented Nov 23, 2019

LHolten commented Nov 23, 2019

intersun commented Nov 24, 2019

LHolten commented Nov 25, 2019

abisee commented Oct 20, 2020

abisee commented Oct 20, 2020

multi-turn dialog format #17

multi-turn dialog format #17

Comments

LHolten commented Nov 22, 2019

dreasysnail commented Nov 22, 2019

LHolten commented Nov 23, 2019

LHolten commented Nov 23, 2019

intersun commented Nov 24, 2019

LHolten commented Nov 25, 2019

abisee commented Oct 20, 2020

abisee commented Oct 20, 2020