Allow EOS token for finetuning #1199

jimwu6 · 2024-05-14T01:10:21Z

This is needed to allow the finetuning dataset to be constructed correctly.

dakinggg · 2024-05-14T01:59:47Z

Where do you see this needed? I'm pretty sure finetuning just uses the eos from the tokenizer.

milocress · 2024-06-03T19:19:15Z

Where do you see this needed? I'm pretty sure finetuning just uses the eos from the tokenizer.

It looks like it's one of the things **ed into the superclass, I think there are some cases where omitting this causes an error. eg.

[rank2]: ValueError: sequence_id is a required argument when MPT is configured with attn_uses_sequence_id=True and the model is in train mode.

dakinggg · 2024-06-03T19:23:50Z

@milocress that should only be for pretraining style. finetuning style handles packing and sequence id on its own. e.g.

Line 155 in fb9a225

trim_example['sequence_id'] = torch.zeros_like(trim_example['input_ids'])

Allow EOS token for finetuning

acda467

jimwu6 requested review from milocress and dakinggg May 14, 2024 01:10

Merge branch 'main' into jimwu6-eos-tok

e937509

milocress approved these changes Jun 3, 2024

View reviewed changes

Merge branch 'main' into jimwu6-eos-tok

b519520

milocress requested a review from a team as a code owner June 4, 2024 14:02

dakinggg closed this Jun 6, 2024

Provide feedback