Add SFT params random_offset_probability, label_masking #2005

andreaskoepf · 2023-03-07T12:39:35Z

Added SFT training parameters:

random_offset_probability (float, default: 0.5): probability of random offset into conversations when conversation > max_length
label_masking (bool, default: true): if true only loss for tokens of assistant replies is calculated else for all tokens (including prompter)

sanagno · 2023-03-07T14:23:31Z

model/model_training/custom_datasets/dialogue_collator.py

@@ -54,13 +64,20 @@ def __call__(self, features):
                    list(map(lambda x: x[1], flatten_message["offset_mapping"])),


this should not be required now

Could you make a suggested change (ctrl+g)?

sanagno · 2023-03-07T14:24:27Z

model/model_training/custom_datasets/dialogue_collator.py

+
+            # append eos token to each messages
+            assert self.tokenizer.eos_token
+            messages = [m + self.tokenizer.eos_token for m in messages]


Did we converge to this? Using the end of prompt-reply the same as eos? Not totally against it if eveyone agrees

EOS at end of messages simplifies the decoding, TRLX also relies on eos tokens (i.e. would either have to be patched or eos could be set to <human>). The format is a compromise between framing format v3 and old <bot> <user> format v2.

Added SFT training parameters: - `random_offset_probability` (float, default: 0.5): probability of random offset into conversations when conversation > max_length - `label_masking` (bool, default: true): if true only loss for tokens of assistant replies is calculated else for all tokens (including prompter)

Add SFT params random_offset_probability, label_masking

c45e65c

andreaskoepf requested review from theblackcat102 and sanagno as code owners March 7, 2023 12:39

Merge branch 'main' into sft_random_message_offsets

dcdfaea

sanagno reviewed Mar 7, 2023

View reviewed changes

sanagno approved these changes Mar 7, 2023

View reviewed changes

sanagno merged commit 9d64bb8 into main Mar 7, 2023
1 check passed

sanagno deleted the sft_random_message_offsets branch March 7, 2023 15:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SFT params random_offset_probability, label_masking #2005

Add SFT params random_offset_probability, label_masking #2005

andreaskoepf commented Mar 7, 2023 •

edited

sanagno Mar 7, 2023

andreaskoepf Mar 7, 2023

sanagno Mar 7, 2023

andreaskoepf Mar 7, 2023

		@@ -54,13 +64,20 @@ def __call__(self, features):
		list(map(lambda x: x[1], flatten_message["offset_mapping"])),

Add SFT params random_offset_probability, label_masking #2005

Add SFT params random_offset_probability, label_masking #2005

Conversation

andreaskoepf commented Mar 7, 2023 • edited

sanagno Mar 7, 2023

Choose a reason for hiding this comment

andreaskoepf Mar 7, 2023

Choose a reason for hiding this comment

sanagno Mar 7, 2023

Choose a reason for hiding this comment

andreaskoepf Mar 7, 2023

Choose a reason for hiding this comment

andreaskoepf commented Mar 7, 2023 •

edited