Add new column for sft fine tuning with `prepare_dataset` #309

plaguss · 2024-01-30T08:46:14Z

Description

This PR adds functionality on the prepare_dataset function to generate a new column to be used for SFT:

from distilabel.utils import prepare_dataset

dataset = load_dataset("argilla/distilabel-intel-orca-dpo-pairs", split="train")
dataset.task = JudgeLMTask()
dataset_binarized_random = prepare_dataset(dataset, strategy="random", keep_ties=True, sft=True). # 'sft' defaults to False

This dataset will include a new column messages based on the content of the chosen response.
Initially it's not well documented as it just offers a simple way to extend the functionality just on the preference datasets using the same function, but this could be more general. Once refactored, we can use a special section for the functionality.

Example of use of this column from the alignment handbook

Closes #276.

Add new column for sft fine tuning

805cfbd

plaguss requested review from gabrielmbmb, dvsrepo and alvarobartt January 30, 2024 08:46

plaguss self-assigned this Jan 31, 2024

plaguss added this to the 0.5.0 milestone Jan 31, 2024

plaguss merged commit 5bd84bc into main Jan 31, 2024
4 checks passed

plaguss deleted the feat/prepare-dataset-sft branch January 31, 2024 11:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new column for sft fine tuning with `prepare_dataset` #309

Add new column for sft fine tuning with `prepare_dataset` #309

plaguss commented Jan 30, 2024 •

edited

Add new column for sft fine tuning with prepare_dataset #309

Add new column for sft fine tuning with prepare_dataset #309

Conversation

plaguss commented Jan 30, 2024 • edited

Description

Add new column for sft fine tuning with `prepare_dataset` #309

Add new column for sft fine tuning with `prepare_dataset` #309

plaguss commented Jan 30, 2024 •

edited