Implement KTO into OpenRLHF #201

Dylancer1998 · 2024-01-26T15:18:55Z

Referenced the implementation of HALOs, the KTO algorithm has been integrated into this branch. It supports both balanced (referred to as the vanilla version) and unbalanced (referred to as the non-vanilla version) scenarios for handling positive and negative samples in a batch. The vanilla version ensures that the number of positive and negative samples is consistent within each batch, while the non-vanilla version does not require this consistency.

A lightweight dataset was selected for algorithm validation, where the effects of DPO, vanilla KTO, non-vanilla KTO, and the baseline were compared. The dataset and the results are as follows:

dataset

--dataset Anthropic/hh-rlhf,tasksource/oasst1_pairwise_rlhf_reward,openai/webgpt_comparisons
--dataset_probs 0.72,0.14,0.14

performance

model	Writing	Roleplay	Reasoning	Math	Coding	Extraction	STEM	Humanities	Average
baseline	7.125	7.425	4.05	2.6	2.85	4.475	7.475	8.475	5.559
DPO	7.4	7.39	3.9	3.05	2.475	4.875	7.2	9.075	5.670
KTO_with_vanilla_loss	7.225	7.325	4.025	2.3	3.475	5.525	7.184	9.075	5.715
KTO	7.145	7.273	4.112	2.666	2.790	5.212	8.315	8.479	5.799

* baseline model is "OpenLLMAI/Llama-2-7b-sft-model-ocra-500k"

for more information, see https://pre-commit.ci

hijkzzz · 2024-01-27T00:11:34Z

Thank you for your contribution and we will review it as soon as possible

hijkzzz · 2024-01-27T02:27:40Z

openrlhf/datasets/preference_dataset.py

+        labels = np.array(self.dataset.labels)
+        unique_labels = np.unique(labels)
+        self.label_to_indices = {label: np.where(labels == label)[0] for label in unique_labels}
+        for label in self.label_to_indices:


Seems like we should add a condition here

if self.shuffle: xxxxx

Good suggestion, I'll include it in the next PR submission.

Dylancer1998 and others added 3 commits January 26, 2024 22:10

[feat]: support two difference versions of KTO

71029d5

Merge branch 'OpenLLMAI:main' into support_KTO

0ae5876

[pre-commit.ci] auto fixes from pre-commit.com hooks

5b33923

for more information, see https://pre-commit.ci

hijkzzz requested review from hijkzzz and wuxibin89 January 27, 2024 00:11

hijkzzz reviewed Jan 27, 2024

View reviewed changes

hijkzzz mentioned this pull request Jan 27, 2024

Support KTO #161

Closed

hijkzzz merged commit a581794 into OpenLLMAI:main Jan 27, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement KTO into OpenRLHF #201

Implement KTO into OpenRLHF #201

Dylancer1998 commented Jan 26, 2024

hijkzzz commented Jan 27, 2024

hijkzzz Jan 27, 2024

Dylancer1998 Jan 27, 2024

Implement KTO into OpenRLHF #201

Implement KTO into OpenRLHF #201

Conversation

Dylancer1998 commented Jan 26, 2024

hijkzzz commented Jan 27, 2024

hijkzzz Jan 27, 2024

Choose a reason for hiding this comment

Dylancer1998 Jan 27, 2024

Choose a reason for hiding this comment