enable multiple eval datasets by peter-sk · Pull Request #1052 · huggingface/trl

peter-sk · 2023-12-02T08:25:47Z

The standard Trainer class from the transformers library (and the documentation of SFTTrainer) allow for multiple validation datasets to be passed as a dictionary from dataset name to dataset.

This does not work however in current SFTTrainer code. This PR fixes this.

peter-sk · 2023-12-03T08:10:31Z

@younesbelkada
Does this make sense?

pop-srw · 2023-12-04T08:39:46Z

I have the same problem. When multiple eval datasets were pass as dict, It cause an error in dataset.map

trl/trl/trainer/sft_trainer.py

Line 379 in a60ceef

tokenized_dataset = dataset.map(

I think this PR is really useful.

peter-sk · 2023-12-04T09:39:09Z

I have the same problem. When multiple eval datasets were pass as dict, It cause an error in dataset.map

trl/trl/trainer/sft_trainer.py

Line 379 in a60ceef

tokenized_dataset = dataset.map(

I think this PR is really useful.

Yes, exactly this. And this error occurs naturally both when using packing and when not.

lvwerra

Looks good to me but I'll let @younesbelkada have a look as well!

lvwerra · 2023-12-04T18:35:28Z

If you could add a test for this that would be awesome!

HuggingFaceDocBuilderDev · 2023-12-04T18:39:20Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

peter-sk · 2023-12-04T19:07:37Z

@lvwerra
Sure, I can add a test. Are there any guidelines for this? I have tried it for transformers, but not for trl. Any hints are appreciated!

…r-sk/trl into feat/multiple-eval-datasets

peter-sk · 2023-12-05T06:10:58Z

@lvwerra
I added a test.

peter-sk · 2023-12-06T11:11:16Z

@lvwerra @younesbelkada
Just a careful push :-)

younesbelkada

Looks great to me, thanks for this contribution!

younesbelkada · 2023-12-06T17:41:01Z

@peter-sk the tests are taking forever, I think that something went wrong with the test you designed, can you please have a quick look?

peter-sk · 2023-12-06T19:14:33Z

@younesbelkada
I guess the training dataset was getting iterated (for hundreds of thousands of iterations).
Now, I am using twice the evaluation dataset. This test works fine in my machine.
I also downsized the test.

younesbelkada

Thanks again!

* enable multiple eval datasets * added test * try to avoid infinite computation * make sure eval set is not infinite * downsizing the test

enable multiple eval datasets

38759eb

lvwerra approved these changes Dec 4, 2023

View reviewed changes

peter-sk added 3 commits December 4, 2023 20:17

Merge branch 'main' into feat/multiple-eval-datasets

9b39833

added test

adced52

Merge branch 'feat/multiple-eval-datasets' of https://github.com/pete…

6dd5244

…r-sk/trl into feat/multiple-eval-datasets

younesbelkada approved these changes Dec 6, 2023

View reviewed changes

peter-sk added 3 commits December 6, 2023 19:52

try to avoid infinite computation

29ae72f

make sure eval set is not infinite

5da74d8

downsizing the test

1b0a9da

younesbelkada approved these changes Dec 6, 2023

View reviewed changes

younesbelkada merged commit 5a23354 into huggingface:main Dec 6, 2023

lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024

enable multiple eval datasets (huggingface#1052)

a43912d

* enable multiple eval datasets * added test * try to avoid infinite computation * make sure eval set is not infinite * downsizing the test

Conversation

peter-sk commented Dec 2, 2023

Uh oh!

peter-sk commented Dec 3, 2023

Uh oh!

pop-srw commented Dec 4, 2023

Uh oh!

peter-sk commented Dec 4, 2023

Uh oh!

lvwerra left a comment

Choose a reason for hiding this comment

Uh oh!

lvwerra commented Dec 4, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Dec 4, 2023

Uh oh!

peter-sk commented Dec 4, 2023

Uh oh!

peter-sk commented Dec 5, 2023

Uh oh!

peter-sk commented Dec 6, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented Dec 6, 2023

Uh oh!

peter-sk commented Dec 6, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants