Skip to content

[SFTTrainer] Add warning for wrong padding_side#550

Merged
younesbelkada merged 4 commits intomainfrom
add-sft-warning
Jul 22, 2023
Merged

[SFTTrainer] Add warning for wrong padding_side#550
younesbelkada merged 4 commits intomainfrom
add-sft-warning

Conversation

@younesbelkada
Copy link
Copy Markdown
Contributor

While debugging why some users got 0 loss when fine-tuning Llama-2 on Guanaco, it seems the issue was caused by padding tokens being appended on the left.

The fix for that model seems to be to force tokenizer.padding_side = "right" to successfully fine-tune Llama-2. Therefore we advise users to do that for their training.

Related: https://gist.github.com/younesbelkada/9f7f75c94bdc1981c8ca5cc937d4a4da?permalink_comment_id=4636728#gistcomment-4636728

cc @lvwerra

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

HuggingFaceDocBuilderDev commented Jul 21, 2023

The documentation is not available anymore as the PR was closed or merged.

@younesbelkada younesbelkada merged commit 170d58f into main Jul 22, 2023
@younesbelkada younesbelkada deleted the add-sft-warning branch July 22, 2023 08:53
yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025
* add warning for wrong padding_side

* add warning

* revert

* oops
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants