Fix correct gradient accumulation by younesbelkada · Pull Request #407 · huggingface/trl

younesbelkada · 2023-06-06T12:12:38Z

What does this PR do?

This PR correctly fixes the gradient accumulation, firstly introduced in #220

Fixes #321

HuggingFaceDocBuilderDev · 2023-06-06T12:16:10Z

The documentation is not available anymore as the PR was closed or merged.

vwxyzjn

The PR makes sense! Would it be possible to verify that it in fact can reproduce the exact same gradient with different number of gradient accumulation steps?

Here is a snippet:

import torch
from torch.utils.data import TensorDataset, DataLoader
import copy

# seed
torch.manual_seed(0)

# define toy inputs and labels
x = torch.tensor([1., 2., 3., 4.])
y = torch.tensor([2., 4., 6., 8.])

# define dataset and dataloader
dataset = TensorDataset(x, y)
dataloader = DataLoader(dataset, batch_size=2)

# define model, optimizer and loss function
model = torch.nn.Linear(1, 1)

# clone the model
model_clone = copy.deepcopy(model)
criterion = torch.nn.MSELoss()
accumulation_steps = 2

# loop over batches
for i, (inputs, labels) in enumerate(dataloader):
    # reshape inputs and labels
    inputs = inputs.view(-1, 1)
    labels = labels.view(-1, 1)
    # forward pass
    outputs = model(inputs)
    loss = criterion(outputs, labels) / accumulation_steps
    # backward pass
    loss.backward()
    # check if accumulation is done
    if (i + 1) % accumulation_steps == 0:
        print("w/ accumulation, the final model grad is", model.weight.grad) 
        break

loss = criterion(model_clone(x.view(-1, 1)), y.view(-1, 1))
loss.backward()
print("w/o accumulation, the final model grad is", model_clone.weight.grad)

w/ accumulation, the final model grad is tensor([[-27.4301]])
w/o accumulation, the final model grad is tensor([[-27.4301]])

younesbelkada

Yes definitelty! Will add a test for that, thanks a lot for the snippet and the pointer!

vwxyzjn

LGTM! Thanks so much @younesbelkada

* add correct grad acc * add some tests but they fail * test should pass * style * fix

add correct grad acc

5a29ae1

younesbelkada requested a review from lvwerra June 6, 2023 12:12

younesbelkada mentioned this pull request Jun 6, 2023

Why is the backward step in ppo_trainer not handled by accelerate's accumulate? #321

Closed

vwxyzjn reviewed Jun 6, 2023

View reviewed changes

younesbelkada commented Jun 6, 2023

View reviewed changes

younesbelkada added 4 commits June 14, 2023 10:23

add some tests but they fail

72217f5

test should pass

3f385ae

style

b38aa62

fix

3d9f918

younesbelkada requested a review from vwxyzjn June 14, 2023 11:15

vwxyzjn approved these changes Jun 14, 2023

View reviewed changes

vwxyzjn merged commit 61af5f2 into main Jun 14, 2023

vwxyzjn deleted the fix-grad-acc-bug branch June 14, 2023 12:43

yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025

Fix correct gradient accumulation (huggingface#407)

cabf8b7

* add correct grad acc * add some tests but they fail * test should pass * style * fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix correct gradient accumulation#407

Fix correct gradient accumulation#407
vwxyzjn merged 5 commits intomainfrom
fix-grad-acc-bug

younesbelkada commented Jun 6, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 6, 2023 •

edited

Loading

Uh oh!

vwxyzjn left a comment •

edited

Loading

Uh oh!

younesbelkada left a comment

Uh oh!

vwxyzjn left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

younesbelkada commented Jun 6, 2023

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jun 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vwxyzjn left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

vwxyzjn left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Jun 6, 2023 •

edited

Loading

vwxyzjn left a comment •

edited

Loading