Assigning parameter to tensor doesn't pass gradient info #77565

InfProbSciX · 2022-05-16T17:14:07Z

Issue description

In the example below, assigning a parameter to tensor doesn't seem to pass gradient information, unless a list is used for indexing.

Code example

import torch
torch.__version__  # '1.11.0'
z = torch.zeros(3, 2)

z[:, 0] = torch.nn.Parameter(torch.tensor(1.0))
z

# tensor([[1., 0.],
#         [1., 0.],
#         [1., 0.]])  # <------------

z[:, [0]] = torch.nn.Parameter(torch.tensor(1.0))
z

# tensor([[1., 0.],
#         [1., 0.],
#         [1., 0.]], grad_fn=<CopySlices>)  # <------------

... is this behaviour expected?

cc @ezyang @albanD @zou3519 @gqchen @pearu @nikitaved @soulitzer @lezcano @Varal7

ezyang · 2022-05-17T12:01:46Z

No, non-advanced indexing should propagate gradients, looks like a bug.

albanD · 2022-05-17T12:44:46Z

Note that it doesn't require to use Parameter, the same thing happens with plain Tensors that require gradients.
Also this only happens if we use a scalar Tensor, so I guess some convertion to plain python Number in the advanced indexing code?

Note that as a workaround, you can use torch.nn.Parameter(torch.tensor((1.0,))) that is a 1D Tensor with a single value.

yuguo68 · 2022-05-25T06:20:44Z

Thank you @InfProbSciX for reporting the bug and the reproducing code. It seems the root cause is here

pytorch/aten/src/ATen/TensorIndexing.h

Lines 356 to 358 in b447114

    
           } else if (src.sizes().size() == 0 && src.device().type() == at::kCPU) { 
        
             dst.fill_(src.item()); 
        
             return;

, indeed it is "some convertion to plain python Number" suggested by @albanD. Also, if the scalar tensor is on GPU, we have the gradient info. I will have a pr to fix.

fix #77565 [ghstack-poisoned]

albanD · 2022-06-01T16:07:16Z

Fixed in master now!

mikaylagawarecki added module: autograd Related to torch.autograd, and the autograd engine in general triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels May 17, 2022

yuguo68 self-assigned this May 25, 2022

yuguo68 mentioned this issue May 25, 2022

fix set item to scalar tensor missing gradient info #78246

Closed

yuguo68 added a commit that referenced this issue May 25, 2022

Update on "fix set item to scalar tensor missing gradient info"

a5c1d8e

fix #77565 [ghstack-poisoned]

albanD closed this as completed Jun 1, 2022

yonghakim mentioned this issue Jun 7, 2023

Assigning parameter to tensor doesn't pass gradient info in Autograd #103155

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assigning parameter to tensor doesn't pass gradient info #77565

Assigning parameter to tensor doesn't pass gradient info #77565

InfProbSciX commented May 16, 2022 •

edited by pytorch-bot bot

Loading

ezyang commented May 17, 2022

albanD commented May 17, 2022

yuguo68 commented May 25, 2022 •

edited

Loading

albanD commented Jun 1, 2022

Assigning parameter to tensor doesn't pass gradient info #77565

Assigning parameter to tensor doesn't pass gradient info #77565

Comments

InfProbSciX commented May 16, 2022 • edited by pytorch-bot bot Loading

Issue description

Code example

ezyang commented May 17, 2022

albanD commented May 17, 2022

yuguo68 commented May 25, 2022 • edited Loading

albanD commented Jun 1, 2022

InfProbSciX commented May 16, 2022 •

edited by pytorch-bot bot

Loading

yuguo68 commented May 25, 2022 •

edited

Loading