return value of backward in class Sum #2

ifoyooo · 2021-10-03T13:40:27Z

From my perspective, the shape of 'grad_output' should be broadcasted into a_shape.
Although it can still pass task3_1, I'm not sure of its accuracy in more complex situations.

            @staticmethod
            def backward(ctx, grad_output):
                a_shape, dim = ctx.saved_values
              
                if dim is None:
                    out = grad_output.zeros(a_shape)
                    out._tensor._storage[:] = grad_output[0]
                    return out
                else:
                # START Code Update
                    return grad_output #should be replaced by add_zip(grad_output,zeros(a_shape)) #
                # END Code Update

ifoyooo · 2021-10-03T13:53:50Z

I just realized the case 2 in function 'expand' is exactly to tackle the problem aroused by the inconsistence between origin shape and gradient shape which is caused by the return value.

        # Case 2: Backward is a smaller than self. Broadcast up.
        true_shape = TensorData.shape_broadcast(self.shape, other.shape)
        buf = self.zeros(true_shape)
        self.backend._id_map(other, out=buf)
        if self.shape == true_shape:
            return buf

I'm sorry to trouble you.

srush · 2021-10-04T20:05:58Z

Thanks. Yes, this is not well documented. Just kind of snuck it in the fix this issue.

ifoyooo closed this as completed Oct 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

return value of backward in class Sum #2

return value of backward in class Sum #2

ifoyooo commented Oct 3, 2021

ifoyooo commented Oct 3, 2021

srush commented Oct 4, 2021

return value of backward in class Sum #2

return value of backward in class Sum #2

Comments

ifoyooo commented Oct 3, 2021

ifoyooo commented Oct 3, 2021

srush commented Oct 4, 2021