zero for second residual grad #33

aijianiula0601 · 2022-11-23T10:17:55Z

Thanks for you jobs. When we checked the code, we found that there was no gradient for residual layer after second layer, please confirm it.

we change the code to : residual = residual - quantized ---> residual = residual - quantized.detach()

Here's the verification we did

    if __name__ == "__main__":
          quantizer = ResidualVQ(
              num_quantizers=4, dim=256, codebook_size=16,
              kmeans_init=True, kmeans_iters=10, threshold_ema_dead_code=2, channel_last=False,
          )
  
          for i in range(4):
              input = torch.rand((2, 256, 30), requires_grad=True)
              quantized, indices, losses = quantizer(input)
              print(quantized.shape, indices.shape, losses.shape)
  
              losses[0, i].backward()
              print(input.grad)

The text was updated successfully, but these errors were encountered:

lucidrains · 2022-11-23T21:20:16Z

@aijianiula0601 i do believe you are correct! thank you for spotting this! 🙏

lucidrains added a commit that referenced this issue Nov 23, 2022

address #33

ecf2f7c

lucidrains closed this as completed Nov 26, 2022

npuichigo mentioned this issue Dec 26, 2022

Zero grad in residual vq facebookresearch/encodec#25

Open

liangcl0928 mentioned this issue Jun 13, 2023

Crash on Mac M1/M2 chip when using MPS support #55

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zero for second residual grad #33

zero for second residual grad #33

aijianiula0601 commented Nov 23, 2022 •

edited

Loading

lucidrains commented Nov 23, 2022

zero for second residual grad #33

zero for second residual grad #33

Comments

aijianiula0601 commented Nov 23, 2022 • edited Loading

lucidrains commented Nov 23, 2022

aijianiula0601 commented Nov 23, 2022 •

edited

Loading