Bug about Dropout CUDA Kernel #68909

MARD1NO · 2021-11-25T07:50:43Z

In Dropout.cu (https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/cuda/Dropout.cu#L35)

In 56 lines， you set bool gridxvec_loop_state = 0。

When the gridxvec_loop_state is 0, you generate 4 rand numbers。and in other case，you want to use the last 2 rand values：

    if ((VEC == 4) || (gridxvec_loop_state == 0)) {
      rand = curand_uniform4(&state);
    } else {
      // sets up the last two values we generated last iteration to be used this iteration.
      rand.x = rand.z;
      rand.y = rand.w;
      gridxvec_loop_state ^= 1;
    }

Since you set loop_state as 0, it enter the if branch and generate 4 random numbers，you didn't change the loop_state so you never enter the else branch and cannot use the last 2 rand values.

In my opinion, it should be like this:

    if ((VEC == 4) || (gridxvec_loop_state == 0)) {
      rand = curand_uniform4(&state);
      gridxvec_loop_state ^= 1;
    } else {
      // sets up the last two values we generated last iteration to be used this iteration.
      rand.x = rand.z;
      rand.y = rand.w;
      gridxvec_loop_state ^= 1;
    }

cc @albanD @mruberry @jbschlosser @walterddr @ngimel

The text was updated successfully, but these errors were encountered:

eqy · 2021-11-29T23:10:06Z

CC'ing @mcarilli who might be interested

eqy · 2021-12-09T00:14:08Z

CC @ngimel

ngimel · 2021-12-09T00:32:06Z

We'll accept a fix for this, but Dropout with vec_size=2 has probably never been used (and even if it is, the bug is not catastrophic, we just don't advance random state enough, but that's fine).

mcarilli · 2021-12-09T20:29:15Z

I think you're right about the bug and the proposed fix, good catch.

soulitzer added module: cuda Related to torch.cuda, and CUDA support in general module: nn Related to torch.nn triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug about Dropout CUDA Kernel #68909

Bug about Dropout CUDA Kernel #68909

MARD1NO commented Nov 25, 2021 •

edited by pytorch-probot bot

eqy commented Nov 29, 2021

eqy commented Dec 9, 2021

ngimel commented Dec 9, 2021

mcarilli commented Dec 9, 2021 •

edited

Bug about Dropout CUDA Kernel #68909

Bug about Dropout CUDA Kernel #68909

Comments

MARD1NO commented Nov 25, 2021 • edited by pytorch-probot bot

eqy commented Nov 29, 2021

eqy commented Dec 9, 2021

ngimel commented Dec 9, 2021

mcarilli commented Dec 9, 2021 • edited

MARD1NO commented Nov 25, 2021 •

edited by pytorch-probot bot

mcarilli commented Dec 9, 2021 •

edited