RuntimeError: copy_if failed to synchronize: device-side assert triggered #658
Comments
According to this #275, it seems like having a learning rate that is too large may have caused the problem? But the error message does not seem to be related at all. I am still in the process of verifying this fact. |
Having learning rate that is too large is indeed the problem. Lowering the learning rate solves the problem. |
I meet the same issues. Can you tell me how to reduce your learning rate? Is it just experience to lower the value a little bit from the beginning?thanks |
batch size too large also cause this issue |
hello, I have met the seem issue, then i reduce the learning rate, but i can't reslove it. so could you help me to reslove the issue, thanks! the error in below: |
馃悰 Bug
This may be similar to #229 but the message is slightly different. 229 is
an illegal memory access was encountered
but what I met isdevice-side assert triggered
.I have changed the
NUM_CLASSES
as well.To Reproduce
Steps to reproduce the behavior:
Run training code
Expected behavior
No error
Environment
The text was updated successfully, but these errors were encountered: