New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] FIX Unit tests on Windows #85
Conversation
I suspect both have something to do with numpy integers being 64-bit on Linux but not (apparently) on Windows. |
It seems we are very efficiently duplicating each other's work :) |
@maciejkula yes, I just realised this is a duplicate of your PR #84. I can remove the redundant parts, up to you how to split this work, I wouldn't mind helping a bit :) |
OK, I like your set up better, so let's roll with what's here and I'll close the other PR. With respect to the int/long issue: I think I would prefer to track down the root issue (why do we have ints anywhere at all?) and leave the layers without casting. Is there a way you could roll back the embedding layer changes only so that I could see where it fails in Appveyor? |
Great!
Yes, sure, done.
I don't get it either, but I'm also not used to debugging PyTorch code. The function deeper in the traceback that raises the exception (e.g. |
Great! I'm of limited use because I don't have access to a Windows machine, so can't easily make it fail without going to Appveyor all the time. I think the key will be dropping into a debugger where it fails and going up the stack until we hit the source of the problem. |
In hindsight, so obvious. Thanks a lot for the fixes! Are you happy for me to merge? If so, I'll push a new version out to Conda tonight. |
Yes, everything appears to be due to |
🎉 (I have seen similar things with |
This PR aims to fix #82
It includes,
randint
overflow issueIntTensor
andLongTensor
by casting them in forward as suggested here embeddings layer with IntTensor / cuda.IntTensor inputs pytorch/pytorch#145 (comment) . I must be missing something though as I still don't understand why this not an issue on Linux but only on Windows..The latest Appveyor output can be seen here some of the failures will go away once #83 is merged..