-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hotfix: cpp softmax not impl #572
Conversation
Hi @chrishkchris , thank you for pointing this out. could you please help on testing at your side? |
thanks. I will test soon |
@dcslin I tested the examples/autograd/mlp.py (multilayer perception) and there is such error:
However if I use softmax_cross_entropy instead of softmax + cross_entropy it will be totally okay, i.e.
|
@dcslin Another observation: in mnist_cnn.py, if I change softmax_cross_entropy to softmax + cross_entropy, it fails no matter what the axis is selected for SoftMax
|
@dcslin
|
To sum up, this PR fixed the cpp SoftMax not implemented error. After checking the result with pytorch, it is okay. |
Fix mlp.py
Hi @chrishkchris , thank you the fix on mlp.py |
moved
softmax(..., axis)
implementation one level up, from cuda to tensor. After the change,softmax(..., axis)
implemenation could serve for both cuda and cpp. Tested is added accordingly.