Fix softmax_cross_entropy #8254

kevinthesun · 2017-10-13T07:24:03Z

Description

Fix softmax_cross_entropy operator "Not enough argument to call operator softmax_cross_entropy" issue #6874

Checklist

Essentials

Passed code style checking (make lint)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
For user-facing API changes, API doc string has been updated.
To my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Fix "Not enough argument to call operator softmax_cross_entropy" issue for softmax_cross_entropy operator.
Change output to be shape (batch_size,), which is consistent with gluon softmax loss.

piiswrong · 2017-10-13T17:33:22Z

softmax_cross_entropy's output currently has shape (1,)
It should behave like gluon.loss.SoftmaxCrossEntropyLoss instead.
Since this is currently broken and no one is using it, we should take the chance to fix it

kevinthesun · 2017-10-13T18:19:38Z

Got it. I'll try to fix it.

piiswrong · 2017-12-12T22:07:09Z

closing due to inactive

eric-haibin-lin · 2017-12-15T18:10:51Z

This issue breaks existing examples #6874
We should fix this

eric-haibin-lin · 2017-12-15T18:13:20Z

@kevinthesun you can remove the sumall_except_dim at https://github.com/apache/incubator-mxnet/blob/master/src/operator/loss_binary_op-inl.h#L80 to make sure it doesn't return scalar result.
Also update the infer shape function

Roshrini · 2017-12-15T18:26:15Z

+1 Currently https://github.com/apache/incubator-mxnet/tree/master/example/model-parallel/lstm example breaks because of this issue.

kevinthesun · 2017-12-15T20:12:18Z

@eric-haibin-lin Will work on this today.

kevinthesun · 2017-12-16T11:17:12Z

@piiswrong @eric-haibin-lin @Roshrini

ehsanmok · 2018-01-15T22:56:24Z

Any update for this fix?

eric-haibin-lin · 2018-01-22T20:37:35Z

@kevinthesun any update?

kevinthesun · 2018-01-22T22:00:04Z

Already fixed.

CircleXing001 · 2018-05-18T07:16:06Z

@kevinthesun why it reproduce

tao-sun · 2018-06-22T01:17:31Z

I use docker python:1.1.0_gpu_cuda8 and the same error is reproduced.

piiswrong closed this Dec 12, 2017

eric-haibin-lin reopened this Dec 15, 2017

eric-haibin-lin mentioned this pull request Dec 15, 2017

Fix softmax_cross_entropy list input names #6766

Closed

kevinthesun and others added 2 commits December 15, 2017 12:17

Fix softmax_cross_entropy

4512c6f

Change softmax_cross_entropy output shape to be (batch_size,)

8e47b14

kevinthesun force-pushed the FixSoftmaxCE branch from 28a3cb5 to 8e47b14 Compare December 16, 2017 00:14

Wang added 2 commits December 16, 2017 03:05

Fix and change test case

692b4d1

Fix test

da48543

piiswrong closed this Jan 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix softmax_cross_entropy #8254

Fix softmax_cross_entropy #8254

kevinthesun commented Oct 13, 2017 •

edited

Loading

piiswrong commented Oct 13, 2017

kevinthesun commented Oct 13, 2017

piiswrong commented Dec 12, 2017

eric-haibin-lin commented Dec 15, 2017

eric-haibin-lin commented Dec 15, 2017

Roshrini commented Dec 15, 2017

kevinthesun commented Dec 15, 2017

kevinthesun commented Dec 16, 2017

ehsanmok commented Jan 15, 2018

eric-haibin-lin commented Jan 22, 2018

kevinthesun commented Jan 22, 2018

CircleXing001 commented May 18, 2018

tao-sun commented Jun 22, 2018

Fix softmax_cross_entropy #8254

Fix softmax_cross_entropy #8254

Conversation

kevinthesun commented Oct 13, 2017 • edited Loading

Description

Checklist

Essentials

Changes

piiswrong commented Oct 13, 2017

kevinthesun commented Oct 13, 2017

piiswrong commented Dec 12, 2017

eric-haibin-lin commented Dec 15, 2017

eric-haibin-lin commented Dec 15, 2017

Roshrini commented Dec 15, 2017

kevinthesun commented Dec 15, 2017

kevinthesun commented Dec 16, 2017

ehsanmok commented Jan 15, 2018

eric-haibin-lin commented Jan 22, 2018

kevinthesun commented Jan 22, 2018

CircleXing001 commented May 18, 2018

tao-sun commented Jun 22, 2018

kevinthesun commented Oct 13, 2017 •

edited

Loading