[scala] Make accuracy idependant of output size (fix #8226) #8297

benqua · 2017-10-16T19:23:19Z

Description

This PR change EvalMetric.sumMetric from Float to Double.

yzhliu · 2017-10-18T13:08:29Z

Thanks @benqua , I think a better way is to change sumMetric to Double. The fix here changes the definition of Acc.

benqua · 2017-10-18T13:54:22Z

@Javelinjs , you're right, it changes the definition of accuracy for output.size > 1.
What is the exact definition of Accuracy? I couldn't find a clear definition.

This change provides a definition of accuracy that match the one from wikipedia for binary classification, that says:
the accuracy is the proportion of true results (both true positives and true negatives) among the total number of cases examined
(https://en.wikipedia.org/wiki/Accuracy_and_precision#In_binary_classification).

It seems weird (at least to me :) ) that the accuracy depends on the output dimension and can grow to very large numbers. By dividing by the label dimension, we keep the accuracy between 0 and 1, which is the expected range of a "proportion".

If we change sumMetric to Double, should we do it only for the value stored internally and keep float in the EvalMetric API?

yzhliu · 2017-10-19T03:32:57Z

We should keep it the same as other language bindings, especially python. What if we make it Double in EvalMetric API?

benqua · 2017-10-19T06:32:54Z

It can be done, but it would break the API for people calling EvalMetric.get (https://github.com/apache/incubator-mxnet/blob/master/scala-package/core/src/main/scala/ml/dmlc/mxnet/EvalMetric.scala#L52) .

yzhliu · 2017-10-19T08:11:11Z

We can convert it back to Float when calling EvalMetric.get

benqua · 2017-10-19T19:53:05Z

@Javelinjs ok, I updated the PR as you suggested.
(I should maybe have done a new one because the comment thread is now difficult to understand)
What do you think?

benqua · 2017-10-20T06:49:14Z

Ho, it seems there is a build issue. Not sure it is related to my code (something about the windows gpu build), but I will check this week-end.

yzhliu · 2017-10-20T16:16:10Z

LGTM.
The CI seems to have some problems these days. Lets try again this weekend.

yzhliu · 2017-10-27T03:59:54Z

@benqua could you rebase the pr and re-trigger the CI build?

benqua · 2017-10-27T04:33:32Z

Rebased. Waiting for the CI to complete.

yzhliu · 2017-10-29T14:41:31Z

@piiswrong Could you help to do a force merge?

benqua · 2017-11-02T09:55:13Z

I tried again, still no luck.
The message from the CI doesn't help to understand if it is still a CI issue or a PR one.

yzhliu · 2017-11-03T14:44:17Z

Sorry to see that. Please rebase again, I think the CI is OK now.

benqua · 2017-11-03T19:33:50Z

done. let's see...

benqua · 2017-11-04T21:44:37Z

It fails with:

FAIL: test_operator_gpu.test_svmoutput_with_type

----------------------------------------------------------------------

Traceback (most recent call last):

  File "C:\Anaconda3\envs\py2\lib\site-packages\nose\case.py", line 197, in runTest

    self.test(*self.arg)

  File "c:\jenkins_slave\workspace\ut-python-gpu\tests\python\gpu\test_operator_gpu.py", line 999, in test_svmoutput_with_type

    check_consistency(sym, ctx_list)

  File "c:\jenkins_slave\workspace\ut-python-gpu\pkg_vc14_gpu\python\mxnet\test_utils.py", line 1338, in check_consistency

    raise e

AssertionError: 

Items are not equal:

Error 5.000000 exceeds tolerance rtol=0.100000, atol=0.100000.  Location of maximum error:(18, 3), a=0.000000, b=1.000000

 a: array([[-1.,  1.,  1., ...,  1.,  1.,  1.],

       [-1.,  1.,  0., ...,  0.,  1.,  1.],

       [-1.,  1.,  1., ...,  1.,  1.,  1.],...

 b: array([[-1.,  1.,  1., ...,  1.,  1.,  1.],

       [-1.,  1.,  0., ...,  0.,  1.,  1.],

       [-1.,  1.,  1., ...,  1.,  1.,  1.],...

-------------------- >> begin captured stdout << ---------------------

Train Err: ctx 2 vs ctx 0 at svmoutput_data


--------------------- >> end captured stdout << ----------------------


----------------------------------------------------------------------

Ran 243 tests in 1392.034s


FAILED (SKIP=5, failures=1)


(py2) c:\jenkins_slave\workspace\ut-python-gpu>IF 1 NEQ 0 exit /b 1 

script returned exit code 1

will investigate when I find time. idea, help welcome :)

marcoabreu · 2017-11-21T08:04:57Z

I've encountered the same error with test_operator_gpu.test_svmoutput_with_type on my setup based on the release brach v0.12.0. Please check our internal wiki, @KellenSunderland @mbaijal @larroy

marcoabreu · 2017-11-21T08:05:22Z

https://builds.apache.org/blue/organizations/jenkins/incubator-mxnet/detail/PR-8297/13/pipeline

line 452

[ERROR] /workspace/scala-package/core/src/main/scala/ml/dmlc/mxnet/EvalMetric.scala:115: error: not found: value la
[INFO] this.sumMetric += la.zip(predLabel.toArray)

That’s clearly not a CI issue

When the difference in magnitude between the total accuracy and 1 becomes too big and accuracy is not updated anymore due to the low precision of float numbers.

benqua · 2017-11-21T11:39:55Z

@Javelinjs , Finally, the PR passes all tests.
Do you think it can be merged?

yzhliu · 2017-11-22T01:45:19Z

Thanks.

…he#8297) When the difference in magnitude between the total accuracy and 1 becomes too big and accuracy is not updated anymore due to the low precision of float numbers.

benqua mentioned this pull request Oct 16, 2017

[scala] Accuracy precision is too low #8226

Closed

benqua force-pushed the issue-8226 branch from 8354b98 to d727048 Compare October 17, 2017 19:44

benqua force-pushed the issue-8226 branch from d727048 to 4955864 Compare October 19, 2017 19:49

benqua force-pushed the issue-8226 branch from 4955864 to 3689bdd Compare October 27, 2017 04:31

benqua force-pushed the issue-8226 branch from 3689bdd to 2bd1308 Compare November 2, 2017 07:00

benqua force-pushed the issue-8226 branch from 2bd1308 to 02bcd30 Compare November 3, 2017 19:32

benqua force-pushed the issue-8226 branch from 02bcd30 to c016d93 Compare November 4, 2017 09:22

benqua force-pushed the issue-8226 branch 4 times, most recently from 98c2b13 to 28dc2ee Compare November 16, 2017 10:55

benqua force-pushed the issue-8226 branch 2 times, most recently from b19ad48 to 0873ecb Compare November 20, 2017 20:15

[scala] EvalMetric sumMetric is now a Double instead of a Float

b4588b3

When the difference in magnitude between the total accuracy and 1 becomes too big and accuracy is not updated anymore due to the low precision of float numbers.

benqua force-pushed the issue-8226 branch from 0873ecb to b4588b3 Compare November 21, 2017 08:27

yzhliu merged commit 8df20a2 into apache:master Nov 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[scala] Make accuracy idependant of output size (fix #8226) #8297

[scala] Make accuracy idependant of output size (fix #8226) #8297

benqua commented Oct 16, 2017 •

edited

yzhliu commented Oct 18, 2017

benqua commented Oct 18, 2017 •

edited

yzhliu commented Oct 19, 2017

benqua commented Oct 19, 2017

yzhliu commented Oct 19, 2017

benqua commented Oct 19, 2017

benqua commented Oct 20, 2017

yzhliu commented Oct 20, 2017

yzhliu commented Oct 27, 2017

benqua commented Oct 27, 2017

yzhliu commented Oct 29, 2017

benqua commented Nov 2, 2017

yzhliu commented Nov 3, 2017

benqua commented Nov 3, 2017

benqua commented Nov 4, 2017

marcoabreu commented Nov 21, 2017 •

edited

marcoabreu commented Nov 21, 2017 •

edited

benqua commented Nov 21, 2017

yzhliu commented Nov 22, 2017

[scala] Make accuracy idependant of output size (fix #8226) #8297

[scala] Make accuracy idependant of output size (fix #8226) #8297

Conversation

benqua commented Oct 16, 2017 • edited

Description

yzhliu commented Oct 18, 2017

benqua commented Oct 18, 2017 • edited

yzhliu commented Oct 19, 2017

benqua commented Oct 19, 2017

yzhliu commented Oct 19, 2017

benqua commented Oct 19, 2017

benqua commented Oct 20, 2017

yzhliu commented Oct 20, 2017

yzhliu commented Oct 27, 2017

benqua commented Oct 27, 2017

yzhliu commented Oct 29, 2017

benqua commented Nov 2, 2017

yzhliu commented Nov 3, 2017

benqua commented Nov 3, 2017

benqua commented Nov 4, 2017

marcoabreu commented Nov 21, 2017 • edited

marcoabreu commented Nov 21, 2017 • edited

benqua commented Nov 21, 2017

yzhliu commented Nov 22, 2017

benqua commented Oct 16, 2017 •

edited

benqua commented Oct 18, 2017 •

edited

marcoabreu commented Nov 21, 2017 •

edited

marcoabreu commented Nov 21, 2017 •

edited