Fix pearson correlation.py #3101

JackKuo666 · 2019-07-27T06:19:29Z

Fixes #3102. Since the input tensor may be, for example(a batch of label), a tensor ([[0.,0.,0.,0.],
[0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the denominator is assigned a value of 1.

Since the input tensor may be, for example, a tensor ([[0.,0.,0.,0.], [0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the denominator is assigned a value of 1.

Since the input tensor may be, for example, a tensor ([[0.,0.,0.,0.], [0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the pearson_r is assigned a value of 0.

matt-gardner

Thanks for the bug fix! Does the test you added fail without the changes that you made?

There are some minor pylint things to fix (you can see them here: http://build.allennlp.org/viewLog.html?buildId=18000&buildTypeId=AllenNLP_AllenNLPPullRequests&tab=buildLog). After that, this should be good to merge.

allennlp/tests/training/metrics/pearson_correlation_test.py

fix some pylint things

JackKuo666

The reason for fixing this bug is that in some input data, a batch's predictions or labels may be exactly the same. Then when the pearson_correlation.py is called, it will appear that the predictions_variance or labels_variance of the batch is 0 when calculating the variance, resulting in:

ZeroDivisionError: float division by zero

The two tests are separated because: in most cases, the data is constructed like predictions_1; but in a few cases, for example: predictions_2, the data of such a batch is exactly the same.

JackKuo666 · 2019-08-01T02:48:06Z

@matt-gardner please review

DeNeutoy · 2019-08-16T22:33:34Z

@matt-gardner can you take another look at this?

matt-gardner

Thanks, LGTM! Sorry to be super slow with this, I've been traveling for the last few weeks.

* fix bug: ZeroDivisionError: float division by zero Since the input tensor may be, for example, a tensor ([[0.,0.,0.,0.], [0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the denominator is assigned a value of 1. * fix bug: ZeroDivisionError: float division by zero Since the input tensor may be, for example, a tensor ([[0.,0.,0.,0.], [0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the denominator is assigned a value of 1. * fix bug: ZeroDivisionError: float division by zero Since the input tensor may be, for example, a tensor ([[0.,0.,0.,0.], [0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the pearson_r is assigned a value of 0. * fix bug: ZeroDivisionError: float division by zero Since the input tensor may be, for example, a tensor ([[0.,0.,0.,0.], [0.,0.,0.,0. ]), there will be a case where (math.sqrt(predictions_variance) or math.sqrt(labels_variance)) is zero, so a judgment is added here to prevent the denominator from being zero. If it is zero, the pearson_r is assigned a value of 0. * fix some pylint things fix some pylint things * Update pearson_correlation.py * Update pearson_correlation_test.py * Update pearson_correlation_test.py

JackKuo666 added 2 commits July 27, 2019 14:12

JackKuo666 mentioned this pull request Jul 27, 2019

ZeroDivisionError: float division by zero [pearson_correlation.py] #3102

Closed

JackKuo666 added 2 commits July 29, 2019 21:27

matt-gardner reviewed Jul 29, 2019

View reviewed changes

allennlp/tests/training/metrics/pearson_correlation_test.py Show resolved Hide resolved

JackKuo666 added 4 commits July 29, 2019 23:42

fix some pylint things

b9faa6c

fix some pylint things

Merge branch 'master' into fix_pearson_correlation.py

38c8465

Update pearson_correlation.py

bc5351a

Update pearson_correlation_test.py

14d0e69

JackKuo666 commented Jul 30, 2019

View reviewed changes

Update pearson_correlation_test.py

2df0d2a

matt-gardner approved these changes Aug 20, 2019

View reviewed changes

matt-gardner merged commit 18daa29 into allenai:master Aug 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix pearson correlation.py #3101

Fix pearson correlation.py #3101

JackKuo666 commented Jul 27, 2019 •

edited by matt-gardner

matt-gardner left a comment

JackKuo666 left a comment

JackKuo666 commented Aug 1, 2019

DeNeutoy commented Aug 16, 2019

matt-gardner left a comment

Fix pearson correlation.py #3101

Fix pearson correlation.py #3101

Conversation

JackKuo666 commented Jul 27, 2019 • edited by matt-gardner

matt-gardner left a comment

Choose a reason for hiding this comment

JackKuo666 left a comment

Choose a reason for hiding this comment

JackKuo666 commented Aug 1, 2019

DeNeutoy commented Aug 16, 2019

matt-gardner left a comment

Choose a reason for hiding this comment

JackKuo666 commented Jul 27, 2019 •

edited by matt-gardner