Add Covariance and PearsonCorrelation metrics #1684

nelson-liu · 2018-08-29T07:36:21Z

This PR implements an online algorithm for calculating Covariance and the sample Pearson correlation coefficient.

This was actually nontrivial, I mostly referenced the tensorflow streaming_covariance metric in implementing this. Their implementation is a vectorized version of the weighted algorithm on this wikipedia page

The tests simply ensure that the streaming Covariance and PearsonCorrelation match up with what numpy would calculate, which I believe is a reasonable correctness check.

joelgrus · 2018-08-29T14:06:16Z

allennlp/training/metrics/covariance.py

+    def __init__(self) -> None:
+        self._total_prediction_mean = 0.0
+        self._total_label_mean = 0.0
+        self._total_comoment = 0.0


nit: it's very hard for my brain not to look at comoment and feel like someone made a typo in comment, which is jarring. would you consider co_moment instead? 😀

(feel free to tell me I'm being a crazy person)

This PR implements an online algorithm for calculating Covariance and the sample Pearson correlation coefficient. This was actually nontrivial, I mostly referenced the tensorflow [streaming_covariance metric](https://github.com/tensorflow/tensorflow/blob/4dcfddc5d12018a5a0fdca652b9221ed95e9eb23/tensorflow/contrib/metrics/python/ops/metric_ops.py#L3127-L3264) in implementing this. Their implementation is a vectorized version of the weighted algorithm [on this wikipedia page](https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#Online) The tests simply ensure that the streaming Covariance and PearsonCorrelation match up with what numpy would calculate, which I believe is a reasonable correctness check.

nelson-liu added 5 commits August 28, 2018 23:47

Add Covariance metric and a test for it

dd5996b

Fix covariance returns docstring

2ea3f01

Move FloatTensor call to variable definition

afd430b

Fix lint

e92db94

Add PearsonCorrelation and tests for it

f4e42c4

nelson-liu requested review from joelgrus and matt-gardner August 29, 2018 07:36

nelson-liu added 4 commits August 29, 2018 00:50

Add more docs

f817f9d

Fix long lines in docstring

064fbf6

Test Covariance and PearsonCorrelation reset

ebe2aca

Test PearsonCorrelation reset with masked inputs

243e037

joelgrus approved these changes Aug 29, 2018

View reviewed changes

s/comoment/co_moment/ in variable names

36089b6

nelson-liu merged commit 2a45f44 into allenai:master Aug 29, 2018

nelson-liu deleted the pearson_correlation_metric branch August 29, 2018 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Covariance and PearsonCorrelation metrics #1684

Add Covariance and PearsonCorrelation metrics #1684

nelson-liu commented Aug 29, 2018

joelgrus Aug 29, 2018

joelgrus Aug 29, 2018

Add Covariance and PearsonCorrelation metrics #1684

Add Covariance and PearsonCorrelation metrics #1684

Conversation

nelson-liu commented Aug 29, 2018

joelgrus Aug 29, 2018

Choose a reason for hiding this comment

joelgrus Aug 29, 2018

Choose a reason for hiding this comment