Introduce Kendall's Tau computation. #1147

sorensenjs · 2020-10-29T17:13:51Z

Migrated and updated https://github.com/tensorflow/addons/pull/2169/files

davmre

This looks great; thanks a lot for the contribution! I've gone ahead and made a first pass of comments, mostly focused on TFP-specific requirements and style. Happy to discuss further.

tensorflow_probability/python/stats/kendalls_tau.py

tensorflow_probability/python/stats/kendalls_tau_test.py

tensorflow_probability/python/stats/kendalls_tau.py

davmre · 2020-10-29T18:15:16Z

tensorflow_probability/python/stats/kendalls_tau.py

+  exchanges = 0
+  num = tf.size(y)
+  k = tf.constant(1, tf.int32)
+  while tf.less(k, num):


This is a bit annoying, but I'm going to ask that you write this loop, and all loops in this file, using TF graph ops (tf.while_loop or tf.scan) in place of Python while or for loop constructs. Since we don't know the contexts in which TFP code will be called, we need the whole library to work in both eager and graph modes (these days you can read 'graph mode' as equivalent to 'inside a @tf.function tracing context'), and in particular when shape information is not statically available. That means we can't assume that conditions like tf.less(k, num) will be concrete values available to the Python interpreter, so the control flow has to occur at the TF level.

It's usually pretty mechanical to do this conversion, though it does force you to think explicitly about the state carried through the loop. In general if you can frame a Python loop in the form

loop_state = initial_loop_state while condition(loop_state): loop_state = loop_body(loop_state)

for some structure of Tensors loop_state, then the translation is just:

final_loop_state = tf.while_loop( condition, loop_body, initial_loop_vars)

and you can use TensorArrays to store any values accumulated during the loop (or tf.scan, which is just a thin wrapper around while_loop + TensorArray).

The test for whether everything is working is to trace the code with unknown inputs of unknown shape

traced_kendalls_tau = tf.function( kendalls_tau, autograph=False).get_concrete_function( y_true=tf.TensorSpec(shape=None, dtype=tf.float32), y_pred=tf.TensorSpec(shape=None, dtype=tf.float32))

and verify that the traced_kendalls_tau behaves identically to the original kendalls_tau function.

The @test_util.test_all_tf_execution_regimes decorator in the unit test file should be doing something like this (though with known shapes). It'd surprise me if those tests are all passing currently? In any case, the above check is the gold standard.

We have a lot of experience writing graph-mode control flow (it's a pain, but you get used to it), so feel free to ask for help if you get stuck.

My test file was missing the main so the tests were not executing.

davmre · 2020-10-29T18:21:36Z

tensorflow_probability/python/stats/kendalls_tau.py

+      while tf.less(i, rght) and tf.less(j, rend):
+        permij = aperm.gather([i, j])
+        yij = tf.gather(y, permij)
+        if tf.less_equal(yij[0], yij[1]):


Similarly to the point above about loop control flow, we also need to replace

if condition_fn(): result = do_one_thing() else: result = do_another_thing()

with the equivalent graph op

result = tf.cond( condition_fn, do_one_thing, do_another_thing)

to handle the case where the condition can't be statically evaluated.

(here and elsewhere)

tensorflow_probability/python/stats/kendalls_tau.py

davmre · 2020-10-29T19:11:34Z

tensorflow_probability/python/stats/kendalls_tau.py

+  v += ((n - first) * (n - first - 1)) // 2
+
+  tot = (n * (n - 1)) // 2
+  if tf.equal(tot, u) or tf.equal(tot, v):


As above---prefer raising an error (assert_util.assert_none_equal(tot, y)) to returning NaN.

tensorflow_probability/python/stats/kendalls_tau.py

tensorflow_probability/python/stats/kendalls_tau_test.py

Adds conversion of arguments via convert_to_tensor, plus additional paramters for type and name.

sorensenjs · 2021-03-04T19:44:38Z

Working on integrating via tensorflow probability.

bhack · 2021-03-04T19:50:25Z

Is there another PR?

sorensenjs · 2021-03-04T19:59:35Z

Not yet, but the code has diverged a lot from this PR so thought I should close. Will ping this thread when it's made part of a future release.

Introduce Kendall's Tau computation.

11dbb9d

googlebot added the cla: yes Declares that the user has signed CLA label Oct 29, 2020

sorensenjs mentioned this pull request Oct 29, 2020

Kendall's Tau metric, based loosely on scipy. tensorflow/addons#2169

Closed

20 tasks

sorensenjs added 2 commits October 29, 2020 14:32

Fix dependencies, adding kendall's tau to stats.

bcdc37c

Add kendall's tau library target.

1409974

davmre reviewed Oct 29, 2020

View reviewed changes

sorensenjs added 11 commits November 2, 2020 08:55

Use TestCase assertion methods.

3f4ab59

Use self.evaluate() instead of .numpy()

5727e96

Update copywrite to specify TFP authors.

d3af694

Add _all_ annotation, make iterative_mergesort a non-private member.

38eea1e

Avoid tf constant.

c31d7dd

Remove pytype annotations.

14f5044

Add tf.name scopes.

c8b63c5

Multiple changes to comply with probability style guide.

9983759

Fixes mergesort unit tests with correct assertions.

aa94925

Adds conversion of arguments via convert_to_tensor, plus additional paramters for type and name.

Remove reference to scipy code - this code only shares procedural order.

5c735ec

Align with William Knight's notation.

3662454

sorensenjs closed this Mar 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce Kendall's Tau computation. #1147

Introduce Kendall's Tau computation. #1147

sorensenjs commented Oct 29, 2020

davmre left a comment

davmre Oct 29, 2020

sorensenjs Nov 2, 2020

davmre Oct 29, 2020

davmre Oct 29, 2020

sorensenjs commented Mar 4, 2021

bhack commented Mar 4, 2021

sorensenjs commented Mar 4, 2021

Introduce Kendall's Tau computation. #1147

Introduce Kendall's Tau computation. #1147

Conversation

sorensenjs commented Oct 29, 2020

davmre left a comment

Choose a reason for hiding this comment

davmre Oct 29, 2020

Choose a reason for hiding this comment

sorensenjs Nov 2, 2020

Choose a reason for hiding this comment

davmre Oct 29, 2020

Choose a reason for hiding this comment

davmre Oct 29, 2020

Choose a reason for hiding this comment

sorensenjs commented Mar 4, 2021

bhack commented Mar 4, 2021

sorensenjs commented Mar 4, 2021