Invalid behavior for `ops.take` with torch #18449

fchollet · 2023-07-02T22:53:47Z

It appears that ops.take(embed_matrix, ids) has a different behavior in torch compared to the other 2 backends.

The text was updated successfully, but these errors were encountered:

asingh9530 · 2023-07-12T14:06:37Z

@fchollet if this issue is still valid can I pick this up ? also if possible to share a sample code to reproduce the unexpected behaviour ?

fchollet · 2023-07-12T16:52:39Z

Yes, it's still valid -- you're welcome to take on it!

You can start by writing a unit test for it -- if I recall correctly the issue has to do with using take() with multi-dimensional IDs. So like, a 2D embed_matrix and 2D or 3D (batched) IDs?

asingh9530 · 2023-07-12T16:58:29Z

@fchollet got it.

asingh9530 · 2023-07-13T14:06:10Z

@fchollet are we open to write test cases in pytest or only using unittest ? also we will be writing test cases here only right ?

ariG23498 · 2023-07-13T14:18:27Z

@fchollet are we open to write test cases in pytest or only using unittest ? also we will be writing test cases here only right ?

Hey @asingh9530 this is where the ops.take comes from. You would need to make the necessary changes here! And as per the tests, you would need to make changes to this file.

Hope that helps 🤗

asingh9530 · 2023-07-13T14:29:55Z

@ariG23498 I think this is just a wrapper since it will be running backend.take and the problem is occurring in torch.take I think this will be correct place to make changes correct me if I am wrong 🤔 also thanks for pointing out exact test file.

ariG23498 · 2023-07-13T14:31:47Z

Yes! You are correct, sorry for pointing to the wrapper and not the actual backend file!

asingh9530 · 2023-07-13T14:43:23Z

@ariG23498 I think test-case has already been written from line 191 to 204, so what remains is fixing bug with backend == 'torch'

 def test_take(self):
        x = KerasTensor([None, 3])
        self.assertEqual(knp.take(x, 1).shape, ())
        self.assertEqual(knp.take(x, [1, 2]).shape, (2,))
        self.assertEqual(
            knp.take(x, [[1, 2], [1, 2]], axis=1).shape, (None, 2, 2)
        )

        x = KerasTensor([None, 3, 3])
        self.assertEqual(knp.take(x, 1, axis=1).shape, (None, 3))
        self.assertEqual(knp.take(x, [1, 2]).shape, (2,))
        self.assertEqual(
            knp.take(x, [[1, 2], [1, 2]], axis=1).shape, (None, 2, 2, 3)
        )

ariG23498 · 2023-07-13T15:25:25Z

You can start by writing a unit test for it -- if I recall correctly the issue has to do with using take() with multi-dimensional IDs. So like, a 2D embed_matrix and 2D or 3D (batched) IDs?

I think you would also need to add this test as @fchollet has advised. This would help make the take() ops more robust across the backends. After you see that failing for torch (hopefully) you could move to resolving the issue.

Does that help?

asingh9530 · 2023-07-13T15:28:27Z

@ariG23498 yes this is what I am planning thanks.

asingh9530 · 2023-07-13T18:18:29Z

As per my understanding invalid behaviour of take in torch backend comes from following logic which is defined here from line 749 which is

if x.ndim == 2 and (axis is None or axis == 0):
        
        # This case is equivalent to embedding lookup.
        return torch.nn.functional.embedding(indices, x)

The core issue with this logic as this is primarily searching for 1-d values embeddings in a 2-d matrix embedding matrix for index specified in indices which makes it impossible to run lets say following logic

tensor = [[4, 3, 5],[6, 7, 8]]
indices = [0, 2, 5]
axis = None

which ideally should output [4, 5, 8] when comparing it to torch.take() but as the logic states this get's treated as embedding lookup and it fails.

Possible solution 1:
Have a lookup flag as parameter in function call but since this implementation will only be applicable in torch setting. This might not be optimal.

Possible solution 2:
making lookup as a separate operation rather than part of take, but I believe this will require rewrite of many other component, so not sure how feasible is this.

@fchollet @ariG23498 let me know your thoughts on this 🙂.

asingh9530 · 2023-07-14T15:53:06Z

@fchollet Can I proceed to make change and raise a PR ?

asingh9530 · 2023-07-16T06:38:40Z

@fchollet I think if would like the behaviour of ops.take() with backend torch to be equivalent to numpy.take() then I think we are still doing fine as for all the 2-d or 3-d indices or any multidimensional indices but if we want for it to replicate behaviour of only torch.take() + torch.indices_select() then a added lookup variable would be required as mentioned above.

@ariG23498 If there is any specific case you encountered where ops.take() behaviour is not similar to numpy.take() then please share as I am not able to reproduce any specific set of inputs where its behaviour is not as expected as numpy.take()

fchollet · 2023-07-17T22:58:38Z

@asingh9530 absolutely, please open a PR! Numpy behavior should indeed be our reference.

sachinprasadhs · 2024-04-12T21:19:15Z

I tried the below code with Torch, it is now giving same result as TensorFlow and Jax.

import os
os.environ["KERAS_BACKEND"] = "torch"
import numpy as np
import keras

a = np.array([[0, 1.0, 2.0],
              [10.0, 11.0, 12.0],
              [20.0, 21.0, 22.0],
              [30.0, 31.0, 32.0]])
print(keras.ops.take(a,[3,1],axis=0))

tensor([[30., 31., 32.],
        [10., 11., 12.]], dtype=torch.float64)

Only in out of range index, torch throws IndexError, JAX handles with nan values and TensorFlow provides the last axis values.

github-actions · 2024-04-27T01:48:18Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

innat-asj · 2024-04-27T07:16:26Z

@fchollet
FYI, #19238 (comment)

fchollet transferred this issue from keras-team/keras-core Sep 22, 2023

sachinprasadhs self-assigned this Apr 12, 2024

sachinprasadhs added stat:awaiting response from contributor type:Bug backend:torch labels Apr 12, 2024

github-actions bot added the stale label Apr 27, 2024

google-ml-butler bot removed stale stat:awaiting response from contributor labels Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid behavior for `ops.take` with torch #18449

Invalid behavior for `ops.take` with torch #18449

fchollet commented Jul 2, 2023

asingh9530 commented Jul 12, 2023

fchollet commented Jul 12, 2023

asingh9530 commented Jul 12, 2023

asingh9530 commented Jul 13, 2023 •

edited

ariG23498 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

ariG23498 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

ariG23498 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

asingh9530 commented Jul 14, 2023

asingh9530 commented Jul 16, 2023

fchollet commented Jul 17, 2023

sachinprasadhs commented Apr 12, 2024

github-actions bot commented Apr 27, 2024

innat-asj commented Apr 27, 2024 •

edited

Invalid behavior for ops.take with torch #18449

Invalid behavior for ops.take with torch #18449

Comments

fchollet commented Jul 2, 2023

asingh9530 commented Jul 12, 2023

fchollet commented Jul 12, 2023

asingh9530 commented Jul 12, 2023

asingh9530 commented Jul 13, 2023 • edited

ariG23498 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

ariG23498 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

ariG23498 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

asingh9530 commented Jul 13, 2023

asingh9530 commented Jul 14, 2023

asingh9530 commented Jul 16, 2023

fchollet commented Jul 17, 2023

sachinprasadhs commented Apr 12, 2024

github-actions bot commented Apr 27, 2024

innat-asj commented Apr 27, 2024 • edited

Invalid behavior for `ops.take` with torch #18449

Invalid behavior for `ops.take` with torch #18449

asingh9530 commented Jul 13, 2023 •

edited

innat-asj commented Apr 27, 2024 •

edited