Unwrap variable values in all stateless calls. #19287

hertschuh · 2024-03-11T21:26:59Z

When variables are passed to stateless calls (stateless_call, stateless_update, stateless_apply), unwrap them to extract their value so that the mapping from variable to value in StatelessScope points to a value. This is to prevent an infinite recursion when performing operations (e.g. __add__) in a stateless scope.

Also fix issue where casting a tf.SparseTensor would lose the shape. The optimization in ops.cast is what revealed the stateless calls bug.

codecov-commenter · 2024-03-11T21:33:58Z

Codecov Report

Attention: Patch coverage is 44.44444% with 5 lines in your changes are missing coverage. Please review.

Project coverage is 54.23%. Comparing base (c8700f4) to head (a27cb72).
Report is 88 commits behind head on master.

Files	Patch %	Lines
keras/backend/tensorflow/core.py	16.66%	4 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #19287       +/-   ##
===========================================
- Coverage   80.14%   54.23%   -25.91%     
===========================================
  Files         341      365       +24     
  Lines       36163    39804     +3641     
  Branches     7116     7719      +603     
===========================================
- Hits        28982    21587     -7395     
- Misses       5578    16645    +11067     
+ Partials     1603     1572       -31

Flag	Coverage Δ
keras	`54.23% <44.44%> (-25.76%)`	⬇️
keras-jax	`?`
keras-numpy	`54.23% <44.44%> (-2.86%)`	⬇️
keras-tensorflow	`?`
keras-torch	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

fchollet · 2024-03-11T22:31:51Z

keras/layers/layer.py

-        non_trainable_mapping = zip(
-            self.non_trainable_variables, non_trainable_variables
+        all_variables = map(
+            lambda v: v.value if isinstance(v, KerasVariable) else v,


For greater generality we should do this in StatelessScope I think?

What I don't get is that we already do v = backend.convert_to_tensor(v, dtype=k.dtype) in StatelessScope. This should grab the value, if I'm not mistaken?

For greater generality we should do this in StatelessScope I think?

Yes, I don't like the current duplication. And it's totally trivial to add.

What I don't get is that we already do v = backend.convert_to_tensor(v, dtype=k.dtype) in StatelessScope. This should grab the value, if I'm not mistaken?

Oh, now I fully understand why my change to tensorflow.core.convert_to_tensor triggered this.

The issue is that in Tensorflow, we have:

class Variable( KerasVariable, tf.__internal__.types.Tensor, tf.__internal__.tracking.Trackable, ):

Because of the tf.__internal__.types.Tensor, tf.is_tensor returns True and bypasses the conversion. However, we were lucky enough that tf.cast doesn't have a shortcut when the dtypes are identical, so it would cause the cast to happen always, at which point the variable was turned to an actual tf.Tensor.

But isn't the issue with JAX specifically? I'm confused.

The issue is only with Tensorflow... So it's not a real use case, but the tests do fail.

Ok, perhaps not fully related, but I did run into exactly this issue with JAX a bit earlier -- you cannot call model.stateless_call with Variables, you need to unwrap them. The nature of the issue is an infinite recursion, which seems to happen at the level of JAX.

Can you just add explicit if isinstance logic in StatelessScope? That should fix it.

Yes, doing that, now it's a 1-line change basically. Thanks!

When variables are passed to stateless calls (`stateless_call`, `stateless_update`, `stateless_apply`), unwrap them to extract their value so that the mapping from variable to value in `StatelessScope` points to a value. This is to prevent an infinite recursion when performing operations (e.g. `__add__`) in a stateless scope. Also fix issue where casting a `tf.SparseTensor` would lose the shape. The optimization in `ops.cast` is what revealed the stateless calls bug.

fchollet

LGTM, thank you!

google-ml-butler bot added the size:S label Mar 11, 2024

google-ml-butler bot assigned gbaned Mar 11, 2024

hertschuh requested a review from fchollet March 11, 2024 21:45

google-ml-butler bot added the awaiting review label Mar 11, 2024

fchollet reviewed Mar 11, 2024

View reviewed changes

hertschuh force-pushed the stateless_call branch from c512562 to a27cb72 Compare March 11, 2024 23:39

fchollet approved these changes Mar 12, 2024

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Mar 12, 2024

fchollet merged commit ffa9d52 into keras-team:master Mar 12, 2024
6 checks passed

google-ml-butler bot removed awaiting review ready to pull Ready to be merged into the codebase kokoro:force-run labels Mar 12, 2024

hertschuh deleted the stateless_call branch March 12, 2024 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unwrap variable values in all stateless calls. #19287

Unwrap variable values in all stateless calls. #19287

hertschuh commented Mar 11, 2024

codecov-commenter commented Mar 11, 2024 •

edited

fchollet left a comment

fchollet Mar 11, 2024

hertschuh Mar 11, 2024

fchollet Mar 11, 2024

hertschuh Mar 11, 2024

fchollet Mar 11, 2024

hertschuh Mar 11, 2024

fchollet left a comment

Unwrap variable values in all stateless calls. #19287

Unwrap variable values in all stateless calls. #19287

Conversation

hertschuh commented Mar 11, 2024

codecov-commenter commented Mar 11, 2024 • edited

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

fchollet Mar 11, 2024

Choose a reason for hiding this comment

hertschuh Mar 11, 2024

Choose a reason for hiding this comment

fchollet Mar 11, 2024

Choose a reason for hiding this comment

hertschuh Mar 11, 2024

Choose a reason for hiding this comment

fchollet Mar 11, 2024

Choose a reason for hiding this comment

hertschuh Mar 11, 2024

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

codecov-commenter commented Mar 11, 2024 •

edited