add setter for EinsumDense.kernel #19469

haifeng-jin · 2024-04-10T00:36:38Z

No description provided.

codecov-commenter · 2024-04-10T00:48:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 76.14%. Comparing base (8961e3f) to head (879e132).
Report is 5 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #19469      +/-   ##
==========================================
+ Coverage   76.08%   76.14%   +0.06%     
==========================================
  Files         367      367              
  Lines       41055    41051       -4     
  Branches     8014     8010       -4     
==========================================
+ Hits        31235    31259      +24     
+ Misses       8103     8082      -21     
+ Partials     1717     1710       -7

Flag	Coverage Δ
keras	`76.00% <100.00%> (+0.06%)`	⬆️
keras-jax	`60.27% <100.00%> (+0.07%)`	⬆️
keras-numpy	`54.23% <100.00%> (+0.11%)`	⬆️
keras-tensorflow	`61.47% <100.00%> (+0.06%)`	⬆️
keras-torch	`60.36% <100.00%> (+0.06%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

fchollet · 2024-04-10T02:21:15Z

keras/layers/core/einsum_dense.py

@@ -197,6 +197,10 @@ def kernel(self):
            )
        return self._kernel

+    @kernel.setter
+    def kernel(self, value):
+        self._kernel.assign(value)


Is this the intended behavior? This is a value assignment, but would users expect to set the actual variable instead?

Also, should we disable this when LoRA is enabled?

I think this should be compatible with lora:

@kernel.setter def kernel(self, value): self._kernel.assign(value) if self.lora_enabled: self.lora_kernel_a.assign(ops.zeros(self.lora_kernel_a.shape)) self.lora_kernel_b.assign(ops.zeros(self.lora_kernel_b.shape))

just like how load_own_variables does

This is a value assignment, but would users expect to set the actual variable instead?
Yes, you are right. I changed it to set the actual variable.
The use case I saw was that they try to normalize the kernel during training everytime the layer is called.
I think this use case is OK, but it would be better to set the variable.

For Lora, it would error out even before reaching this setter function. See the new test I added for testing the error.

fchollet

It isn't clear to me what the use case is in the first place. Based on the use case, we can determine whether direct assignment or assign() is the right behavior.

What do you want to assign? A tensor? An existing Variable?
What tracking behavior do you want? Should layer.kernel still be a trainable variable?

fchollet · 2024-04-10T19:55:20Z

keras/layers/core/einsum_dense.py

@@ -197,6 +197,10 @@ def kernel(self):
            )
        return self._kernel

+    @kernel.setter
+    def kernel(self, value):
+        self._kernel = value


If we're going to do this, should we typecheck kernel to make sure it's a Variable? Also, we should definitely untrack the previously tracked _kernel variable otherwise it's still listed in weights.

fchollet · 2024-04-10T19:55:54Z

keras/layers/core/einsum_dense_test.py

+        )
+        layer.build(input_shape)
+        kernel = layer.kernel
+        layer.kernel = kernel + 1.0


This will set the kernel to a tensor (it won't be trainable anymore). Is that intended?

fchollet · 2024-04-10T19:56:16Z

keras/layers/core/einsum_dense_test.py

+        bias_axes = "de"
+        input_shape = (2, 1, 2)
+        output_shape = (3, 4)
+        layer = layers.EinsumDense(


Please add a check on layer.trainable_variables

fchollet · 2024-04-12T03:00:20Z

Thinking more about this -- I think we should do it like:

Check that it's a KerasVariable
Untrack the old value
Set the new value directly

We should also extend the setter to Dense and Embedding.

haifeng-jin · 2024-04-12T17:01:53Z

Yes, that make sense.
Basically, we just swap out the old kernel.
For my specific use case, it is OK to do either way, but to be more general, we should just do as you said.

fchollet · 2024-04-12T17:46:09Z

Cool, let's do that!

haifeng-jin · 2024-04-12T18:24:30Z

I found it is affecting a lot of things.
It changes the order of the variables, thus, changed get_weights() and saving and loading of the weights, and naming of the weights as well.
It is very prone to bugs.

We may just let the user extend the EinsumDense class to do that instead of via the setter.
WDYT? @fchollet

fchollet · 2024-04-12T20:48:29Z

The naming/ordering issue is fixable. Another issue is that the layer's tracker would have to be unlocked, and we can't do that in the setter (because setattr gets called first). Let's not do that then!

If assign works for your use case you can just do layer.kernel.assign().

add setter for EinsumDense.kernel

cae3af8

google-ml-butler bot added the size:S label Apr 10, 2024

google-ml-butler bot assigned gbaned Apr 10, 2024

fchollet reviewed Apr 10, 2024

View reviewed changes

haifeng-jin added 2 commits April 10, 2024 18:36

update the variable not assigning values

ca83070

test for value error when lora enabled

879e132

haifeng-jin requested a review from fchollet April 10, 2024 18:52

google-ml-butler bot added the awaiting review label Apr 10, 2024

fchollet reviewed Apr 10, 2024

View reviewed changes

fchollet mentioned this pull request Apr 12, 2024

Sharing weights across layers in keras 3 [feature request] #18821

Open

fchollet closed this Apr 13, 2024

google-ml-butler bot removed the awaiting review label Apr 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add setter for EinsumDense.kernel #19469

add setter for EinsumDense.kernel #19469

haifeng-jin commented Apr 10, 2024

codecov-commenter commented Apr 10, 2024 •

edited

Loading

fchollet left a comment

fchollet Apr 10, 2024

james77777778 Apr 10, 2024

haifeng-jin Apr 10, 2024

fchollet left a comment

fchollet Apr 10, 2024

fchollet Apr 10, 2024

fchollet Apr 10, 2024

fchollet commented Apr 12, 2024

haifeng-jin commented Apr 12, 2024

fchollet commented Apr 12, 2024

haifeng-jin commented Apr 12, 2024

fchollet commented Apr 12, 2024

add setter for EinsumDense.kernel #19469

add setter for EinsumDense.kernel #19469

Conversation

haifeng-jin commented Apr 10, 2024

codecov-commenter commented Apr 10, 2024 • edited Loading

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

fchollet Apr 10, 2024

Choose a reason for hiding this comment

james77777778 Apr 10, 2024

Choose a reason for hiding this comment

haifeng-jin Apr 10, 2024

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

fchollet Apr 10, 2024

Choose a reason for hiding this comment

fchollet Apr 10, 2024

Choose a reason for hiding this comment

fchollet Apr 10, 2024

Choose a reason for hiding this comment

fchollet commented Apr 12, 2024

haifeng-jin commented Apr 12, 2024

fchollet commented Apr 12, 2024

haifeng-jin commented Apr 12, 2024

fchollet commented Apr 12, 2024

codecov-commenter commented Apr 10, 2024 •

edited

Loading