Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder #15938

sanchit-gandhi · 2022-03-04T12:48:55Z

This PR correctly implements a back propagation test to verify the functionality of the freeze_feature_encoder argument added to the FlaxWav2Vec2 Model in #15873. It tests:

That the computed loss for the frozen feature encoder model and unfrozen model are equal.
That the gradients of the frozen feature encoder differ to those of the unfrozen feature encoder.
That the gradients of all other unfrozen layers remain equal.

HuggingFaceDocBuilderDev · 2022-03-04T12:52:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

patrickvonplaten

Great! Very nice test and good to see that no modeling code had to be changed

sanchit-gandhi · 2022-03-07T08:12:11Z

If @patil-suraj is happy with this test I'll merge!

patil-suraj

Really nice tests, great job! Just left a comment.

patil-suraj · 2022-03-07T10:24:59Z

tests/wav2vec2/test_modeling_flax_wav2vec2.py

+        # ensure that the gradients of the frozen layers differ, i.e. that the feature encoder is properly frozen
+        feature_extractor_grads = tuple(grads[k] for k in grads if "feature_extractor" in k)
+        feature_extractor_grads_frozen = tuple(grads_frozen[k] for k in grads_frozen if "feature_extractor" in k)
+
+        for feature_extractor_grad, feature_extractor_grad_frozen in zip(
+            feature_extractor_grads, feature_extractor_grads_frozen
+        ):
+            self.assert_difference(feature_extractor_grad, feature_extractor_grad_frozen, 1e-7)


could we also add one more check to see if the grads of frozen module are all precisely zero, since that's what jax.lax.stop_gradient is supposed to do.

Thank you for the feedback! Sure thing, I'll look into that now!

The most recent commit (ca918e9) adds an assertion that verifies that the gradients of the frozen feature encoder layers are precisely zero!

sanchit-gandhi added 2 commits March 4, 2022 16:02

Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder

a4ca857

remove jnp.ndarray type suggestion

bf29d36

sanchit-gandhi force-pushed the flax-wav2vec2 branch from 85ad7e8 to bf29d36 Compare March 4, 2022 18:59

sanchit-gandhi requested review from patrickvonplaten and patil-suraj March 4, 2022 20:02

patrickvonplaten approved these changes Mar 5, 2022

View reviewed changes

patil-suraj approved these changes Mar 7, 2022

View reviewed changes

assert frozen grads are precisely zero

ca918e9

sanchit-gandhi merged commit 1a62b25 into huggingface:master Mar 7, 2022

sanchit-gandhi deleted the flax-wav2vec2 branch March 8, 2022 17:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder #15938

Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder #15938

sanchit-gandhi commented Mar 4, 2022

HuggingFaceDocBuilderDev commented Mar 4, 2022

patrickvonplaten left a comment

sanchit-gandhi commented Mar 7, 2022

patil-suraj left a comment

patil-suraj Mar 7, 2022

sanchit-gandhi Mar 7, 2022

sanchit-gandhi Mar 7, 2022

Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder #15938

Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder #15938

Conversation

sanchit-gandhi commented Mar 4, 2022

HuggingFaceDocBuilderDev commented Mar 4, 2022

patrickvonplaten left a comment

Choose a reason for hiding this comment

sanchit-gandhi commented Mar 7, 2022

patil-suraj left a comment

Choose a reason for hiding this comment

patil-suraj Mar 7, 2022

Choose a reason for hiding this comment

sanchit-gandhi Mar 7, 2022

Choose a reason for hiding this comment

sanchit-gandhi Mar 7, 2022

Choose a reason for hiding this comment