Transformer.predict: do not broadcast to listeners #345

danieldk · 2022-08-31T12:14:59Z

The output of a transformer is passed through in two different ways:

Prediction: the data is passed through the Doc._.trf_data attribute.
Training: the data is broadcast directly to the transformer's listeners.

However, the Transformer.predict method breaks the strict separation between training and prediction by also broadcasting transformer outputs to its listeners.

However, this breaks down when we are training a model with an unfrozen transformer when the transformer is also in annotating_components. The transformer will first (as part of its update step) broadcast the tensors and backprop function to its listeners. However, then when acting as an annotating component, it would immediately override its own output and clear the backprop function. As a result, gradients will not flow into the transformer.

This change removes the broadcast from the predict method. If a listener does not receive a batch, attempt to get the transformer output from the Doc instances. This makes it possible to train a pipeline with a frozen transformer.

This ports explosion/spaCy#11385 to spacy-transformers. Alternative to #342.

The output of a transformer is passed through in two different ways: - Prediction: the data is passed through the `Doc._.trf_data` attribute. - Training: the data is broadcast directly to the transformer's listeners. However, the `Transformer.predict` method breaks the strict separation between training and prediction by also broadcasting transformer outputs to its listeners. However, this breaks down when we are training a model with an unfrozen transformer when the transformer is also in `annotating_components`. The transformer will first (as part of its update step) broadcast the tensors and backprop function to its listeners. However, then when acting as an annotating component, it would immediately override its own output and clear the backprop function. As a result, gradients will not flow into the transformer. This change removes the broadcast from the `predict` method. If a listener does not receive a batch, attempt to get the transformer output from the `Doc` instances. This makes it possible to train a pipeline with a frozen transformer.

spacy_transformers/layers/listener.py

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

…redict

danieldk added bug Something isn't working feat / pipeline Feature: Pipeline components labels Aug 31, 2022

svlandeg reviewed Sep 7, 2022

View reviewed changes

spacy_transformers/layers/listener.py Outdated Show resolved Hide resolved

svlandeg reviewed Sep 7, 2022

View reviewed changes

spacy_transformers/layers/listener.py Outdated Show resolved Hide resolved

svlandeg reviewed Sep 7, 2022

View reviewed changes

spacy_transformers/layers/listener.py Outdated Show resolved Hide resolved

svlandeg mentioned this pull request Sep 7, 2022

Transformer: add update_listeners_in_predict option #342

Closed

danieldk and others added 4 commits September 26, 2022 12:01

Apply suggestions from code review

2e1a876

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

Require spaCy 3.5.0

66f84c0

Use spaCy error code

43cc25b

Merge remote-tracking branch 'upstream/master' into bugfix/listener_p…

4c9727b

…redict

danieldk marked this pull request as ready for review January 26, 2023 08:50

Fix missing import

7251d48

svlandeg merged commit e66c73d into explosion:master Jan 30, 2023

adrianeboyd added a commit to adrianeboyd/spacy-transformers that referenced this pull request Feb 11, 2023

Undo changes to v1.1 model CI test from explosion#345

5585a1c

adrianeboyd mentioned this pull request Feb 11, 2023

Undo changes to v1.1 model CI test from #345 #371

Merged

3 tasks

adrianeboyd added a commit that referenced this pull request Feb 13, 2023

Undo changes to v1.1 model CI test from #345 (#371)

ae1bffd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformer.predict: do not broadcast to listeners #345

Transformer.predict: do not broadcast to listeners #345

danieldk commented Aug 31, 2022

Transformer.predict: do not broadcast to listeners #345

Transformer.predict: do not broadcast to listeners #345

Conversation

danieldk commented Aug 31, 2022