Tagger: use unnormalized probabilities for inference #10197

danieldk · 2022-02-03T08:56:11Z

Description

Using unnormalized softmax avoids use of the relatively expensive exp function,
which can significantly speed up non-transformer models (e.g. I got a speedup
of 27% on a German tagging + parsing pipeline).

Types of change

Performance improvement

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

Draft since this requires explosion/thinc#583 and a new Thinc version.

Using unnormalized softmax avoids use of the relatively expensive exp function, which can significantly speed up non-transformer models (e.g. I got a speedup of 27% on a German tagging + parsing pipeline).

adrianeboyd · 2022-02-03T09:23:45Z

There are users who use the scores directly and (I assume) would be expecting them to be normalized. I'm not saying we necessarily shouldn't do this, but this proposal doesn't give these users any way to control this behavior, does it?

danieldk · 2022-02-03T09:43:06Z

There are users who use the scores directly and (I assume) would be expecting them to be normalized. I'm not saying we necessarily shouldn't do this, but this proposal doesn't give these users any way to control this behavior, does it?

We could make this configurable. Maybe we should also only provide this functionality in a Tagger.v2 so that if anyone uses the probabilities it doesn't break for them (and they could opt-in to normalized probabilities with Tagger.v2)?

adrianeboyd · 2022-02-03T09:55:18Z

That sounds like a reasonable proposal.

Normalization of probabilities is disabled by default to improve performance.

spacy/ml/models/tagger.py

svlandeg

Agreed that it's a good idea to make this configurable in a new version of the Tagger.

I find it slightly unintuitive to go from a v1 with always normalization, to a v2 with default normalization off. It might still catch some users off-guard that the behaviour changes between the two versions when you rely on defaults. But then again it is documented in the docs, and I can see why this would be beneficial in most cases / for most users. So I'm leaning towards keeping it like the PR has it currently :-)

website/docs/api/architectures.md

…-unnormalized

Tagger: use unnormalized probabilities for inference

3329620

Using unnormalized softmax avoids use of the relatively expensive exp function, which can significantly speed up non-transformer models (e.g. I got a speedup of 27% on a German tagging + parsing pipeline).

danieldk added enhancement Feature requests and improvements feat / tagger Feature: Part-of-speech tagger perf / speed Performance: speed and removed enhancement Feature requests and improvements labels Feb 3, 2022

danieldk added 2 commits February 3, 2022 11:04

Add spacy.Tagger.v2 with configurable normalization

441ebb2

Normalization of probabilities is disabled by default to improve performance.

Update documentation, models, and tests to spacy.Tagger.v2

bf9c8a2

adrianeboyd reviewed Feb 3, 2022

View reviewed changes

spacy/ml/models/tagger.py Outdated Show resolved Hide resolved

Move Tagger.v1 to spacy-legacy

e4a2b84

danieldk mentioned this pull request Feb 3, 2022

Add Tagger.v1 explosion/spacy-legacy#20

Merged

svlandeg approved these changes Feb 3, 2022

View reviewed changes

website/docs/api/architectures.md Outdated Show resolved Hide resolved

danieldk added 2 commits February 3, 2022 17:44

docs/architectures: run prettier

a69b62d

Unnormalized softmax is now a Softmax_v2 option

e89f2a4

svlandeg added the v3.3 Related to v3.3 label Feb 16, 2022

svlandeg changed the base branch from develop to master February 16, 2022 14:48

Require thinc 8.0.14 and spacy-legacy 3.0.9

7061d62

danieldk marked this pull request as ready for review March 14, 2022 12:06

Merge remote-tracking branch 'upstream/master' into softmax-inference…

e694149

…-unnormalized

adrianeboyd merged commit e5debc6 into explosion:master Mar 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tagger: use unnormalized probabilities for inference #10197

Tagger: use unnormalized probabilities for inference #10197

danieldk commented Feb 3, 2022

adrianeboyd commented Feb 3, 2022 •

edited

danieldk commented Feb 3, 2022 •

edited

adrianeboyd commented Feb 3, 2022

svlandeg left a comment

Tagger: use unnormalized probabilities for inference #10197

Tagger: use unnormalized probabilities for inference #10197

Conversation

danieldk commented Feb 3, 2022

Description

Types of change

Checklist

adrianeboyd commented Feb 3, 2022 • edited

danieldk commented Feb 3, 2022 • edited

adrianeboyd commented Feb 3, 2022

svlandeg left a comment

Choose a reason for hiding this comment

adrianeboyd commented Feb 3, 2022 •

edited

danieldk commented Feb 3, 2022 •

edited