Adds optional scale and bias to cudnn's layernorm #234

vedaanta · 2024-04-19T00:42:04Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

This PR adds more functionality to cudnn's layernorm

Adds optional bias and scale
Updates to cudnn 9.1 requirements
Removes older way of caching full graph making function

Future:

Add benchmarks for LN fwd with litgpt configs
Add LN bwd
Add RMSnorm

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

for more information, see https://pre-commit.ci

thunder/executors/cudnn_layernormex.py

crcrpar · 2024-04-19T04:40:13Z

thunder/executors/cudnn_layernormex.py

+    except Exception as e:
+        raise


when would we hit this? I'm not sure if raise is appropriate here at glance

If cudnn is not able to support the graph, this checker should return False. Other errors thrown will most definitely be user errors or gaps in cudnn support. Earlier cudnnex just used to return False for other errors too, but now I want to move it to raise them.

For example, right now a couple of thunder test cases do fail and it is due to a bug in latest cudnn. A workaround right now will be to block that failing layernorm config here in cudnnex explicitly. If the checker just returned False, it would have been hard to uncover this bug.

So these raise makes sure that both user code errors and bugs in cudnn get propagated to the user.
(@wujingyue suggested this workflow when working on cudnn's sdpa)

Co-authored-by: Masaki Kozuki <mkozuki@nvidia.com>

lantiga · 2024-07-03T19:34:20Z

tagging older draft PRs as later, feel free to reopen if this gets back being active

Adds optional scale and bias to cudnn's layernorm

b628ff2

vedaanta requested a review from Anerudhan April 19, 2024 00:42

vedaanta requested review from mruberry, lantiga, robieta, t-vi and carmocca as code owners April 19, 2024 00:42

[pre-commit.ci] auto fixes from pre-commit.com hooks

0250461

for more information, see https://pre-commit.ci

vedaanta marked this pull request as draft April 19, 2024 00:45

crcrpar reviewed Apr 19, 2024

View reviewed changes

Update thunder/executors/cudnn_layernormex.py

beee531

Co-authored-by: Masaki Kozuki <mkozuki@nvidia.com>

lantiga closed this Jul 3, 2024

t-vi deleted the cudnn/norm branch July 16, 2024 12:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds optional scale and bias to cudnn's layernorm #234

Adds optional scale and bias to cudnn's layernorm #234

vedaanta commented Apr 19, 2024

crcrpar Apr 19, 2024

vedaanta Apr 19, 2024

lantiga commented Jul 3, 2024

Adds optional scale and bias to cudnn's layernorm #234

Adds optional scale and bias to cudnn's layernorm #234

Conversation

vedaanta commented Apr 19, 2024

What does this PR do?

PR review

crcrpar Apr 19, 2024

Choose a reason for hiding this comment

vedaanta Apr 19, 2024

Choose a reason for hiding this comment

lantiga commented Jul 3, 2024