Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allocate dQ, dK, and dV as a catted tensor to save a downstream cat in nvFuser. #59

Merged
merged 15 commits into from
Mar 27, 2024

Conversation

wujingyue
Copy link
Collaborator

@wujingyue wujingyue commented Mar 23, 2024

For NVIDIA/Fuser#1768.

The description on the added compile option explains what this optimization does.

This optimization is disabled by default for now. I'll try to enable it by default or even always after #35 is merged and bookend is disabled by default.

Base automatically changed from wjy/exception to main March 23, 2024 12:31
@wujingyue wujingyue changed the base branch from main to wjy/clean March 23, 2024 23:42
Base automatically changed from wjy/clean to main March 24, 2024 13:07
Copy link
Collaborator

@jjsjann123 jjsjann123 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👏

thunder/tests/test_cudnn_executor.py Show resolved Hide resolved
Copy link
Collaborator

@t-vi t-vi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Per @jjsjann123 review. Thank you @wujingyue @jjsjann123 .

@wujingyue
Copy link
Collaborator Author

Hey @vedaanta-nvidia, are you interested in reviewing this before I merge?

@wujingyue wujingyue added cudnn enhancement New feature or request labels Mar 26, 2024
@wujingyue
Copy link
Collaborator Author

Hey @vedaanta-nvidia, are you interested in reviewing this before I merge?

Hey Vedaanta, do you have more comments before I merge?

thunder/executors/cudnnex.py Show resolved Hide resolved
thunder/executors/cudnnex.py Show resolved Hide resolved
@wujingyue wujingyue merged commit 483c352 into main Mar 27, 2024
37 checks passed
@wujingyue wujingyue deleted the wjy/format branch March 27, 2024 20:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cudnn enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants