Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

attempt to substantially reduce kernel launch overhead #3638

Merged
merged 8 commits into from
Apr 12, 2024

Conversation

apgoucher
Copy link
Collaborator

@apgoucher apgoucher commented Apr 12, 2024

This improves kernel launch latency by 2.2x (from 108us to 49us using @bertmaher's benchmarking script in issue #3619 ). Thanks also to @liboyue's analysis and suggestions.

See the discussion in the third-party PR #3503 (comment)

Copy link
Collaborator

@ThomasRaoux ThomasRaoux left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

awesome! If you have some data points on the overhead before and after the PR could you include those in the description? That useful so that others can reproduce and always convenient to keep track of it.

@apgoucher
Copy link
Collaborator Author

awesome! If you have some data points on the overhead before and after the PR could you include those in the description? That useful so that others can reproduce and always convenient to keep track of it.

Using @bertmaher's benchmarking script I get a 2.2x speedup with these changes (108us --> 49us). I'll add this to the PR description so that it goes in the commit message

@apgoucher apgoucher merged commit e1d0fea into main Apr 12, 2024
5 checks passed
@apgoucher apgoucher deleted the memoize_signature_bind branch April 12, 2024 15:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants