Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[cuda] Reduce kernel profiler memory usage #5623

Merged
merged 3 commits into from
Aug 4, 2022

Conversation

qiao-bo
Copy link
Collaborator

@qiao-bo qiao-bo commented Aug 4, 2022

Related issue = #5527

As described in the issue, kernel profiler takes substantial ram usage to log execution. ~0.15GB/s increase, which is not acceptable. The core reason is that all the cuda events used to log the time are not destroyed until the last update. This PR fixes this by moving the event destroy immediately after kernel execution. Locally tested the example in the issue no longer increase ram usage.

@netlify
Copy link

netlify bot commented Aug 4, 2022

Deploy Preview for docsite-preview canceled.

Name Link
🔨 Latest commit cb95eaf
🔍 Latest deploy log https://app.netlify.com/sites/docsite-preview/deploys/62eb335691c4990009c40872

Copy link
Contributor

@ailzhang ailzhang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a ton for fixing this!

@ailzhang ailzhang merged commit b3ef6f2 into taichi-dev:master Aug 4, 2022
@qiao-bo qiao-bo deleted the profiler_memory branch August 4, 2022 08:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants