[cuda] Reduce kernel profiler memory usage #5623

qiao-bo · 2022-08-04T02:47:49Z

Related issue = #5527

As described in the issue, kernel profiler takes substantial ram usage to log execution. ~0.15GB/s increase, which is not acceptable. The core reason is that all the cuda events used to log the time are not destroyed until the last update. This PR fixes this by moving the event destroy immediately after kernel execution. Locally tested the example in the issue no longer increase ram usage.

netlify · 2022-08-04T02:47:58Z

✅ Deploy Preview for docsite-preview canceled.

Name	Link
🔨 Latest commit	`cb95eaf`
🔍 Latest deploy log	https://app.netlify.com/sites/docsite-preview/deploys/62eb335691c4990009c40872

ailzhang

Thanks a ton for fixing this!

qiao-bo added 3 commits August 3, 2022 16:38

Destroy event immediately

57c6a78

Fix event return address

35e2e06

Merge branch 'master' into profiler_memory

cb95eaf

qiao-bo requested review from turbo0628 and ailzhang August 4, 2022 02:47

ailzhang approved these changes Aug 4, 2022

View reviewed changes

ailzhang merged commit b3ef6f2 into taichi-dev:master Aug 4, 2022

qiao-bo deleted the profiler_memory branch August 4, 2022 08:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cuda] Reduce kernel profiler memory usage #5623

[cuda] Reduce kernel profiler memory usage #5623

qiao-bo commented Aug 4, 2022

netlify bot commented Aug 4, 2022 •

edited

Loading

ailzhang left a comment

[cuda] Reduce kernel profiler memory usage #5623

[cuda] Reduce kernel profiler memory usage #5623

Conversation

qiao-bo commented Aug 4, 2022

netlify bot commented Aug 4, 2022 • edited Loading

✅ Deploy Preview for docsite-preview canceled.

ailzhang left a comment

Choose a reason for hiding this comment

netlify bot commented Aug 4, 2022 •

edited

Loading