Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix intermittent CUDA activity flush issue #95

Closed
wants to merge 1 commit into from

Conversation

gdankel
Copy link
Contributor

@gdankel gdankel commented Feb 28, 2021

Summary:
We recently observed that no Cuda activities appears in libkineto traces for certain setups.
The issue is that cuptiActivityFlushAll sometimes produces no buffers.
Setting the CUPTI_ACTIVITY_FLAG_FLUSH_FORCED appears to give us consistent results.

We're still figuring out which setups are affected.

Differential Revision: D26717258

Summary:
We recently observed that no Cuda activities appears in libkineto traces for certain setups.
The issue is that cuptiActivityFlushAll sometimes produces no buffers.
Setting the CUPTI_ACTIVITY_FLAG_FLUSH_FORCED appears to give us consistent results.

We're still figuring out which setups are affected.

Differential Revision: D26717258

fbshipit-source-id: ac3f39841de1f64d7800393f2bad5d105897a708
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D26717258

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in fa0f2d5.

Kingwl pushed a commit to Kingwl/kineto that referenced this pull request Mar 4, 2021
Summary:
Pull Request resolved: pytorch#95

We recently observed that no Cuda activities appears in libkineto traces for certain setups.
The issue is that cuptiActivityFlushAll sometimes produces no buffers.
Setting the CUPTI_ACTIVITY_FLAG_FLUSH_FORCED appears to give us consistent results.

We're still figuring out why it suddenly started happening and why it affects some setups and not others.

Reviewed By: chauhang

Differential Revision: D26717258

fbshipit-source-id: d96df0eccaaa7ef865d0314eca6dc43c4ad1d648
liuziyue pushed a commit to liuziyue/kineto that referenced this pull request Mar 10, 2021
Summary:
Pull Request resolved: pytorch#95

We recently observed that no Cuda activities appears in libkineto traces for certain setups.
The issue is that cuptiActivityFlushAll sometimes produces no buffers.
Setting the CUPTI_ACTIVITY_FLAG_FLUSH_FORCED appears to give us consistent results.

We're still figuring out why it suddenly started happening and why it affects some setups and not others.

Reviewed By: chauhang

Differential Revision: D26717258

fbshipit-source-id: d96df0eccaaa7ef865d0314eca6dc43c4ad1d648
@gdankel gdankel linked an issue Mar 15, 2021 that may be closed by this pull request
ilia-cher pushed a commit that referenced this pull request Mar 16, 2021
Summary:
Pull Request resolved: #95

We recently observed that no Cuda activities appears in libkineto traces for certain setups.
The issue is that cuptiActivityFlushAll sometimes produces no buffers.
Setting the CUPTI_ACTIVITY_FLAG_FLUSH_FORCED appears to give us consistent results.

We're still figuring out why it suddenly started happening and why it affects some setups and not others.

Reviewed By: chauhang

Differential Revision: D26717258

fbshipit-source-id: d96df0eccaaa7ef865d0314eca6dc43c4ad1d648
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CUDA traces not getting generated in some environments
2 participants