Rewrite docs so that it is OK to use record_stream before uses #113282

ezyang · 2023-11-08T18:04:47Z

Stack from ghstack (oldest at bottom):

-> Rewrite docs so that it is OK to use record_stream before uses #113282

The previous documentation did not appear to accurately describe
the actual semantics in CUDA caching allocator.

When you record stream, we only record a stream use:

  void recordStream(Block* block, cuda::CUDAStream stream) {
    std::lock_guard<std::recursive_mutex> lock(mutex);
    if (stream.stream() == block->stream) {
      // ignore uses on the allocation stream, since those don't require any
      // special synchronization
      return;
    }
    block->stream_uses.insert(stream);
  }

It is only at deallocation time when we actually install an event on
stream uses that we will subsequently query to determine if the block
can be reused or not.

Signed-off-by: Edward Z. Yang ezyang@meta.com

The previous documentation did not appear to accurately describe the actual semantics in CUDA caching allocator. When you record stream, we only record a stream use: ``` void recordStream(Block* block, cuda::CUDAStream stream) { std::lock_guard<std::recursive_mutex> lock(mutex); if (stream.stream() == block->stream) { // ignore uses on the allocation stream, since those don't require any // special synchronization return; } block->stream_uses.insert(stream); } ``` It is only at deallocation time when we actually install an event on stream uses that we will subsequently query to determine if the block can be reused or not. Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

pytorch-bot · 2023-11-08T18:04:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113282

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 958446d with merge base 9e6e958 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

pull / linux-focal-py3_8-clang9-xla / test (xla, 1, 1, linux.12xlarge, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

The previous documentation did not appear to accurately describe the actual semantics in CUDA caching allocator. When you record stream, we only record a stream use: ``` void recordStream(Block* block, cuda::CUDAStream stream) { std::lock_guard<std::recursive_mutex> lock(mutex); if (stream.stream() == block->stream) { // ignore uses on the allocation stream, since those don't require any // special synchronization return; } block->stream_uses.insert(stream); } ``` It is only at deallocation time when we actually install an event on stream uses that we will subsequently query to determine if the block can be reused or not. Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 46c3c2c Pull Request resolved: #113282

albanD

SGTM

ezyang · 2023-11-08T20:03:29Z

@pytorchbot merge

pytorchmergebot · 2023-11-08T20:05:25Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

ezyang · 2023-11-08T21:21:46Z

@pytorchbot merge -f "only the doc job matters"

pytorchmergebot · 2023-11-08T21:24:33Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ch#113282) The previous documentation did not appear to accurately describe the actual semantics in CUDA caching allocator. When you record stream, we only record a stream use: ``` void recordStream(Block* block, cuda::CUDAStream stream) { std::lock_guard<std::recursive_mutex> lock(mutex); if (stream.stream() == block->stream) { // ignore uses on the allocation stream, since those don't require any // special synchronization return; } block->stream_uses.insert(stream); } ``` It is only at deallocation time when we actually install an event on stream uses that we will subsequently query to determine if the block can be reused or not. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: pytorch#113282 Approved by: https://github.com/Skylion007, https://github.com/albanD

github-actions bot requested review from SherlockNoMad, albanD, antoniojkim, bdhirsh, miladm, voznesenskym and wconstab November 8, 2023 18:05

ezyang requested review from colesbury and janeyx99 November 8, 2023 18:05

Skylion007 approved these changes Nov 8, 2023

View reviewed changes

albanD approved these changes Nov 8, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 8, 2023

pytorchmergebot added the merging label Nov 8, 2023

pytorchmergebot removed the merging label Nov 8, 2023

ezyang added release notes: cuda release notes category topic: docs topic category labels Nov 8, 2023

pytorchmergebot added merging Merged labels Nov 8, 2023

pytorchmergebot closed this in 77e8e8f Nov 8, 2023

pytorchmergebot removed the merging label Nov 8, 2023

facebook-github-bot deleted the gh/ezyang/2421/head branch November 12, 2023 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rewrite docs so that it is OK to use record_stream before uses #113282

Rewrite docs so that it is OK to use record_stream before uses #113282

Uh oh!

ezyang commented Nov 8, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 8, 2023 •

edited

Loading

Uh oh!

albanD left a comment

Uh oh!

ezyang commented Nov 8, 2023

Uh oh!

pytorchmergebot commented Nov 8, 2023

Uh oh!

ezyang commented Nov 8, 2023

Uh oh!

pytorchmergebot commented Nov 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Rewrite docs so that it is OK to use record_stream before uses #113282

Rewrite docs so that it is OK to use record_stream before uses #113282

Uh oh!

Conversation

ezyang commented Nov 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113282

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Nov 8, 2023

Uh oh!

pytorchmergebot commented Nov 8, 2023

Merge failed

Uh oh!

ezyang commented Nov 8, 2023

Uh oh!

pytorchmergebot commented Nov 8, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ezyang commented Nov 8, 2023 •

edited

Loading

pytorch-bot bot commented Nov 8, 2023 •

edited

Loading