Refac: Straightforward output shape permutation #317

NicolasHug · 2024-10-29T14:07:14Z

This PR is about where and when we call MaybePermuteHWC2CHW(). It's not about tensor allocation (this will come, later).

At a high-level, this PR changes all conditional call patterns like:

if (cond) {
  output.frames = MaybePermuteHWC2CHW(output.frames)
}

to a plain, unconditional

output.frames = MaybePermuteHWC2CHW(output.frames)

This makes it a lot simpler to reason about our output shape permutation. In main, cond is typically input-dependent (but really, caller-dependent), and it leads to a state that's hard to reason about.

Another benefit of this PR is that now all low-level decoding routines (like convertAVFrameToDecodedOutputOnCPU()) have a simpler interface: they only ever take and return HWC tensors.

At a lower level, the following changes were made:

MaybePermuteHWC2CHW() is now a method so we can pass a streamIndex. It makes its interface slightly simpler.
It's now up to every high-level decoding function to call MaybePermuteHWC2CHW().
Some methods like getFrameAtIndex() and getNextDecodedOutputNoDemux() were used both as a high-level decoding entry-point and as a low-level subroutine of other entry-points. I split those into getFrameAtIndex()/getFrameAtIndexInternal() and getNextFrame()/getNextDecodedOutputNoDemux() to clearly distinguish between the public entry point and the underlying private helper. Note that this isn't just a "nice-to-have" or a nit-pick, it's a necessary change for the goal of this PR.
getNextFrame() is the new public entry point, getNextDecodedOutputNoDemux() is now private.

A follow-up of this PR will be to unify the tensor allocation. I think it'd make sense for tensors to always be pre-allocated by the high-level decoding entry points. It will allow us to unify the allocation logic in a single place.

NicolasHug · 2024-10-29T14:08:01Z

packaging/check_glibcxx.py

-    raise ValueError(f"No GLIBCXX symbols found in {symbol_matches}. Something is wrong.")
+    raise ValueError(
+        f"No GLIBCXX symbols found in {symbol_matches}. Something is wrong."
+    )


Not sure why, my linter started to want to format this. If the linter on our CI job is OK with it, can we let this in? Otherwise I have to manually revert it on all my commits.

Linters gonna lint.

NicolasHug · 2024-10-29T14:16:16Z

src/torchcodec/decoders/_core/VideoDecoder.cpp

-    if (!preAllocatedOutputTensor.has_value()) {
-      // We only convert to CHW if a pre-allocated tensor wasn't passed. When a
-      // pre-allocated tensor is passed, it's up to the caller (typically a
-      // batch API) to do the conversion. This is more efficient as it allows
-      // batch NHWC tensors to be permuted only once, instead of permuting HWC
-      // tensors N times.
-      output.frame = MaybePermuteHWC2CHW(streamInfo.options, output.frame);
-    }


This removal is actually the key change. Everything else is just (sensible) patching until tests work

scotts · 2024-10-29T14:27:30Z

Looks good to me. @ahmadsharif1 should also review.

ahmadsharif1 · 2024-10-29T14:45:04Z

src/torchcodec/decoders/_core/VideoDecoder.cpp

  return rawOutput;
 }

+VideoDecoder::DecodedOutput VideoDecoder::getNextFrame() {


For consistency this should have a NoDemux suffix

ahmadsharif1 · 2024-10-29T14:47:40Z

src/torchcodec/decoders/_core/VideoDecoder.h

+      int streamIndex,
+      int64_t frameIndex,
+      std::optional<torch::Tensor> preAllocatedOutputTensor = std::nullopt);
+  DecodedOutput getNextDecodedOutputNoDemux(


Should this have an internal suffix?

ahmadsharif1 · 2024-10-29T14:48:30Z

src/torchcodec/decoders/_core/VideoDecoder.h

      DecodedOutput& output,
      std::optional<torch::Tensor> preAllocatedOutputTensor = std::nullopt);

+  DecodedOutput getFrameAtIndexInternal(


Should you have a comment somewhere saying these are always returned in HWC?

You could have a convention that Internal suffix'd functions always return in HWC

Yes definitely, ultimately I want to label all functions in the decoding stack with expected input and output shapes. I'll follow-up with that, I think it'll be part of a sequence of PRs, after the one about up-leveling the tensor allocation

NicolasHug added 5 commits October 29, 2024 13:10

Remove call from convertAVFrameToDecodedOutputOnCPU

7c37e9a

Nit

f273b52

Slighlty better

639952a

No more if

3ae34c7

Make getNextDecodedOutputNoDemux private

8019219

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 29, 2024

NicolasHug commented Oct 29, 2024

View reviewed changes

scotts approved these changes Oct 29, 2024

View reviewed changes

Fix CUDA tests?

dc40ad6

ahmadsharif1 approved these changes Oct 29, 2024

View reviewed changes

NicolasHug added 2 commits October 29, 2024 16:00

getNextFrame -> getNextFrameNoDemux

6bd363d

getNextDecodedOutputNoDemux -> getNextFrameOutputNoDemuxInternal

0b6590d

NicolasHug merged commit daf3631 into meta-pytorch:main Oct 29, 2024
40 checks passed

NicolasHug deleted the unify_shape_handling branch October 29, 2024 16:40

NicolasHug mentioned this pull request Oct 30, 2024

Remove preAllocatedTensor from getFrameAtIndex #322

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refac: Straightforward output shape permutation #317

Refac: Straightforward output shape permutation #317

Uh oh!

NicolasHug commented Oct 29, 2024 •

edited

Loading

Uh oh!

NicolasHug Oct 29, 2024

Uh oh!

scotts Oct 29, 2024

Uh oh!

NicolasHug Oct 29, 2024 •

edited

Loading

Uh oh!

scotts commented Oct 29, 2024

Uh oh!

ahmadsharif1 Oct 29, 2024

Uh oh!

ahmadsharif1 Oct 29, 2024

Uh oh!

ahmadsharif1 Oct 29, 2024

Uh oh!

NicolasHug Oct 29, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refac: Straightforward output shape permutation #317

Refac: Straightforward output shape permutation #317

Uh oh!

Conversation

NicolasHug commented Oct 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

scotts Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

scotts commented Oct 29, 2024

Uh oh!

ahmadsharif1 Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

ahmadsharif1 Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

ahmadsharif1 Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NicolasHug commented Oct 29, 2024 •

edited

Loading

NicolasHug Oct 29, 2024 •

edited

Loading