Frame and FrameBatch improvements #283

NicolasHug · 2024-10-23T13:18:52Z

This PR:

Adds a bunch of input checks when creating a Frame or a FrameBatch. We enforce Frame data to be 3D, and FrameBatch data to be >= 4D. We also ensure consistency of leading dimensions between data, pts_seconds and duration_seconds
Adds indexing support for FrameBatch. This supports pytorch fancy indexing naturally and intuitively. Note that indexing a 4D FrameBatch returns a Frame.
Adds iteration support for FrameBatch. This removes the "tuple unpacking" behavior that we had for FrameBatch, but luckily this is not something we have been using at all. The unpacking behavior of Frame is preserved.

These changes are mostly necessary in order for us to change the output of the samples from List[FrameBatch(4D)] to FrameBatch (5D), as done in #284

ahmadsharif1 · 2024-10-23T14:40:26Z

src/torchcodec/_frame.py

+        if self.data.ndim == 4:
+            return Frame(
+                data=data,
+                pts_seconds=float(pts_seconds.item()),
+                duration_seconds=float(duration_seconds.item()),
+            )
+        else:
+            return FrameBatch(
+                data=data,
+                pts_seconds=pts_seconds,
+                duration_seconds=duration_seconds,
+            )


Tensor has a .item() method for returning the underlying dtype.

Should we have something like that here? i.e. always return a FrameBatch but return a Frame if .item() is called?

always return a FrameBatch but return a Frame if .item() is called?

I feel like this is what we're already doing, but perhaps I'm misunderstanding?

BTW, this quirk is only needed for mypy (sigh). Originally the code was simpler:

cls = Frame if self.data.ndim == 4 else FrameBatch return cls( self.data[key], self.pts_seconds[key], self.duration_seconds[key], )

and everything was fine, and the Frame would get proper float value because of what we do in its post_init. But mypy was complaining so I had to go for this in 4661237 (#283)

I feel like this is what we're already doing, but perhaps I'm misunderstanding?

Don't we return a Frame for the special case of dimensions=4?

What I am saying is we should return a FrameBatch even in that case (of size 1). So we are consistent with Tensor

I'll leave it to you

I don't have a super strong preference on this - let me open an issue so we can discuss during one of the meetings

src/torchcodec/_frame.py

…vements

Frame and FrameBatch improvements

1482529

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 23, 2024

Fix mypy?

4661237

NicolasHug marked this pull request as ready for review October 23, 2024 13:38

ahmadsharif1 reviewed Oct 23, 2024

View reviewed changes

NicolasHug mentioned this pull request Oct 23, 2024

Speed-up time-based samplers by 20X and index-based by 1.5X #284

Merged

Added comment

c6b594c

NicolasHug requested review from ahmadsharif1 and scotts October 23, 2024 16:39

Merge branch 'main' of github.com:pytorch/torchcodec into frame_impro…

a53ef1a

…vements

scotts approved these changes Oct 24, 2024

View reviewed changes

NicolasHug merged commit b841eb3 into meta-pytorch:main Oct 24, 2024
24 checks passed

NicolasHug deleted the frame_improvements branch October 24, 2024 13:24

This was referenced Oct 24, 2024

Indexing a 4D FrameBatch: Should this return a Frame or a 3D FrameBatch? #288

Closed

Indexing 4D FrameBatch now returns FrameBatch #296

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Frame and FrameBatch improvements #283

Frame and FrameBatch improvements #283

Uh oh!

NicolasHug commented Oct 23, 2024 •

edited

Loading

Uh oh!

ahmadsharif1 Oct 23, 2024

Uh oh!

NicolasHug Oct 23, 2024 •

edited

Loading

Uh oh!

ahmadsharif1 Oct 24, 2024

Uh oh!

NicolasHug Oct 24, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Frame and FrameBatch improvements #283

Frame and FrameBatch improvements #283

Uh oh!

Conversation

NicolasHug commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahmadsharif1 Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ahmadsharif1 Oct 24, 2024

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 24, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NicolasHug commented Oct 23, 2024 •

edited

Loading

NicolasHug Oct 23, 2024 •

edited

Loading