Fix audio decoding bug with FFmpeg4 by setting channel_layout #865

Dan-Flores · 2025-09-02T14:30:27Z

This PR avoids the error message in #843 when usinf FFmpeg4 by setting the missing channel_layout field avFrame. FFmpeg documentation indicates that this field may be 0 if it is unknown or unspecified, so it is set to the default layout for the number of channels present using av_get_default_channel_layout.

The test was updated to compare the decoded frames to the checked in samples from sine_mono_s16.wav. These checked in samples were generated using torchcodec with FFmpeg6.

NicolasHug · 2025-09-02T15:18:29Z

src/torchcodec/_core/SingleStreamDecoder.cpp

-      srcNumChannels,
-      ". If you are hitting this, it may be because you are using "
-      "a buggy FFmpeg version. FFmpeg4 is known to fail here in some "
-      "valid scenarios. Try to upgrade FFmpeg?");


I think we should still have this check, but perhaps relax it if getNumChannels(srcAVFrame) returns 0 while getNumChannels(streamInfo.codecContext) returns something < 1 (which is what is happening in #843

I added back the check, and excluded the case where getNumChannels(srcAVFrame) returns 0 while getNumChannels(streamInfo.codecContext) returns a valid fallback.

src/torchcodec/_core/FFMPEGCommon.cpp

NicolasHug · 2025-09-02T17:02:06Z

src/torchcodec/_core/FFMPEGCommon.cpp

+  // to allow successful initialization of SwrContext
+  if (numChannels == 0 && avFrame->channels > 0) {
+    avFrame->channel_layout = av_get_default_channel_layout(avFrame->channels);
+    return avFrame->channels;


Let's just do this instead, so we can have a single return statement

Suggested change

return avFrame->channels;

numChannels = avFrame->channels;

NicolasHug · 2025-09-02T17:05:34Z

src/torchcodec/_core/SingleStreamDecoder.cpp

+  int srcNumChannelsFromCodec = getNumChannels(streamInfo.codecContext);
+  // Use number of channels from codec if 0 returned from frame
+  if (srcNumChannels == 0 && srcNumChannelsFromCodec > 0) {
+    srcNumChannels = srcNumChannelsFromCodec;


I wonder if we still need this if branch, now that we are returning the hopefully-correct avFrame->channels instead of the incorrect numChannels in getNumChannels().

Can you double check if that's still needed? Perhaps we can leave the entire TORCH_CHECK() the way it was!

NicolasHug · 2025-09-02T17:08:23Z

test/test_decoders.py

+        test_frames = decoder.get_samples_played_in_range()
+        assert test_frames.data.shape[0] == decoder.metadata.num_channels
+        assert test_frames.sample_rate == decoder.metadata.sample_rate


Let's use the term "samples" instead of frame for the stuff that comes out of the decoder. For the asset, the use of

reference_frames = asset.get_frame_data_by_range

is still correct since this is really a frame-based API.

NicolasHug · 2025-09-02T17:10:10Z

test/test_decoders.py

+            start=0, stop=1, stream_index=0
+        )
+        torch.testing.assert_close(
+            test_frames.data[0], reference_frames, atol=0, rtol=0


I'm surprised by the use of [0] here - can you help me understand why that's needed?

The decoded audio has the shape [1, 64000], where the first dimension is the number of channels (1, in the case of SINE_MONO_S16), and the second dimension is the samples.

The checked in tensor is returned by get_frame_data_by_index in utils.py, which returns only the samples at index=1 for the one stream_index.

So to make the comparison, the test must use only the samples from the decoded audio at [0].

Got it, I think eventually we may want to update get_frame_data_by_index to always return 2D audio samples, but that's not urgent. Using [0] like yo did is fine.

src/torchcodec/_core/FFMPEGCommon.cpp

NicolasHug

NICE.

Thank you @Dan-Flores for the fix, and for finding a cleaner fix than my original hacky workaround!

I'm labeling this as "bug" so that we can list it in the bug-fix section of our release notes. It might be worth updating the PR title to something more "user-facing", like "Fix audio decoding bug with FFmpeg4" or something like that.

NicolasHug · 2025-09-03T09:24:39Z

src/torchcodec/_core/FFMPEGCommon.cpp

+  // Handle FFmpeg 4 bug where channel_layout is 0 or unset
+  // Set it to the default layout for the number of channels
+  // to allow successful initialization of SwrContext


Nit, I think the code has slightly changed since the comment was written, so this edit may be closer to what's happening (feel free to edit!)

Suggested change

// Handle FFmpeg 4 bug where channel_layout is 0 or unset

// Set it to the default layout for the number of channels

// to allow successful initialization of SwrContext

// Handle FFmpeg 4 bug where channel_layout and numChannels are 0 or unset

// Set them based on avFrame->channels which appears to be correct

// to allow successful initialization of SwrContext

set channel_layout in avFrame

366c63f

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 2, 2025

Dan-Flores marked this pull request as ready for review September 2, 2025 15:00

NicolasHug reviewed Sep 2, 2025

View reviewed changes

src/torchcodec/_core/FFMPEGCommon.cpp Show resolved Hide resolved

Restore TORCH_CHECK, return non-zero channels

4bed34b

NicolasHug reviewed Sep 2, 2025

View reviewed changes

Daniel Flores added 2 commits September 2, 2025 13:22

restore original torch_check

38caec7

incorporate suggestions

73130b4

NicolasHug approved these changes Sep 3, 2025

View reviewed changes

NicolasHug added the bug Something isn't working label Sep 3, 2025

update comment

a2052d8

Dan-Flores changed the title ~~Set default channel_layout in avFrame~~ Fix audio decoding bug with FFmpeg4 by setting channel_layout Sep 3, 2025

Dan-Flores merged commit 1a88391 into meta-pytorch:main Sep 3, 2025
46 of 50 checks passed

Dan-Flores deleted the fix_ffmpeg4_bug branch September 3, 2025 15:33

Dan-Flores mentioned this pull request Sep 3, 2025

RuntimeError: The frame has 0 channels, expected 1. If you are hitting this, it may be because you are using a buggy FFmpeg version. FFmpeg4 is known to fail here in some valid scenarios. Try to upgrade FFmpeg? #843

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix audio decoding bug with FFmpeg4 by setting channel_layout #865

Fix audio decoding bug with FFmpeg4 by setting channel_layout #865

Uh oh!

Dan-Flores commented Sep 2, 2025

Uh oh!

NicolasHug Sep 2, 2025 •

edited

Loading

Uh oh!

Dan-Flores Sep 2, 2025

Uh oh!

Uh oh!

NicolasHug Sep 2, 2025

Uh oh!

NicolasHug Sep 2, 2025

Uh oh!

NicolasHug Sep 2, 2025

Uh oh!

NicolasHug Sep 2, 2025

Uh oh!

Dan-Flores Sep 2, 2025

Uh oh!

NicolasHug Sep 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

NicolasHug left a comment •

edited

Loading

Uh oh!

NicolasHug Sep 3, 2025

Uh oh!

Uh oh!

Uh oh!

Fix audio decoding bug with FFmpeg4 by setting channel_layout #865

Fix audio decoding bug with FFmpeg4 by setting channel_layout #865

Uh oh!

Conversation

Dan-Flores commented Sep 2, 2025

Uh oh!

NicolasHug Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NicolasHug Sep 2, 2025 •

edited

Loading

NicolasHug Sep 3, 2025 •

edited

Loading

NicolasHug left a comment •

edited

Loading