Add `sample_format` to audio metadata #557

NicolasHug · 2025-03-13T17:27:44Z

Towards #549

This PR adds the sample_format field to AudioStreamMetadata. In terms of values, it'll be what av_get_sample_fmt_name() returns, so it's a bit "ffmpeg-like": flt, fltp, s16, s32... Maybe we'll want to expose a more user-friendly string? I'm not sure, either way, we can always re-map as follow-ups.

NicolasHug · 2025-03-13T17:28:19Z

test/decoders/test_metadata.py

    )
    assert best_audio_stream_metadata.bit_rate == 128837
    assert best_audio_stream_metadata.codec == "aac"
+    assert best_audio_stream_metadata.sample_format == "fltp"


Hard-coded value here and below, this test already has lots of these.

NicolasHug · 2025-03-13T17:59:58Z

test/utils.py

+                # decoded frames.
+                self._reference_frames[stream_index] = torch.load(
+                    frames_data_path, weights_only=True
+                )


I'll add the corresponding reference frames in #556

NicolasHug · 2025-03-13T18:21:42Z

test/resources/sine_mono_s32.wav

Also adding this new asset, not strictly needed for this PR, but still useful to check a format that's not fltp.
It will be needed in #556 anyway. It's from TorchAudio.

Add sample_format to audio metadata

58e5277

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 13, 2025

NicolasHug commented Mar 13, 2025

View reviewed changes

Add non-fltp file

94c11fc

NicolasHug commented Mar 13, 2025

View reviewed changes

scotts approved these changes Mar 14, 2025

View reviewed changes

NicolasHug merged commit 8e611bb into meta-pytorch:main Mar 14, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `sample_format` to audio metadata #557

Add `sample_format` to audio metadata #557

Uh oh!

NicolasHug commented Mar 13, 2025 •

edited

Loading

Uh oh!

NicolasHug Mar 13, 2025

Uh oh!

NicolasHug Mar 13, 2025

Uh oh!

NicolasHug Mar 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add sample_format to audio metadata #557

Add sample_format to audio metadata #557

Uh oh!

Conversation

NicolasHug commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add `sample_format` to audio metadata #557

Add `sample_format` to audio metadata #557

NicolasHug commented Mar 13, 2025 •

edited

Loading