Initial version of handling audio object content. #304

thirv · 2023-12-12T18:51:13Z

Object audio is an emerging immersive format, which is especially interesting for content creators. An audio object consists of 1) audio wave stem (typically mono), and 2) associated object metadata. Currently most prominent application is to utilize spatial objects, where the metadata describes the spatial position of the object as a function of time, and all objects in the content are then rendered simultaneously according to their metadata. Objects are in principle quite layout-agnostic, so the content can be reproduced in any multi-loudspeaker setup, or headphones etc. Typically, content creators prefer that the objects are merely spatial without any renderer-side interactivity to preserve the artistic intent.

This PR represents an initial step toward handling object audio compression using Opus. The naivest solution would be to code each object with a separate mono Opus instance at equal bitrate. However, given that the number of objects can be large, this is very consuming. Luckily, Opus already implements multistream coding, as well as a mechanism to adjust individual channel/stream rate based on analyzing the joint masking among all channels. No handling of object metadata, nor decoder side rendering, are implemented, and it may be reasonable to leave these outside Opus in general. All object wave PCMs are assumed to be inputted as e.g. a multichannel file.

The underlying spatial masking model for bitrate allocation assumes that all objects in the content/multistream are rendered with a typical spatial renderer (such as EAR). The decoder side interactivity (e.g. changing the levels) is not assumed here. In a typical listening room with reflections (as opposed to free field/anechoic room), the spatial masking release effects are not very prominent, and thus for a typical object content, this first approximation just assumes no spatial release from masking between the objects. Despite the simplicity, object_analysis is added as a separate function to make future development easier.

Related PRs to other Opus projects TODO.

Comments and suggestions welcome!

jmvalin · 2024-02-11T17:38:26Z

I've noticed that object_analysis() is quite similar to surround_analysis(). Did you consider having a single function implement the analysis for both cases? Or are you expecting object_analysis() to further diverge from the surround case as it gets improved?

thirv · 2024-03-15T23:48:05Z

I've noticed that object_analysis() is quite similar to surround_analysis(). Did you consider having a single function implement the analysis for both cases? Or are you expecting object_analysis() to further diverge from the surround case as it gets improved?

Yeah, I think it might be good to have it separate in case of further developments.

xnorpx · 2024-03-19T02:24:36Z

Master branch is deleted and all pr's against master is closed. If this change is still relevant please reopen and repoint your PR to main branch.

thirv added 2 commits December 12, 2023 10:22

Initail version, test OK.

61e2ab2

Fix correct family mapping

9f4a385

jmvalin deleted the branch xiph:master March 19, 2024 02:11

jmvalin closed this Mar 19, 2024

mark4o mentioned this pull request Mar 19, 2024

Only run legacy appveyor ci on github master branch #325

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial version of handling audio object content. #304

Initial version of handling audio object content. #304

Uh oh!

thirv commented Dec 12, 2023 •

edited

Loading

Uh oh!

jmvalin commented Feb 11, 2024

Uh oh!

thirv commented Mar 15, 2024

Uh oh!

xnorpx commented Mar 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Initial version of handling audio object content. #304

Initial version of handling audio object content. #304

Uh oh!

Conversation

thirv commented Dec 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jmvalin commented Feb 11, 2024

Uh oh!

thirv commented Mar 15, 2024

Uh oh!

xnorpx commented Mar 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

thirv commented Dec 12, 2023 •

edited

Loading