Limiter applied? #20

sclsj · 2022-10-15T01:02:51Z

Still the sample audio in the last issue, but I noticed that the first two channels are heavily limited and sounds much more obvious than the stereo mix provided by the artist. Is this limiter applied by the producer or the decoder?

SakethSathuvalli · 2022-11-08T14:32:04Z

Hi @sclsj,

The decoded output depends on the configurations enabled at the encoder end. The decoder as such does not do any kind of post processing apart from what the specification suggests. So in your case if you are observing that the first two channels are heavily limited then in all probability it is the way the stream is encoded (intended by the creator of the stream).

Request you to close the issue if this answers your doubt!

Thanks!

sclsj · 2022-11-27T19:12:52Z

Thank you!

The reason I'm asking is that it appears that if I select 2ch rather than 7.1ch as output, the first two channels are more limited/compressed, although I could be wrong about that.

I will close this issue when I'm able to extract the individual objects to see if they are compressed as well.

SakethSathuvalli · 2023-01-02T04:50:56Z

Hi,

If the speaker layout of the bitstream file is different from that of the speaker layout requested from command line using -cicp: option, then the decoder converts the output to the requested speaker layout using rendering algorithm suitable for the file.

From the pictures you shared it looks like the result of rendering gives an impression that the signal is compressed. As informed earlier, the decoder does not specifically apply limiter apart from the end of chain processing descried in the specification.

Thanks!

sclsj · 2023-01-02T04:52:30Z

That made sense. However, you did not answer my question. What can I do to not let this happen?

SakethSathuvalli · 2023-01-02T04:58:59Z

Can You let us know the information on the number of channels / objects in this stream and the speaker lay-out of the bit stream?

Also can you please explain this "Stereo Mix provided by the artist" part ? Can you elaborate on what this information is ?

SakethSathuvalli · 2023-01-02T05:39:50Z

Hi @sclsj,

Can You please let us know if you see the "limiting effect" when you he command line option -cicp: is not used ??

sclsj · 2023-01-02T05:56:13Z

And speaking of such, I really want to see support for outputting in (decoding to) object-oriented format such as ADM BWF. It makes more sense to call this as decoding and mixing down those objects for a specific channel layout as rendering.

SakethSathuvalli · 2023-01-02T06:09:34Z

For the "sample audio in the last issue" one- yes. Still quite obvious (especially in the first two channel)

For the other example, not really. (Well, to be honest it's still obvious in channel 3. That one does not change regardless of cicp configuration, suggesting that quite some objects are concentrated in that coordinate/direction/speaker. (Or just that the vocal are quite loud / high gain compared to everything else)

The two pictures here correspond to that of different audio streams or the same stream decoded with different options - can you please elaborate ?

SakethSathuvalli · 2023-01-02T06:11:15Z

And speaking of such, I really want to see support for outputting in (decoding to) object-oriented format such as ADM BWF. It makes more sense to call this as decoding and mixing down those objects for a specific channel layout as rendering.

We dont have currently support for ADM BWF. However, its possible to have individual decoded objects using the -ext_ren: flag.

sclsj · 2023-01-02T06:23:53Z

Two different ones. First one is 群青, second one is Essence. Sorry for not making that clear.

sclsj · 2023-01-02T06:29:11Z

And speaking of such, I really want to see support for outputting in (decoding to) object-oriented format such as ADM BWF. It makes more sense to call this as decoding and mixing down those objects for a specific channel layout as rendering.

We dont have currently support for ADM BWF. However, its possible to have individual decoded objects using the -ext_ren: flag.

I saw this in the GSG docx. I tried -ext_ren:1 flag, and I got _ext_ren_pcm.raw and _ext_ren_oam_md.bs in the executable folder (agrees with command line description but conflicts with the documentation). I find the section referred in the documentation, but still as some doubt:

17.10.6 Audio PCM data
The PCM data of the channels and objects interfaces shall be provided through the decoder PCM buffer, which first contains the regular rendered PCM signals (e.g. 12 signals for a 7.1+4 setup). Subsequently nchan, out additional signals carry the PCM data of the originally transmitted channel representation. These are followed by nobj, out signals carrying the PCM data of the un-rendered output objects. Then additional signals carry the nHOA, out HOA data which number is indicated in the HOA metadata interface via the HOA order (e.g. 16 signals for HOA order 3). The HOA audio data in the HOA output interface is provided in the so-called equivalent spatial domain representation. The conversion from the HOA domain into the equivalent spatial domain representation and vice versa is described in Annex C.5.1.
The decoder shall signal the offset index of the PCM buffer for the first un-rendered output object and the offset index of the PCM buffer for the first HOA audio signal.

Well, that gives us 12 + 12 + 10 = 34 channels. Assuming 16-bit and 48000Hz, that would result in a 747 mb file, but I got a 357 mb file.

When I try to decode it, I also get (mostly) garbage channels (channels with random noise). Not sure what I'm doing wrong here. I'm using: ffmpeg -f s16le -ar 48k -ac 15 -i /Users/jin/Desktop/libmpegh/_ext_ren_pcm.raw /Users/jin/Desktop/libmpegh/_ext_ren_pcm.wav. The 15 channel count comes from a rough estimate based on file size. I also tried other ones, ranging from 2 to 35 channels, but either all the channels are noise or most of the channels are noise.

Is there a flag I can use for the tool to output a wav instead of a raw pcm?

Also, if I read the specification right, according to 17.10.3 objects are still processed (DRC, gain, and peak limiter) before they are exported. Can I disable that?

SakethSathuvalli · 2023-02-03T06:47:33Z

And speaking of such, I really want to see support for outputting in (decoding to) object-oriented format such as ADM BWF. It makes more sense to call this as decoding and mixing down those objects for a specific channel layout as rendering.

We dont have currently support for ADM BWF. However, its possible to have individual decoded objects using the -ext_ren: flag.

I saw this in the GSG docx. I tried -ext_ren:1 flag, and I got _ext_ren_pcm.raw and _ext_ren_oam_md.bs in the executable folder (agrees with command line description but conflicts with the documentation). I find the section referred in the documentation, but still as some doubt:
17.10.6 Audio PCM data
The PCM data of the channels and objects interfaces shall be provided through the decoder PCM buffer, which first contains the regular rendered PCM signals (e.g. 12 signals for a 7.1+4 setup). Subsequently nchan, out additional signals carry the PCM data of the originally transmitted channel representation. These are followed by nobj, out signals carrying the PCM data of the un-rendered output objects. Then additional signals carry the nHOA, out HOA data which number is indicated in the HOA metadata interface via the HOA order (e.g. 16 signals for HOA order 3). The HOA audio data in the HOA output interface is provided in the so-called equivalent spatial domain representation. The conversion from the HOA domain into the equivalent spatial domain representation and vice versa is described in Annex C.5.1.
The decoder shall signal the offset index of the PCM buffer for the first un-rendered output object and the offset index of the PCM buffer for the first HOA audio signal.
Well, that gives us 12 + 12 + 10 = 34 channels. Assuming 16-bit and 48000Hz, that would result in a 747 mb file, but I got a 357 mb file.

When I try to decode it, I also get (mostly) garbage channels (channels with random noise). Not sure what I'm doing wrong here. I'm using: ffmpeg -f s16le -ar 48k -ac 15 -i /Users/jin/Desktop/libmpegh/_ext_ren_pcm.raw /Users/jin/Desktop/libmpegh/_ext_ren_pcm.wav. The 15 channel count comes from a rough estimate based on file size. I also tried other ones, ranging from 2 to 35 channels, but either all the channels are noise or most of the channels are noise.

Is there a flag I can use for the tool to output a wav instead of a raw pcm?

Also, if I read the specification right, according to 17.10.3 objects are still processed (DRC, gain, and peak limiter) before they are exported. Can I disable that?

Hi @sclsj

Can You please refer to our wiki page on external rendering interfaces ?

Thanks!

SakethSathuvalli · 2023-02-08T16:32:51Z

Hi @sclsj,

Can You please close this issue if this is similar to what is been discussed #19

Thanks!

sclsj · 2023-02-09T16:40:25Z

Yes, it’s kind of the same thing. I’m having some other related issues but I need to investigate further before posting them.

sclsj mentioned this issue Dec 23, 2022

What are valid drc effect types? #35

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limiter applied? #20

Limiter applied? #20

sclsj commented Oct 15, 2022

SakethSathuvalli commented Nov 8, 2022

sclsj commented Nov 27, 2022

This comment was marked as outdated.

SakethSathuvalli commented Jan 2, 2023

sclsj commented Jan 2, 2023

SakethSathuvalli commented Jan 2, 2023

This comment was marked as outdated.

This comment was marked as outdated.

SakethSathuvalli commented Jan 2, 2023

This comment was marked as outdated.

sclsj commented Jan 2, 2023

SakethSathuvalli commented Jan 2, 2023

SakethSathuvalli commented Jan 2, 2023

sclsj commented Jan 2, 2023

sclsj commented Jan 2, 2023 •

edited

Loading

SakethSathuvalli commented Feb 3, 2023

SakethSathuvalli commented Feb 8, 2023

sclsj commented Feb 9, 2023

Limiter applied? #20

Limiter applied? #20

Comments

sclsj commented Oct 15, 2022

SakethSathuvalli commented Nov 8, 2022

sclsj commented Nov 27, 2022

This comment was marked as outdated.

SakethSathuvalli commented Jan 2, 2023

sclsj commented Jan 2, 2023

SakethSathuvalli commented Jan 2, 2023

This comment was marked as outdated.

This comment was marked as outdated.

SakethSathuvalli commented Jan 2, 2023

This comment was marked as outdated.

sclsj commented Jan 2, 2023

SakethSathuvalli commented Jan 2, 2023

SakethSathuvalli commented Jan 2, 2023

sclsj commented Jan 2, 2023

sclsj commented Jan 2, 2023 • edited Loading

SakethSathuvalli commented Feb 3, 2023

SakethSathuvalli commented Feb 8, 2023

sclsj commented Feb 9, 2023

sclsj commented Jan 2, 2023 •

edited

Loading