New lavc api #23

jamrial · 2021-06-06T18:22:54Z

This is untested (I don't have access to a system with alsa, but the change was straightforward enough that i could write it).

It is unlikely that it will fix the silence issue described in #22, but it nonetheless is a change required in order to support future versions of ffmpeg.
The silence is, as far as i could tell, related to the fact that starting with ffmpeg 4.4, the ac3_fixed encoder takes S32 planar as input, versus the S16 planar it used to. This plugin should look at the AVCodec sample_fmts[0] value and feed the encoder the appropriate data.

Since it's used as AVPacket payload, the API requires it to be padded. Signed-off-by: James Almer <jamrial@gmail.com>

a52/pcm_a52.c

Signed-off-by: James Almer <jamrial@gmail.com>

quequotion · 2021-06-08T01:52:17Z

Is this still relying on libavresample, which was deprecated from ffmpeg?

manio · 2021-06-08T03:36:53Z

@quequotion
from the lavrate.txt: "The plugin in rate-lavr subdirectory is an external rate converter using libavresample library", I don't see any references to it in the pcm_a52 though.

manio · 2021-06-08T04:05:45Z

@jamrial
Thank you again for your work on this. Unfortunately I don't see much feedback/active development recently on this plugin code, so I am still trying to fix it myself... :(
I was able to get "some kind of" sound from the plugin as an AC3 stream so I hope I am on good track :)

I am only thinking that even with not exactly matched input type for the encoder some sound has to be produced...
I did my changes on top of yours but my problem is that I don't understand one thing... maybe you can help me with this:

In your new do_encode() before avcodec_send_frame() (jamrial@3aec2b2#diff-96ce76abb22778de0d96ef94d576d3eb582a74c32dfe4f7ad8f560444b04f20eR120)
I need to explicit add this code:

  frame->format = rec->avctx->sample_fmt;
  frame->channel_layout = rec->avctx->channel_layout;
  uint16_t *samples = (uint16_t*)frame->data[0];
  ret = av_frame_get_buffer(frame, 0);
  if (ret < 0) {
        return -EINVAL;
  }
  memcpy(frame->data[0], samples, rec->outbuf_size-8);
  //optional av_frame_free() after an encode call...

otherwise I've got EINVAL from that function (using "original" rec->frame). Do you have a clue why is passing a frame which was previously working in avcodec_encode_audio2 has to be allocated again and memcopied before this new API call?
I don't see av_frame_get_buffer() in the whole file, instead I think it is used av_samples_alloc() in alloc_input_buffer(), is it ok for this new API?

ps. I know that above code is ugly and not complete - it is rather a playground to understand some things and at least something is audible from the speakers...

jamrial · 2021-06-08T13:07:57Z

The reason it fails is probably because the send/receive API attempts to create a new reference to the frame internally, whereas the old API would not and simply pass it as is to the underlying encoder, and creating a new reference requires certain AVFrame fields to be set. In this case, format and channel_layout were missing.

Using av_samples_alloc() is fine even for the new API, it will just result in some extra data copy when creating the new reference.

I set the two relevant fields in alloc_input_buffer() and pushed the changes. See if that helps.

manio · 2021-06-08T14:37:40Z

Thanks, yeah it helps in this matter that I don't need to set those extra data in do_encode() and my minimal patch to get some distorted AC3 sound shrink to:

diff --git a/a52/pcm_a52.c b/a52/pcm_a52.c
index 8f1a654..1403fbd 100644
--- a/a52/pcm_a52.c
+++ b/a52/pcm_a52.c
@@ -118,6 +118,13 @@ static int do_encode(struct a52_ctx *rec)
        AVPacket *pkt = rec->pkt;
        int ret;

+       uint16_t *samples = (uint16_t*)rec->frame->data[0];
+       ret = av_frame_get_buffer(rec->frame, 0);
+       if (ret < 0) {
+               return -EINVAL;
+       }
+       memcpy(rec->frame->data[0], samples, rec->outbuf_size-8);
+
        ret = avcodec_send_frame(rec->avctx, rec->frame);
        if (ret < 0)
                return -EINVAL;

Without this - I've got EINVAL from encoding, so it still needs more work...

jamrial · 2021-06-08T15:00:02Z

Maybe now?

quequotion · 2021-06-08T15:23:16Z

When trying to play audio through an a52 device using pulseaudio, this happens more times per second than I can count:

I: [alsa-sink-] alsa-sink.c: Trying resume...
D: [alsa-sink-] alsa-util.c: Maximum hw buffer size is 177984 ms
D: [alsa-sink-] alsa-util.c: Set buffer size first (to 4608 samples), period size second (to 1536 samples).
D: [alsa-sink-] alsa-sink.c: hwbuf_unused=0
D: [alsa-sink-] alsa-sink.c: setting avail_min=1
I: [alsa-sink-] alsa-sink.c: Disabling rewind_within_thread for device a52:0
I: [alsa-sink-] alsa-sink.c: Resumed successfully...
D: [alsa-sink-] alsa-sink.c: snd_pcm_mmap_commit: Invalid argument
E: [alsa-sink-] alsa-sink.c: snd_pcm_mmap_commit: Invalid argument, trying to restart PCM

speaker-test isn't happy with it either (pulseaudio disabled):

speaker-test -c6 -D a52

speaker-test 1.2.4

Playback device is a52
Stream parameters are 48000Hz, S16_LE, 6 channels
Using 16 octaves of pink noise
Rate set to 48000Hz (requested 48000Hz)
Buffer size range from 3072 to 8543232
Period size range from 1536 to 1536
Using max buffer size 8543232
Periods = 4
was set period_size = 1536
was set buffer_size = 8543232
 0 - Front Left
Write error: -22,Invalid argument
xrun_recovery failed: -22,Invalid argument
Transfer failed: Invalid argument

jamrial · 2021-06-08T15:46:39Z

Unless avcodec_send_frame() or avcodec_receive_packet() are the ones returning an error code, i can't help you. This PR is only trying to port the plugin to the new encode API to be used with ffmpeg versions 4.4 and newer.

Someone familiar with ALSA should adapt this plugin to pass S32 planar audio to the encoder when it expects it, for starters. Otherwise you're not going to get proper AC3 frames out of it.

manio · 2021-06-08T16:26:35Z

@jamrial regarding your last commit with extended_data - no change...
but of course got your points about it... big thanks anyway for this :)

@quequotion
use latest code from this pull request, add my patch and try like this:

wget http://www-mmsp.ece.mcgill.ca/Documents/AudioFormats/WAVE/Samples/Microsoft/6_Channel_ID.wav
aplay -D a52 -f S16_LE -c6 ./6_Channel_ID.wav

jamrial · 2021-06-08T16:47:56Z

@manio What function is failing, avcodec_send_frame or avcodec_receive_packet? And what error code does it return? (Not talking about the hardcoded -EINVAL i made do_encode() propagate)

manio · 2021-06-08T16:58:40Z

@jamrial
error is from avcodec_send_frame() it is returning 0xffffffea

ffmpeg 4.4 is the last version with avcodec_encode_audio2(). Starting from the next release the new decoupled input-output API will be the only one available. BugLink: alsa-project#22 Signed-off-by: James Almer <jamrial@gmail.com>

jamrial · 2021-06-08T17:34:54Z

Can you try again with the latest addition?

manio · 2021-06-08T17:49:38Z

@jamrial
You did it!!! :D
Now avcodec_send_frame and avcodec_receive_packet are not returning error and I can hear AC3 audio without any additional code changes. It is still not correct but I know this is another story (S16 instead of S32). It even plays on all channels on my test wave file - on my code only one channel was playing :)
Big thanks!

It's no longer available starting with LIBAVCODEC_VERSION_MAJOR == 59. Signed-off-by: James Almer <jamrial@gmail.com>

manio · 2021-06-09T03:38:54Z

@perexg
Hi, can you please merge this? Those are necessary API changes to make the a52 plugin compatible with newest ffmpeg 4.4. It is not a complete changes to have a correct sound but a fixes for this should be made on top of this anyway...

quequotion · 2021-06-09T11:34:58Z

Do we have another issue/feature request for the plugin to use S32 instead of S16 yet?

Edit: Not specific to a52, but this looks like it.

We should ~~get that filed and~~ start working on that too.

Thanks for your hard work @jamrial & @manio!

Edit: SwiftKey is the most evil creation in the history of keyboards; link fixed.

perexg · 2021-06-12T17:48:15Z

Could you elaborate the S32 / S16 issue ? The input format is selected here:

alsa-plugins/a52/pcm_a52.c

Line 937 in 2352e4e

rec->format = format;

EDIT: Sorry, bad line. It's output format.

EDIT2: The input format constraint:

alsa-plugins/a52/pcm_a52.c

Line 776 in 2352e4e

unsigned int formats[] = { SND_PCM_FORMAT_S16 };

jamrial · 2021-06-12T18:11:16Z

Could you elaborate the S32 / S16 issue ? The input format is selected here:

alsa-plugins/a52/pcm_a52.c

Line 937 in 2352e4e

rec->format = format;

EDIT: Sorry, bad line. It's output format.

The ac3_fixed encoder in ffmpeg 4.4 and newer takes S32 planar audio as input, but this plugin exclusively handles S16 audio, as seen in the line you linked.
In

alsa-plugins/a52/pcm_a52.c

Line 984 in 2352e4e

rec->av_format = rec->codec->sample_fmts[0];

the plugin sets rec->av_format to what the encoder reports as supported in rec->codec->sample_fmts[0], which used to be AV_SAMPLE_FMT_S16P and became AV_SAMPLE_FMT_S32P in newer ffmpeg releases, but ultimately still feeds the encoder S16 audio.

If you can't get S32 audio from ALSA to pass to the encoder for versions where it expects it, you could use a resampler like libswresample.

perexg · 2021-06-12T18:19:16Z

If you can't get S32 audio from ALSA to pass to the encoder for versions where it expects it, you could use a resampler like libswresample.

Just set the format constrains to S32 according the codec->sample_fmts[0] in

alsa-plugins/a52/pcm_a52.c

Line 776 in 2352e4e

unsigned int formats[] = { SND_PCM_FORMAT_S16 };

. The application will see that the S32 is only accepted and the plug: automatic ALSA plugin can be used for the internal format conversion.

If the av codec accepts multiple formats, pick the more common S16 by default.

jamrial · 2021-06-12T18:23:14Z

I'll leave that to you or someone else familiar with ALSA that can also test such changes. This MR is to ensure the plugin can work with the new encode public API, regardless of what the underlying encoder handles.

jamrial · 2021-06-15T12:23:11Z

If i'm reading this right, it looks like S16 is still being hardcoded in rec->format, in SND_PCM_PLUGIN_DEFINE_FUNC(a52), which is then used in convert_data() and passed to snd_pcm_hw_params_set_format().

perexg · 2021-06-15T12:26:14Z

If i'm reading this right, it looks like S16 is still being hardcoded in rec->format, in SND_PCM_PLUGIN_DEFINE_FUNC(a52), which is then used in convert_data() and passed to snd_pcm_hw_params_set_format().

It's output format (for the packed A52 data), so it should be ok.

jamrial · 2021-06-15T12:52:09Z

Maybe this memset() https://github.com/perexg/alsa-plugins/blob/9868bfa879ffac9c858a5359c6015a37f548f09a/a52/pcm_a52.c#L227

Intel HDMI: PERIOD_SIZE: [32 4272000] PERIOD_BYTES: [128 17088000] PERIODS: [2 32] BUFFER_SIZE: [64 8544000] BUFFER_BYTES: [256 34176000] Selected: PERIOD_SIZE: 768 PERIOD_BYTES: 3072 PERIODS: 32 BUFFER_SIZE: 24576 BUFFER_BYTES: 98304 The a52_hw_params() tries to set the big 4271616 buffer size (frames) for speaker-test which is beyond the maximal slave buffer. BugLink: alsa-project#23 Signed-off-by: Jaroslav Kysela <perex@perex.cz>

The recent a52 encoder in ffmpeg accepts planar S32 (fixed encoder) or planar float (normal encoder) only. Extend the routines to support all combinations of 16-bit, 32-bit and float source samples. BugLink: alsa-project#23 Signed-off-by: Jaroslav Kysela <perex@perex.cz>

perexg · 2021-06-15T13:43:30Z

Maybe this memset() https://github.com/perexg/alsa-plugins/blob/9868bfa879ffac9c858a5359c6015a37f548f09a/a52/pcm_a52.c#L227

Yes, it's bad, but this code is only used for drain (at the end of playback). It's fixed now.

Speaker-test fix: perexg@4f5903b
Latest version for the S32 support: perexg@9bcd762

I found another missing piece in the code (missing rec->filled add for use_planar() code in fill_data()).

manio · 2021-06-15T15:48:22Z

Thank you guys! I can see a big progress :)
I can confirm that indeed the speaker test is now working!
I don't need to kill it and it is iterating over all channels in a loop
but the sound from speaker-test and from aplay is still not 100% clear
let me know if you need a recording for this...

tiwai · 2021-06-15T17:38:37Z

Sorry for being very late to the game, as I've been too busy for other tasks. And thanks for Jaroslav for taking care of this!

I took a quick glance, checked the code, and it looks mostly good, but there are a few small bugs.
I've put my version on top of Jaroslav's latest version in https://github.com/tiwai/alsa-plugins
Please check it out. If this works out, feel free to fold into your commit for the final version.

manio · 2021-06-15T17:48:48Z

@tiwai
just checked your version - I have no sound but silence - both in aplay and speaker-test...

tiwai · 2021-06-15T19:04:12Z

Hm, you can try revert one by one to figure out which one broke; they are only 3 small commits.

manio · 2021-06-16T03:13:28Z

Morning!
sure, @tiwai
first try - reverting from the top to get the sound back:

2bee1d6 (HEAD -> master) Revert "a52: Don't swap bytes for the recent libavcodec for LE"
b5c9f5c Revert "a52: Correct data transfer in planar mode"
0b51a84 Revert "a52: Use av_frame_get_buffer() for buffer allocation"
d4266cd (origin/master, origin/HEAD) a52: Use av_frame_get_buffer() for buffer allocation
[...]

Now I have sound back - now I am trying only to revert this single commit...
Now my tree is:

662906d (HEAD) Revert "a52: Don't swap bytes for the recent libavcodec for LE"
d4266cd (origin/master, origin/HEAD) a52: Use av_frame_get_buffer() for buffer allocation
[...]

Now the best news ever: the sound is crystal clear!!! :D:D

To sum it all: the commit which is wrong is: "a52: Don't swap bytes for the recent libavcodec for LE", after reverting only this one from @tiwai tree I have a correct sound! :)

@quequotion where are you man, can you also confirm if everything is working also on your hardware?

quequotion · 2021-06-16T04:56:16Z

@quequotion where are you man, can you also confirm if everything is working also on your hardware?

On the clock! I'll try some test builds when I get home tonight (~19:00 JST). Most days I'm working 08:30~23:00, including commutes.

Can't believe the progress we've made these last few weeks! After years of struggling to keep surround sound working with hacks, homebrewed packages, and copy pasta configurations, we're a just handful of commits away from ready out-of-the-box functionality.

Looking forward to it!

tiwai · 2021-06-16T06:35:39Z

Good to hear! Now I dropped the wrong patch and re-pushed.
I'm going to send a PR with it.

Since it's used as AVPacket payload, the API requires it to be padded. BugLink: #23 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Jaroslav Kysela <perex@perex.cz>

BugLink: #23 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Jaroslav Kysela <perex@perex.cz>

ffmpeg 4.4 is the last version with avcodec_encode_audio2(). Starting from the next release the new decoupled input-output API will be the only one available. BugLink: #23 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Jaroslav Kysela <perex@perex.cz>

It's no longer available starting with LIBAVCODEC_VERSION_MAJOR == 59. BugLink: #23 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Jaroslav Kysela <perex@perex.cz>

Intel HDMI: PERIOD_SIZE: [32 4272000] PERIOD_BYTES: [128 17088000] PERIODS: [2 32] BUFFER_SIZE: [64 8544000] BUFFER_BYTES: [256 34176000] Selected: PERIOD_SIZE: 768 PERIOD_BYTES: 3072 PERIODS: 32 BUFFER_SIZE: 24576 BUFFER_BYTES: 98304 The a52_hw_params() tries to set the big 4271616 buffer size (frames) for speaker-test which is beyond the maximal slave buffer. BugLink: #23 Signed-off-by: Jaroslav Kysela <perex@perex.cz>

The recent a52 encoder in ffmpeg accepts planar S32 (fixed encoder) or planar float (normal encoder) only. Extend the routines to support all combinations of 16-bit, 32-bit and float source samples. BugLink: #23 Signed-off-by: Jaroslav Kysela <perex@perex.cz>

perexg · 2021-06-16T06:57:31Z

Fixed via #26 . If you have a problem, please, open a new ticket.

manio · 2021-06-16T08:13:40Z

Thank you all very much! :)

perexg · 2021-06-16T15:52:17Z

If you like, please, retest with the current master branch. I pushed more cleanups there. The functionality should not be changed, but the more testing would be nice.

quequotion · 2021-06-16T16:15:51Z

Just building those updates now, was about to post that I had a few problems. Not sure if they are problems with this code though:

First, I can't get aplay to make a six-channel output from the LynnePublishing test file (static is rendered in mono), although it does with @manio's test file. On top of that, the speaker announcements from manio's file come out of the wrong speakers, but that could be because I haven't told aplay what order the speakers are in.

Then, pulseaudio just barely starts (had to comment out a lot of custom configuration) and then crashes the moment something plays audio through the a52 plugin.

Edit: Crashing persists even with a completely default pulseaudio configuration.

At first I thought this was because it seems to insist on reloading the plugin's sink with a different sample format (it creates the sink with s32le, then reloads with float32le), but even with a client (mpv) configured to use s32le, it still crashes as soon as audio is loaded.

Hopefully the cleanups help! I'll be back in a few minutes.

Edit: Bad news, pulseaudio still crashes any time audio is played through the plugin. Here's a log, note that it is 1874 lines long and mostly junk. You should probably work your way up from the bottom.

perexg · 2021-06-16T16:35:50Z

Don't use .ac3 file for tests. It's already the compressed format (like output from the a52 plugin). You need to convert them back to a PCM format before use (for aplay).

quequotion · 2021-06-16T16:41:46Z

That makes sense.

Still, either file is crashing pulseaudio where both should work there (edit: because the file encoding is irrelevant to pulseaudio, which should be sending the ouput of its client, mpv in this case, to plug:a52).

I'll see if I can find a specific commit that might be the cause (although audio quality was terrible, pulseaudio wasn't crashing yesterday).

Edit: pulseaudio would last output through the a52 plugin without crashing as of c12a6ae

manio · 2021-06-17T04:39:56Z

@perexg
tested current master (with libswresample converstion commit) and I still have good sound with aplay (using plug:a52encode though) because without it, it is telling me about sample format not available, but I assume this is ok in the light of our discussion.
speaker-test is also fine. I am still using pure alsa and did not configure pulseaudio yet - so for now I think that @quequotion is the right tester for this stuff...

@quequotion
If you are telling that c12a6ae was the last good one, so maybe you can also do similar thing as I did - trying to revert single commits one by one to get into the first which is starting the PA crashing issue?

quequotion · 2021-06-17T09:17:32Z

revert single commits one by one to get into the first which is starting the PA crashing issue?

My meaning was that the next commit, f11e7a8, which changes the plugin to use s32le, is where it breaks down.

I did try to revert just that commit to test the later ones, but git wanted to do a manual merge and I decided to go to bed instead. Might have time for it tonight.

This troubles me because it seems like it's woking for ALSA, but pulseaudio has a problem with it.

I don't think pulseaudio has any code specific to this plugin, so I wouldn't expect its developers to take on working around changes to it.

On the other hand, it seems to be working well enough for ALSA-only setups, so it might seem like this is an issues that can be passed to the pulseaudio team to deal with.

I think the problem results from the recent commits and needs to be fixed in alsa-plugins, but maybe I should open a new issue for this.

pcm_a52: add some padding bytes to outbuf

e855eba

Since it's used as AVPacket payload, the API requires it to be padded. Signed-off-by: James Almer <jamrial@gmail.com>

jamrial force-pushed the new_lavc_api branch from d262dc2 to 627e582 Compare June 7, 2021 01:17

manio reviewed Jun 7, 2021

View reviewed changes

a52/pcm_a52.c Outdated Show resolved Hide resolved

pcm_a52: propagate errors from do_encode()

05a5e19

Signed-off-by: James Almer <jamrial@gmail.com>

jamrial force-pushed the new_lavc_api branch from 627e582 to 3aec2b2 Compare June 7, 2021 11:55

jamrial force-pushed the new_lavc_api branch from 3aec2b2 to 8f67485 Compare June 8, 2021 13:05

jamrial force-pushed the new_lavc_api branch from 8f67485 to 7c22dd0 Compare June 8, 2021 14:59

pcm_a52: support the new libavcodec encode API

5db4841

ffmpeg 4.4 is the last version with avcodec_encode_audio2(). Starting from the next release the new decoupled input-output API will be the only one available. BugLink: alsa-project#22 Signed-off-by: James Almer <jamrial@gmail.com>

jamrial force-pushed the new_lavc_api branch from 7c22dd0 to 5db4841 Compare June 8, 2021 17:33

pcm_a52: don't call avcodec_register_all() on newer libavocdec versions

66b1db7

It's no longer available starting with LIBAVCODEC_VERSION_MAJOR == 59. Signed-off-by: James Almer <jamrial@gmail.com>

manio mentioned this pull request Jun 12, 2021

Config parsing of pcm_rate broken since 1.1.7 (at least for a52 pcm plugin) alsa-project/alsa-lib#154

Closed

perexg pushed a commit that referenced this pull request Jun 16, 2021

a52: propagate errors from do_encode()

7d9d227

BugLink: #23 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Jaroslav Kysela <perex@perex.cz>

perexg closed this Jun 16, 2021

manio mentioned this pull request Jun 16, 2021

a52: not compatible with recent avcodec #22

Closed

quequotion mentioned this pull request Jun 17, 2021

a52 plugin crashing pulseaudio #28

Closed

New lavc api #23

New lavc api #23

Conversation

jamrial commented Jun 6, 2021

quequotion commented Jun 8, 2021 • edited

manio commented Jun 8, 2021

manio commented Jun 8, 2021

jamrial commented Jun 8, 2021

manio commented Jun 8, 2021

jamrial commented Jun 8, 2021

quequotion commented Jun 8, 2021 • edited

jamrial commented Jun 8, 2021

manio commented Jun 8, 2021

jamrial commented Jun 8, 2021

manio commented Jun 8, 2021

jamrial commented Jun 8, 2021

manio commented Jun 8, 2021

manio commented Jun 9, 2021

quequotion commented Jun 9, 2021 • edited

perexg commented Jun 12, 2021 • edited

jamrial commented Jun 12, 2021

perexg commented Jun 12, 2021

jamrial commented Jun 12, 2021

jamrial commented Jun 15, 2021

perexg commented Jun 15, 2021

jamrial commented Jun 15, 2021

perexg commented Jun 15, 2021

manio commented Jun 15, 2021

tiwai commented Jun 15, 2021

manio commented Jun 15, 2021

tiwai commented Jun 15, 2021

manio commented Jun 16, 2021

quequotion commented Jun 16, 2021 • edited

tiwai commented Jun 16, 2021

perexg commented Jun 16, 2021

manio commented Jun 16, 2021

perexg commented Jun 16, 2021

quequotion commented Jun 16, 2021 • edited

perexg commented Jun 16, 2021 • edited

quequotion commented Jun 16, 2021 • edited

manio commented Jun 17, 2021

quequotion commented Jun 17, 2021 • edited

quequotion commented Jun 8, 2021 •

edited

quequotion commented Jun 8, 2021 •

edited

quequotion commented Jun 9, 2021 •

edited

perexg commented Jun 12, 2021 •

edited

quequotion commented Jun 16, 2021 •

edited

quequotion commented Jun 16, 2021 •

edited

perexg commented Jun 16, 2021 •

edited

quequotion commented Jun 16, 2021 •

edited

quequotion commented Jun 17, 2021 •

edited