Fix echo cancellation bug + add command line option to dump AudioInput streams #4167

fedetft · 2020-05-12T19:16:21Z

Dear mumble developers,
I think I found a bug in the echo canceller.

Basically, the way the current mumble code works is to put speaker readback samples in a queue (qlEchoFrames) in the addEcho() callback, and then in the addMic() callback take both the microphone and speaker samples and pass them to the echo canceller.

However,
https://www.speex.org/docs/manual/speex-manual/node7.html#SECTION00740000000000000000
explicitly says that "It is important that, at any time, any echo that is present in the input has already been sent to the echo canceller as echo_frame."
and adding a queue to the speaker samples makes them arrive after the microphone ones.

As a result, the echo canceller is only effective against periodic signals, but not for voice.

To verify that, the first commit of this pull request,
fedetft@b3aa5de
adds a command line option to tap and synchronously dump pcm streams of the raw microphone, speaker readback, and processed microphone.

The result is shown in this image: https://imgur.com/a/FpeB6Mp

The top figure shows the original mumble code with only the profiling patch applied.
As can be seen, the echo canceller receives the speaker readback with a high delay, I experienced up to 300ms, and is thus effective only against periodic sounds, not voice audio.

The bottom figure shows the effect of this pull request, a 20ms lead is forced by delaying the audio path, so that the echo canceller is reasonably certain to receive the data in the correct order even when callbacks are jittery.

The patch has been tested with the mixed echo cancellation on Linux with PulseAudio.

The patch fixes the issue but some more work needs to be done, in particular either the multichannel echo cancellation is broken and passes garbage data to the echo canceller, or I didn't understand how the PCM streams are passed. In any case, it does not seem to cancel echo, so there appears to be an issue, although I haven't looked it up yet.

Moreover, I could not understand if the addMic() and addEcho() callbacks can be called concurrently by multiple threads, and thus if a mutex is requred in the Resynchronizer class I added.

fedetft · 2020-05-13T08:22:28Z

Closed and reopened as the CI was failing due to network errors, but it appears every time there's at least one target that has network issues.

Don't know what's wrong with translations.

src/mumble/main.cpp

Krzmbrzl · 2020-05-13T08:26:30Z

Closed and reopened as the CI was failing due to network errors, but it appears every time there's at least one target that has network issues.

The issue is actually a different one and closing and reopening doesn't do anything (I think). Anyways I restarted the CI.
As for the translations: See my review :)

streaps · 2020-05-13T10:56:54Z

The patch fixes the issue but some more work needs to be done, in particular either the multichannel echo cancellation is broken and passes garbage data to the echo canceller, or I didn't understand how the PCM streams are passed. In any case, it does not seem to cancel echo, so there appears to be an issue, although I haven't looked it up yet.

Last time I asked, how multichannel echo cancellation works and what is it good for, nobody could explain it. Does anybody care about that (broken) feature or could it be removed?

The OpusFAQ recommends the Google AEC from WebRTC.
https://wiki.xiph.org/index.php?title=OpusFAQ&mobileaction=toggle_view_desktop#Does_Opus_have_an_echo_canceller_like_Speex_does.3F

fedetft · 2020-05-13T11:13:59Z

My branch is called webrtc because the original intention was to replace mumble's echo cancellation algorithm, which to me at least performed poorly, with webrtc's Google AEC.

However, when trying to understand the code I found this bug, and after fixing it, it looks like libspeexdsp's echo canceller isn't bad at all, it was just not operating due to a precondition violation.

For what concerns the multichannel echo canceller and the possibility to remove it, I see an UX/UI issue, and a code issue.

The UX issue is that it is not clear at all to the end user the difference between mixed and multichannel. I only understood it after reading the C++ code...
If you want to keep it, it would be good to add some text like
"Mixed echo cancellation mixes all speaker outputs in one mono stream and passes that stream to the echo canceller, while multichannel echo cancellation passes all audio channels to the echo canceller directly.
Multichannel echo cancellation requires more CPU, so you should try mixed first"

The code issue is that it looks like the multichannel pcm data is somehow corrupted. Since the echo readback is never heard by anyone and only passed to the echo canceller, it was difficult to spot this bug, until I added the --dump-input-streams command line option.

streaps · 2020-05-13T12:04:22Z

thanks for the explanation.

Krzmbrzl

For the actual review, we'll have to wait for @davidebeatrici as I don't really know the audio system...

src/mumble/AudioInput.cpp

fedetft · 2020-05-15T07:23:05Z

Update: yesterday I started having a look at why the multichannel echo canceller receives garbage data. I found a small issue but the signal didn't improve at all.
Should I add the commits related to the multichannel echo canceller to this PR?

Krzmbrzl · 2020-05-15T07:30:01Z

I think it'd be a good idea, yes. This facilitates the work of a potential future contributor that wants to look into this issue :)

fedetft · 2020-05-15T07:50:16Z

I think that to make it easy for future contributors to perform regression tests I would need to write somewhere how to use the --dump-input-streams option to view in Audacity the various signals being passed to AudioInput's dsp algorithms. I don't know if a pull request is the best place for that, though.

Krzmbrzl · 2020-05-15T08:27:10Z

You can create a new markdown document n docs for that purpose ☝️
EDIT: And maybe reference it in a comment somewhere in the related code sections :)

fedetft · 2020-05-15T22:01:44Z

Found it! here's the bug that was corrupting the pcm data for the multichannel echo cancellation!
The addEcho function is called with smaller chunks of data, and accumulates them until it fills an entire 10ms of data, then the buffer is passed to the signal processing chain.

Foe every chunk the multichannel code path always started form the beginning of the buffer, thus not accumulating anything. This overwrote data and left the end of the buffer uninitialized too. No wonder I was hearing garbage when playing those data...

Now multichannel echo works.

fedetft · 2020-05-16T08:49:41Z

Added documentation in the docs directory. Should help make sure issues in the echo canceller don't get reintroduced.

Krzmbrzl

Could you also document the different functions and member fields in the header files using Doxygen /// comments, please?
I know that this is not done in the existing code but it's an ongoing quest of ours to improve the state of the in-source documentation :)

And if you're done with the PR, please squash the commits. I think here it'd be a good idea to make these 3 commits:

Addition of the new cli options + documentation
Fix of mixed channel echo cancellation
Fix of multichannel echo cancellation

We also have a practice of prefixing our commit messages with the changed path, e.g.

src/mumble/AudioInput: <your message>

You can have a look at other commits in this repo to see what I mean by that as well :)

src/mumble/AudioInput.h

Krzmbrzl · 2020-05-17T15:45:20Z

src/mumble/AudioInput.h

+private:
+	void printQueue(char who);
+
+	//TODO: there was a mutex (qmEcho), but can the callbacks be called concurrently? 


I don't know. @davidebeatrici might now better though :)

As far as I know they are never called concurrently.

fedetft · 2020-05-18T19:12:38Z

Had to make four commits as the documentation covers also part of the other commits.
For me this PR is complete.

Krzmbrzl · 2020-05-18T19:42:50Z

Alright thank you!
I'll try to have a look at this in the coming days :)

Krzmbrzl · 2020-05-19T18:59:47Z

What's the best approach to verifying that the echo canceller works as expected? Can I even do this locally on a single machine? 🤔

fedetft · 2020-05-20T07:58:30Z

Good question: I've pushed another commit: it only touches the docs and adds a step-by-step guide to reproducibly verify the correct operation of the echo canceller. I think you may find it useful also in the future to avoid regressions.

By doing the test before and after the patches that fix the echo, you can appreciate the effect of the changes.

Krzmbrzl · 2020-05-20T11:02:40Z

Hm okay I tried it out using the built-in microphone of my laptop and its speakers but it doesn't seem to do all that much. I clearly see that something was done (especially noise reduction) but I still hear most parts of the YouTube video.
I guess though that this might be because the built-in microphone I was using is really shit. Probably the noise was so intense that the echo canceller was confused by it. I'll try again with my headset.

Could someone else please also try this out and report back here?

EDIT: The results with my normal headset are pretty much the same. Significant parts of from the YT video are still hearable...

TerryGeng · 2020-05-20T14:55:09Z

Hi! I'm amazed by your work and it also solved my problem that why echo cancellation isn't working for us. I'm not actually a mumble developer, but I have just read your code and have a little question.

I have seen your finite state design that is used to keep the speaker audio chunk ahead of mic chunk. But then I read these lines:

mumble/src/mumble/AudioInput.cpp

Lines 77 to 81 in 0c07744

    
           if(drop == false) 
        
           { 
        
           	result = AudioChunk(micQueue.front(), speaker); 
        
           	micQueue.pop_front(); 
        
           }

It seems that you have assumed speaker audio chunk comes later than the mic chunk, so you wait until mic chunk piles up and then return the "now" speaker chunk and "past" mic chunk, which is just the opposite of requiring speaker chunk comes earlier than mic chunk.

I'm not sure what mistake I have made in understand your code and I'd appreciate for your reply!

fedetft · 2020-05-25T08:01:33Z

@davidebeatrici
The use of alloca() is to minimize the number of dynamic memory allocations in code that is called 200 times a second. I've tried to allocate on the stack all the buffers that don't have to outlive the stack frame they're in, which is true for anything but the echo cancel queue.
This way you get better performance and you also can't forget a delete and leak memory.
Some of those can be replaced with just an array declaration on the stack, but in some cases the array size is a variable, and I was getting CI errors with MSVC as it doesn't support C99 style variable length arrays.

In the long run, it would be best to just wrap the buffers in an object that manages its lifetime, but such a change would be too invasive and is unrelated to fixing the echo issue. Moreover, if there are rumors about rewriting everything, would a separate PR that refactors the code even be useful or just a waste of time?

About the mutex, I can remove it if you're sure it's not needed, but I can't test on any platform other than linux/pulseaudio.

Krzmbrzl · 2020-05-25T11:20:49Z

if there are rumors about rewriting everything, would a separate PR that refactors the code even be useful or just a waste of time?

I think for now you should leave it until we have decided on what to do about the rewrite/refactor

About the mutex, I can remove it if you're sure it's not needed, but I can't test on any platform other than linux/pulseaudio.

Does it hurt to have it there? If not I think I'd play it safe and just leave it as it is 🤔

fedetft · 2020-05-25T17:17:36Z

Leaving a potentially unnecessary mutex is a missed performance optimization.
There was a mutex also before, so this code is no slower than the previous one.

Krzmbrzl · 2020-05-25T17:27:53Z

Yeah but if it turns out that there is some concurrency in there after all (well hidden in layers of cryptic code) we'll have a problem xD

But as I don't know the audio code, I'll leave the decision about that up to you and @davidebeatrici :)

fedetft · 2020-05-25T17:55:56Z

I've removed it locally and will test it this evening on Linux.
If @davidebeatrici is sure to remove it and it doesn't segfault on my machine, I'll push the change as a separate commit (so as to keep the previous version in the history).

Krzmbrzl

One last thing: I think the last commit should be documentation only. The code style fixes should be squashed into the other commits ☝️

After that though I'd say we can merge this :)

fedetft · 2020-05-26T07:36:01Z

I don't know in git an automated way of doing that squash without going through line by line in the previous commits to change the parentheses and spaces, which given the scale of the changes requested is not a productive use of my time.

Please consider setting up an automated code formatting tool in the future. Personally I'm used to a completely different code style and this part of the review process is becoming a barrier to future contributions.

Krzmbrzl · 2020-05-26T10:26:24Z

You can simply reset all commits and add the respective files and commit them in the chunks you want. That way you don't have to do line by line edits.

…nput audio data and echo canceller queue state

fedetft · 2020-05-26T17:29:32Z

Ok,
the final version of main.cpp uses a variable that has been introduced only in the first echo cancellation fix, so it can't stay in its own commit without creating a commit that won't compile, and the two echo cancellation fixes touch the same file, so that means the only option is to make a single commit with all the code, and one with the documentation.
Apparently the issue is just that I value the git history much more than you guys, but if that's how you want your commits, that's fine for me too.

Krzmbrzl · 2020-05-26T18:04:07Z

Thank you very much for your contribution! Much appreciated :)

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

davidebeatrici added audio client labels May 12, 2020

fedetft closed this May 13, 2020

fedetft reopened this May 13, 2020

Krzmbrzl requested changes May 13, 2020

View reviewed changes

src/mumble/main.cpp Show resolved Hide resolved

fedetft force-pushed the webrtc branch from 9c5956b to 2d52fa4 Compare May 13, 2020 09:00

Krzmbrzl reviewed May 14, 2020

View reviewed changes

src/mumble/AudioInput.cpp Outdated Show resolved Hide resolved

toby63 mentioned this pull request May 15, 2020

Echo cancellation - explain difference #4125

Closed

fedetft mentioned this pull request May 16, 2020

Enable echo cancellation by default #4178

Closed

Krzmbrzl requested changes May 17, 2020

View reviewed changes

Krzmbrzl added the backport-needed label May 17, 2020

fedetft force-pushed the webrtc branch from bba8c9d to 428e555 Compare May 18, 2020 19:10

fedetft mentioned this pull request May 18, 2020

Test and improve RNNoise and add some info in UI & wiki #4181

Closed

4 tasks

fedetft mentioned this pull request May 25, 2020

Mumble rewrite #4195

Closed

davidebeatrici approved these changes May 25, 2020

View reviewed changes

Krzmbrzl requested changes May 26, 2020

View reviewed changes

fedetft added 2 commits May 26, 2020 19:22

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

62eeefc

…nput audio data and echo canceller queue state

src/mumble/AudioInput: Documentation

664437c

fedetft force-pushed the webrtc branch from 8edb3df to 664437c Compare May 26, 2020 17:25

Krzmbrzl approved these changes May 26, 2020

View reviewed changes

Krzmbrzl merged commit 12ce17e into mumble-voip:master May 26, 2020

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 26, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

f3017c4

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl mentioned this pull request May 26, 2020

Backport "Fix echo cancellation bug + add command line option to dump AudioInput streams" #4208

Closed

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 26, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

fc5798c

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 27, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

1a28e47

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 27, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

6fe858e

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 27, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

7f11c9d

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 28, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

21fac5b

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 28, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

05a331d

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl pushed a commit to Krzmbrzl/mumble that referenced this pull request May 28, 2020

src/mumble/AudioInput: Fix echo cancellation, added options to dump i…

5581cdd

…nput audio data and echo canceller queue state (Backported from mumble-voip#4167 and adapted to work with 1.3.x)

Krzmbrzl mentioned this pull request Jun 1, 2020

Doesn't work when selecting "use echo cancellation“ on Windows 10 #3923

Closed

Krzmbrzl removed the backport-needed label Jun 2, 2020

TerryGeng mentioned this pull request Jun 18, 2020

RNNoise and EchoCanceler not working together nicely!? #4302

Closed

toby63 mentioned this pull request Jun 18, 2020

[Question] How can one download builds from azure? #4303

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix echo cancellation bug + add command line option to dump AudioInput streams #4167

Fix echo cancellation bug + add command line option to dump AudioInput streams #4167

fedetft commented May 12, 2020

fedetft commented May 13, 2020 •

edited

Krzmbrzl commented May 13, 2020

streaps commented May 13, 2020 •

edited

fedetft commented May 13, 2020

streaps commented May 13, 2020

Krzmbrzl left a comment

fedetft commented May 15, 2020

Krzmbrzl commented May 15, 2020

fedetft commented May 15, 2020

Krzmbrzl commented May 15, 2020 •

edited

fedetft commented May 15, 2020

fedetft commented May 16, 2020

Krzmbrzl left a comment •

edited

Krzmbrzl May 17, 2020

davidebeatrici May 25, 2020

fedetft commented May 18, 2020

Krzmbrzl commented May 18, 2020

Krzmbrzl commented May 19, 2020

fedetft commented May 20, 2020

Krzmbrzl commented May 20, 2020 •

edited

TerryGeng commented May 20, 2020 •

edited

fedetft commented May 25, 2020

Krzmbrzl commented May 25, 2020

fedetft commented May 25, 2020

Krzmbrzl commented May 25, 2020

fedetft commented May 25, 2020

Krzmbrzl left a comment

fedetft commented May 26, 2020

Krzmbrzl commented May 26, 2020

fedetft commented May 26, 2020

Krzmbrzl commented May 26, 2020

Fix echo cancellation bug + add command line option to dump AudioInput streams #4167

Fix echo cancellation bug + add command line option to dump AudioInput streams #4167

Conversation

fedetft commented May 12, 2020

fedetft commented May 13, 2020 • edited

Krzmbrzl commented May 13, 2020

streaps commented May 13, 2020 • edited

fedetft commented May 13, 2020

streaps commented May 13, 2020

Krzmbrzl left a comment

Choose a reason for hiding this comment

fedetft commented May 15, 2020

Krzmbrzl commented May 15, 2020

fedetft commented May 15, 2020

Krzmbrzl commented May 15, 2020 • edited

fedetft commented May 15, 2020

fedetft commented May 16, 2020

Krzmbrzl left a comment • edited

Choose a reason for hiding this comment

Krzmbrzl May 17, 2020

Choose a reason for hiding this comment

davidebeatrici May 25, 2020

Choose a reason for hiding this comment

fedetft commented May 18, 2020

Krzmbrzl commented May 18, 2020

Krzmbrzl commented May 19, 2020

fedetft commented May 20, 2020

Krzmbrzl commented May 20, 2020 • edited

TerryGeng commented May 20, 2020 • edited

fedetft commented May 25, 2020

Krzmbrzl commented May 25, 2020

fedetft commented May 25, 2020

Krzmbrzl commented May 25, 2020

fedetft commented May 25, 2020

Krzmbrzl left a comment

Choose a reason for hiding this comment

fedetft commented May 26, 2020

Krzmbrzl commented May 26, 2020

fedetft commented May 26, 2020

Krzmbrzl commented May 26, 2020

fedetft commented May 13, 2020 •

edited

streaps commented May 13, 2020 •

edited

Krzmbrzl commented May 15, 2020 •

edited

Krzmbrzl left a comment •

edited

Krzmbrzl commented May 20, 2020 •

edited

TerryGeng commented May 20, 2020 •

edited