Support Speaker Diarization #26

steveway · 2021-06-04T13:51:30Z

Hello,
As you can see here I've started integrating this project into Papagayo-NG:
morevnaproject-org/papagayo-ng#49
The first results from my tests seem to be very promising.
Especially the new timestamp feature is helping a lot with that.

Is it possible to add some speaker separation to this?
Papagayo-NG itself allows several speakers for one audio file.
If we could recognize which parts are spoken by a separate speaker then that would make this a really nice solution for even
more animators.
I've taken a look at the topic, and it seems to be quite complex.
If this could be integrated to Allosaurus then that would be awesome of course.
If not there would be ways to get this into Papagayo-NG, we could do a separate pass over the audio.
I've taken a look and pyAudioAnalysis seems to already do that.
But that would be a big dependency addition.

xinjli · 2021-06-08T13:32:38Z

Hi, thanks for your suggestion!

Unfortunately, speaker diarization is a much different task from the current recognition task.
There is no plan for us to add diarization model.

However, there are a couple of repo doing the diarization task,
you can have a look at them here https://github.com/topics/speaker-diarization

I personally used the following one before, it performs well but requires some additional efforts to make it work.
https://github.com/google/uis-rnn

steveway · 2021-06-09T12:49:51Z

I see, that makes sense, thank you.
I'll experiment with integrating other tools like that then.
I guess this Issue can then be closed.

steveway closed this as completed Jun 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Speaker Diarization #26

Support Speaker Diarization #26

steveway commented Jun 4, 2021

xinjli commented Jun 8, 2021

steveway commented Jun 9, 2021

Support Speaker Diarization #26

Support Speaker Diarization #26

Comments

steveway commented Jun 4, 2021

xinjli commented Jun 8, 2021

steveway commented Jun 9, 2021