Any Idea for feature update for targeted speaker transcription? #53

Jeevi10 · 2024-01-25T15:58:55Z

No description provided.

Gldkslfmsd · 2024-01-25T16:04:46Z

hi, what do you mean by "targeted speaker transcription"?
I think that if you use an underlying model that supports it, instead of the default Whisper, than it will work.

Jeevi10 · 2024-01-25T16:12:54Z

Thank you for the prompt response. I mean, suppose multiple speakers speak in an environment ( noisy environment where people are speaking in the background)
I would like to transcribe the main speaker (who is close to mic) only, I tested the "mic_test_whisper_simple.py" it works very well, however it tend to capture all the noises ( still speech but far from mic) in-between my speech.

Gldkslfmsd · 2024-01-25T16:42:26Z

OK, I understand.

This is not an issue of streaming, but of audio pre-processing or ASR modelling. You need to ask elsewhere to have such model, and then integrate it for streaming, the same way as Whisper.

Jeevi10 changed the title ~~Any Idea feature update for Target speaker transcription?~~ Any Idea for feature update for targeted speaker transcription? Jan 25, 2024

Gldkslfmsd closed this as completed Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any Idea for feature update for targeted speaker transcription? #53

Any Idea for feature update for targeted speaker transcription? #53

Jeevi10 commented Jan 25, 2024

Gldkslfmsd commented Jan 25, 2024

Jeevi10 commented Jan 25, 2024 •

edited

Loading

Gldkslfmsd commented Jan 25, 2024

Any Idea for feature update for targeted speaker transcription? #53

Any Idea for feature update for targeted speaker transcription? #53

Comments

Jeevi10 commented Jan 25, 2024

Gldkslfmsd commented Jan 25, 2024

Jeevi10 commented Jan 25, 2024 • edited Loading

Gldkslfmsd commented Jan 25, 2024

Jeevi10 commented Jan 25, 2024 •

edited

Loading