Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suppress non-speech-related token outputs #473

Merged

Conversation

shibukazu
Copy link
Contributor

Problem

  • Speech unrelated outputs like "(笑)", "[Bell]" often emerge.

Change

@RndyP
Copy link

RndyP commented Feb 5, 2023

I was suggesting this also. Good idea. However, shouldn't this be controlled by a bool in whisper_full_params ?

@shibukazu
Copy link
Contributor Author

Your point is right, I will add the parameter to be able to control suppression.

@shibukazu
Copy link
Contributor Author

shibukazu commented Feb 5, 2023

I noticed that "non-speech-tokens suppression" makes the decoding speed worse in the case input audio file doesn't have any voice (only silent audio).
I think this is because "non-speech-tokens suppression" makes it difficult to express silent state (e.g. [SILENT], [無音] )
Does anyone have good idea?

I realized that by setting temperature_inc parameter as 0, users can mitigate this kind of decoding speed degradation.
That's why this is not a problem.

@ggerganov ggerganov merged commit cfc06bf into ggerganov:master Feb 8, 2023
ggerganov added a commit that referenced this pull request Feb 15, 2023
This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
rock3125 pushed a commit to rock3125/whisper.cpp that referenced this pull request Feb 21, 2023
* add non-speech-token suppression

* add suppress non-speech_tokens param
rock3125 pushed a commit to rock3125/whisper.cpp that referenced this pull request Feb 21, 2023
…#473)

This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
anandijain pushed a commit to anandijain/whisper.cpp that referenced this pull request Apr 28, 2023
* add non-speech-token suppression

* add suppress non-speech_tokens param
anandijain pushed a commit to anandijain/whisper.cpp that referenced this pull request Apr 28, 2023
…#473)

This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023
* add non-speech-token suppression

* add suppress non-speech_tokens param
jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023
…#473)

This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023
* add non-speech-token suppression

* add suppress non-speech_tokens param
jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023
…#473)

This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
landtanin pushed a commit to landtanin/whisper.cpp that referenced this pull request Dec 16, 2023
* add non-speech-token suppression

* add suppress non-speech_tokens param
landtanin pushed a commit to landtanin/whisper.cpp that referenced this pull request Dec 16, 2023
…#473)

This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024
* add non-speech-token suppression

* add suppress non-speech_tokens param
iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024
…#473)

This seems to be causing hallucinations in the end of the audio, e.g.:

"Thank you for listening"
"Amen"
..
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants