Skip silence around hallucinations #646

trungkienbkhn · 2024-01-16T12:21:35Z

Same with openai/whisper#1838

makaveli10 · 2024-01-19T08:00:53Z

@trungkienbkhn looks like this slows down the inference. Have you noticed the increase in latency?

Purfview · 2024-01-19T17:30:21Z

looks like this slows down the inference

Original author mentioned:

...since this also requires extra processing time, we only do this when a probable hallucination is detected.

trungkienbkhn · 2024-01-25T07:43:36Z

@makaveli10 , hello, sorry for the late reply.
I tested an mp3 audio file (192 seconds) that had a lot of noise with the tiny model and device cuda.
If use hallucination_silence_threshold=2, the avearage execution time total is 5.38s.
And if use the original code and don't use this feature, it's 4.87s.
My code:

model = WhisperModel('tiny', device='cuda')
segments, info = model.transcribe(audio_path, word_timestamps=True, hallucination_silence_threshold=2)

=> Latency has increased a bit. But I found that the transcription quality also improved. So I think it's a trade off, it's not too impactful and is acceptable.

trungkienbkhn force-pushed the skip-silence-around-hallucinations branch from 9efdffb to 5e94811 Compare January 16, 2024 12:24

Add clip_timestamps and hallucination_silence_threshold options

beeb467

trungkienbkhn force-pushed the skip-silence-around-hallucinations branch from 5e94811 to beeb467 Compare January 25, 2024 07:03

nguyendc-systran merged commit 0920672 into SYSTRAN:master Feb 20, 2024
3 checks passed

trungkienbkhn mentioned this pull request Feb 23, 2024

Update logic to get segment from features before encoding #705

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip silence around hallucinations #646

Skip silence around hallucinations #646

trungkienbkhn commented Jan 16, 2024

makaveli10 commented Jan 19, 2024

Purfview commented Jan 19, 2024

trungkienbkhn commented Jan 25, 2024 •

edited

Loading

Skip silence around hallucinations #646

Skip silence around hallucinations #646

Conversation

trungkienbkhn commented Jan 16, 2024

makaveli10 commented Jan 19, 2024

Purfview commented Jan 19, 2024

trungkienbkhn commented Jan 25, 2024 • edited Loading

trungkienbkhn commented Jan 25, 2024 •

edited

Loading