Transcription is creating duplicate sentences #716

greerviau · 2024-02-25T20:05:53Z

Transcribe is returning text with repeating sentences

Running with tiny.en model on cpu and int8 compute type:

[Segment(id=1, seek=240, start=0.0, end=2.4, text=' with times it. With times it. With times it. With times it. With times it. With 
times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times 
it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With 
times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times 
it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With 
times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times it. With times 
it. With times it. With times it. With times it', tokens=[50363, 351, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 
1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340, 13, 2080, 1661, 340], temperature=1.0, 
avg_logprob=-0.12661736170450846, compression_ratio=27.032258064516128, no_speech_prob=0.22830641269683838, 
words=None)]

segments, _ = self.model.transcribe(file_path, 
                                            vad_filter=True, 
                                            vad_parameters=dict(min_silence_duration_ms=500))
print(segments)

This wasnt happening before upgrading to 1.0.0

The text was updated successfully, but these errors were encountered:

Purfview · 2024-02-27T13:27:18Z

Can you share an audio sample to reproduce the issue?

stu247 · 2024-02-27T17:02:02Z

I am also seeing this issue. I used this code:

model = WhisperModel("small.en", compute_type="int8")
segments, info = model.transcribe("turnOnKitchenSink.wav")
for segment in segments:
    print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))

and got this output:

[0.00s -> 29.20s]  Turn on kitchen sink. Turn on kitchen sink.

I get the same output if I don't specify compute_type. The audio sample is: turnOnKitchenSink.zip

greerviau · 2024-02-27T17:06:11Z

I cant point to a specific audio sample it was happening to, it was happening to any that I tried. I'm not sure if it was a problem on gpu, I was using cpu for my testing, but I reverted my code to 0.10.1 until there's a fix.

Sharrnah · 2024-02-27T23:56:09Z

I see the same issue when i set a language. (medium and large-v3 model tested).

Happens on medium.en just as on other models if a language is set.

If i set language to autodetect, its fine. older version is fine too.

Sharrnah · 2024-02-28T00:25:13Z

As update: It was introduced with this commit:
0920672

When i revert this one back, its fine again.

trungkienbkhn · 2024-02-28T01:35:26Z

@Sharrnah , hello. Can you try again with this fix ?

Purfview · 2024-02-28T06:51:57Z

I am also seeing this issue.

[0.00s -> 29.20s]  Turn on kitchen sink. Turn on kitchen sink.
I get the same output if I don't specify compute_type. The audio sample is: turnOnKitchenSink.zip

Looks good with PR #705:

[00:00.000 --> 00:02.220]  Turn on kitchen sink.

Sharrnah · 2024-02-28T11:44:31Z

@Sharrnah , hello. Can you try again with this fix ?

Thanks. looks fine with the fix. :)

James-Shared-Studios · 2024-03-01T05:20:34Z

fully tested the fix, and works well.

signebedi mentioned this issue Feb 25, 2024

[bug] faster whisper v.1.0.0 causes duplication signebedi/whisper-api#2

Closed

minhthuc2502 mentioned this issue Feb 26, 2024

perf: conv1d quantization OpenNMT/CTranslate2#1601

Merged

Sharrnah added a commit to Sharrnah/faster-whisper that referenced this issue Feb 28, 2024

temporary bugfix for issue SYSTRAN#716

a667e69

vincaslt mentioned this issue Apr 26, 2024

Duplicate words #811

Open

AnshumanParidaIL mentioned this issue May 27, 2024

faster-whisper==1.0.1 transcription creates duplicate sentences. NavodPeiris/speechlib#32

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transcription is creating duplicate sentences #716

Transcription is creating duplicate sentences #716

greerviau commented Feb 25, 2024

Purfview commented Feb 27, 2024

stu247 commented Feb 27, 2024

greerviau commented Feb 27, 2024

Sharrnah commented Feb 27, 2024

Sharrnah commented Feb 28, 2024

trungkienbkhn commented Feb 28, 2024

Purfview commented Feb 28, 2024

Sharrnah commented Feb 28, 2024

James-Shared-Studios commented Mar 1, 2024

Transcription is creating duplicate sentences #716

Transcription is creating duplicate sentences #716

Comments

greerviau commented Feb 25, 2024

Purfview commented Feb 27, 2024

stu247 commented Feb 27, 2024

greerviau commented Feb 27, 2024

Sharrnah commented Feb 27, 2024

Sharrnah commented Feb 28, 2024

trungkienbkhn commented Feb 28, 2024

Purfview commented Feb 28, 2024

Sharrnah commented Feb 28, 2024

James-Shared-Studios commented Mar 1, 2024