SIGFPE on certain audio files #39

tazz4843 · 2022-10-11T04:39:05Z

Hey there! I'm testing out whisper.cpp to see if it would be suitable for production use. However I'm running into a SIGFPE on certain audio files: namely those that do not produce any output from the model. Because of the way my system is set up, I'm unable to provide any test files that can reproduce this bug.

However, I was able to build the library with debug symbols and trigger the exception. It seems to be a divide-by-zero error on line 2349 of whisper.cpp:

whisper.cpp/whisper.cpp

Line 2349 in 8d94358

int progress_cur = (100*seek)/whisper_n_len(ctx);

The GDB output is as follows:

Thread 21 "scripty_stt_ser" received signal SIGFPE, Arithmetic exception.
[Switching to Thread 0x7ffff7085700 (LWP 3869)]
0x0000555555599123 in whisper_full (ctx=0x5555556f6a80, params=..., samples=<optimized out>, n_samples=<optimized out>) at whisper.cpp:2349
2349            int progress_cur = (100*seek)/whisper_n_len(ctx);

Unfortunately, despite compiling with debug symbols (-g flag), bt gave no extra info beyond that:

(gdb) bt
#0  0x0000555555599123 in whisper_full (ctx=0x5555556f6a80, params=..., samples=<optimized out>, n_samples=<optimized out>) at whisper.cpp:2349
#1  0x0000555555593cf6 in whisper_rs::whisper_ctx::WhisperContext::full (self=<optimized out>, params=..., data=...) at src/whisper_ctx.rs:390

Let me know if there's anything else I can do to help!

The text was updated successfully, but these errors were encountered:

ggerganov · 2022-10-11T17:18:13Z

Very likely that the length of the audio is 0.
The function whisper_n_len returns the length of the spectrogram and it will be 0 if the audio is empty.

There should be a check for division by zero.

When using -g you also want to avoid using -O3 in order to not get <optimized out>

tazz4843 · 2022-10-11T17:31:13Z

I'm willing to try out implementing this, how would you want the division by zero check to be handled? ie. should it throw an error or should it just silently skip?

ggerganov · 2022-10-11T17:37:07Z

Right after computing the mel spectrogram, check if the length is less than 100 (i.e. 1 second) and if yes - return 0:

whisper.cpp/whisper.cpp

Lines 2315 to 2320 in 8d94358

    
           // compute log mel spectrogram 
        
           if (whisper_pcm_to_mel(ctx, samples, n_samples, params.n_threads) != 0) { 
        
               fprintf(stderr, "%s: failed to compute log mel spectrogram\n", __func__); 
        
               return -1; 
        
           }

Add short comment with explanation that we do not process audio less than 1 second

fixes ggerganov#39

fixes #39

fixes ggerganov#39

…tate-struct-with-lifetime refactor: delete map for State and expose struct with lifetime

tazz4843 mentioned this issue Oct 11, 2022

Support for realtime audio input #10

Closed

ggerganov added enhancement New feature or request good first issue Good for newcomers labels Oct 11, 2022

tazz4843 added a commit to tazz4843/whisper.cpp that referenced this issue Oct 11, 2022

check if spectogram length is <100 before doing anything else

ef762e1

fixes ggerganov#39

tazz4843 mentioned this issue Oct 11, 2022

Check spectogram length in whisper_full #41

Merged

ggerganov closed this as completed in #41 Oct 12, 2022

ggerganov pushed a commit that referenced this issue Oct 12, 2022

check if spectogram length is <100 before doing anything else

b799226

fixes #39

anandijain pushed a commit to anandijain/whisper.cpp that referenced this issue Apr 28, 2023

check if spectogram length is <100 before doing anything else

86a6d5f

fixes ggerganov#39

warkcod mentioned this issue Jun 8, 2023

OpenCL clCreateCommandQueue error -30 on MacOS 13.4 intel #996

Open

jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this issue Oct 24, 2023

check if spectogram length is <100 before doing anything else

fdcf9c6

fixes ggerganov#39

sandrohanea mentioned this issue Feb 3, 2024

Real-time identification of microphone has no result. sandrohanea/whisper.net#155

Closed

kultivator-consulting pushed a commit to KultivatorConsulting/whisper.cpp that referenced this issue Feb 12, 2024

Merge pull request ggerganov#39 from yuniruyuni/feat/expose-whisper-s…

efd18b6

…tate-struct-with-lifetime refactor: delete map for State and expose struct with lifetime

zhouwg mentioned this issue Mar 13, 2024

PoC:clean-room implementation of real-time AI subtitle for English online-TV(OTT TV) zhouwg/kantv#64

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIGFPE on certain audio files #39

SIGFPE on certain audio files #39

tazz4843 commented Oct 11, 2022

ggerganov commented Oct 11, 2022

tazz4843 commented Oct 11, 2022

ggerganov commented Oct 11, 2022

SIGFPE on certain audio files #39

SIGFPE on certain audio files #39

Comments

tazz4843 commented Oct 11, 2022

ggerganov commented Oct 11, 2022

tazz4843 commented Oct 11, 2022

ggerganov commented Oct 11, 2022