whisper: optimize fft() function #2242

mkycoder · 2024-06-18T07:34:30Z

In log_mel_spectrogram_worker_thread() function, we allocate enough memory for fft_in and fft_out variable, and then we can avoid allocating memory for even, odd, even_fft and odd_fft in fft().
In my testing, this optimization can reduce mel time from 125 ms to 100 ms for 5-minute audio.

ggerganov · 2024-06-18T08:23:11Z

whisper.cpp

-    std::vector<float> fft_in(frame_size, 0.0);
-    std::vector<float> fft_out(2 * frame_size);
+    std::vector<float> fft_in(frame_size * 2, 0.0);
+    std::vector<float> fft_out(frame_size * 2 * 2 * 2);


Is this correct - shouldn't it be just frame_size * 2 * 2?

Suppose frame_size is 8,
For fft_in, we have series recursively 8, 4, 2, 1, so 8+4+2+1=15, 8*2 is enough.
A little complicated for fft_out, the series are 8*2(out)+8(even_fft)+8(odd_fft), 8+4+4, 4+2+2, 2+1+1, then 32+16+8+4=60, 8*2*2*2 is enough.

* bobqianic/fix-decoding: (436 commits) Add files via upload Update whisper.cpp whisper : optimize fft() function (ggerganov#2242) talk-llama : sync llama.cpp whisper : use ggml_backend_sched (ggerganov#2239) fix : remove extra files scripts : sync ggml-blas build : update make / cmake sync : ggml move BLAS to a separate backend (cont) (llama/6210) Vulkan Shader Refactor, Memory Debugging Option (llama/7947) scripts : stop sync whisper example from ggml cmake : fix sycl build (#0) ggml : remove OpenCL (#0) sycl : sync (#0) cuda : enable CUDA graphs (#0) talk-llama : sync llama.cpp cmake : fix CUDA build (#0) sync : ggml ggml : fix and optimize ppc64le (ggml/849) ...

whisper: optimize fft() function

da13035

ggerganov reviewed Jun 18, 2024

View reviewed changes

ggerganov approved these changes Jun 18, 2024

View reviewed changes

ggerganov merged commit bf4cb4a into ggerganov:master Jun 18, 2024
49 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper: optimize fft() function #2242

whisper: optimize fft() function #2242

mkycoder commented Jun 18, 2024

ggerganov Jun 18, 2024

mkycoder Jun 18, 2024 •

edited

Loading

whisper: optimize fft() function #2242

whisper: optimize fft() function #2242

Conversation

mkycoder commented Jun 18, 2024

ggerganov Jun 18, 2024

Choose a reason for hiding this comment

mkycoder Jun 18, 2024 • edited Loading

Choose a reason for hiding this comment

mkycoder Jun 18, 2024 •

edited

Loading