Need -pthread for make on ubuntu 20.04 #1

ArtyomZemlyak · 2022-09-26T08:31:06Z

Makefile

main: ggml.o main.o
	g++ -pthread -o main ggml.o main.o
	./main -h

ggml.o: ggml.c ggml.h
	gcc -pthread -O3 -mavx -mavx2 -mfma -mf16c -c ggml.c

main.o: main.cpp ggml.h
	g++ -pthread -O3 -std=c++11 -c main.cpp

ggerganov · 2022-09-26T08:59:46Z

Thanks - should be fixed now

* Fix MSVC compile error C3688 Instead of simply using 'add_compile_options(/utf-8)' to address the MSVC compile error C3688, a better approach would be to handle it in a way that prevents passing '/utf-8' to NVCC. * Significantly improve inference quality In the function `log_mel_spectrogram_worker_thread`, there's an array out-of-bounds issue occurring during the calculation of complex number moduli. This issue is causing disruptions in the FFT spectrum, which, in turn, is reducing the quality of inference. * Significantly improve inference quality At last, I've pinpointed the actual source of the problem. Given that the frequency spectrum generated from real input data is symmetrical around the Nyquist frequency, there's a for-loop within the `log_mel_spectrogram_worker_thread` function that attempts to fold the frequency spectrum. Regrettably, a bug within this for-loop is causing a frame shift in the frequency spectrum. The previous attempt to remedy this, which involved using `fft_size + 1` when calculating the modulus, was merely a band-aid solution and did not address the underlying issue. * Addressed a few minor issues Fixed the issue of `fft_out` continuously expanding. Resolved the fallback caused by using 'break' instead of `fft_in[j] = 0`. * Significantly improve inference quality Thanks for your patience everyone. It's finally sorted out. Now, the right side of the FFT spectrum is being flipped over to the left, and the amplitudes at corresponding positions on the left and right are added together (the spectrum on the left needs to be shifted by one position), then the average is calculated. FFT_OUT[0] is no longer discarded, making full use of the limited space to pack in more information. * Add annotation and performance improvement * Calculate FFT only when fft_in are not all zero * Some minor performance improvement * Fixed a bug impacting inference quality * The first version after all the analysis is completed. * Fix some bugs and add debug mode * Fixed several bugs * Temporarily disable speed-up mode and add debug mode. * Add debug mode * Disable speed-up mode and add debug mode * Fix CI error (#1) * Fix error * Fix error * Fixed several bugs including [BLANK_AUDIO] problem * Remove Hard-coded hann window * Some Final Fix (#2) * Fix error * Fix error * Probably the last commit * Probably the last commit * whisper : minor coding style changes * whisper : remove debug from public API --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* Fix MSVC compile error C3688 Instead of simply using 'add_compile_options(/utf-8)' to address the MSVC compile error C3688, a better approach would be to handle it in a way that prevents passing '/utf-8' to NVCC. * Significantly improve inference quality In the function `log_mel_spectrogram_worker_thread`, there's an array out-of-bounds issue occurring during the calculation of complex number moduli. This issue is causing disruptions in the FFT spectrum, which, in turn, is reducing the quality of inference. * Significantly improve inference quality At last, I've pinpointed the actual source of the problem. Given that the frequency spectrum generated from real input data is symmetrical around the Nyquist frequency, there's a for-loop within the `log_mel_spectrogram_worker_thread` function that attempts to fold the frequency spectrum. Regrettably, a bug within this for-loop is causing a frame shift in the frequency spectrum. The previous attempt to remedy this, which involved using `fft_size + 1` when calculating the modulus, was merely a band-aid solution and did not address the underlying issue. * Addressed a few minor issues Fixed the issue of `fft_out` continuously expanding. Resolved the fallback caused by using 'break' instead of `fft_in[j] = 0`. * Significantly improve inference quality Thanks for your patience everyone. It's finally sorted out. Now, the right side of the FFT spectrum is being flipped over to the left, and the amplitudes at corresponding positions on the left and right are added together (the spectrum on the left needs to be shifted by one position), then the average is calculated. FFT_OUT[0] is no longer discarded, making full use of the limited space to pack in more information. * Add annotation and performance improvement * Calculate FFT only when fft_in are not all zero * Some minor performance improvement * Fixed a bug impacting inference quality * The first version after all the analysis is completed. * Fix some bugs and add debug mode * Fixed several bugs * Temporarily disable speed-up mode and add debug mode. * Add debug mode * Disable speed-up mode and add debug mode * Fix CI error (ggerganov#1) * Fix error * Fix error * Fixed several bugs including [BLANK_AUDIO] problem * Remove Hard-coded hann window * Some Final Fix (ggerganov#2) * Fix error * Fix error * Probably the last commit * Probably the last commit * whisper : minor coding style changes * whisper : remove debug from public API --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

wchess : add clear_audio callback

* Fix MSVC compile error C3688 Instead of simply using 'add_compile_options(/utf-8)' to address the MSVC compile error C3688, a better approach would be to handle it in a way that prevents passing '/utf-8' to NVCC. * Significantly improve inference quality In the function `log_mel_spectrogram_worker_thread`, there's an array out-of-bounds issue occurring during the calculation of complex number moduli. This issue is causing disruptions in the FFT spectrum, which, in turn, is reducing the quality of inference. * Significantly improve inference quality At last, I've pinpointed the actual source of the problem. Given that the frequency spectrum generated from real input data is symmetrical around the Nyquist frequency, there's a for-loop within the `log_mel_spectrogram_worker_thread` function that attempts to fold the frequency spectrum. Regrettably, a bug within this for-loop is causing a frame shift in the frequency spectrum. The previous attempt to remedy this, which involved using `fft_size + 1` when calculating the modulus, was merely a band-aid solution and did not address the underlying issue. * Addressed a few minor issues Fixed the issue of `fft_out` continuously expanding. Resolved the fallback caused by using 'break' instead of `fft_in[j] = 0`. * Significantly improve inference quality Thanks for your patience everyone. It's finally sorted out. Now, the right side of the FFT spectrum is being flipped over to the left, and the amplitudes at corresponding positions on the left and right are added together (the spectrum on the left needs to be shifted by one position), then the average is calculated. FFT_OUT[0] is no longer discarded, making full use of the limited space to pack in more information. * Add annotation and performance improvement * Calculate FFT only when fft_in are not all zero * Some minor performance improvement * Fixed a bug impacting inference quality * The first version after all the analysis is completed. * Fix some bugs and add debug mode * Fixed several bugs * Temporarily disable speed-up mode and add debug mode. * Add debug mode * Disable speed-up mode and add debug mode * Fix CI error (ggerganov#1) * Fix error * Fix error * Fixed several bugs including [BLANK_AUDIO] problem * Remove Hard-coded hann window * Some Final Fix (ggerganov#2) * Fix error * Fix error * Probably the last commit * Probably the last commit * whisper : minor coding style changes * whisper : remove debug from public API --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

revert change

Update ggml-backend.c

ggerganov added a commit that referenced this issue Sep 26, 2022

ref #1 : add -pthread to compilation flags

154fa79

cdosoftei mentioned this issue Sep 28, 2022

Pass -pthread to linker #3

Merged

ggerganov closed this as completed Sep 28, 2022

xyx361100238 mentioned this issue Oct 20, 2022

Not working on MacOS (ARM) #66

Closed

ANMahmood mentioned this issue Mar 27, 2023

GGML_ASSERT: \whisper.cpp\ggml.c:3904: !ggml_is_transposed(a) #661

Closed

cjia4 mentioned this issue Mar 28, 2023

ggml_new_tensor_impl: not enough space in the scratch memory #671

Closed

starsea mentioned this issue Mar 30, 2023

talk-llama segmentation fault on mac os #691

Open

catdumitru mentioned this issue Apr 12, 2023

Problems running the stream example - [Start speaking] frozen #747

Closed

anandijain pushed a commit to anandijain/whisper.cpp that referenced this issue Apr 28, 2023

ref ggerganov#1 : add -pthread to compilation flags

c2d04ee

huapingchen mentioned this issue May 13, 2023

segmentation fault when use Core ML #919

Closed

jacob-salassi mentioned this issue May 15, 2023

Core ML support #566

Merged

10 tasks

This was referenced May 31, 2023

Run talk example failed #782

Closed

Segmentation fault while running talk on mac m1 max #974

Open

warkcod mentioned this issue Jun 8, 2023

OpenCL clCreateCommandQueue error -30 on MacOS 13.4 intel #996

Open

Mingkun-Lu mentioned this issue Jul 4, 2023

How does Android interrupt quickly while running fullTranscribe and free up memory #1079

Open

ToSeven mentioned this issue Oct 14, 2023

whisper_init_state: ggml_metal_init() failed #1367

Closed

StuartIanNaylor mentioned this issue Nov 3, 2023

Benchmark results #89

Open

ggerganov pushed a commit that referenced this issue Nov 30, 2023

Merge pull request #1 from ggerganov/gg/wchess

8dba820

wchess : add clear_audio callback

jettoblack pushed a commit to jettoblack/whisper.cpp that referenced this issue Feb 8, 2024

Merge pull request ggerganov#1 from bobqianic/bobqianic-patch-1

7499e3c

revert change

liam-mceneaney referenced this issue in liam-mceneaney/androidwhisper.cpp Mar 5, 2024

Merge pull request #1 from Digipom/size_t_32bit_fix

d4577da

Update ggml-backend.c

genglinxiao mentioned this issue May 16, 2024

What kind of performance can we expect? #2157

Open

bradmit mentioned this issue May 23, 2024

Crash with multiple whisper states running at the same time CUDA #2177

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need -pthread for make on ubuntu 20.04 #1

Need -pthread for make on ubuntu 20.04 #1

ArtyomZemlyak commented Sep 26, 2022

ggerganov commented Sep 26, 2022

Need -pthread for make on ubuntu 20.04 #1

Need -pthread for make on ubuntu 20.04 #1

Comments

ArtyomZemlyak commented Sep 26, 2022

ggerganov commented Sep 26, 2022