Multithreaded queries problem #23

konstantin-sancom · 2023-01-25T11:14:59Z

I tried to add multithreading and find out, that there is a problem with multithreading:

in unrestricted mode(new threads created as fast as it is possible) from 464 input files Olaf found only 26 in DB
with delay of 1-0.5 second there was a result as good as in serial mode(no threading at all)
with delay of 0.25 second or less results are degrading

Could you explain, is it 'by design' or is it a BUG also?

konstantin-sancom · 2023-01-25T12:59:50Z

Update.
The problem is here:

              //The fft struct is reused
              PFFFT_Setup *fftSetup = processor->runner->fftSetup;
              float *fft_in= processor->runner->fft_in;
              float *fft_out= processor->runner->fft_out;

In multithreading it is not possible to "reuse" those objects.

JorenSix · 2023-01-25T13:05:23Z

Hi,

I am not sure about the added value of multi-threading. Perhaps your tests might prove otherwise but I think decoding and storage are the bottleneck. From the readme:

Olaf is single threaded. The main reasons are simplicity and limitations of embedded platforms. The single threaded design keeps the code simple. On embedded platforms with single core CPU’s multithreading makes no sense. On traditional computers there might be a performance gain by implementing multi-threading. However, the time spent on decoding audio and storing fingerprints is much larger than analysis/extraction so the gain might be limited. As an work-around multiple processes can be used simultaniously to query the database.

JorenSix · 2023-01-25T13:08:04Z

But please do not let that stop you to experiment and look where data is shared which should better not be shared. Or how to improve Olaf in general. I am grateful for all constructive criticism on the code! Thanks!

konstantin-sancom · 2023-01-25T15:17:08Z

But please do not let that stop you to experiment and look where data is shared which should better not be shared. Or how to improve Olaf in general. I am grateful for all constructive criticism on the code! Thanks!

Ok. I see now - this code is for embeded systems.
On x86 platforms there is a a reason for multithreading and I made it work, BTW.
DB for reading is not a "bottleneck" - it is threadsafe (from the lmdb documentation), they say: there may be a lot of readers as processes or threads without locking eacj other, even there can one writer, not blocking readers.

JorenSix · 2023-01-25T16:22:03Z

Congrats! Very curious to see which changes were needed and how that would look and which effects it would have on (query) performance. Especially vs queries from multiple processes.

JorenSix closed this as completed Jan 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multithreaded queries problem #23

Multithreaded queries problem #23

konstantin-sancom commented Jan 25, 2023

konstantin-sancom commented Jan 25, 2023 •

edited

Loading

JorenSix commented Jan 25, 2023

JorenSix commented Jan 25, 2023

konstantin-sancom commented Jan 25, 2023

JorenSix commented Jan 25, 2023

Multithreaded queries problem #23

Multithreaded queries problem #23

Comments

konstantin-sancom commented Jan 25, 2023

konstantin-sancom commented Jan 25, 2023 • edited Loading

JorenSix commented Jan 25, 2023

JorenSix commented Jan 25, 2023

konstantin-sancom commented Jan 25, 2023

JorenSix commented Jan 25, 2023

konstantin-sancom commented Jan 25, 2023 •

edited

Loading