Support multithreading #496

emmaai · 2024-08-26T06:39:21Z

As the title (ref: #494), the change:

replace _numexpr_last with a dictionary like object aware of context

reasoning:

I do like the feature of re-evaluate and want it to stay the safe as well
open the pathways to async re/evaluate, which is to further cater to my specific user case.

benchmark case:
It's based on my reality that most of time, I got large amount of data and only chunks of data can fit into memory, most of my cpus are idling when io (especially i) happens. If the data can fit into memory and cpus are fully utilized, I guess it doesn't make much difference between threading with numpy by chunks vs numexpr. Certainly I can implement what numexpr can achieve by further chunking each chunk, but why? Specifically I use dask threading to schedule the tasks. All I have to do is to pack ne.evaluate nicely into a "task" if thread safety is taken care of.

FrancescAlted · 2024-08-26T08:26:28Z

Cool. Can you add an example of a benchmark (you can put it in bench/) where the new feature is exercised and where we can see the advantage of the new method? It would be nice if you can add the results of your machine as a docstring at the beginning of the script; alternatively, you can use a jupyter notebook too. Thanks!

gdementen · 2024-08-27T08:28:12Z

I am sorry if my comment is dumb, and I am not the maintainer for numexpr, so my opinion is not worth much, but I have the impression your benchmark compares apples to oranges: numpy 2 threads vs numexpr 32 (?) threads (2 explicit threads x 16 builtin threads) or how does the builtin numexpr threading interact with manual threading anyway? Also I would be interested in a benchmark against "normal"/builtin numexpr threading, which I think is more interesting than against numpy. Unless there is something I don't understand (very likely), I don't expect much difference.

emmaai · 2024-08-27T09:06:13Z

There are many benchmark cases both against numpy and different numbers of threads, under the folder bench/. The whole point of this pr is to avoid implementing the same mechanism of numexpr in numpy if multithreading, numexpr is not thread safe due to global dict, which is stated clearly in pr content. I’m not quite sure if I understand the comment “how does the builtin numexpr interact with manual threading”. Well, better they don’t? If they do, usually implies race condition. The change here is to guarantee that they don’t. Oversubscription (I have only 16 cores) is a common technique that the cpu utilisation is low due to io, cuz in reality the other thread(s) might be loading the data not using cpus that much. As commented in benchmark file, it has 2 threads because the presumption is that memory is only enough for 2 chunks computation. Then there are two choices, one is to threading on smaller chunks with numpy, the other is to hand over threading to numexpr. To me clearly the later is an easier option. With chunk being smaller and oversubscription of threads, numexpr is not doing as well as other conditions (again they’re available under bench/ folder). However, it’s still much better than single numpy, and MUCH less work.

…

On Tue, 27 Aug 2024 at 5:58 PM, Gaëtan de Menten ***@***.***> wrote: I am sorry if my comment is dumb, and I am not the maintainer for numexpr, so my opinion is not worth much, but I have the impression your benchmark compares apples to oranges: numpy 2 threads vs numexpr 32 (?) threads (2 explicit threads x 16 builtin threads) or how does the builtin numexpr threading interact with manual threading anyway? Also I would be interested in a benchmark against "normal"/builtin numexpr threading, which I think is more interesting than against numpy. Unless there is something I don't understand (very likely), I don't expect much difference. — Reply to this email directly, view it on GitHub <#496 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABLBWNQY3DKAKV6KYZ7PVY3ZTQ2DFAVCNFSM6AAAAABNDLTO4SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMJRHA4DKMBVGU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

FrancescAlted · 2024-08-27T11:26:15Z

Thanks @emmaai for your example. It was more for me (and others!) to understand the way you wanted to use your feature. Now it is much clearer, and sounds good to me. The only thing that I'd ask is to add a new test exercising this new feature; tests are actually the way to ensure that we are not introducing regressions in the future.

emmaai · 2024-08-28T06:46:34Z

Test added to verify thread safety by always manifesting the race condition.

gdementen · 2024-08-28T07:14:28Z

Thanks @emmaai for the added explanation.

emmaai · 2024-09-12T03:39:38Z

I'm following up with this pr, wondering if there is any concern that it can't be merged?

FrancescAlted · 2024-09-12T08:26:47Z

I've just activated the tests in CI, and Mac OSX is reporting a failure. Can you address it?

emmaai · 2024-09-12T10:59:43Z

sorry forgot to commit the change in another file when pushing the test. it should pass now

FrancescAlted · 2024-09-13T05:09:46Z

Thanks @emmaai !

Emma Ai and others added 3 commits August 26, 2024 06:26

support threading with context aware dict

6c2289e

Merge branch 'pydata:master' into master

ba246a9

remove comment might cause confusion

0d21093

add benchmark against numpy in multithreading

4e90a70

add test on thread safety

cb5faa1

emmaai force-pushed the master branch from 1d9b433 to cb5faa1 Compare August 28, 2024 06:46

isolate dict between contexts

c4f527d

FrancescAlted merged commit a99412e into pydata:master Sep 13, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multithreading #496

Support multithreading #496

emmaai commented Aug 26, 2024 •

edited

Loading

FrancescAlted commented Aug 26, 2024

gdementen commented Aug 27, 2024

emmaai commented Aug 27, 2024 via email

FrancescAlted commented Aug 27, 2024

emmaai commented Aug 28, 2024

gdementen commented Aug 28, 2024

emmaai commented Sep 12, 2024

FrancescAlted commented Sep 12, 2024

emmaai commented Sep 12, 2024

FrancescAlted commented Sep 13, 2024

Support multithreading #496

Support multithreading #496

Conversation

emmaai commented Aug 26, 2024 • edited Loading

FrancescAlted commented Aug 26, 2024

gdementen commented Aug 27, 2024

emmaai commented Aug 27, 2024 via email

FrancescAlted commented Aug 27, 2024

emmaai commented Aug 28, 2024

gdementen commented Aug 28, 2024

emmaai commented Sep 12, 2024

FrancescAlted commented Sep 12, 2024

emmaai commented Sep 12, 2024

FrancescAlted commented Sep 13, 2024

emmaai commented Aug 26, 2024 •

edited

Loading