DO NOT MERGE. Wip/get thread id schedules #6674

stuartarchibald · 2021-02-01T13:07:45Z

No description provided.

…re calling set_parallel_chunksize.

…size, 2) set chunksize back to the default of 0 and then after the gufunc returns, restore the chunksize back to the previously saved value. This way, the current thread gets its default chunksize behavior inside the parallel region but goes back to its previous value when the region is over.

…its use point.

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

More details on how actual chunksize can differ from specification. Moved code examples in docs to tests/doc_examples/test_parallel_chunksize.py. Export (g,s)et_parallel_chunksize from numba.np.ufunc. Fix withcontext parallel_chunksize doc string. Change set_parallel_chunksize to return previous chunk size. Use that return value to remove need for get_parallel_chunksize in some places. Raise exception if negative value to set_parallel_chunksize.

This: * Makes `_get_thread_id()` return enumerated threads ids from 0..mask. * Makes it possible to obtain the parfors schedule from within a parallel region. * Implements broadcasting of schedule info across all threads for convenience.

DrTodd13

@stuartarchibald My first reaction is that perhaps this is overly complicated. Here is what I was thinking. When we lower a parfor, we first calculate the schedule size and then alloca that size to store the schedule. We then call do_schedule_(un)signed to calculate the schedule and store it in that alloca space. Both the size and the actual schedule are then on the stack and stick around until the end of the function. We won't try to access them directly by their LLVM names but just know that the storage will stick around. Then, in gufunc_scheduler, when we compute a schedule, we just save in that file the computed size and the pointer to the last computed schedule. So, on top of saving those two things, you add a function to get the schedule that just returns the size and the pointer to the schedule that you saved. You then have the issue with interpreting that raw array but I assume you could add a wrapper that could convert from C to Python format there. Thus, you would execute the parfor and after that you could inspect the schedule. My approach would also supporting inspecting it in the loop but that doesn't seem as clean to me. Does your approach require the inspection to be done inside the loop? If so, would that preclude testing vector-style parfors?

So, with my approach, if one function generated a schedule and then that function stopped and another function started and you then asked for the schedule before you did a parfor in that function then you could get a crash because the schedule alloca would not be on the stack anymore. Does your approach also have this issue?

stuartarchibald · 2022-03-24T16:06:19Z

Closing #7625 implements this further.

DrTodd13 and others added 16 commits July 22, 2020 15:34

Initial support for selecting the chunk size for parallel regions.

29eaed6

Flake8 fixes.

861e23d

Flake8 fixes.

52ef533

Merge branch 'master' into autochunk

65d0806

Put the chunksize from the with context argument into a variable befo…

ea4c36f

…re calling set_parallel_chunksize.

Make parallel_chunksize statie.

9975c13

Add tests for setting the parallel chunksize.

41d75c0

Merge branch 'master' into autochunk

eb2ad38

Add void to argument list to make a strict prototype.

0596204

Remove file-wide multiprocessing import and rely on scoped import at …

b1ca9e4

…its use point.

Update docs/source/user/parallel.rst

ff2c105

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

Update docs/source/user/parallel.rst

b5ddb2b

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

Add tests to assert if set_parallel_chunksize with negative value.

cfee058

stuartarchibald requested review from DrTodd13, esc and sklam as code owners February 1, 2021 13:07

stuartarchibald added the abandoned PR is abandoned (no reason required) label Feb 1, 2021

stuartarchibald mentioned this pull request Feb 1, 2021

Initial support for selecting the chunk size for parallel regions. #6025

Closed

DrTodd13 reviewed Feb 1, 2021

View reviewed changes

stuartarchibald mentioned this pull request Dec 15, 2021

Getting Thread IDs in prange #3130

Open

stuartarchibald mentioned this pull request Mar 24, 2022

Add official support for thread IDs #7936

Closed

stuartarchibald closed this Mar 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DO NOT MERGE. Wip/get thread id schedules #6674

DO NOT MERGE. Wip/get thread id schedules #6674

stuartarchibald commented Feb 1, 2021

DrTodd13 left a comment

stuartarchibald commented Mar 24, 2022

DO NOT MERGE. Wip/get thread id schedules #6674

DO NOT MERGE. Wip/get thread id schedules #6674

Conversation

stuartarchibald commented Feb 1, 2021

DrTodd13 left a comment

Choose a reason for hiding this comment

stuartarchibald commented Mar 24, 2022