bwl1289/feat/merge-google-pthreadpool #1

BwL1289 · 2025-04-26T00:27:31Z

No description provided.

PiperOrigin-RevId: 706596019

PiperOrigin-RevId: 706995443

PiperOrigin-RevId: 707095720

PiperOrigin-RevId: 707224063

PiperOrigin-RevId: 707559093

PiperOrigin-RevId: 707609059

2nd try, fixed subtle difference in `pthreadpool_decrement_fetch_acquire_release_size_t` this time around. PiperOrigin-RevId: 708224758

Unless we're on Android, where sleeping/waking is slow. PiperOrigin-RevId: 708363228

…n defined. PiperOrigin-RevId: 713777960

PiperOrigin-RevId: 714131650

…tegies to `pthreadpool`. The `pthreadpool_parallelize_Xd_tile_Yd_dynamic` strategy differs from the existing non-dynamic `pthreadpool_parallelize_Xd_tile_Yd` in that the `count_*` argument of `function` corresponding to the innermost dimension can be any integer multiple of the corresponding `tile_*` argument, or the remainder of that range. E.g. `pthreadpool_parallelize_3d_tile_2d_dynamic(&threadpool, function, context, range_i, range_j, range_k, tile_j, tile_k)`, results in calls to ``` function(context, offset_i, offset_j, offset_k, count_j, count_k) ``` where `offset_j` and `offset_k` are integer multiples of `tile_j` and `tile_k`, respectively, and `count_k` is an integer multiple of `tile_k` or `range_k - offset_k`. If `range_k <= tile_k`, then `count_j` may be an integer multiple of `tile_j` or `range_j - offset_j`. The number of elements processed in each call is chosen as `max((count - offset) / (chunk_factor * num_threads), 1)` where `count` is the total number of elements and `offset` is the current number of elements already processed. This produces large element chunks initially, when there are lots of elements to process, and successively smaller chunks as the end of the computation nears. The `chunk_factor` used above is the ratio of the fastest to the slowest core in the parallel computation plus one. On heterogeneous systems, this is set to `5`, otherwise to `2`. PiperOrigin-RevId: 715757992

…trategies. PiperOrigin-RevId: 716200764

…gies to match the tile traversal order of the other strategies. Also use the same logic for picking tiles within the range (owner picks from `range_start`, stealing threads from `range_end`) to reduce the number of atomic operations per tile. PiperOrigin-RevId: 723497649

PiperOrigin-RevId: 723539628

PiperOrigin-RevId: 723596220

…ut first check. PiperOrigin-RevId: 724250471

… == 1`. PiperOrigin-RevId: 724342903

Also update yanked version of rules_cc: https://github.com/google/pthreadpool/actions/runs/13320936827/job/37205321364 PiperOrigin-RevId: 726720282

Apple toolchains have been moved to apple_support from Bazel 7. See bazelbuild/bazel#16619. Those values won't match anymore. Also all those config_settings should move to platforms: https://bazel.build/extending/platforms PiperOrigin-RevId: 732118106

PiperOrigin-RevId: 736869281

…d `pthreadpool_parallelize_4d_tile_2d_dynamic_with_uarch`. PiperOrigin-RevId: 736903003

PiperOrigin-RevId: 738347700

…placing the implementation of pthreadpool functions. PiperOrigin-RevId: 740067423

…k symbols PiperOrigin-RevId: 746173951

PiperOrigin-RevId: 750312221

xnnpack-bot and others added 25 commits December 16, 2024 17:11

No public description

d5b1640

PiperOrigin-RevId: 706596019

Fix formatting style and add per-file licensing headers.

e4dda15

PiperOrigin-RevId: 706995443

Add basic CI workflows.

4e80ca2

PiperOrigin-RevId: 707095720

Fix cpuinfo include path.

39df650

PiperOrigin-RevId: 707224063

Use the c11 built-in atomic functions directly.

ba98306

PiperOrigin-RevId: 707559093

Use the c11 built-in atomic functions directly.

93fcce9

PiperOrigin-RevId: 707609059

Use the c11 built-in atomic functions directly.

c02f903

2nd try, fixed subtle difference in `pthreadpool_decrement_fetch_acquire_release_size_t` this time around. PiperOrigin-RevId: 708224758

Don't yield-loop before waiting on a condition.

847fb99

Unless we're on Android, where sleeping/waking is slow. PiperOrigin-RevId: 708363228

Wrap #define _GNU_SOURCE in an #ifndef in case it has already bee…

c90640f

…n defined. PiperOrigin-RevId: 713777960

Restore the LICENSE file.

b4fb4eb

PiperOrigin-RevId: 714131650

Clean up the unused num_tiles in the params for the new dynamic s…

e146941

…trategies. PiperOrigin-RevId: 716200764

Use c11's aligned_alloc instead of posix_memalign.

3489b62

PiperOrigin-RevId: 723539628

Use c11's aligned_alloc instead of posix_memalign.

2f0931e

PiperOrigin-RevId: 723596220

Don't just assume that we have aligned_alloc or posix_memalign, b…

d647393

…ut first check. PiperOrigin-RevId: 724250471

Fix bug in thread_parallelize_2d_tile_2d_dynamic when `tile_range_j…

f94ab76

… == 1`. PiperOrigin-RevId: 724342903

Use posix_memalign on Hexagon

5703882

Also update yanked version of rules_cc: https://github.com/google/pthreadpool/actions/runs/13320936827/job/37205321364 PiperOrigin-RevId: 726720282

Add 4d_tile_2d_dynamic to pthreadpool.

bd09d5c

PiperOrigin-RevId: 736869281

Add missing shims for pthreadpool_parallelize_4d_tile_2d_dynamic an…

4e1831c

…d `pthreadpool_parallelize_4d_tile_2d_dynamic_with_uarch`. PiperOrigin-RevId: 736903003

Remove unused variables.

b924477

PiperOrigin-RevId: 738347700

Make pthreadpool symbols weak to allow customizing the behavior by re…

706a8ea

…placing the implementation of pthreadpool functions. PiperOrigin-RevId: 740067423

Mark pthreadpool_get_threads_count and pthreadpool_destroy as wea…

da30a55

…k symbols PiperOrigin-RevId: 746173951

Add strong aliases to public pthreadpool functions.

290ee6f

PiperOrigin-RevId: 750312221

BwL1289 merged commit be3c5e9 into eugo-inc:master Apr 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bwl1289/feat/merge-google-pthreadpool #1

bwl1289/feat/merge-google-pthreadpool #1

Uh oh!

BwL1289 commented Apr 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

bwl1289/feat/merge-google-pthreadpool #1

bwl1289/feat/merge-google-pthreadpool #1

Uh oh!

Conversation

BwL1289 commented Apr 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants