gh-112529: Make the GC scheduling thread-safe #114880

colesbury · 2024-02-01T21:21:35Z

The GC keeps track of the number of allocations (less deallocations) since the last GC. This change buffers the allocation count in thread-local state and uses atomic operations to modify the per-interpreter count. The thread-local buffering avoids contention on shared state.

A consequence is that the GC scheduling is not as precise, so "test_sneaky_frame_object" is skipped because it requires that the GC be run exactly after allocating a frame object.

Issue: Make the garbage collector thread-safe in --disable-gil builds #112529

The GC keeps track of the number of allocations (less deallocations) since the last GC. This buffers the count in thread-local state and uses atomic operations to modify the per-interpreter count. The thread-local buffering avoids contention on shared state. A consequence is that the GC scheduling is not as precise, so "test_sneaky_frame_object" is skipped because it requires that the GC be run exactly after allocating a frame object.

ericsnowcurrently

The GC is one of the runtime components with which I am least familiar, so I mostly have questions for you. 😄

Otherwise, the PR mostly makes sense.

Objects/typeobject.c

Python/gc_free_threading.c

ericsnowcurrently · 2024-02-02T22:55:00Z

The change here seems okay to me, but I'd feel better if one of the GC experts reviewed this before it's merged.

CC @markshannon @pablogsal @nascheme @DinoV @nanjekyejoannah

nascheme · 2024-02-04T00:50:20Z

I've not looked at the code but the idea of the change sounds fine to me. I suspect there are some users who require that the GC threshold is precise, like the test_sneaky_frame_object case. However, I don't think that's a reasonable thing and I think it's okay to break them. We are pretty likely to adjust how the thresholds work anyhow. Using atomic operations to count allocations/dellocations will be too expensive.

DinoV · 2024-02-14T22:12:43Z

Python/gc_free_threading.c

+    // We buffer the allocation count to avoid the overhead of atomic
+    // operations for every allocation.
+    gc->alloc_count++;
+    if (gc->alloc_count >= LOCAL_ALLOC_COUNT_THRESHOLD) {


I wonder if this could be tied to the configurable GC threshold and therefore the tests could continue to pass but maybe it doesn't matter enough and the extra read isn't worth it.

Yeah, I considered making it a configurable runtime threshold, but decided it wasn't worth it, at least for now.

I think there's a decent chance we change how we count allocations in the future. In the nogil forks, for example, I accounted for allocations in mi_page_to_full and _mi_page_unfull, which provides some natural batching and avoids the thread-local that's done in every allocation here, but wouldn't allow for a configurable threshold. I haven't attempted that yet because I'd like some performance measurements to justify it first.

DinoV

LGTM!

bedevere-bot · 2024-02-14T23:02:14Z

🤖 New build scheduled with the buildbot fleet by @colesbury for commit f99d14e 🤖

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

The GC keeps track of the number of allocations (less deallocations) since the last GC. This buffers the count in thread-local state and uses atomic operations to modify the per-interpreter count. The thread-local buffering avoids contention on shared state. A consequence is that the GC scheduling is not as precise, so "test_sneaky_frame_object" is skipped because it requires that the GC be run exactly after allocating a frame object.

colesbury requested review from DinoV, nascheme and pablogsal February 1, 2024 21:21

colesbury requested review from ericsnowcurrently and markshannon as code owners February 1, 2024 21:21

bedevere-app bot added the awaiting review label Feb 1, 2024

bedevere-app bot mentioned this pull request Feb 1, 2024

Make the garbage collector thread-safe in --disable-gil builds #112529

Closed

3 tasks

colesbury added the skip news label Feb 1, 2024

ericsnowcurrently reviewed Feb 2, 2024

View reviewed changes

Objects/typeobject.c Show resolved Hide resolved

Python/gc_free_threading.c Show resolved Hide resolved

Python/gc_free_threading.c Show resolved Hide resolved

Python/gc_free_threading.c Show resolved Hide resolved

Fix warning

e832dd9

colesbury added 2 commits February 6, 2024 15:38

Merge branch 'main' into pythongh-112529-gc-schedule

483b37e

Skip test_gc.test_get_count() in builds

456b778

colesbury mentioned this pull request Feb 9, 2024

gh-112175: Add eval_breaker to PyThreadState #115194

Merged

Merge branch 'main' into pythongh-112529-gc-schedule

f99d14e

DinoV reviewed Feb 14, 2024

View reviewed changes

DinoV approved these changes Feb 14, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Feb 14, 2024

colesbury added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Feb 14, 2024

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Feb 14, 2024

colesbury merged commit b24c916 into python:main Feb 16, 2024
119 checks passed

colesbury deleted the gh-112529-gc-schedule branch February 16, 2024 16:22

bedevere-app bot removed the awaiting merge label Feb 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-112529: Make the GC scheduling thread-safe #114880

gh-112529: Make the GC scheduling thread-safe #114880

colesbury commented Feb 1, 2024 •

edited by bedevere-app bot

ericsnowcurrently left a comment

ericsnowcurrently commented Feb 2, 2024

nascheme commented Feb 4, 2024

DinoV Feb 14, 2024

colesbury Feb 14, 2024

DinoV left a comment

bedevere-bot commented Feb 14, 2024

gh-112529: Make the GC scheduling thread-safe #114880

gh-112529: Make the GC scheduling thread-safe #114880

Conversation

colesbury commented Feb 1, 2024 • edited by bedevere-app bot

ericsnowcurrently left a comment

Choose a reason for hiding this comment

ericsnowcurrently commented Feb 2, 2024

nascheme commented Feb 4, 2024

DinoV Feb 14, 2024

Choose a reason for hiding this comment

colesbury Feb 14, 2024

Choose a reason for hiding this comment

DinoV left a comment

Choose a reason for hiding this comment

bedevere-bot commented Feb 14, 2024

colesbury commented Feb 1, 2024 •

edited by bedevere-app bot