GH-118095: Unify the behavior of tier 2 `FOR_ITER` branch micro-ops #118420

markshannon · 2024-04-30T09:17:41Z

Simplifies and unifies the behavior of
_GUARD_NOT_EXHAUSTED_RANGE
_GUARD_NOT_EXHAUSTED_LIST
_GUARD_NOT_EXHAUSTED_TUPLE
_FOR_ITER_TIER_TWO

Such that all leave just the iterator on the stack and they exit to the POP_TOP immediately after the associated END_FOR.

This fixes a bug in the tier 2 handling of _FOR_ITER_TIER_TWO where errors were treated as occurring at the jump target, not an the original instructions.

Issue: Increase the number of micro-ops that we can handle in tier 2 #118095

…prepare for execution step.

…and fix off by one error)

markshannon · 2024-05-01T12:45:05Z

The stats are a bit confusing.

The number of traces executed goes up, but the number of uops executed goes down, as we would expect.
However, there is a large increase in the number of tier 1 FOR_ITER_TUPLE and FOR_ITER_LIST instructions executed.

Looking at the number of instructions executed for the various tier 1 and tier 2 FOR_ITER variants, what's happening becomes clearer:

Specialized

FOR_ITER_TUPLE +118M
FOR_ITER_LIST +236M
FOR_ITER_RANGE +22M

_ITER_CHECK_TUPLE + 165M
_ITER_CHECK_LIST + 87M
_ITER_CHECK_RANGE + 65M

Unspecialized

FOR_ITER -16M
_FOR_ITER_TIER_TWO -1081M

Specialization is being improved: we are executing more specialized T1 and T2 variants and much fewer unspecialized
FOR_ITER and _FOR_ITER_TIER_TWOs.

brandtbucher · 2024-05-01T16:34:23Z

The test hangs for tier two seem to be in test_capi.test_misc.TestPendingCalls. I have a hunch the culprit is GH-117442... so that should probably be fixed first?

gvanrossum

Seem to be some unrelated cleanups -- maybe minimize those or extract them to another PR? Otherwise LGTM.

gvanrossum · 2024-05-01T21:09:46Z

Python/optimizer.c

@@ -23,6 +23,18 @@

 #define MAX_EXECUTORS_SIZE 256

+#ifdef Py_DEBUG
+static int base_opcode(PyCodeObject *code, int offset)


Suggested change

static int base_opcode(PyCodeObject *code, int offset)

static int

base_opcode(PyCodeObject *code, int offset)

gvanrossum · 2024-05-01T21:51:58Z

Python/optimizer_symbols.c

+    if (_Py_uop_sym_is_not_null(sym)) {
+        sym_set_bottom(sym);
+        return false;
+    }
    sym_set_flag(sym, IS_NULL);
-    return !_Py_uop_sym_is_bottom(sym);
+    return true;


Does this refactoring matter? If so, why not do the same for set_non_null below?

It is not a refactoring.
Calling _Py_uop_sym_set_null on a non-NULL symbol would fail an assertion in _Py_uop_sym_is_bottom

And yes, it should be applied to set_non_null as well.

gvanrossum · 2024-05-01T22:15:27Z

Python/optimizer.c

+#ifdef Py_DEBUG
+                                uint32_t next_inst = target + 1 + INLINE_CACHE_ENTRIES_FOR_ITER + (oparg > 255);
+                                uint32_t jump_target = next_inst + oparg;
+                                assert(base_opcode(code, jump_target) == END_FOR ||
+                                       base_opcode(code, jump_target) == INSTRUMENTED_END_FOR);
+                                assert(base_opcode(code, jump_target+1) == POP_TOP);
+#endif


Probably best to include the case also in { ... } to limit the scope of the two variables declared in debug mode.

…ps (pythonGH-118420) * Target _FOR_ITER_TIER_TWO at POP_TOP following the matching END_FOR * Modify _GUARD_NOT_EXHAUSTED_RANGE, _GUARD_NOT_EXHAUSTED_LIST and _GUARD_NOT_EXHAUSTED_TUPLE so that they also target the POP_TOP following the matching END_FOR

markshannon changed the title ~~118095: Unify the behavior of tier 2 FOR_ITER branch micro-ops~~ GH-118095: Unify the behavior of tier 2 FOR_ITER branch micro-ops Apr 30, 2024

bedevere-app bot mentioned this pull request Apr 30, 2024

Increase the number of micro-ops that we can handle in tier 2 #118095

Open

markshannon added the skip news label Apr 30, 2024

markshannon mentioned this pull request May 1, 2024

GH-118095: Make invalidating and clearing executors memory safe #118459

Merged

markshannon added 4 commits May 1, 2024 11:48

Target _FOR_ITER_TIER_TWO at POP_TOP following END_FOR

f8bd566

Move handling of _FOR_ITER_TIER_TWO exits from trace creation to the …

3bac858

…prepare for execution step.

Extend treatment of _FOR_ITER_TIER_TWO to all FOR_ITER tier 2 tests (…

b83d053

…and fix off by one error)

Fix a minor bug in optimizer symbols

bb7efd4

markshannon force-pushed the unify-tier-2-for-iter branch from 25a889a to bb7efd4 Compare May 1, 2024 11:01

markshannon added 2 commits May 1, 2024 14:53

Fix stats for non-tier-2 build

983b7c8

Fix tier 2 build

df2792f

gvanrossum reviewed May 1, 2024

View reviewed changes

markshannon added 3 commits May 2, 2024 11:31

Merge branch 'main' into unify-tier-2-for-iter

d2af12f

Address review comments

98b517d

Merge branch 'main' into unify-tier-2-for-iter

aa35032

gvanrossum approved these changes May 2, 2024

View reviewed changes

bedevere-app bot added the awaiting merge label May 2, 2024

markshannon marked this pull request as ready for review May 2, 2024 15:14

bedevere-app bot added awaiting core review and removed awaiting merge labels May 2, 2024

markshannon merged commit 72867c9 into python:main May 2, 2024
54 checks passed

bedevere-app bot removed the awaiting core review label May 2, 2024

markshannon deleted the unify-tier-2-for-iter branch May 2, 2024 15:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-118095: Unify the behavior of tier 2 `FOR_ITER` branch micro-ops #118420

GH-118095: Unify the behavior of tier 2 `FOR_ITER` branch micro-ops #118420

markshannon commented Apr 30, 2024 •

edited by bedevere-app bot

markshannon commented May 1, 2024

brandtbucher commented May 1, 2024

gvanrossum left a comment

gvanrossum May 1, 2024

gvanrossum May 1, 2024

markshannon May 2, 2024

gvanrossum May 1, 2024

	static int base_opcode(PyCodeObject *code, int offset)
	static int
	base_opcode(PyCodeObject *code, int offset)

GH-118095: Unify the behavior of tier 2 FOR_ITER branch micro-ops #118420

GH-118095: Unify the behavior of tier 2 FOR_ITER branch micro-ops #118420

Conversation

markshannon commented Apr 30, 2024 • edited by bedevere-app bot

markshannon commented May 1, 2024

Specialized

Unspecialized

brandtbucher commented May 1, 2024

gvanrossum left a comment

Choose a reason for hiding this comment

gvanrossum May 1, 2024

Choose a reason for hiding this comment

gvanrossum May 1, 2024

Choose a reason for hiding this comment

markshannon May 2, 2024

Choose a reason for hiding this comment

gvanrossum May 1, 2024

Choose a reason for hiding this comment

GH-118095: Unify the behavior of tier 2 `FOR_ITER` branch micro-ops #118420

GH-118095: Unify the behavior of tier 2 `FOR_ITER` branch micro-ops #118420

markshannon commented Apr 30, 2024 •

edited by bedevere-app bot