Specialized ops by youknowone · Pull Request #7301 · RustPython/RustPython

youknowone · 2026-03-01T15:56:36Z

Summary by CodeRabbit

Refactor
- Added internal version tracking for dictionaries, functions, and types to improve mutation detection and caching.
- Introduced fast-path iteration for lists, tuples, and ranges to speed up common loops.
- Added internal accessors and tightened attribute-handling visibility to strengthen runtime state management.
Bug Fixes
- Added runtime assertions to validate function-call preconditions in hot paths.

coderabbitai · 2026-03-01T15:56:47Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds mutation-version tracking to Dict and exposes it on PyDict; introduces crate-visible fast-path fast_next accessors for list, tuple, and range iterators; adds function and type version allocation with exhaustion handling and get_version_for_current_state(); and makes small crate-visible API tweaks (slot_setattro, PyProperty::get_fget).

Changes

Cohort / File(s)	Summary
Dictionary internals & PyDict `crates/vm/src/dict_inner.rs`, `crates/vm/src/builtins/dict.rs`	Added `version: AtomicU64` to `Dict<T>`, `bump_version()` helper, incremented version on mutation paths, initialized `version` in Clone/Default, and exposed `pub(crate) fn version(&self) -> u64` on `PyDict`. Many mutation sites modified.
Iterator fast-paths `crates/vm/src/builtins/list.rs`, `crates/vm/src/builtins/tuple.rs`, `crates/vm/src/builtins/range.rs`	Added `pub(crate) fn fast_next(...) -> Option<...>` for `PyListIterator` and `PyTupleIterator`; replaced previous `next_fast` with `fast_next` for `PyRangeIterator` and made `next()` delegate to it. Public iterator behavior unchanged; new crate-visible fast accessors added.
Function versioning & asserts `crates/vm/src/builtins/function.rs`	Added `fn next_func_version() -> u32`, switched function initialization to use it, and added `get_version_for_current_state()` on `PyFunction`/`Py<PyFunction>` (returns 0 when exhausted). Added runtime-only `debug_assert!` checks in `invoke_exact_args`.
Type version allocation `crates/vm/src/builtins/type.rs`	Introduced module-scoped `NEXT_TYPE_VERSION: AtomicU32` and clarified `PyType::assign_version_tag()` semantics to return 0 when the counter is exhausted; signature unchanged.
Crate-visible API tweaks `crates/vm/src/builtins/object.rs`, `crates/vm/src/builtins/property.rs`	Changed `slot_setattro` visibility to `pub(crate)` and added `pub(crate) fn get_fget(&self) -> Option<PyObjectRef>` on `PyProperty` (crate-private accessor mirroring existing getter).

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐰 I count the bumps where versions climb,

My whiskers twitch at every fast_next rhyme,
Keys hop up when mutations show,
Functions wear tags, quiet and low,
A rabbit cheers this tidy time.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title 'Specialized ops' is vague and generic, using a non-descriptive term that doesn't convey meaningful information about the changeset's actual purpose or scope.	Consider a more specific title that indicates the main changes, such as 'Add mutation tracking and fast-path accessors for dict/list/tuple iterators' or 'Implement version tracking for PyDict and function state management'.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 94.74% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-03-02T04:22:27Z

Code has been automatically formatted

The code in this PR has been formatted using:

cargo fmt --all
Please pull the latest changes before pushing again:

git pull origin specialization

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

crates/vm/src/builtins/range.rs (1)

663-674: Avoid duplicating step logic between next_fast and fast_next.

Both methods currently implement the same increment/check path. Please keep one source of truth to prevent divergence.

♻️ Suggested simplification

 impl PyRangeIterator {
     /// Fast path for FOR_ITER specialization. Returns the next isize value
     /// without allocating PyInt or PyIterReturn.
     pub(crate) fn fast_next(&self) -> Option<isize> {
-        let index = self.index.fetch_add(1);
-        if index < self.length {
-            Some(self.start + (index as isize) * self.step)
-        } else {
-            None
-        }
+        self.next_fast()
     }
 }

As per coding guidelines: "When branches differ only in a value but share common logic, extract the differing value first, then call the common logic once to avoid duplicate code."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/vm/src/builtins/range.rs` around lines 663 - 674, The iteration logic
in PyRangeIterator is duplicated between fast_next and fast_next (the fast path)
— extract the shared increment/check logic into a single helper (e.g., a private
method on PyRangeIterator like next_index_or_none or advance_and_get_index) and
have both next_fast and fast_next call that helper to compute the next
index/value; ensure the helper uses self.index.fetch_add(1), compares to
self.length, and returns either the computed isize value (start + index * step)
or None so both next_fast and fast_next reuse the same implementation and avoid
divergence.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/vm/src/builtins/function.rs`:
- Around line 611-621: The get_version_for_current_state implementation allows
FUNC_VERSION_COUNTER to wrap and reuse version tags; update it to atomically
advance FUNC_VERSION_COUNTER without ever returning a recycled nonzero version
by using an atomic compare-and-swap/loop (or Atomics::fetch_update) that: loads
the current counter (FUNC_VERSION_COUNTER), if it's 0 or u32::MAX treat as
exhausted and return 0, otherwise compute next = current.wrapping_add(1) and
attempt compare_exchange to set it to next; once you successfully set the global
counter, store that nonzero new version into self.func_version and return it;
reference get_version_for_current_state, FUNC_VERSION_COUNTER, and
self.func_version for locating the change.

In `@crates/vm/src/dict_inner.rs`:
- Around line 263-270: The version() accessor and bump_version() updater must
use proper Acquire/Release ordering and bump_version must be called while
holding the dict write lock: change version() to load with Acquire and
bump_version() to fetch_add with Release (function names: version and
bump_version in dict_inner.rs), and move every call to bump_version so it
executes inside the same write-lock critical section where the dictionary
mutation occurs (the callers in this repo include the mutation sites referenced
in frame.rs that read the version lock-free for LOAD_GLOBAL caching); ensure the
write lock remains held across the mutation and the bump_version call so the
Release store synchronizes with readers that use Acquire.

---

Nitpick comments:
In `@crates/vm/src/builtins/range.rs`:
- Around line 663-674: The iteration logic in PyRangeIterator is duplicated
between fast_next and fast_next (the fast path) — extract the shared
increment/check logic into a single helper (e.g., a private method on
PyRangeIterator like next_index_or_none or advance_and_get_index) and have both
next_fast and fast_next call that helper to compute the next index/value; ensure
the helper uses self.index.fetch_add(1), compares to self.length, and returns
either the computed isize value (start + index * step) or None so both next_fast
and fast_next reuse the same implementation and avoid divergence.

ℹ️ Review info

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c315033 and b870090.

📒 Files selected for processing (10)

crates/vm/src/builtins/dict.rs
crates/vm/src/builtins/function.rs
crates/vm/src/builtins/list.rs
crates/vm/src/builtins/object.rs
crates/vm/src/builtins/property.rs
crates/vm/src/builtins/range.rs
crates/vm/src/builtins/tuple.rs
crates/vm/src/builtins/type.rs
crates/vm/src/dict_inner.rs
crates/vm/src/frame.rs

crates/vm/src/builtins/function.rs

coderabbitai · 2026-03-02T08:01:20Z

crates/vm/src/dict_inner.rs

+    pub fn version(&self) -> u64 {
+        self.version.load(Relaxed)
+    }
+
+    /// Bump the version counter after any mutation.
+    fn bump_version(&self) {
+        self.version.fetch_add(1, Relaxed);
+    }


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

# 1) Show where version bumps happen relative to lock scopes. rg -n -C4 'bump_version\(' crates/vm/src/dict_inner.rs

Repository: RustPython/RustPython

Length of output: 2663

🏁 Script executed:

# 2) Show consumers of dict version values (to validate whether reads are lock-free). rg -n -C4 '\.version\(\)' crates/vm/src

Repository: RustPython/RustPython

Length of output: 3008

🏁 Script executed:

# 3) Get the full context around lines 263-270 and surrounding code head -n 300 crates/vm/src/dict_inner.rs | tail -n +250

Repository: RustPython/RustPython

Length of output: 1941

Use Acquire/Release ordering for version field and perform version bumps under the write lock.

The version() and bump_version() methods use Relaxed ordering. Currently, bump_version() is called after the write lock is released at lines 319, 383, 441, 465, 499, 536, 738, and 756. Meanwhile, version() is read lock-free in frame.rs (lines 4644, 4680, 7076) to validate LOAD_GLOBAL specialization caches. With Relaxed ordering, a reader may see stale version values after mutations complete, causing cache validation to incorrectly accept invalidated caches.

🔧 Suggested direction

-use core::sync::atomic::{AtomicU64, Ordering::Relaxed}; +use core::sync::atomic::{AtomicU64, Ordering::{Acquire, Release}}; ... pub fn version(&self) -> u64 { - self.version.load(Relaxed) + self.version.load(Acquire) } ... fn bump_version(&self) { - self.version.fetch_add(1, Relaxed); + self.version.fetch_add(1, Release); }

Also move each bump_version() call into the same write-lock critical section as the corresponding mutation to ensure version updates are ordered with respect to dict mutations.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/vm/src/dict_inner.rs` around lines 263 - 270, The version() accessor and bump_version() updater must use proper Acquire/Release ordering and bump_version must be called while holding the dict write lock: change version() to load with Acquire and bump_version() to fetch_add with Release (function names: version and bump_version in dict_inner.rs), and move every call to bump_version so it executes inside the same write-lock critical section where the dictionary mutation occurs (the callers in this repo include the mutation sites referenced in frame.rs that read the version lock-free for LOAD_GLOBAL caching); ensure the write lock remains held across the mutation and the bump_version call so the Release store synchronizes with readers that use Acquire.

github-actions · 2026-03-02T12:18:47Z

📦 Library Dependencies

The following Lib/ modules were modified. Here are their dependencies:

[x] lib: cpython/Lib/doctest.py
[ ] test: cpython/Lib/test/test_doctest (TODO: 6)

dependencies:

doctest

dependent tests: (33 tests)

doctest: test_builtin test_cmd test_code test_collections test_ctypes test_decimal test_deque test_descrtut test_difflib test_doctest test_doctest2 test_enum test_extcall test_generators test_getopt test_heapq test_http_cookies test_itertools test_listcomps test_math test_metaclass test_pep646_syntax test_pickle test_pickletools test_setcomps test_statistics test_syntax test_threading_local test_typing test_unpack test_unpack_ex test_weakref test_zipimport

Legend:

[+] path exists in CPython
[x] up-to-date, [ ] outdated

coderabbitai

♻️ Duplicate comments (1)

crates/vm/src/dict_inner.rs (1)

262-270: ⚠️ Potential issue | 🟠 Major

Use Acquire/Release ordering for dict version invalidation.

Line 264 and Line 269 currently use Relaxed. For a version counter read lock-free by specialization paths, this can admit stale version observations on weak memory models.

🔧 Suggested fix

-use core::sync::atomic::{AtomicU64, Ordering::Relaxed};
+use core::sync::atomic::{
+    AtomicU64,
+    Ordering::{Acquire, Release},
+};
...
 pub fn version(&self) -> u64 {
-    self.version.load(Relaxed)
+    self.version.load(Acquire)
 }
...
 fn bump_version(&self) {
-    self.version.fetch_add(1, Relaxed);
+    self.version.fetch_add(1, Release);
 }

Run this read-only verification to confirm ordering and usage sites:

#!/bin/bash
set -euo pipefail

# Inspect ordering in dict version API.
rg -n -C3 'AtomicU64|Ordering::|fn version\(|fn bump_version\(|fetch_add|load\(' crates/vm/src/dict_inner.rs

# Find version readers across VM (including specialization paths).
rg -n -C3 '\.version\(\)' crates/vm/src

# Review all mutation sites that bump the version.
rg -n -C2 'bump_version\(' crates/vm/src/dict_inner.rs

As per coding guidelines "Follow Rust best practices for error handling and memory management".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/vm/src/dict_inner.rs` around lines 262 - 270, The version counter
currently uses Relaxed ordering which can allow stale reads; change the atomic
load in fn version(&self) to use Acquire ordering and change the fetch_add in fn
bump_version(&self) to use Release ordering on self.version so writers publish
updates and readers observe them (i.e., replace Relaxed with Acquire in
version() and with Release in bump_version()); after changing, run the suggested
searches for .version() and bump_version() to ensure all readers/writers rely on
the Acquire/Release semantics and update any comments indicating the stronger
memory ordering.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@crates/vm/src/dict_inner.rs`:
- Around line 262-270: The version counter currently uses Relaxed ordering which
can allow stale reads; change the atomic load in fn version(&self) to use
Acquire ordering and change the fetch_add in fn bump_version(&self) to use
Release ordering on self.version so writers publish updates and readers observe
them (i.e., replace Relaxed with Acquire in version() and with Release in
bump_version()); after changing, run the suggested searches for .version() and
bump_version() to ensure all readers/writers rely on the Acquire/Release
semantics and update any comments indicating the stronger memory ordering.

ℹ️ Review info

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b870090 and 87bd158.

⛔ Files ignored due to path filters (1)

Lib/test/test_doctest/test_doctest.py is excluded by !Lib/**

📒 Files selected for processing (10)

crates/vm/src/builtins/dict.rs
crates/vm/src/builtins/function.rs
crates/vm/src/builtins/list.rs
crates/vm/src/builtins/object.rs
crates/vm/src/builtins/property.rs
crates/vm/src/builtins/range.rs
crates/vm/src/builtins/tuple.rs
crates/vm/src/builtins/type.rs
crates/vm/src/dict_inner.rs
crates/vm/src/frame.rs

🚧 Files skipped from review as they are similar to previous changes (5)

crates/vm/src/builtins/range.rs
crates/vm/src/builtins/dict.rs
crates/vm/src/builtins/list.rs
crates/vm/src/builtins/type.rs
crates/vm/src/builtins/tuple.rs

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

crates/vm/src/dict_inner.rs (1)

22-22: ⚠️ Potential issue | 🟠 Major

Use Acquire/Release for version atomics (previously reported, still unresolved).

Line 264 and Line 269 still use Relaxed. For a mutation-version invalidation signal, this can leave dict writes unsynchronized with lock-free version readers and allow stale cache validation.

Proposed fix

-use core::sync::atomic::{AtomicU64, Ordering::Relaxed};
+use core::sync::atomic::{
+    AtomicU64,
+    Ordering::{Acquire, Release},
+};
...
     pub fn version(&self) -> u64 {
-        self.version.load(Relaxed)
+        self.version.load(Acquire)
     }
...
     fn bump_version(&self) {
-        self.version.fetch_add(1, Relaxed);
+        self.version.fetch_add(1, Release);
     }

#!/bin/bash
# Verify atomic ordering at the definition site and inspect version() consumers.
rg -n -C3 'AtomicU64|Ordering::|version\.load|fetch_add\(' crates/vm/src/dict_inner.rs
rg -n -C3 '\.version\(\)' crates/vm/src

Also applies to: 262-270

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/vm/src/dict_inner.rs` at line 22, The atomic version uses
Ordering::Relaxed which can break synchronization between writers and lock-free
readers; change the ordering so readers use Acquire on version loads (e.g. any
call sites of version().load(...) or version.read paths) and writers use Release
on updates (e.g. the AtomicU64.fetch_add(...) or any version.store(...) in
DictInner), or use Acquire on loads and Release on the mutation operation in the
methods named around version()/fetch_add in dict inner; update the AtomicU64
operations referenced (version(), fetch_add, version.load) to use
Ordering::Acquire for reads and Ordering::Release for writes to ensure proper
acquire–release synchronization.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/vm/src/builtins/range.rs`:
- Around line 653-657: fast_next currently uses self.index.fetch_add(1) which
can wrap at usize::MAX and later casts that usize to isize (index as isize),
causing wrong values for huge ranges; fix by replacing the fetch_add with an
atomic fetch_update (or a compare-and-swap loop) that first reads current =
self.index.load(...), returns None if current >= self.length, otherwise attempts
to increment to current + 1 and, on success, uses that non-wrapping current as
the element index; then convert the chosen current index to isize using
isize::try_from(current).ok_or_else(|| ...) and compute the element with checked
arithmetic (use current_isize.checked_mul(self.step) and then
checked_add(self.start)) and return None on overflow — reference the fast_next
function and the fields self.index, self.length, self.start, and self.step when
making these changes.

---

Duplicate comments:
In `@crates/vm/src/dict_inner.rs`:
- Line 22: The atomic version uses Ordering::Relaxed which can break
synchronization between writers and lock-free readers; change the ordering so
readers use Acquire on version loads (e.g. any call sites of version().load(...)
or version.read paths) and writers use Release on updates (e.g. the
AtomicU64.fetch_add(...) or any version.store(...) in DictInner), or use Acquire
on loads and Release on the mutation operation in the methods named around
version()/fetch_add in dict inner; update the AtomicU64 operations referenced
(version(), fetch_add, version.load) to use Ordering::Acquire for reads and
Ordering::Release for writes to ensure proper acquire–release synchronization.

ℹ️ Review info

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 87bd158 and 5b6f414.

⛔ Files ignored due to path filters (1)

Lib/test/test_doctest/test_doctest.py is excluded by !Lib/**

📒 Files selected for processing (10)

crates/vm/src/builtins/dict.rs
crates/vm/src/builtins/function.rs
crates/vm/src/builtins/list.rs
crates/vm/src/builtins/object.rs
crates/vm/src/builtins/property.rs
crates/vm/src/builtins/range.rs
crates/vm/src/builtins/tuple.rs
crates/vm/src/builtins/type.rs
crates/vm/src/dict_inner.rs
crates/vm/src/frame.rs

🚧 Files skipped from review as they are similar to previous changes (6)

crates/vm/src/builtins/property.rs
crates/vm/src/builtins/tuple.rs
crates/vm/src/builtins/dict.rs
crates/vm/src/builtins/object.rs
crates/vm/src/builtins/function.rs
crates/vm/src/builtins/list.rs

crates/vm/src/builtins/range.rs

- dict version: Relaxed → Acquire/Release ordering - range iterator: deduplicate fast_next/next_fast

- Add debug_assert preconditions in invoke_exact_args - Add get_version_for_current_state() for lazy version reassignment after func_version invalidation - Document NEXT_TYPE_VERSION overflow policy

- COMPARE_OP: CompareOpInt, CompareOpFloat, CompareOpStr - TO_BOOL: ToBoolBool, ToBoolInt, ToBoolNone, ToBoolList, ToBoolStr - FOR_ITER: ForIterRange, ForIterList, ForIterTuple with fast_next() - LOAD_GLOBAL: LoadGlobalModule, LoadGlobalBuiltin with dict version guard - Add version counter to Dict for mutation tracking

…ation - BinaryOpSubscrListInt, BinaryOpSubscrTupleInt, BinaryOpSubscrDict - ContainsOpDict, ContainsOpSet - UnpackSequenceTwoTuple, UnpackSequenceTuple, UnpackSequenceList - StoreAttrInstanceValue with type_version guard - Deoptimize bytecode for marshal serialization (original_bytes) - Separate co_code (deoptimized) from _co_code_adaptive (quickened)

…Isinstance, CallType1 specialization

…al, ForIterGen, CallListAppend specialization

- LoadAttrNondescriptorNoDict: plain class attr on objects without dict - LoadAttrNondescriptorWithValues: plain class attr with dict fallback - LoadAttrClass: handler for type attribute access (not yet routed) - CallMethodDescriptorNoargs: method descriptor with 0 args - CallMethodDescriptorO: method descriptor with 1 arg - CallMethodDescriptorFast: method descriptor with multiple args - Use HAS_DICT flag instead of obj.dict().is_some() for method/nondescriptor routing

- CallBuiltinFast: native function calls with arbitrary positional args - CallNonPyGeneral: fallback for unmatched callables (custom __call__, etc.) - All builtin function calls now specialize (CallBuiltinFast as default) - specialize_call now always produces a specialized instruction

- SendGen: direct coro.send() for generator/coroutine receivers - Add adaptive counter to Send instruction - specialize_send checks builtin_coro for PyGenerator/PyCoroutine

- LoadAttrSlot: direct obj.get_slot(offset) bypassing descriptor protocol - StoreAttrSlot: direct obj.set_slot(offset, value) bypassing descriptor protocol - Detect PyMemberDescriptor with MemberGetter::Offset in specialize_load_attr/store_attr - Cache slot offset in cache_base+3

…ltinFastWithKeywords, CallMethodDescriptorFastWithKeywords specialization

Fix LoadSuperAttrMethod to push unbound descriptor + self instead of bound method + self which caused double self binding. Fix LoadSuperAttrAttr obj_arg condition for classmethod detection.

Remove unnecessary CPython references, FIXME→TODO, redundant Note: prefix, and "Same as" cross-references.

coderabbitai

🧹 Nitpick comments (1)

crates/vm/src/builtins/function.rs (1)

614-628: Minor race condition may waste version numbers.

Two threads calling get_version_for_current_state() simultaneously when func_version == 0 could each allocate a fresh version, with one being discarded. This is benign for correctness (the function still gets a valid version) but wastes version space.

If version exhaustion is a practical concern, consider using compare_exchange to atomically set only if still 0:

♻️ Optional improvement using CAS

     pub fn get_version_for_current_state(&self) -> u32 {
         let v = self.func_version.load(Relaxed);
         if v != 0 {
             return v;
         }
         let new_v = next_func_version();
         if new_v == 0 {
             return 0;
         }
-        self.func_version.store(new_v, Relaxed);
-        new_v
+        match self.func_version.compare_exchange(0, new_v, Relaxed, Relaxed) {
+            Ok(_) => new_v,
+            Err(existing) => existing, // Another thread won; return their version
+        }
     }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/vm/src/builtins/function.rs` around lines 614 - 628,
get_version_for_current_state currently races when func_version == 0: two
threads can both call next_func_version and one result is discarded, wasting
version numbers; change the allocation path to perform an atomic
compare-and-swap (compare_exchange) on self.func_version so you only install
new_v if the current value is still 0 (if compare_exchange fails, discard new_v
and read the installed value instead); keep the same zero handling (return 0 if
next_func_version() returns 0) and use the same memory ordering (Relaxed) for
the CAS/store to preserve the original semantics while preventing wasted
allocations.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@crates/vm/src/builtins/function.rs`:
- Around line 614-628: get_version_for_current_state currently races when
func_version == 0: two threads can both call next_func_version and one result is
discarded, wasting version numbers; change the allocation path to perform an
atomic compare-and-swap (compare_exchange) on self.func_version so you only
install new_v if the current value is still 0 (if compare_exchange fails,
discard new_v and read the installed value instead); keep the same zero handling
(return 0 if next_func_version() returns 0) and use the same memory ordering
(Relaxed) for the CAS/store to preserve the original semantics while preventing
wasted allocations.

ℹ️ Review info

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c9c95c1 and 51accdb.

📒 Files selected for processing (4)

crates/vm/src/builtins/function.rs
crates/vm/src/builtins/range.rs
crates/vm/src/dict_inner.rs
crates/vm/src/frame.rs

🚧 Files skipped from review as they are similar to previous changes (1)

crates/vm/src/builtins/range.rs

youknowone

This PR still has many missing features and caveats. They will be addressed in follow-up PR

- dict version: Relaxed → Acquire/Release ordering - range iterator: deduplicate fast_next/next_fast

youknowone force-pushed the specialization branch 3 times, most recently from 8fc678d to 57cd6b2 Compare March 2, 2026 04:16

youknowone force-pushed the specialization branch from 843794e to 5ac7f95 Compare March 2, 2026 07:43

youknowone marked this pull request as ready for review March 2, 2026 07:53

coderabbitai bot reviewed Mar 2, 2026

View reviewed changes

youknowone marked this pull request as draft March 2, 2026 08:01

youknowone force-pushed the specialization branch 4 times, most recently from 9df2a06 to 0f8da0c Compare March 2, 2026 12:10

youknowone marked this pull request as ready for review March 3, 2026 00:10

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

youknowone force-pushed the specialization branch from 87bd158 to 5b6f414 Compare March 3, 2026 00:30

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

crates/vm/src/builtins/range.rs Show resolved Hide resolved

youknowone added a commit to youknowone/RustPython that referenced this pull request Mar 3, 2026

Apply code review feedback from PR RustPython#7301

7b2edb4

- dict version: Relaxed → Acquire/Release ordering - range iterator: deduplicate fast_next/next_fast

youknowone added 11 commits March 3, 2026 18:03

Add debug_assert to invoke_exact_args, lazy func_version reassignment

687e99f

- Add debug_assert preconditions in invoke_exact_args - Add get_version_for_current_state() for lazy version reassignment after func_version invalidation - Document NEXT_TYPE_VERSION overflow policy

working

81d307b

Add STORE_SUBSCR, BinaryOpAddUnicode, ToBoolAlwaysTrue, CallLen, Call…

1c07777

…Isinstance, CallType1 specialization

Add BinaryOpSubscrStrInt, CallStr1, CallTuple1 specialization

240f3ac

Add BinaryOpInplaceAddUnicode specialization

cadb9be

Add LoadAttrModule, CallBuiltinO, CallPyGeneral, CallBoundMethodGener…

fd098fe

…al, ForIterGen, CallListAppend specialization

Add SendGen specialization for generator/coroutine send

d950035

- SendGen: direct coro.send() for generator/coroutine receivers - Add adaptive counter to Send instruction - specialize_send checks builtin_coro for PyGenerator/PyCoroutine

youknowone added 6 commits March 3, 2026 18:03

Add LoadSuperAttrAttr, LoadSuperAttrMethod, CallBuiltinClass, CallBui…

a7c179c

…ltinFastWithKeywords, CallMethodDescriptorFastWithKeywords specialization

Add LoadAttrProperty specialization for property descriptor access

e1289f1

Add LoadAttrClass specialization for class attribute access

2350bc1

Add BinaryOpSubscrListSlice specialization

ba9d528

Add CallKwPy, CallKwBoundMethod, CallKwNonPy specialization

3c88368

Fix LoadSuperAttrMethod to push unbound descriptor + self instead of bound method + self which caused double self binding. Fix LoadSuperAttrAttr obj_arg condition for classmethod detection.

youknowone force-pushed the specialization branch from 5b6f414 to c9c95c1 Compare March 3, 2026 09:04

youknowone added 3 commits March 3, 2026 18:27

Clean up comments in specialization code

ab6bbb6

Remove unnecessary CPython references, FIXME→TODO, redundant Note: prefix, and "Same as" cross-references.

fix check_signals

48fd5c7

fix import

51accdb

youknowone force-pushed the specialization branch from c9c95c1 to 51accdb Compare March 3, 2026 09:27

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

youknowone commented Mar 3, 2026

View reviewed changes

youknowone merged commit 25619aa into RustPython:main Mar 3, 2026
13 checks passed

youknowone deleted the specialization branch March 3, 2026 10:55

youknowone added a commit to youknowone/RustPython that referenced this pull request Mar 3, 2026

Apply code review feedback from PR RustPython#7301

00584c6

- dict version: Relaxed → Acquire/Release ordering - range iterator: deduplicate fast_next/next_fast

youknowone added a commit to youknowone/RustPython that referenced this pull request Mar 3, 2026

Apply code review feedback from PR RustPython#7301

d6e13eb

- dict version: Relaxed → Acquire/Release ordering - range iterator: deduplicate fast_next/next_fast

youknowone added a commit to youknowone/RustPython that referenced this pull request Mar 3, 2026

Apply code review feedback from PR RustPython#7301

cc74e8b

- dict version: Relaxed → Acquire/Release ordering - range iterator: deduplicate fast_next/next_fast

coderabbitai bot mentioned this pull request Mar 3, 2026

Specialized ops #7322

Merged

Conversation

youknowone commented Mar 1, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 inconclusive)

Uh oh!

github-actions bot commented Mar 2, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📦 Library Dependencies

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

youknowone left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

youknowone commented Mar 1, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 1, 2026 •

edited

Loading

github-actions bot commented Mar 2, 2026 •

edited

Loading