test(amber-python): add unit tests for evaluate-expression and retry-current-tuple handlers by Yicong-Huang · Pull Request #4520 · apache/texera

Yicong-Huang · 2026-04-26T02:00:16Z

What changes were proposed in this PR?

Adds unit tests for the two debugger-adjacent worker RPC handlers that had no Python coverage:

test_evaluate_expression_handler.py (4 cases) — covers EvaluateExpressionHandler.evaluate_python_expression, which backs the frontend's "watch variable" feature. Verifies the evaluator's return is passed through unchanged, the runtime context exposes the executor as self / current tuple as tuple_ / current port id as input_, the context is read fresh on each call (not snapshot at handler construction), and the handler tolerates None tuple/port (worker before any input has arrived).
test_replay_current_tuple_handler.py (6 cases) — covers RetryCurrentTupleHandler.retry_current_tuple (used by debugger "step over an exception" flows). Verifies it chains the current tuple onto the front of the input iterator, resumes USER_PAUSE + EXCEPTION_PAUSE in order, does not resume DEBUG_PAUSE (so an active debugging session is not silently dropped), no-ops when the worker is COMPLETED, and still chains correctly when the remaining iterator is empty.

No production code is touched. Async handlers are driven via asyncio.run to avoid pulling in pytest-asyncio, matching the pattern from #4510 / #4512.

Any related issues, documentation, discussions?

Closes #4516. Same gap pattern as #4509.

How was this PR tested?

$ python -m pytest core/architecture/handlers/control/test_evaluate_expression_handler.py \
                   core/architecture/handlers/control/test_replay_current_tuple_handler.py -v
======================== 10 passed in 1.14s ========================

ruff format --check . and ruff check . clean locally.

Was this PR authored or co-authored using generative AI tooling?

Generated-by: Claude Code (Opus 4.7)

…current-tuple handlers Two debugger-adjacent worker RPC handlers had no Python coverage: - EvaluateExpressionHandler (frontend "watch variable" feature): 4 cases — return value pass-through, runtime context exposes self/tuple_/input_, context is read fresh on each call (not snapshot at construction), tolerates None tuple/port before any input has arrived. - RetryCurrentTupleHandler (debugger "step over an exception"): 6 cases — chains the current tuple onto the front of the input iterator, resumes USER + EXCEPTION pause types in order, leaves DEBUG_PAUSE alone (so an active session is not silently dropped), no-ops when state is COMPLETED, handles an exhausted remaining iterator. No production code is touched. Async handlers are driven via asyncio.run to avoid pulling in pytest-asyncio. Closes apache#4516

…ndlers Self-review surfaced two gaps: - EvaluateExpressionHandler: when ExpressionEvaluator.evaluate raises (bad syntax, attribute error, etc.), the handler must propagate the exception so the RPC layer can surface it to the frontend, not swallow it. Added test_evaluator_exception_propagates. - RetryCurrentTupleHandler: previously only RUNNING and COMPLETED states were tested. The completion guard is `if not confirm_state( COMPLETED)`, so every other state (PAUSED, READY, UNINITIALIZED) must take the chain+resume path. PAUSED is the most likely real-world entry point — added test_paused_state_still_chains_and_ resumes. 12 cases total now (up from 10). No production code is touched. Refs apache#4516

…gn_ecm (#4526) ### What changes were proposed in this PR? Fixes the recurring CI flake in `core/runnables/test_main_loop.py::TestMainLoop::test_main_loop_thread_can_align_ecm` and tightens the assertion to verify the actual production priority guarantee. **Why it flaked**: the test put a data tuple and an alignment-completing ECM into `input_queue` back-to-back, then assumed `output_queue.get()` would yield the `DataElement` first followed by the NoOperation reply. `output_queue` is a `LinkedBlockingMultiQueue` whose sub-queues are keyed by channel — control channels at priority 1, data channels at priority 2 (`internal_queue.py:80`). MainLoop produces in this order: data → NoOperation reply, but whether the test pops them in that order depends on whether MainLoop has finished both productions by the time the test calls `.get()`: - Fast machine: only data is in the queue → data first → ✅ - Slow CI: both items queued → priority returns the control reply first → ❌ The production semantics are intentional and correct: - Control to coordinator outranks data on egress (cross-channel priority — priority 1 vs 2 sub-queues). - Within a channel sub-queue, FIFO is preserved (so an ECM forwarded on a data channel stays behind the data tuples on that same channel). **Fix**: the test now expresses the priority semantics directly: 1. Wait until both expected channels have their item in `output_queue`'s sub-queues — the data channel to the downstream worker, and the control channel back to "sender". Sub-queues are added lazily on first put, so the wait safely treats a missing key as size zero. 2. With both items queued, assert the priority pop order: `output_queue.get()` first returns the `DCMElement` (NoOperation reply, control sub-queue priority 1), then the `DataElement` (data sub-queue priority 2). 3. Bounded with a 5s timeout so a regression that drops one of the channels fails with a descriptive message instead of hanging. If a future change flips priorities or routes the NoOperation reply to a different channel, the first `get()` now fails fast with "expected control reply first". No production code change. ### Any related issues, documentation, discussions? Closes #4524. Has been hitting unrelated PRs (#4512, #4520). ### How was this PR tested? Ran 30 consecutive iterations locally — 30 PASS, 0 FAIL. `ruff format --check .` and `ruff check .` clean. ### Was this PR authored or co-authored using generative AI tooling? Generated-by: Claude Code (Opus 4.7)

zuozhiw

lgtm

github-actions Bot added engine python labels Apr 26, 2026

Yicong-Huang added 2 commits April 25, 2026 21:30

Merge branch 'main' into test/debugger-adjacent-handlers

42aa3ea

Yicong-Huang self-assigned this Apr 26, 2026

This was referenced Apr 26, 2026

Flaky test_main_loop_thread_can_align_ecm: drain order depends on inter-channel timing #4524

Closed

test(amber-python): per-channel sync in test_main_loop_thread_can_align_ecm #4526

Merged

Merge branch 'main' into test/debugger-adjacent-handlers

c6c629b

Yicong-Huang requested a review from zuozhiw April 26, 2026 07:01

Yicong-Huang enabled auto-merge (squash) April 26, 2026 07:21

Merge branch 'main' into test/debugger-adjacent-handlers

a1b94cb

zuozhiw approved these changes Apr 27, 2026

View reviewed changes

Merge branch 'main' into test/debugger-adjacent-handlers

fe076ab

Yicong-Huang merged commit 871017e into apache:main Apr 27, 2026
11 checks passed

Yicong-Huang mentioned this pull request Apr 28, 2026

ReconfigurationSpec: source-propagation case hangs after UDF processes EndChannel #4545

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(amber-python): add unit tests for evaluate-expression and retry-current-tuple handlers#4520

test(amber-python): add unit tests for evaluate-expression and retry-current-tuple handlers#4520
Yicong-Huang merged 6 commits into
apache:mainfrom
Yicong-Huang:test/debugger-adjacent-handlers

Yicong-Huang commented Apr 26, 2026

Uh oh!

zuozhiw left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Yicong-Huang commented Apr 26, 2026

What changes were proposed in this PR?

Any related issues, documentation, discussions?

How was this PR tested?

Was this PR authored or co-authored using generative AI tooling?

Uh oh!

zuozhiw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants