feat: resilient error reporting across component/host boundary by gamescoot · Pull Request #30 · KyleKincer/testing

gamescoot · 2026-04-17T20:35:31Z

Summary

Runtime errors no longer crash the test runner. Errors raised by ON ERR CALL during a test are captured, attached to the offending test as a structured runtimeErrors[] entry plus a synthetic failed assertion with isRuntimeError: true, and the remaining tests still run.
Reports are always produced. _generateReport() is invoked from run() (was buried inside _runSuitesSequentially) so the report still renders after global error capture, even when execution is bumpy.
Host/component error boundary solved. A 4D component's ON ERR CALL cannot catch errors raised in host scope. The component now reads $hostStorage.testErrors so a host that installs its own TestErrorHandler / TestGlobalErrorHandler and shares its Storage can route host-side errors into the per-test and global pipelines (runtimeErrors[] per test, globalErrors[] / <system-err> for the rest).
JUnit XML correctness. Precise failures / errors / skipped counts on <testsuites> and <testsuite>; valid ISO 8601 timestamp (was producing …T00:00:00ZT16:47:10); <skipped/> child on every skipped <testcase> so consumers (Jenkins, JUnitXML, GitLab) classify them correctly. External runtime errors live in a <system-err> block instead of inflating errors and driving failures negative.
New CLI flags. outputPath= writes JSON or JUnit reports to disk (sidesteps any debug noise on stdout); callchain=true adds callChain to terse-JSON failures without going full verbose. Both flow through every Makefile target.
Per-test enrichment. Every assertion record now carries line and functionName (qualified Class.method) when the call chain has the test frame; runtime errors are mirrored into runtimeErrors[] with functionName and formula; captured errors include a callChainJSON snapshot taken at the moment the error fired.
Docs. README, docs/guide.md, AGENTS, CLAUDE, and AI_AGENT_TESTING_GUIDE updated for the new flags, refreshed JSON / JUnit samples, the captured-error schema, and the host-integration wiring.

Changes

File	What changed
`.gitignore`	Ignore local dev artifacts (`Components/`, `testing/Logs/`, `testing/test-results/`, `testing/.4DD`, `testing/.Match`, `Makefile.local`, `.DS_Store`)
`Makefile`	Forward `outputPath` / `verbose` / `callchain` to `--user-param`. Reroute every secondary target (`test-junit`, `test-ci`, `test-parallel*`, etc.) through `$(MAKE) test …` so all flags compose consistently.
`testing/Project/Sources/Classes/Testing.4dm`	New `testFunctionName` / `testClassName` properties (initialized in the constructor, reset in `resetForNewTest`) so reports can attribute assertions and errors to a qualified test name.
`testing/Project/Sources/Classes/Assert.4dm`	`_recordAssertion` now records `line` and `functionName` when the call chain contains the test frame; new `_findTestLine($t; $callChain)` helper walks the chain looking for that frame.
`testing/Project/Sources/Classes/_TestFunction.4dm`	After running each test, drains errors from both component `Storage.testErrors` and host `$hostStorage.testErrors` (matched by `processNumber`); each error gets `functionName` / `formula` and a synthetic failed assertion via new `_addRuntimeErrorAsAssertion`; `failureCallChain` populated from `error.callChainJSON`; new `_getHostStorage` helper; `_errorBelongsToProcess` no longer falls back to `True` for unowned errors. Transaction cleanup simplified.
`testing/Project/Sources/Classes/TestRunner.4dm`	`_generateReport()` moved into `run()` so it runs after global error capture; new `callChainOutput` property gates terse-JSON `callChain` (`verbose=true` OR `callchain=true`); preps and drains `hostStorage.testErrors` (init + global drain); JSON terse mode emits per-test `runtimeErrors[]` and honors `outputPath=` via new `_writeJSONToFile`; JUnit fix: accurate `failures` / `errors` / `skipped` counts on `<testsuites>` and `<testsuite>`, `<skipped/>` child for skipped cases, valid ISO 8601 timestamp; shared `_resolveOutputFile` handles relative + absolute (POSIX / Windows) paths; human-mode call chains now gated on `verbose=true`.
`testing/Project/Sources/Methods/TestErrorHandler.4dm`	Captures `Last errors[0].message` for the human-readable `message` field; captures `Get call chain` immediately and stores it as `callChainJSON` on the error record; chain filter only drops the handler frame itself (preserves test-method frames in the `testing` database).
`testing/Project/Sources/Methods/TestGlobalErrorHandler.4dm`	Same enhancements as `TestErrorHandler`.
`testing/Resources/en.lproj/syntaxEN.json`	Auto-regenerated for the new public surface (properties, methods, parameters).
`README.md`, `AGENTS.md`, `CLAUDE.md`, `AI_AGENT_TESTING_GUIDE.md`, `docs/guide.md`	Document the new flags, refreshed JSON / JUnit samples, the captured-error schema, and the host-integration wiring example.

Test plan

Prerequisites

Checkout the symphony branch HRS/SWE-44545-error-handling-test
Copy the built Testing.4dbase component into your symphony Components/ folder

Steps

Baseline — full suite passes across all three output formats
```
make test                                                          # human, exit 0
make test format=json outputPath=test-results/report.json          # JSON to file
make test-ci                                                        # JUnit XML -> test-results/junit.xml
```
All three exit 0. Confirm:
- test-results/report.json is a single valid JSON document containing tests, passed, failed, skipped, plus globalErrorCount and globalErrors — these fields are the host-error pathway showing it's wired (typically 0 / [] when symphony is healthy).
- test-results/junit.xml opens with <testsuites tests="N" failures="0" errors="0" skipped="M" time="…" timestamp="YYYY-MM-DDTHH:MM:SS">. The timestamp is a valid ISO 8601 (no Z, no doubled T…). Each skipped <testcase> contains a <skipped/> child. A <system-err> block is present (empty when no host errors fired, populated when they did).
Per-test runtime error — captured, reported, non-fatal
In UpdateSOLocationTest.test_empty_updates_collection, add ALERT($result.length) as the first line of the body (forces [8] Alphanumeric argument expected), then:
```
make test format=json test=UpdateSOLocationTest outputPath=/tmp/r.json
```
In /tmp/r.json:
- "tests": 13, "passed": 12, "failed": 1, "status": "fail" — the runner did not crash, the report was still written, and the other 12 tests still ran.
- The failed test's assertions[] contains a synthetic entry with isRuntimeError: true, actual: "[8] An Alphanumeric argument was expected.", line: 3, functionName: "UpdateSOLocationTest.test_empty_updates_collection".
- The failed test's runtimeErrors[] contains code: 8, message: "An Alphanumeric argument was expected.", formula: "ALERT($result.length)".
callchain=true enriches terse JSON without going full verbose
With the broken test still in place:
```
make test format=json test=UpdateSOLocationTest callchain=true outputPath=/tmp/r-cc.json
```
In /tmp/r-cc.json, the failures[] entry for test_empty_updates_collection now carries a callChain collection containing the test-method frame (and surrounding stack). Re-running without callchain=true confirms it disappears — terse stays terse by default.
Host-side capture proves the component/host boundary (optional but the headline feature)
Move the ALERT out of the test method into a tiny host helper:
```
// _HostErrorHelper.4dm  (host project)
var $obj : Object
ALERT($obj.length)
```
Call _HostErrorHelper() from the test in place of the inline ALERT. Re-run the step 2 command. The runtime error still surfaces on the test with the same runtimeErrors[] shape — but this time the error fired in host scope, where the component's own ON ERR CALL cannot reach. It only reaches the report because the host's TestErrorHandler pushed it onto Storage.testErrors, which the component drained via $hostStorage.testErrors.
Cleanup — Revert the ALERT change(s) and delete _HostErrorHelper if you added it. make test returns all green.

Made with Cursor

coderabbitai · 2026-04-17T20:35:46Z

📝 Walkthrough

Walkthrough

Adds per-assertion source location and function metadata, extends runtime/global error capture (including call-chain and message), records test function/class names in context, refactors report generation (file resolution, JSON/JUnit output control), and exposes a terse JSON call-chain flag; plus Makefile, docs, and .gitignore updates.

Changes

Testing runtime, error capture, and reporting

Layer / File(s)	Summary
Data Shape `testing/Project/Sources/Classes/Testing.4dm`	Adds public `testFunctionName : Text` and `testClassName : Text`, initialized and reset per test.
Error Record Shape `testing/Project/Sources/Methods/TestErrorHandler.4dm`, `testing/Project/Sources/Methods/TestGlobalErrorHandler.4dm`	Stored error objects gain `"message"`; global handler also serializes a filtered `callChainJSON`.
Test Execution & Process Error Collection `testing/Project/Sources/Classes/_TestFunction.4dm`	`run()` records `t.testFunctionName`/`t.testClassName`; collects process-scoped errors (including from `hostStorage.testErrors`), qualifies errors with `functionName`/`formula`, records them as assertions via new `_addRuntimeErrorAsAssertion()`, sets `t.failed` if runtime errors exist, and unconditionally cancels started transactions. Adds `_getHostStorage()` and tightens `_errorBelongsToProcess()`.
Assertion Recording `testing/Project/Sources/Classes/Assert.4dm`	`_recordAssertion` computes source `line` via new `_findTestLine()` (searching call chain) and derives `functionName` from `t.testClassName/t.testFunctionName`; conditionally attaches them to assertion records.
Error Handling Storage Prep / Drain `testing/Project/Sources/Classes/TestRunner.4dm`, `testing/Project/Sources/Classes/_TestFunction.4dm`	Prepare/drain logic extended to also initialize/clear and drain `hostStorage.testErrors` alongside `Storage:C1525.testErrors`.
Report Generation & Output `testing/Project/Sources/Classes/TestRunner.4dm`	Adds `callChainOutput : Boolean`; moves final report generation into `run()` with debug logs; refactors JSON/JUnit file writing via `_resolveOutputFile()` and `_writeJSONToFile()`; terse JSON now includes per-test `runtimeErrors` and gates call-chain inclusion behind `callChainOutput`; JUnit counts exclude external/global runtime errors from `@errors` and include skipped tests as `<skipped/>`.
CLI / Make Integration `Makefile`	`USER_PARAMS` now optionally includes `outputPath`, `verbose`, and `callchain`; several test targets delegate to `$(MAKE) test` so they inherit `USER_PARAMS`.
Docs & Ignored Files `testing/Resources/en.lproj/syntaxEN.json`, `.gitignore`	Docs updated to document new properties (`testClassName`, `testFunctionName`, `callChainOutput`) and new `areDeepEqual(..., maxDepth)` overload; `.gitignore` extended for local/IDE/test artifacts.

Sequence Diagram(s)

sequenceDiagram
    participant Runner as TestRunner
    participant TF as _TestFunction
    participant Testing as Testing (context)
    participant Host as hostStorage
    participant Storage as Storage:C1525
    participant Assert as Assert._recordAssertion
    Runner->>TF: start run() (sets t.testFunctionName/testClassName)
    TF->>Host: _collectProcessErrors() (drain hostStorage.testErrors)
    TF->>Storage: _collectProcessErrors() (drain Storage:C1525.testErrors)
    TF->>TF: enrich errors (functionName, formula) and callChainJSON
    TF->>Assert: _addRuntimeErrorAsAssertion() (append assertions with line/function)
    TF->>Runner: return test result
    Runner->>Runner: _captureGlobalErrors() (read host/storage testErrors)
    Runner->>Runner: _generateReport() (resolve files, include runtimeErrors, gate call chains)
    Runner->>FileSystem: _resolveOutputFile() / write outputs

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

KyleKincer/testing#15: Modifies Assert._recordAssertion to capture per-assertion location/call-chain metadata (direct overlap).
KyleKincer/testing#18: Touches global error-handling and testErrors collection/draining logic (overlapping error storage changes).
KyleKincer/testing#5: Related TestRunner and JUnit/Makefile reporting changes (overlapping output & CI target behavior).

Suggested labels

codex

Poem

🐰 Hopped through stacks and traced each line,

function names stitched to every sign.
Errors gathered, call chains shown,
Reports now write to files well-known —
A rabbit’s cheer for tests refined.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately and clearly summarizes the main objective: making error reporting resilient across component/host boundaries, which is the central theme of the changes across all modified files.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (3)

testing/Project/Sources/Classes/Testing.4dm (1)
65-71: Consider resetting testFunctionName / testClassName here too.

For consistency, resetForNewTest() should clear the new context fields alongside the other per-test state. Today it works because _TestFunction.run() re-assigns them right after calling this, but if resetForNewTest is ever invoked in another path (or by a caller that forgets to reassign), stale names will leak into assertions recorded via _recordAssertion / _findTestLine.
♻️ Proposed tweak
 Function resetForNewTest()
 	This:C1470.failed:=False:C215
 	This:C1470.done:=False:C215
         This:C1470.logMessages:=[]
         This:C1470.assertions:=[]
         This:C1470.stats.resetStatistics()
         This:C1470.failureCallChain:=Null
+        This:C1470.testFunctionName:=""
+        This:C1470.testClassName:=""
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@testing/Project/Sources/Classes/Testing.4dm` around lines 65 - 71,
resetForNewTest currently clears per-test state but does not clear the context
fields testFunctionName and testClassName, which can leak into
_recordAssertion/_findTestLine if resetForNewTest is called outside
_TestFunction.run; update Function resetForNewTest() to explicitly reset
This.testFunctionName and This.testClassName (e.g., set to empty string or Null
consistent with your codebase) alongside This.failed, This.done,
This.logMessages, This.assertions, This.stats.resetStatistics(), and
This.failureCallChain so the test context is always cleared.
testing/Project/Sources/Classes/TestRunner.4dm (1)
566-571: Nit: indentation inconsistency.

The new If ($failedTest.callChain#Null:C1517) block is indented one level shallower than the surrounding If ($failedTest.runtimeErrors...) block, while its body $terseFailure.callChain:=... is indented two levels deeper than its If. The End if at line 570 and subsequent $failedTests.push at line 571 are also off relative to each other. Functionally fine (4D doesn't care), but it obscures the structure on review.
♻️ Align indentation
-			// Include call chain for failed tests
-			If ($failedTest.callChain#Null:C1517)
-					$terseFailure.callChain:=$failedTest.callChain
-				End if 
-				$failedTests.push($terseFailure)
+				// Include call chain for failed tests
+				If ($failedTest.callChain#Null:C1517)
+					$terseFailure.callChain:=$failedTest.callChain
+				End if 
+				$failedTests.push($terseFailure)
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@testing/Project/Sources/Classes/TestRunner.4dm` around lines 566 - 571,
Reformat the indentation for the callChain conditional to match the surrounding
runtimeErrors block: align the "If ($failedTest.callChain#Null:C1517)" with the
enclosing "If ($failedTest.runtimeErrors...)" line, indent the body
"$terseFailure.callChain:=$failedTest.callChain" one level further than that If,
place the corresponding "End if" directly under the If (same indent), and ensure
"$failedTests.push($terseFailure)" aligns with the enclosing block's statements;
adjust spacing around these symbols so the If/End if pair and the push call
visually reflect the control structure.
testing/Project/Sources/Classes/Assert.4dm (1)
275-293: Current implementation is correct; defensive matching is optional.

Web documentation for 4D's Get call chain confirms that class function frames report name as the bare function name (e.g., "testExample"), not qualified with class name. The current code matches this correctly: $frame.name against $t.testFunctionName (bare). Tests confirm the feature works—line numbers are being captured and included in assertion output.

However, the qualified name variable ($functionName) is built in _recordAssertion but only used in the output object; it's never used for matching in _findTestLine. The proposed defensive matching (checking both bare and qualified names) is reasonable for robustness but unnecessary based on current evidence. If you want to add it, the suggested diff is sound and would do no harm.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@testing/Project/Sources/Classes/Assert.4dm` around lines 275 - 293, The
current _findTestLine function only matches bare names ($frame.name against
$t.testFunctionName) which is correct but to be defensive update _findTestLine
to also accept a qualified name: construct the qualified function name exactly
as done in _recordAssertion (the same format used to populate $functionName)
and, inside the For each ($frame; $chain) loop, check both
($frame.name=$t.testFunctionName) OR ($frame.name=qualifiedName) before
returning $frame.line; ensure you reference $t.testFunctionName and the same
qualified-name construction used in _recordAssertion so matching covers both
bare and class-qualified frames.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@testing/Project/Sources/Classes/_TestFunction.4dm`:
- Around line 474-483: The function _errorBelongsToProcess currently returns
True when an error object lacks a processNumber, which can misattribute errors;
update _errorBelongsToProcess so it only returns True when $error.processNumber
is present and equals $processNumber, and return False when $error.processNumber
is Null (i.e., change the default return to False and ensure the conditional
using $error.processNumber#Null checks equality before returning True), keeping
the existing Null check for $error itself intact.
- Around line 68-86: The runtime error collection currently pushes every entry
from This:C1470._collectProcessErrors($processNumber) into
This:C1470.runtimeErrors and calls This:C1470._addRuntimeErrorAsAssertion for
each, causing cascaded errors to inflate assertions; fix by de-duplicating
processErrors before pushing: iterate the array returned by
This:C1470._collectProcessErrors($processNumber), compute a unique key for each
error (e.g., error.code + "|" + error.method + "|" + error.line or similar
fields present), keep only the first error per key, and then push those uniques
into This:C1470.runtimeErrors and call This:C1470._addRuntimeErrorAsAssertion
once per unique; ensure This:C1470.t.failed is set true only once for the test;
alternatively, if you prefer immediate bail-out, short-circuit the handler by
checking Storage.testErrors for this process and set a bail flag the test body
inspects to stop execution — implement one of these two approaches in the block
that processes $processErrors (referencing This:C1470._collectProcessErrors,
This:C1470.runtimeErrors, This:C1470._addRuntimeErrorAsAssertion, and
This:C1470.t.failed).

---

Nitpick comments:
In `@testing/Project/Sources/Classes/Assert.4dm`:
- Around line 275-293: The current _findTestLine function only matches bare
names ($frame.name against $t.testFunctionName) which is correct but to be
defensive update _findTestLine to also accept a qualified name: construct the
qualified function name exactly as done in _recordAssertion (the same format
used to populate $functionName) and, inside the For each ($frame; $chain) loop,
check both ($frame.name=$t.testFunctionName) OR ($frame.name=qualifiedName)
before returning $frame.line; ensure you reference $t.testFunctionName and the
same qualified-name construction used in _recordAssertion so matching covers
both bare and class-qualified frames.

In `@testing/Project/Sources/Classes/Testing.4dm`:
- Around line 65-71: resetForNewTest currently clears per-test state but does
not clear the context fields testFunctionName and testClassName, which can leak
into _recordAssertion/_findTestLine if resetForNewTest is called outside
_TestFunction.run; update Function resetForNewTest() to explicitly reset
This.testFunctionName and This.testClassName (e.g., set to empty string or Null
consistent with your codebase) alongside This.failed, This.done,
This.logMessages, This.assertions, This.stats.resetStatistics(), and
This.failureCallChain so the test context is always cleared.

In `@testing/Project/Sources/Classes/TestRunner.4dm`:
- Around line 566-571: Reformat the indentation for the callChain conditional to
match the surrounding runtimeErrors block: align the "If
($failedTest.callChain#Null:C1517)" with the enclosing "If
($failedTest.runtimeErrors...)" line, indent the body
"$terseFailure.callChain:=$failedTest.callChain" one level further than that If,
place the corresponding "End if" directly under the If (same indent), and ensure
"$failedTests.push($terseFailure)" aligns with the enclosing block's statements;
adjust spacing around these symbols so the If/End if pair and the push call
visually reflect the control structure.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: dbc3b76f-d26e-4f55-9de3-f293e1845018

📥 Commits

Reviewing files that changed from the base of the PR and between b4433d9 and 17b1a3b.

📒 Files selected for processing (6)

testing/Project/Sources/Classes/Assert.4dm
testing/Project/Sources/Classes/TestRunner.4dm
testing/Project/Sources/Classes/Testing.4dm
testing/Project/Sources/Classes/_TestFunction.4dm
testing/Project/Sources/Methods/TestErrorHandler.4dm
testing/Project/Sources/Methods/TestGlobalErrorHandler.4dm

coderabbitai · 2026-04-17T20:42:20Z

+	This:C1470.function.apply(This:C1470.classInstance; [This:C1470.t])
+
+	// Collect errors captured by ON ERR CALL handler in Storage.testErrors
+	var $processErrors : Collection
+	$processErrors:=This:C1470._collectProcessErrors($processNumber)
+
+	If ($processErrors.length>0)
+		var $qualifiedName : Text
+		$qualifiedName:=This:C1470.class.name+"."+This:C1470.functionName
+
+		var $error : Object
+		For each ($error; $processErrors)
+			$error.functionName:=$qualifiedName
+			$error.formula:=$error.method || ""
+			This:C1470.runtimeErrors.push($error)
+			This:C1470._addRuntimeErrorAsAssertion($error)
+		End for each
+		This:C1470.t.failed:=True:C214
+	End if


⚠️ Potential issue | 🟡 Minor

Cascading errors after the first runtime error will each become a failed assertion.

Because ON ERR CALL is non-fatal, execution continues past the failing line. If the first runtime error (e.g., ALERT($result.length)) leaves a variable in an unexpected state, subsequent lines can produce additional runtime errors that are all captured in processErrors, and each gets pushed both into runtimeErrors and as a separate isRuntimeError:true assertion (lines 82-83). Reports may show N failed assertions for what is really one root cause, inflating assertionCount and noise in the JSON output.

Consider either:

stopping execution of the test body on first error (e.g., check Storage.testErrors for this process inside the handler and set a flag the test can bail on), or

de-duplicating processErrors by (code, method, line) before turning each into an assertion.

Not blocking — the current behavior matches the PR goal of "continue executing remaining tests" at the runner level — but worth documenting the per-test cascade semantics.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@testing/Project/Sources/Classes/_TestFunction.4dm` around lines 68 - 86, The runtime error collection currently pushes every entry from This:C1470._collectProcessErrors($processNumber) into This:C1470.runtimeErrors and calls This:C1470._addRuntimeErrorAsAssertion for each, causing cascaded errors to inflate assertions; fix by de-duplicating processErrors before pushing: iterate the array returned by This:C1470._collectProcessErrors($processNumber), compute a unique key for each error (e.g., error.code + "|" + error.method + "|" + error.line or similar fields present), keep only the first error per key, and then push those uniques into This:C1470.runtimeErrors and call This:C1470._addRuntimeErrorAsAssertion once per unique; ensure This:C1470.t.failed is set true only once for the test; alternatively, if you prefer immediate bail-out, short-circuit the handler by checking Storage.testErrors for this process and set a bail flag the test body inspects to stop execution — implement one of these two approaches in the block that processes $processErrors (referencing This:C1470._collectProcessErrors, This:C1470.runtimeErrors, This:C1470._addRuntimeErrorAsAssertion, and This:C1470.t.failed).

I believe this is good behavior as we want it. We definitely do not want to stop execution (that's the problem we're trying to solve), and if there are multiple errors on a single line, we'd want all of them.

@FlutesForHire, understood — that's a reasonable design choice. Capturing every runtime error as its own assertion gives full fidelity into what went wrong per test, and stopping execution would defeat the purpose of the non-fatal error handling this PR introduces. I'll keep this in mind for future reviews.

✏️ Learnings added

Learnt from: FlutesForHire Repo: KyleKincer/testing PR: 30 File: testing/Project/Sources/Classes/_TestFunction.4dm:68-86 Timestamp: 2026-04-23T17:20:42.844Z Learning: In `testing/Project/Sources/Classes/_TestFunction.4dm`, capturing all runtime errors (including cascading ones) as individual failed assertions is intentional. Each error from `_collectProcessErrors` is pushed into `runtimeErrors` and recorded as a separate `isRuntimeError:true` assertion. Stopping execution on the first error is explicitly not desired — the goal is to continue executing remaining tests while reporting every error. Multiple errors on a single line should all be reported.

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

🧠 Learnings used

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: AGENTS.md:0-0 Timestamp: 2025-12-30T19:27:58.388Z Learning: Applies to **/*Test.4dm : Use the Testing context parameter ($t : cs:Testing) to access transaction control methods (startTransaction, validateTransaction, cancelTransaction, inTransaction) when manual transaction control is needed

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: AGENTS.md:0-0 Timestamp: 2025-12-30T19:27:58.388Z Learning: Applies to **/*Test.4dm : Implement test assertion patterns using the Assert library methods for validating test conditions in 4D test functions

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: CLAUDE.md:0-0 Timestamp: 2025-12-30T19:27:45.415Z Learning: Applies to **/*Test.4dm : Use manual transaction control methods (`startTransaction()`, `validateTransaction()`, `cancelTransaction()`, `withTransaction()`, `withTransactionValidate()`) when `// `#transaction`: false` is set

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: AGENTS.md:0-0 Timestamp: 2025-12-30T19:27:58.388Z Learning: Applies to **/*Test.4dm : Leverage automatic transaction management for database isolation by allowing tests to run in automatic transactions that are rolled back after completion

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: CLAUDE.md:0-0 Timestamp: 2025-12-30T19:27:45.415Z Learning: Applies to **/*Test.4dm : Disable automatic transaction management per-test using `// `#transaction`: false` annotation

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: AGENTS.md:0-0 Timestamp: 2025-12-30T19:27:58.388Z Learning: Applies to **/*Test.4dm : Use `// `#transaction`: false` annotation to disable automatic transaction management for tests requiring manual transaction control

Learnt from: CR Repo: KyleKincer/testing PR: 0 File: CLAUDE.md:0-0 Timestamp: 2025-12-30T19:27:45.415Z Learning: Applies to **/*.4dm : Pass host project's Storage object to Testing framework when running tests from a host project using `Testing_RunTestsWithCs(cs; $hostStorage; $userParams)` method

FlutesForHire · 2026-04-23T17:44:12Z

-				// Include call chain in verbose JSON output
-				If (This:C1470.verboseOutput) && ($failedTest.callChain#Null:C1517)
+			// Include call chain for failed tests
+			If ($failedTest.callChain#Null:C1517)


Do we really want to remove the verboseOutput guard clause here?

Also, nitpick: The If on line 568 is at the wrong indention level.

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@Makefile`:
- Line 38: The test-junit and test-parallel targets still invoke tool4d directly
and thus ignore the new USER_PARAMS variable; update the recipes for test-junit
and test-parallel so they invoke the test target via $(MAKE) and forward
USER_PARAMS (e.g. change their recipe from calling tool4d ... to something like:
$(MAKE) test $(USER_PARAMS)) so that USER_PARAMS is honored consistently by
test, test-junit and test-parallel; reference the USER_PARAMS variable and the
test, test-junit, and test-parallel targets when making the change.

In `@scripts/test-sandbox.sh`:
- Around line 63-71: The current rewrite injects an unescaped absolute path into
USER_PARAM (via RAW_PATH -> ABS_PATH), which breaks TestRunner._parseParamString
because it splits on spaces; instead preserve round-trippable tokens by escaping
or encoding spaces before substitution: when building ABS_PATH, transform spaces
into an escaped form (e.g., replace " " with "\ " or URL-encode as "%20") and
then perform
USER_PARAM="${USER_PARAM/outputPath=$RAW_PATH/outputPath=$ESCAPED_ABS_PATH}";
alternatively, don't replace the token with an absolute path and leave
outputPath=$RAW_PATH unchanged so the parser receives the original token. Ensure
you update the substitution using the ESCAPED_ABS_PATH variable and keep
references to USER_PARAM, RAW_PATH and ABS_PATH consistent.

In `@testing/Project/Sources/Classes/TestRunner.4dm`:
- Around line 31-37: The unconditional LOG EVENT calls that write "[DEBUG] ..."
to standard output (see the LOG EVENT(... Into system standard outputs ...)
lines around calls to This:C1470._runInternal(), _captureGlobalErrors(), and
_generateReport()) corrupt JSON when format=json and no outputPath is set;
modify the run path so those debug LOG EVENTs are either skipped when the runner
is configured for JSON stdout (e.g., check the runner's format/outputPath flags)
or redirected to a different sink (stderr or a logger that doesn't use standard
outputs) before emitting the JSON body to stdout.

In `@testing/Project/Sources/Methods/TestErrorHandler.4dm`:
- Around line 38-50: The current filter in TestErrorHandler removes any call
chain entry from database "testing", which also drops user test frames; update
the filter logic in the loop over $rawChain (variables $rawChain,
$filteredChain, $entry) to stop excluding by ($entry.database#"testing") and
instead only skip explicit framework/internal frames — e.g., check and skip when
$entry.name matches known internal handler names like "TestErrorHandler" or when
an entry flag/field indicates an internal/framework module (e.g.,
$entry.isInternal or $entry.module == "framework"), then push all other entries
and continue to produce $callChainJSON via JSON Stringify:C1217($filteredChain).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 6e06173a-ccfb-43c8-9be4-3e9c0c85c6a4

📥 Commits

Reviewing files that changed from the base of the PR and between 17b1a3b and e815782.

📒 Files selected for processing (9)

.gitignore
Makefile
scripts/test-sandbox.sh
testing/Project/Sources/Classes/TestRunner.4dm
testing/Project/Sources/Classes/Testing.4dm
testing/Project/Sources/Classes/_TestFunction.4dm
testing/Project/Sources/Methods/TestErrorHandler.4dm
testing/Project/Sources/Methods/TestGlobalErrorHandler.4dm
testing/Resources/en.lproj/syntaxEN.json

✅ Files skipped from review due to trivial changes (1)

.gitignore

🚧 Files skipped from review as they are similar to previous changes (1)

testing/Project/Sources/Classes/Testing.4dm

coderabbitai · 2026-05-04T16:36:25Z

+	LOG EVENT:C667(Into system standard outputs:K38:9; "[DEBUG] run: before _runInternal\r\n"; Information message:K38:1)
 	This:C1470._runInternal()
+	LOG EVENT:C667(Into system standard outputs:K38:9; "[DEBUG] run: after _runInternal\r\n"; Information message:K38:1)
 	This:C1470._captureGlobalErrors()
+	LOG EVENT:C667(Into system standard outputs:K38:9; "[DEBUG] run: after _captureGlobalErrors\r\n"; Information message:K38:1)
+	This:C1470._generateReport()
+	LOG EVENT:C667(Into system standard outputs:K38:9; "[DEBUG] run: after _generateReport\r\n"; Information message:K38:1)


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Unconditional debug logging corrupts stdout JSON output.

These LOG EVENT(... Into system standard outputs ...) calls write to the same stream as the JSON body. With format=json and no outputPath, the new [DEBUG] ... lines are emitted before/around the JSON string, so consumers no longer get valid JSON. That breaks the PR’s “always produce JSON output” guarantee. Gate them off for JSON stdout, or send them to a different sink.

Also applies to: 420-433, 497-619

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@testing/Project/Sources/Classes/TestRunner.4dm` around lines 31 - 37, The unconditional LOG EVENT calls that write "[DEBUG] ..." to standard output (see the LOG EVENT(... Into system standard outputs ...) lines around calls to This:C1470._runInternal(), _captureGlobalErrors(), and _generateReport()) corrupt JSON when format=json and no outputPath is set; modify the run path so those debug LOG EVENTs are either skipped when the runner is configured for JSON stdout (e.g., check the runner's format/outputPath flags) or redirected to a different sink (stderr or a logger that doesn't use standard outputs) before emitting the JSON body to stdout.

…ains - Route test-junit, test-ci, junit tag variants, and parallel targets through `make test` so outputPath/verbose/callchain USER_PARAMS apply (CodeRabbit). - Remove debug LOG EVENT lines that broke JSON-on-stdout consumers. - Error handlers: stop filtering every testing-database stack frame; only drop the handler method so real test frames remain (CodeRabbit / host chains). - Terse JSON failures: callChain only when callchain=true; verbose no longer implies callChain (explicit flag; addresses FlutesForHire). - Drop scripts/test-sandbox.sh from this branch — keep PR scoped; parallel sandbox lives on wip/parked-local-work-20260504. Co-authored-by: Cursor <cursoragent@cursor.com>

When you build the testing project as a component or run tests locally, tool4d drops files we don't want in the repo. Add the following entries to `.gitignore` so they stop showing up in `git status`: - Components/ (the compiled .4dbase output) - testing/Logs/ (4D runtime logs) - testing/test-results/ (JUnit XML and other test output) - testing/*.4DD, *.Match (datafiles created on first run) - .DS_Store (macOS Finder metadata)

…SWE-44545) Why --- When a test running through the testing component hits a runtime error (e.g. dereferencing a null object), the component's own `ON ERR CALL` handler can only see code that runs inside the testing database. Calls into the *host* project — where the test method actually lives — are invisible. That makes runtime failures hard to diagnose from the test report: you see a test failed with "An Alphanumeric argument was expected", but not where in your code it happened. What ---- Add an opt-in path that lets the host project install its own error handler so the full call stack (host + component code) ends up attached to the failing test. 1. New host-side error handlers `Methods/TestErrorHandler.4dm` and `Methods/TestGlobalErrorHandler.4dm` are intended to be installed by the host project via `ON ERR CALL`. When a runtime error fires, they: - walk `Last errors` and record one call-chain entry per call, - drop only the handler's own entry (so real test methods that happen to live in the testing database stay in the chain), - push the resulting error (with its call chain) onto `Storage.testErrors`. The host wires them up before invoking the runner, so the runner doesn't need to know anything about them — it just looks for pre-captured errors in shared Storage. 2. Test runner picks them up `TestRunner` / `_TestFunction` now drain `Storage.testErrors` (and the host's `Storage.testErrors` when running as a component) at the end of each test. For each captured error they: - record it as a `runtimeErrors` entry on the test result, - synthesize a failed assertion ("Runtime error: …") so the test is reported as failed in every output format, - surface the captured call chain on the test result. 3. New user-params for choosing what JSON includes callchain=true include `callChain` on terse JSON failures (verbose JSON has always included it; this makes it available without going full verbose) outputPath=<f> write JSON results to <f> instead of stdout. Stdout intentionally still contains [DEBUG] framing for human-debugged runs, so use `outputPath=` when you need clean parseable JSON. 4. Makefile cleanup The Makefile now threads `format`, `tags`, `test`, `requireTags`, `parallel`, `maxWorkers`, `outputPath`, `verbose`, and `callchain` through `--user-param`, and routes every convenience target (`test-junit`, `test-ci`, `test-parallel*`, …) through the main `test` target. So whichever entry point you use, the same parameter handling applies. 5. `Resources/en.lproj/syntaxEN.json` regenerated to match the new public symbols. Co-authored-by: Cursor <cursoragent@cursor.com>

@errors

The JUnit XML report had three problems that made CI consumers (Jenkins, GitHub/GitLab test reports, etc.) display nonsense. 1. <testsuites> failures and errors were wrong, sometimes negative ---------------------------------------------------------------- Real example output for a run with 24 actual failures: <testsuites tests="1185" failures="-880" errors="904" …> `_buildJUnitXML` was adding `globalErrorCount` (the count of external, non-test runtime errors that fire outside any test) into the @errors attribute, then computing @failures = results.failed − @errors. When external errors outnumbered failed tests, @failures went negative. External runtime errors are already reported in `<system-err>` and should not contribute to @errors at all. Stop mixing them in: @errors now counts only failed *tests* that had a runtime error, and @failures = results.failed − @errors is correct again. 2. @timestamp was malformed ------------------------ Real example: timestamp="2026-05-04T04:00:00ZT14:29:40". `String(Current date; ISO date GMT)` already returns a full "YYYY-MM-DDT00:00:00Z" string, but the code was appending "T" + Current time on top of it, producing the doubled "ZT…". Fix: take just the YYYY-MM-DD prefix from that call and append the local time, producing plain ISO 8601 "YYYY-MM-DDTHH:MM:SS". The trailing Z is dropped because `Current date` / `Current time` are local values, not UTC. 3. Skipped tests were invisible to JUnit consumers ------------------------------------------------ Skipped testcases were emitted as bare self-closing `<testcase … />` elements, indistinguishable from passing tests. Fix: emit a `<skipped/>` child on those `<testcase>`s and add a matching @Skipped attribute to `<testsuite>` and `<testsuites>` so counts roll up correctly. Verified on the testing project's own suite (208 / 207 / 0 / 1) and Symphony's full suite (1182 / 1143 / 22 / 17): both now report the expected totals, plain ISO timestamps, and skipped <testcase>s with <skipped/> children. Existing call-stack output inside <failure> / <error> CDATA is unchanged.

Update README, AGENTS, CLAUDE, AI_AGENT_TESTING_GUIDE, and docs/guide for the PR's surface changes: - Document the new --user-param flags: outputPath (JSON + JUnit file output) and callchain (terse-JSON callChain on failed tests). - Refresh JSON output samples to the actual current shape (tests/passed/failed/ skipped/duration/rate/status, globalErrors/globalErrorCount, per-test runtimeErrors[], failures[].callChain when verbose or callchain=true). - Update JUnit XML samples to show skipped="N" attributes, the <skipped/> child element, and the <system-err> block; explain the corrected counting rules (failures = assertion failures, errors = test runtime errors, externals isolated to <system-err>). - Add a Runtime Error Capture section explaining per-test vs global capture, the captured record schema, and where each field shows up in each output format. - Add a Host Project Integration section with a worked example of the host's RunTests entry point, TestErrorHandler / TestGlobalErrorHandler methods, the shared Storage.testErrors collection, and the preemptive-capable handler requirement. Co-authored-by: Cursor <cursoragent@cursor.com>

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@testing/Project/Sources/Classes/_TestFunction.4dm`:
- Around line 393-407: _addRuntimeErrorAsAssertion currently always injects
"line"; $error.line into the assertion object causing a null "line" key; change
it to mirror _recordAssertion in Assert.4dm by only adding the "line" property
when $error.line is present and > 0 (i.e., conditionally include the "line"
entry instead of unconditionally setting "line"; $error.line), so runtime-error
assertions omit the key when no valid line number exists.

In `@testing/Project/Sources/Classes/Assert.4dm`:
- Around line 275-293: _findTestLine currently compares $frame.name only to
$t.testFunctionName, which misses class-qualified names like
"ClassName.FunctionName"; update the match in Function _findTestLine to also
check the class-qualified form by constructing the qualified name using
$t.testClassName (when not Null/empty) and compare $frame.name to either
$t.testFunctionName or ($t.testClassName + "." + $t.testFunctionName), ensuring
you guard for Null/empty $t.testClassName so you don't build an invalid string.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: fb40df65-d24a-4713-b1d2-aef3f8b6f4d1

📥 Commits

Reviewing files that changed from the base of the PR and between 35ad35f and be2c7cc.

📒 Files selected for processing (9)

.gitignore
Makefile
testing/Project/Sources/Classes/Assert.4dm
testing/Project/Sources/Classes/TestRunner.4dm
testing/Project/Sources/Classes/Testing.4dm
testing/Project/Sources/Classes/_TestFunction.4dm
testing/Project/Sources/Methods/TestErrorHandler.4dm
testing/Project/Sources/Methods/TestGlobalErrorHandler.4dm
testing/Resources/en.lproj/syntaxEN.json

✅ Files skipped from review due to trivial changes (3)

.gitignore
testing/Project/Sources/Classes/Testing.4dm
Makefile

🚧 Files skipped from review as they are similar to previous changes (3)

testing/Project/Sources/Methods/TestErrorHandler.4dm
testing/Project/Sources/Methods/TestGlobalErrorHandler.4dm
testing/Project/Sources/Classes/TestRunner.4dm

FlutesForHire · 2026-05-07T18:22:12Z

Everything looks good to me with these changes. @KyleKincer I'm assuming you'd prefer to be the one that does the merge, since you're the owner of this repo?

KyleKincer · 2026-05-07T20:00:21Z

Everything looks good to me with these changes. @KyleKincer I'm assuming you'd prefer to be the one that does the merge, since you're the owner of this repo?

@FlutesForHire Looks good to me as well! I don't mind y'all taking ownership and merging changes into this-- you have more stake in it than I do at this point. Don't let me slow you down! More than happy to look at anything you would like me to, but merge away.

coderabbitai Bot reviewed Apr 17, 2026

View reviewed changes

FlutesForHire reviewed Apr 23, 2026

View reviewed changes

coderabbitai Bot reviewed May 4, 2026

View reviewed changes

gamescoot force-pushed the cursor/error-reporting-to-host branch 2 times, most recently from ec3d099 to b13b8d2 Compare May 4, 2026 19:06

gamescoot force-pushed the cursor/error-reporting-to-host branch from b13b8d2 to e114b5d Compare May 4, 2026 19:11

Scott Harris and others added 2 commits May 4, 2026 17:46

gamescoot force-pushed the cursor/error-reporting-to-host branch from e114b5d to be2c7cc Compare May 4, 2026 21:47

coderabbitai Bot reviewed May 4, 2026

View reviewed changes

Comment thread testing/Project/Sources/Classes/_TestFunction.4dm

Comment thread testing/Project/Sources/Classes/Assert.4dm

gamescoot merged commit 39a9746 into KyleKincer:main May 7, 2026
1 check passed

coderabbitai Bot mentioned this pull request May 11, 2026

feat: streamlined output, failure detection, and call chain support for external errors #32

Open

Conversation

gamescoot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Prerequisites

Steps

Uh oh!

coderabbitai Bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FlutesForHire Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

FlutesForHire Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

FlutesForHire commented May 7, 2026

Uh oh!

KyleKincer commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gamescoot commented Apr 17, 2026 •

edited

Loading

coderabbitai Bot commented Apr 17, 2026 •

edited

Loading

coderabbitai Bot Apr 17, 2026 •

edited

Loading

KyleKincer commented May 7, 2026 •

edited

Loading