fix(ai): normalize boolean scores in onlineEval scoresSummary by lukasmalkmus · Pull Request #263 · axiomhq/ai

lukasmalkmus · 2026-02-24T14:16:13Z

Overview

onlineEval() was writing raw boolean scores (true/false) into the parent eval span's eval.case.scores attribute, while child scorer spans correctly normalized them to 1/0 with eval.score.is_boolean metadata via normalizeBooleanScore()
Apply the same normalizeBooleanScore() call when building scoresSummary so both parent and child spans produce consistent numeric scores

Note

Low Risk
Small telemetry-only change that affects how scores are serialized into span attributes; low risk aside from potential downstream expectations of boolean values.

Overview
Ensures onlineEval() writes consistent numeric scores into the parent eval span’s eval.case.scores summary by normalizing boolean score values (true/false → 1/0) and propagating the corresponding eval.score.is_boolean metadata.

This updates onlineEval.ts to call normalizeBooleanScore() while building scoresSummary, and only emits normalized metadata when non-empty.

^{Written by Cursor Bugbot for commit bfa6ce7. This will update automatically on new commits. Configure here.}

When a scorer returned `{ score: true }` or `{ score: false }`, the parent eval span's `eval.case.scores` attribute contained raw booleans instead of normalized numeric values with `eval.score.is_boolean` metadata. This was inconsistent with individual scorer child spans which already called `normalizeBooleanScore()` via `executor.ts`. Apply the same normalization when building `scoresSummary` so both the parent eval span and child scorer spans produce consistent numeric scores.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

packages/ai/src/online-evals/onlineEval.ts

pkg-pr-new · 2026-02-24T14:17:32Z

Open in StackBlitz

npm i https://pkg.pr.new/axiomhq/ai/axiom@263

commit: bfa6ce7

🤖 I have created a release *beep* *boop* --- ## [0.46.1](axiom-v0.46.0...axiom-v0.46.1) (2026-02-25) ### Bug Fixes * **ai:** move online eval scorer counters to eval.* namespace ([#264](#264)) ([bef94db](bef94db)) * **ai:** normalize boolean scores in onlineEval scoresSummary ([#263](#263)) ([ff75842](ff75842)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).  --- > [!NOTE] > **Low Risk** > Release metadata/changelog-only changes with no functional code modifications in this PR. > > **Overview** > Publishes `packages/ai` version `0.46.1` by updating the release manifest, `package.json` version, and `CHANGELOG.md`. > > The changelog for `0.46.1` notes two bug fixes: moving online eval scorer counters into the `eval.*` namespace and normalizing boolean scores in `onlineEval` `scoresSummary`. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit 90b0bd1. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup>

lukasmalkmus self-assigned this Feb 24, 2026

lukasmalkmus requested review from c-ehrlich and thesollyz February 24, 2026 14:16

cursor bot reviewed Feb 24, 2026

View reviewed changes

packages/ai/src/online-evals/onlineEval.ts Show resolved Hide resolved

lukasmalkmus enabled auto-merge (squash) February 24, 2026 14:18

thesollyz approved these changes Feb 25, 2026

View reviewed changes

lukasmalkmus merged commit ff75842 into main Feb 25, 2026
11 checks passed

lukasmalkmus deleted the lukasmalkmus/yoxwysopstxv branch February 25, 2026 13:32

axiom-automation mentioned this pull request Feb 25, 2026

chore(main): release axiom 0.46.1 #267

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ai): normalize boolean scores in onlineEval scoresSummary#263

fix(ai): normalize boolean scores in onlineEval scoresSummary#263
lukasmalkmus merged 1 commit intomainfrom
lukasmalkmus/yoxwysopstxv

lukasmalkmus commented Feb 24, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

pkg-pr-new bot commented Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lukasmalkmus commented Feb 24, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pkg-pr-new bot commented Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lukasmalkmus commented Feb 24, 2026 •

edited by cursor bot

Loading