Skip to content

fix: loss diagnostic logging + training step test#227

Merged
abrichr merged 1 commit into
mainfrom
fix/loss-diagnostics
Mar 29, 2026
Merged

fix: loss diagnostic logging + training step test#227
abrichr merged 1 commit into
mainfrom
fix/loss-diagnostics

Conversation

@abrichr
Copy link
Copy Markdown
Member

@abrichr abrichr commented Mar 29, 2026

Scientific notation for loss, grad_norm, advantages. 13 tests.

…ges)

loss=0.0000 was misleading: %.4f truncation + symmetric advantages
canceling. Now logs loss in scientific notation, absolute loss per
rollout, gradient norm, and per-rollout advantages.

13 vision loss tests (was 12). New test verifies loss_abs > 0 and
advantages are symmetric with reward variance.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@abrichr abrichr merged commit 74fd646 into main Mar 29, 2026
abrichr added a commit that referenced this pull request Mar 29, 2026
PR #227 accidentally committed local experiment data via git add -A:
- flywheel_results/ (224 screenshots + JSON)
- .claude/worktrees/ (31 agent gitlinks)
- annotated_demos/ (16 files)
- eval_results/ (11 screenshots)
- grpo_output/ (1 file)
- demos/*/synthetic_correction/ (placeholder PNGs)
- .beads/ (SQLite database)

All removed from tracking. .gitignore updated to prevent reoccurrence.
No sensitive data was exposed (confirmed via tidy scan).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abrichr added a commit that referenced this pull request Mar 29, 2026
These were committed before PR #227 and missed in the first cleanup.
waa_recordings/ contains WAA experiment screenshots (PNGs).
.beads/ contains a SQLite database for local tooling.

Both are already in .gitignore from the prior cleanup commit.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant