[codex] Restore PR 607 KL loss removal by FurtherAI · Pull Request #639 · OpenPipe/ART

FurtherAI · 2026-04-02T08:34:00Z

Summary

Removes the Schulman KL term that PR 619 accidentally reintroduced into art.loss.loss_fn, restoring the behavior from PR 607.

Root Cause

The LoRA correctness work in PR 619 refactored Loss and brought back the direct KL-divergence term and Loss.kl field that PR 607 had intentionally removed.

Changes

remove the reintroduced KL-divergence accumulation from src/art/loss.py
remove the now-invalid loss.kl scaling path from the Megatron oracle worker helper
add a focused regression assertion so Loss does not grow the kl field back silently

Validation

uv sync --all-extras
uv run pytest src/art/test/test_kl_advantage.py -q
uv run python -m py_compile src/art/loss.py tests/integration/megatron_oracle_worker.py src/art/test/test_kl_advantage.py

fix: restore PR 607 KL loss removal

348410b

FurtherAI force-pushed the austin/fix_pr_607_regression_from_pr_619 branch from f9d79b2 to 348410b Compare April 2, 2026 08:40

FurtherAI marked this pull request as ready for review April 2, 2026 17:22

FurtherAI merged commit 75a81e9 into main Apr 2, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] Restore PR 607 KL loss removal#639

[codex] Restore PR 607 KL loss removal#639
FurtherAI merged 1 commit intomainfrom
austin/fix_pr_607_regression_from_pr_619

FurtherAI commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

FurtherAI commented Apr 2, 2026

Summary

Root Cause

Changes

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant