Skip to content

Add option to recompute old policy logprobs on the trainer.

dc2fed0
Select commit
Loading
Failed to load commit list.
Merged

Add option to recompute old policy logprobs on the trainer. #1487

Add option to recompute old policy logprobs on the trainer.
dc2fed0
Select commit
Loading
Failed to load commit list.
Google CLA / cla/google succeeded May 12, 2026 in 14s

✅ All contributors are covered under a CLA with Google

See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).

ℹ️ Googlers: Go here to view more details and manage scans for this pull request.

Details

The following contributors were found for this pull request:

dc2fed0 PR Opener: @copybara-service[bot]
dc2fed0 Author: @hgao327 <ha****ao​@google.com>

(Only the first commit for a unique contributor is listed.)