Skip to content
This repository was archived by the owner on Nov 19, 2025. It is now read-only.

Conversation

@terrykong
Copy link
Collaborator

What does this PR do ?

Removes the warning log for the old dpo dataset format since it cluttered people's terminals and made it hard to watch their progress

image

Changelog

  • Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

Checklist when contributing a new algorithm

  • Does the trainer resume and restore model state all states?
  • Does the trainer support all parallelism techniques(PP, TP, DP)?
  • Does the trainer support max_steps=-1 and validation?
  • Does the trainer only call APIs defined in alignable_interface.py?
  • Does the trainer have proper logging?

Additional Information

  • Related to # (issue)

Signed-off-by: Terry Kong <terryk@nvidia.com>
@terrykong terrykong requested a review from ashors1 February 19, 2025 07:05
@terrykong terrykong added the Run CICD Set + un-set to retrigger (add after r*.*.* labels) label Feb 19, 2025
@terrykong terrykong enabled auto-merge (squash) February 19, 2025 16:56
@terrykong terrykong merged commit 203b669 into main Feb 19, 2025
20 checks passed
@terrykong terrykong deleted the tk/rm-dpo-spammy-warning branch February 19, 2025 17:24
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

Run CICD Set + un-set to retrigger (add after r*.*.* labels)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants