feat: Add Interleaved Trainer implementation #3107

ucalyptus2 · 2025-03-18T16:22:31Z

What does this PR do?

This PR introduces a new InterleaveTrainer class that enables alternating between different training strategies within the same training loop. This implementation allows for more flexible training patterns where different optimization objectives can be interleaved during model training.

Key additions:

Add InterleaveTrainer class and configuration
- Implements a trainer that can alternate between different training strategies
- Provides configurable scheduling of training phases
- Supports seamless integration with existing TRL trainers
Add unit tests for interleaved training
- Comprehensive test coverage for trainer functionality
- Tests for configuration validation
- Integration tests with different training scenarios
Update init.py files to expose new trainer
- Make InterleaveTrainer accessible through the main TRL package
- Maintain consistent import patterns with other trainers
Implement trainer configuration with InterleaveConfig
- Flexible configuration options for defining training schedules
- Support for customizing phase transitions
- Type-safe configuration validation

Technical Details

The InterleaveTrainer allows users to define multiple training phases that can be alternated during the training process. This is particularly useful for scenarios where you want to:

Alternate between different learning objectives
Switch between different datasets during training
Implement curriculum learning strategies
Balance multiple training goals in a controlled manner

Before submitting

Did you read the contributor guideline
Did you write any new necessary tests?
Did you make sure to update the documentation with your changes?

Who can review?

Anyone familiar with TRL's trainer implementations and interested in advanced training strategies. @huggingface/trl-core-team would be great reviewers for this feature.

qgallouedec · 2025-04-02T16:56:14Z

Hi, thanks for your contribution! I’m unsure if the potential profit outweighs the added complexity. For now, I’m putting this on hold to gauge community interest in supporting this feature.

Enable the user to set Eager Execution instead of building cuda graph to save memory.

Update vllm_serve.py

ucalyptus added 2 commits April 21, 2025 16:07

Update vllm_serve.py

c7fdd9c

Enable the user to set Eager Execution instead of building cuda graph to save memory.

Merge pull request #1 from ucalyptus/feat/enforce-eager-vllm

6d8e969

Update vllm_serve.py

ucalyptus2 closed this Apr 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add Interleaved Trainer implementation #3107

feat: Add Interleaved Trainer implementation #3107

Uh oh!

ucalyptus2 commented Mar 18, 2025

Uh oh!

qgallouedec commented Apr 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add Interleaved Trainer implementation #3107

feat: Add Interleaved Trainer implementation #3107

Uh oh!

Conversation

ucalyptus2 commented Mar 18, 2025

What does this PR do?

Technical Details

Before submitting

Who can review?

Uh oh!

qgallouedec commented Apr 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants