Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge #33

chinganc · 2025-08-22T22:35:06Z

Adding a high level train helper function.

This PR also makes the default of

MinibatchAlgorithm: ensure_improvement: bool = True. This has been found to ensure stability. This setting is used in some other recent algos like in GEPA and Adaflow.
LLMJudge: use_formatted_response: bool = False. The current default formatting is not robust enough. I suggest turning it off by default and just returns the llm's response.

…be true.

allenanie · 2025-08-28T16:26:34Z

examples/train_model.py

+
+
+@trace.model
+class Learner:


Like this name! Learner and Guide

allenanie

LGTM

allenanie · 2025-08-28T16:29:47Z

There are some improvements to this PR feature/train, but it can be a separate PR:

LLM Judge will not silently fail (if it outputs score=0 and rejects a parameter update, due to lack of context)

chinganc added 6 commits August 19, 2025 23:54

Fix missing oprov2 problem

e9bfb07

Add an assertion to make optimizer receives non-empty parameters.

e258e87

Add a prototype

3d444cb

Update train.py

50c7adc

Make train runnable and add an example code

3c79cca

Fix a bug in the example code. Set minibatch's ensure improvement to …

6def8fa

…be true.

chinganc changed the title ~~Feature/train~~ Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge Aug 22, 2025

chinganc assigned allenanie and adith387 Aug 22, 2025

Make train support single-node optimization

432459a

allenanie reviewed Aug 28, 2025

View reviewed changes

examples/train_model.py

@trace.model

class Learner:

Copy link

Member

allenanie Aug 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Like this name! Learner and Guide

allenanie approved these changes Aug 28, 2025

View reviewed changes

chinganc merged commit c347274 into experimental Aug 28, 2025
1 check passed

chinganc deleted the feature/train branch August 28, 2025 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge #33

Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge #33

Uh oh!

chinganc commented Aug 22, 2025

Uh oh!

allenanie Aug 28, 2025

Uh oh!

allenanie left a comment

Uh oh!

allenanie commented Aug 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge #33

Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge #33

Uh oh!

Conversation

chinganc commented Aug 22, 2025

Uh oh!

allenanie Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

allenanie left a comment

Choose a reason for hiding this comment

Uh oh!

allenanie commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

allenanie commented Aug 28, 2025 •

edited

Loading