Skip to content

Conversation

@chinganc
Copy link
Member

Adding a high level train helper function.

This PR also makes the default of

  • MinibatchAlgorithm: ensure_improvement: bool = True. This has been found to ensure stability. This setting is used in some other recent algos like in GEPA and Adaflow.
  • LLMJudge: use_formatted_response: bool = False. The current default formatting is not robust enough. I suggest turning it off by default and just returns the llm's response.

@chinganc chinganc changed the title Feature/train Adding a train helper function and updating defaults of MinibatchAlgorithm and LLMJudge Aug 22, 2025


@trace.model
class Learner:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Like this name! Learner and Guide

Copy link
Member

@allenanie allenanie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@allenanie
Copy link
Member

allenanie commented Aug 28, 2025

There are some improvements to this PR feature/train, but it can be a separate PR:

LLM Judge will not silently fail (if it outputs score=0 and rejects a parameter update, due to lack of context)

@chinganc chinganc merged commit c347274 into experimental Aug 28, 2025
1 check passed
@chinganc chinganc deleted the feature/train branch August 28, 2025 18:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants