-
Notifications
You must be signed in to change notification settings - Fork 653
Open
Labels
questionFurther information is requestedFurther information is requested
Description
Say I have an evaluator that returns a score based on some written code. If the code does not compile (e.g. a syntax error) I simply return a score of 0.
Say I also have a config with an early stopping patience set. Say that my patience score is a smallish number, like 3.
Right now what I am observing is if my evaluator cannot run a bit of iterated code 3 times, it will assume that it has succeeded and move on, even though it actually failed three times in a row.
Is there a way to distinguish between 'this was successful and that is why the score did not change (and you should stop running)' and 'this was a failure and that is why the score did not change (and you should keep running)'?
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested