Skip to content

v0.3.1

Latest

Choose a tag to compare

@matee8 matee8 released this 01 Dec 08:14
57d4015

Added

  • Added "Single Evaluation" mode to the benchmarking pipeline.
  • Added model_kwargs to RLConfig.
  • Updated benchmark operator resolution to support classes from backtracking_llm.rl.operators.

Fixed

  • Fixed RlPolicyOperator initialization to correctly accept file paths as strings (previously required pathlib.Path objects).