Added
- Added "Single Evaluation" mode to the benchmarking pipeline.
- Added
model_kwargstoRLConfig. - Updated benchmark operator resolution to support classes from
backtracking_llm.rl.operators.
Fixed
- Fixed
RlPolicyOperatorinitialization to correctly accept file paths as strings (previously requiredpathlib.Pathobjects).