You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix reward scaler when run on varied episode lengths (#455)
When calling `fit` with a reward scaler on a dataset with varied episode lengths,
the following error would be thrown in the `fit_with_trajectory_slicer` method:
```
ValueError: setting an array element with a sequence. The requested array has an
inhomogeneous shape after 1 dimensions.
```
This commit fixes the issue by flattening the rewards before calculating the mean and std.
0 commit comments