You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, @kaymen99 , thanks for your code. Is there anything wrong with the her_augmentation function under HER.py file where the re-computed reward is always zero?
And why should we take the future observation as the augmented observation, shouldn't we keep the observation in the current timestep, i.e., obs, _, _ = obs_array[index].values() as the augmented observation?
The text was updated successfully, but these errors were encountered:
Hi, @kaymen99 , thanks for your code. Is there anything wrong with the
her_augmentation
function underHER.py
file where the re-computed reward is always zero?And why should we take the future observation as the augmented observation, shouldn't we keep the observation in the current timestep, i.e.,
obs, _, _ = obs_array[index].values()
as the augmented observation?The text was updated successfully, but these errors were encountered: