Skip to content

Commit ab60ce1

Browse files
committed
Want element multiplication, not dot product
1 parent 29e63f0 commit ab60ce1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

rl_algo_impls/shared/callbacks/microrts_reward_decay_callback.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ def on_step(self, timesteps_elapsed: int = 1) -> bool:
2828

2929
progress = self.timesteps_elapsed / self.total_train_timesteps
3030
# Decay all rewards except WinLoss
31-
reward_weights = self.base_reward_weights @ np.array(
31+
reward_weights = self.base_reward_weights * np.array(
3232
[1] + [1 - progress] * (len(self.base_reward_weights) - 1)
3333
)
3434
self.microrts_env.reward_weight = reward_weights

0 commit comments

Comments
 (0)