Investigate nn.utils.clip_grad_norm_
for DQN, DDPG, and TD3
#148
Closed
3 tasks
nn.utils.clip_grad_norm_
for DQN, DDPG, and TD3
#148
Problem Description
Compared to the original implementations, our DQN, DDPG, and TD3 implementations additionally do global gradient clipping, a code-level optimization done in PPO. It is unclear if global gradient clipping offers real performance benefits, so we should look into it and remove it if necessary.
dqn_atari.py
ddpg_continuous_action.py
td3_continuous_action.py
The text was updated successfully, but these errors were encountered: