actor_log_prob.transpose in MPO causes error #10

zhaoyi11 · 2021-11-15T17:40:01Z

Hi,

Thanks so much for the clean implementation. In MPO, using actor_log_prob.transpose((0, 1)) causes error.

Line 236 in a2c6067

actor_log_prob = actor_log_prob.transpose((0, 1))

TypeError: transpose permutation isn't a permutation of operand dimensions, got permutation (0, 1) for operand shape (256, 20, 1).

I checked the shape of actor_log_prob and weights, they are in the same shape. After removing this line, I can get the samilar performance in cartpole_swingup env as shown in README.md.

Best,
Yi

henry-prior · 2021-11-15T18:21:06Z

Looks like this line was unnecessary, removed it in #11

henry-prior mentioned this issue Nov 15, 2021

[Bug] Fix MPO transpose bug and pass seed to agent classes #11

Merged

henry-prior closed this as completed Nov 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

actor_log_prob.transpose in MPO causes error #10

actor_log_prob.transpose in MPO causes error #10

zhaoyi11 commented Nov 15, 2021

henry-prior commented Nov 15, 2021

actor_log_prob.transpose in MPO causes error #10

actor_log_prob.transpose in MPO causes error #10

Comments

zhaoyi11 commented Nov 15, 2021

henry-prior commented Nov 15, 2021