Feudal policy on PongDeterministic-v4 #8

ThanosRoidis · 2018-09-06T12:08:13Z

I have been trying to get the 'feudal' policy to work on the 'PongDeterministic-v4' environment but I had no luck. The 'lstm' policy seems to work for me, but If I change it to 'feudal' the episode rewards do not increase even after of 8 hours of training with 1 worker, they are stuck to -20, both on the 'master' branch and the 'dilated_fix' branch.

I saw the other issues mentioning that it doesn't achieve the benchmarks from the paper, but is it supposed to work on pong at least? or am I doing something wrong?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feudal policy on PongDeterministic-v4 #8

Feudal policy on PongDeterministic-v4 #8

ThanosRoidis commented Sep 6, 2018

Feudal policy on PongDeterministic-v4 #8

Feudal policy on PongDeterministic-v4 #8

Comments

ThanosRoidis commented Sep 6, 2018