Question about using Importance Sampling in BEAR #10

wadx2019 · 2021-08-06T10:40:03Z

Hello, I have some problem with the BEAR_IS in your algos.py file.

As is known to us, DDPG is actually one-step Q-learning in continuous tasks and BEAR also takes such architechture. Now that it makes no sense to use importance sampling in BEAR because the difference between current policy and behavioral policy doesn't result in the inaccuracy of the estimation of Q-value.

So Can you explain why you wrote a importance sampling version of BEAR in your project?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about using Importance Sampling in BEAR #10

Question about using Importance Sampling in BEAR #10

wadx2019 commented Aug 6, 2021

Question about using Importance Sampling in BEAR #10

Question about using Importance Sampling in BEAR #10

Comments

wadx2019 commented Aug 6, 2021