Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

可能的错误:6.2.1小节--自举导致偏差的传播 #32

Open
2017040264 opened this issue Nov 13, 2021 · 2 comments
Open

可能的错误:6.2.1小节--自举导致偏差的传播 #32

2017040264 opened this issue Nov 13, 2021 · 2 comments

Comments

@2017040264
Copy link

image

更新参数推导里面漏了TD误差

@wangshusen
Copy link
Owner

非常感谢你的提醒。其实没有错。对 L 关于 w 求导,结果等于 TD 误差乘以 Q 关于 w 的梯度。

@2017040264
Copy link
Author

哦哦,对对,懂了懂了。感谢王教授。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants