We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
在dqn算例中第140行为什么要用 reward = -1 if done else 0.1重新覆写reward为1或者0.1呢?而不是用gym环境给出的reward。https://zhuanlan.zhihu.com/p/21477488 这篇文章中结构差不多,但没有覆写,而是一个新的变量reward_agent = -1 if done else 0.1,其他dqn变种的算例中也都同样如此。
reward = -1 if done else 0.1
reward_agent = -1 if done else 0.1
The text was updated successfully, but these errors were encountered:
你好,这里可以覆写,也可以不覆写。 如果你想自己设计下这个环境的奖励函数,重新设计奖励,看看有没有效果提升,那么就可以覆写。 如果仅仅是学习,跑一下即可,那么不用覆写。
Sorry, something went wrong.
No branches or pull requests
在dqn算例中第140行为什么要用
reward = -1 if done else 0.1
重新覆写reward为1或者0.1呢?而不是用gym环境给出的reward。https://zhuanlan.zhihu.com/p/21477488 这篇文章中结构差不多,但没有覆写,而是一个新的变量reward_agent = -1 if done else 0.1
,其他dqn变种的算例中也都同样如此。The text was updated successfully, but these errors were encountered: