Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NIPS2018: AI for Prosthetics Challenge的part3中的ensemble_num的含义? #57

Closed
janyChan opened this issue Mar 8, 2019 · 4 comments · Fixed by #58
Closed

NIPS2018: AI for Prosthetics Challenge的part3中的ensemble_num的含义? #57

janyChan opened this issue Mar 8, 2019 · 4 comments · Fixed by #58

Comments

@janyChan
Copy link

janyChan commented Mar 8, 2019

请问,Winning Solution for NIPS2018: AI for Prosthetics Challenge的Part3: Training in random velocity environment for round2 evaluation中ensemble_num指的是什么?,是类似A3C的思想吗?还是同时并行训练ensemble_num个模型,test的时候再分别检测各个模型的效果,从中选出一个好的?

@janyChan janyChan changed the title part3中的ensemble_num? NIPS2018: AI for Prosthetics Challenge的part3中的ensemble_num的含义? Mar 9, 2019
@zenghsh3
Copy link
Contributor

  1. ensemble_num是指同时训练K个模型(K个Actor和K个Critic),不过K个模型会共享底部参数(Multi-head方式);
  2. 预测的时候K个Critic会对K个Actor的输出分别打分,每个Critic会分别归一化它们自己的打分,然后每个Actor的输出最终得分为归一化后的Critic打分之和,我们会选择最终得分最高的Actor输出。这块逻辑可以参考源码:
    def define_ensemble_predict(self, obs):

@janyChan
Copy link
Author

janyChan commented Mar 11, 2019

def define_ensemble_predict(self, obs);函数我看到在build_program中调用过,但build_program没看到在哪使用?

@janyChan
Copy link
Author

build_program是在PARL/parl/framework/agent_base.py中,模型运行时就直接调用的吗?

@TomorrowIsAnOtherDay
Copy link
Collaborator

TomorrowIsAnOtherDay commented Mar 11, 2019

self.build_program()

是的,agent 构建的时候自动调用这个函数。

看来这个函数的调用方式不大容易定位,我们会update文档来说明这个问题

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants