Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ugly implementation of evaluation control #17

Closed
KornbergFresnel opened this issue Jul 29, 2021 · 1 comment
Closed

Ugly implementation of evaluation control #17

KornbergFresnel opened this issue Jul 29, 2021 · 1 comment
Labels
enhancement New feature or request

Comments

@KornbergFresnel
Copy link
Member

KornbergFresnel commented Jul 29, 2021

The changes from PR #12 added a new feature to do policy evaluation, while it was ignored. Polishment is required

settings

"test_num_episodes": 0,

parameter

test: bool = False,

some related logics

if self._offline_dataset is None and not self._test:

def get_test(self):

group="testing" if self._test else "rollout",

@KornbergFresnel KornbergFresnel added the enhancement New feature or request label Jul 29, 2021
@zbzhu99
Copy link
Contributor

zbzhu99 commented Jul 29, 2021

Actually, I have some personal thoughts about the evaluation worker, which should work quite differently from normal rollout workers.

First, the evaluation worker should not sample data in asynchronous manners, which will cause a waste of computing resources. Instead, it is supposed to wait for the training manager to send signals along with policy parameters to be evaluated. Maybe it has to maintain a local parameter buffer to handle the case when new signals coming in during evaluation time.

Second, the information received from the training manager should contain the corresponding (training) epoch number, so the evaluation worker can log the evaluation metrics with the training epoch rather than the evaluator's local sample epoch.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants