Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

clarification of environments used in A2C algorithm #94

Closed
TomorrowIsAnOtherDay opened this issue Jul 23, 2019 · 6 comments
Closed

clarification of environments used in A2C algorithm #94

TomorrowIsAnOtherDay opened this issue Jul 23, 2019 · 6 comments
Assignees

Comments

@TomorrowIsAnOtherDay
Copy link
Collaborator

Currently we are using envname&NoFrameSkip-v4 for training.

'env_name': 'PongNoFrameskip-v4',

We should add more details in READEME for users to make sure that they use the actual environments.

@TomorrowIsAnOtherDay TomorrowIsAnOtherDay self-assigned this Jul 23, 2019
@TomorrowIsAnOtherDay
Copy link
Collaborator Author

note:

from paddle.fluid.param_attr import ParamAttr

this line should be removed.

@TomorrowIsAnOtherDay
Copy link
Collaborator Author

note2:

Currently the command tensorboard cannot be found after installing PARL in Mac, and we should add fix this issue in our installation script.

@TomorrowIsAnOtherDay
Copy link
Collaborator Author

note3:
refine the API get_weights of parl.Model so it return a dict like parl.Algorithm and parl.Agent.

@TomorrowIsAnOtherDay
Copy link
Collaborator Author

note4:
use a CPU version of Paddle for checking the compatibility on CPU

@zenghsh3
Copy link
Contributor

note5:
when creating too many actors: "zmq.error.ZMQError: Too many open files"

@TomorrowIsAnOtherDay
Copy link
Collaborator Author

All the environments use gym API now. I

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants