Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What does the actions listed in sample configs means? #125

Closed
agupta83 opened this issue May 3, 2019 · 3 comments
Closed

What does the actions listed in sample configs means? #125

agupta83 opened this issue May 3, 2019 · 3 comments

Comments

@agupta83
Copy link

agupta83 commented May 3, 2019

Hi,

In parameter file dqn_example.json#L17-L20 to train DQN model, the actions are

"actions": [
    "4",
    "5"
  ],

What does it mean given cartpole-v0 has only two actions {0, 1}?

Thanks

@agupta83
Copy link
Author

agupta83 commented May 9, 2019

@MisterTea Any info or suggestion will be very helpful. I tried to print details during evaluation and I still haven't figure it out

Float State Features = [{'0': 0.16316721, '1': 1.7206059, '2': -0.22225317, '3': -2.8040316}]
Prediction: [{'4': -1.6168417, '5': 0.6317559}]
Max Q values = 5
Action Index = 1

My guess so far is that actions values from dqn_example.json is mapped as
{'4': 0, '5': 1}

Thanks

@MisterTea
Copy link
Contributor

MisterTea commented May 11, 2019

Hey! The actions are integers starting where the state integers leave off. In this case, the state features are [0...3]

The action feature IDs and state feature IDs can't overlap, which is why the actions are not 0 and 1

@agupta83
Copy link
Author

Thank you so much @MisterTea.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants