What does the actions listed in sample configs means? #125

agupta83 · 2019-05-03T17:31:56Z

Hi,

In parameter file dqn_example.json#L17-L20 to train DQN model, the actions are

"actions": [
    "4",
    "5"
  ],

What does it mean given cartpole-v0 has only two actions {0, 1}?

Thanks

The text was updated successfully, but these errors were encountered:

agupta83 · 2019-05-09T19:39:19Z

@MisterTea Any info or suggestion will be very helpful. I tried to print details during evaluation and I still haven't figure it out

Float State Features = [{'0': 0.16316721, '1': 1.7206059, '2': -0.22225317, '3': -2.8040316}]
Prediction: [{'4': -1.6168417, '5': 0.6317559}]
Max Q values = 5
Action Index = 1

My guess so far is that actions values from dqn_example.json is mapped as
{'4': 0, '5': 1}

Thanks

MisterTea · 2019-05-11T00:09:57Z

Hey! The actions are integers starting where the state integers leave off. In this case, the state features are [0...3]

The action feature IDs and state feature IDs can't overlap, which is why the actions are not 0 and 1

agupta83 · 2019-05-11T15:16:45Z

Thank you so much @MisterTea.

agupta83 closed this as completed May 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What does the actions listed in sample configs means? #125

What does the actions listed in sample configs means? #125

agupta83 commented May 3, 2019

agupta83 commented May 9, 2019 •

edited

Loading

MisterTea commented May 11, 2019 •

edited

Loading

agupta83 commented May 11, 2019

What does the actions listed in sample configs means? #125

What does the actions listed in sample configs means? #125

Comments

agupta83 commented May 3, 2019

agupta83 commented May 9, 2019 • edited Loading

MisterTea commented May 11, 2019 • edited Loading

agupta83 commented May 11, 2019

agupta83 commented May 9, 2019 •

edited

Loading

MisterTea commented May 11, 2019 •

edited

Loading