Is this a bug in runner.py? #14

Ericonaldo · 2018-08-14T08:13:56Z

Thank you for the great codes. When I tried new maps, I found some problems in runner.py. When there are more than one env, one env have done before others, then it is going to restart the game. At the end, all envs are done, the calculated rewards contain many episodes, which is a much bigger number. If you understand what I am talking about, please tell me is there any problem?

Ericonaldo · 2018-08-14T08:36:15Z

These leaves another problem. Followed by the codes, the model should be trained every n_steps, say 12, inside each training, may contains two episode, then some value calculated by the bellman equation will be wrong.

inoryy · 2018-08-14T10:22:01Z

No bugs, episode end is accounted for inside the agent during returns calculation

Ericonaldo · 2018-08-15T04:17:49Z

Oh, thanks a lot, but I still think that there is error in reward calculation...

inoryy · 2018-08-15T04:20:03Z

If you're looking at console logs then they're calculated here. Notice that rewards are averaged and only displayed after all envs report back done flag.

Ericonaldo · 2018-08-16T01:24:38Z

Yeah, I have seen those codes.:) I mean, when calculating the average score, it may include more than one episode in one env because it has to wait for others to report done. So the result is unlikely to represent the average reward of one episode, which I though you'd like to record.

inoryy · 2018-08-16T04:22:55Z

Okay, I can see it now. Can confirm it's a bug, good catch! This most likely doesn't affect non-adversarial minigames, but definitely might explain high variance for others like DefeatZerglingsAndBanelings.

As I mentioned in #7 I'm currently re-writing the project essentially from scratch, so I don' think I'll have time to fix it in legacy codebase, but I'll be sure to keep this bug in mind. I plan to publish the rewrite by the end of August.

Ericonaldo · 2018-08-16T06:45:53Z

Great! Thanks a lot. Hope for your next great release!

inoryy · 2018-08-16T08:35:20Z

Let's keep the ticket open until next release so others are informed as well.

inoryy · 2018-11-25T18:29:37Z

Fixed!

inoryy closed this as completed Aug 14, 2018

inoryy reopened this Aug 15, 2018

inoryy added bug on hold labels Aug 16, 2018

Ericonaldo closed this as completed Aug 16, 2018

inoryy reopened this Aug 16, 2018

inoryy closed this as completed Nov 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this a bug in runner.py? #14

Is this a bug in runner.py? #14

Ericonaldo commented Aug 14, 2018

Ericonaldo commented Aug 14, 2018

inoryy commented Aug 14, 2018 •

edited

Loading

Ericonaldo commented Aug 15, 2018

inoryy commented Aug 15, 2018

Ericonaldo commented Aug 16, 2018

inoryy commented Aug 16, 2018 •

edited

Loading

Ericonaldo commented Aug 16, 2018

inoryy commented Aug 16, 2018

inoryy commented Nov 25, 2018

Is this a bug in runner.py? #14

Is this a bug in runner.py? #14

Comments

Ericonaldo commented Aug 14, 2018

Ericonaldo commented Aug 14, 2018

inoryy commented Aug 14, 2018 • edited Loading

Ericonaldo commented Aug 15, 2018

inoryy commented Aug 15, 2018

Ericonaldo commented Aug 16, 2018

inoryy commented Aug 16, 2018 • edited Loading

Ericonaldo commented Aug 16, 2018

inoryy commented Aug 16, 2018

inoryy commented Nov 25, 2018

inoryy commented Aug 14, 2018 •

edited

Loading

inoryy commented Aug 16, 2018 •

edited

Loading