Agent reset() not called before starting new episodes in SingleThreadedWorker #51

dwang9302 · 2019-02-22T00:31:21Z

Coming from tensorforce (where agent.reset() is called in episode loop), and by reading the doc on agents/agent.py, it seems agent.reset() is supposed to be called before starting a new episode. However currently it does not seem to be called in SingleThreadedWorker nor RayWorker before new episodes, although preprocessors stack seems to have been reset explicitly.

It would be nice if you could clarify the purpose of agent.reset() and when it is supposed to be called. Would appreciate some examples..

def reset(self):
        """
        Must be implemented to define some reset behavior (before starting a new episode).
        This could include resetting the preprocessor and other Components.
        """
        pass  # optional

Refs:
https://github.com/rlgraph/rlgraph/blob/master/rlgraph/agents/agent.py
https://github.com/rlgraph/rlgraph/blob/master/rlgraph/execution/single_threaded_worker.py

The text was updated successfully, but these errors were encountered:

michaelschaarschmidt · 2019-02-22T09:43:03Z

Hey, thanks for raising this! Will take care of shortly

michaelschaarschmidt · 2019-02-22T09:49:04Z

Irrespective of the issue just a short piece of info on why the preprocessor code is even there:

The reason the preprocessors are done separately in Python is because we noticed an issue around image resizing in TensorFlow (https://hackernoon.com/how-tensorflows-tf-image-resize-stole-60-days-of-my-life-aba5eb093f35), so to do any benchmarks around images, we realised we would need to have the option to do the preprocessing with CV2 out-of-graph.

michaelschaarschmidt · 2019-02-22T16:01:34Z

Cont.:

The heart of the issue is then that agent.reset() is meant to reset the preprocessor, but because of this separation as a consequence of this critical TF bug, it is not being called currently.

In any case, the name reset() is maybe misleading because users could reasonably expect that reset() fully resets the internal state (i.e. reinitalises all variables in particular).

We should hence maybe have a reset() method to fully reset agent state, and an episode_reset() which resets preprocessor state.

dwang9302 · 2019-02-27T21:04:08Z

Since agent.reset() is not being called because of an external bug, I assume the worker(s) logic would remain the same until the TF bug is fixed? The reason I asked is that I put some logic into my modified agent and an end-of-episode agent.reset() call becomes critical in training.
Thanks for the reply and clarification!

michaelschaarschmidt · 2019-02-28T08:34:06Z

Yes since preprocessing is being handled via the python/numpy preprocessor stack implementations by default now, the tf preprocessor does not need resetting (even thought it would not hurt). So you could add a reset call if your agent needed some specific extra resetting.

michaelschaarschmidt assigned sven1977 and michaelschaarschmidt Feb 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent reset() not called before starting new episodes in SingleThreadedWorker #51

Agent reset() not called before starting new episodes in SingleThreadedWorker #51

dwang9302 commented Feb 22, 2019

michaelschaarschmidt commented Feb 22, 2019

michaelschaarschmidt commented Feb 22, 2019

michaelschaarschmidt commented Feb 22, 2019

dwang9302 commented Feb 27, 2019 •

edited

Loading

michaelschaarschmidt commented Feb 28, 2019

Agent reset() not called before starting new episodes in SingleThreadedWorker #51

Agent reset() not called before starting new episodes in SingleThreadedWorker #51

Comments

dwang9302 commented Feb 22, 2019

michaelschaarschmidt commented Feb 22, 2019

michaelschaarschmidt commented Feb 22, 2019

michaelschaarschmidt commented Feb 22, 2019

dwang9302 commented Feb 27, 2019 • edited Loading

michaelschaarschmidt commented Feb 28, 2019

dwang9302 commented Feb 27, 2019 •

edited

Loading