Refactoring master before async merge #23

lake4790k · 2016-05-27T15:24:13Z

Before the async methods from async branch are merged master could be refactored to allow for reusing most of the code between async and ER methods. Also I believe this refactoring would improve the structure of the existing ER solution in itself. Currently single files (classes) contain functionality for multiple aspects, would be better to create objects dealing with one aspect at a time. I already started this approach in async if followed in master as well merge will be smooth and easy.

I suggest the following:

main would only contain option parsing. Currently it is also responsible for

training the ER Agent
validating the Agent
evaluating or demoing the Agent with one episode

These I would separate in classes

ERTrainer training loop
ValidationAgent validation loop, also the validation related functionality would be moved here from the ER Agent, ie. stats, report, saliency. This agent would also work with the async agents in its own thread in async mode later.
EvaluationAgent showcasing, movie making

So main would parse the options and then either create a

a single threaded ERTrainer + ValidationAgent
an EvaluationAgent
later the multithreaded async training / evaluation

I would not share the ER Agent code with async agents (1/n Q and A3C) for now. In the async agents I took the approach of having separate (subclassed) implementations of different methods to make it easier to look at one algorithm at a time. The ER Agent contains many methods, but all based on ER logic. I think this separation is fine. Much of the simplicity of async code comes from not having to deal with sampled batches.

Splitting up main is not strictly necessary the way I'm suggesting, we only gain the reuse of validation logic with it. Other parts can be still reused from async (eg. Model, CircularQueue, BinaryHeap) without doing this, but I do believe doing it would improve the ER code as well.

I would submit multiple PRs for the suggested steps, so would be easier to look at what's going on. @Kaixhin let me know if you like this or have other idea!

The text was updated successfully, but these errors were encountered:

Kaixhin · 2016-05-27T16:26:15Z

Sounds like a sensible plan to me - thanks for working this out. I agree on the PR approach - if you can make a sequence of PRs to master, each adding a bit more modularity, then it'll be easier to check/look back on the commit history.

I'll leave this up to you, but I think having a partially abstract Agent as a base for all of these would be nice for readability.

lake4790k added the enhancement label May 27, 2016

Kaixhin mentioned this issue May 28, 2016

separate qt display and video #25

Merged

lake4790k mentioned this issue May 30, 2016

Evaluation mode for async and towards unified validation #30

Merged

Kaixhin closed this as completed in #30 May 30, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring master before async merge #23

Refactoring master before async merge #23

lake4790k commented May 27, 2016

Kaixhin commented May 27, 2016

Refactoring master before async merge #23

Refactoring master before async merge #23

Comments

lake4790k commented May 27, 2016

Kaixhin commented May 27, 2016