Unflatten tensors in RolloutStorage #108

Lucaweihs · 2020-08-05T20:21:02Z

Problem

In the rollout storage, we currently flatten tensors along some dimensions (combining the rollout index dim and the time dim into one). This is awkward and means that every actor critic model needs to remember this arbitrary ordering.

Solution

Let's stop flattening and fix existing models to expect these unflattened tensors (fixing the RNNStateEncoder should go a long way towards this).

The text was updated successfully, but these errors were encountered:

jordis-ai2 · 2020-08-06T17:34:20Z

I'm working on it

jordis-ai2 · 2020-08-15T01:30:15Z

Merged from 109:

Problem

We do not currently support multiple agents.

Solution

To support multiple agent's we'll need to:

Extend the Task abstraction (and also VectorSampledTasks) to:
- Accept multiple actions at once, e.g. the step should change from action: int to Union[int, Sequence[int]].
- Return a sequence of rewards and observations (one for each agent). I don't know if we'd rather return Sequence[RLStepResult] or, instead, update RLStepResult to return sequences of values when appropriate. I have a slight preference for the second variant (as it would make it easier in the future to return values that are common to all agents, e.g. a joint reward) but could be convinced otherwise.
Update the RolloutStorage class so that one dimension is dedicated to different agents.
Other changes to handle the above changes multiple in the light_engine

I'm sure I'm missing some additional places that will need to be updated.

Dependencies

This should be completed after #108.

Lucaweihs added the enhancement New feature or request label Aug 5, 2020

Lucaweihs mentioned this issue Aug 5, 2020

Multi-agent support #109

Closed

Lucaweihs assigned jordis-ai2 Aug 5, 2020

anikem added this to the 0.1 milestone Aug 12, 2020

jordis-ai2 linked a pull request Aug 15, 2020 that will close this issue

Issue108 unflatten rollouts #141

Merged

jordis-ai2 closed this as completed in #141 Aug 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unflatten tensors in RolloutStorage #108

Unflatten tensors in RolloutStorage #108

Lucaweihs commented Aug 5, 2020 •

edited

jordis-ai2 commented Aug 6, 2020

jordis-ai2 commented Aug 15, 2020 •

edited

Unflatten tensors in RolloutStorage #108

Unflatten tensors in RolloutStorage #108

Comments

Lucaweihs commented Aug 5, 2020 • edited

Problem

Solution

jordis-ai2 commented Aug 6, 2020

jordis-ai2 commented Aug 15, 2020 • edited

Problem

Solution

Dependencies

Lucaweihs commented Aug 5, 2020 •

edited

jordis-ai2 commented Aug 15, 2020 •

edited