Initial support for multiple observations #256

asolano · 2018-01-17T11:37:05Z

Add preliminary support for multiple observations in the PPO module. Current implementation has a problem for multiple observations in the trainer class.

This issue is resolved by creating a separate set of convolution layers and then merging all the streams in the first fully connected layer. The history buffer implementation is also updated to create keys for observations dynamically.

awjuliani · 2018-01-17T23:28:37Z

Hi @asolano,

This is awesome! Thanks for taking the time to write these changes to fully-enable multiple observations in the PPO code. Would it be possible to make a PR into our development-0.3 branch? This way we can roll it up into the other features we are adding and release it with the next version release.

awjuliani · 2018-01-18T00:40:06Z

Hi @asolano,

The internal team had a discussion around the inclusion of this, and we've decided to merge it into master. There are incompatible changes we will be making in the next release (specifically reworking how our experience buffer works), but we will re-implement the relevant parts of code here to ensure this continues working going forward. The benefit of allowing people to use this now outweighs the small extra effort on our part.

Very excited to see what kinds of multi-camera agent scenarios you and others come up with in the future!

asolano · 2018-01-18T00:59:08Z

Hi @awjuliani ,

That's amazing, thank you very much! We have just started using it ourselves too, looking forward to what the community does with it 👍

vincentpierre · 2018-01-18T01:18:03Z

python/ppo/models.py

+            height_size, width_size = brain.camera_resolutions[i]['height'], brain.camera_resolutions[i]['width']
+            bw = brain.camera_resolutions[i]['blackAndWhite']
+            encoders.append(self.create_visual_encoder(height_size, width_size, bw, h_size, 2, tf.nn.tanh, num_layers))
+        hidden_visual = [tf.concat(encoders, axis=1)]


I am not sure this will work in the case of continuous state : encoders is a list of list of tensors.
[num_streams, num_observations, h_size]
This tf.concat will not work in these conditions

Good catch! We were focusing on discrete control so we missed it 😅

The continuous control can be fixed by changing the axis argument from 1 to 2 and not making a list. Please refer to this commit. We added a couple of cameras to the 3DBall environment and it seems to be working.

awjuliani · 2018-01-19T00:04:21Z

Thanks @asolano!

Initial support for multiple observations

b7d17e6

vincentpierre reviewed Jan 18, 2018

View reviewed changes

Fix PPO for continuous control

6173d17

awjuliani merged commit a1d35bf into Unity-Technologies:master Jan 19, 2018

asolano deleted the dev-multiple-observations branch January 19, 2018 01:32

github-actions bot locked as resolved and limited conversation to collaborators May 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial support for multiple observations #256

Initial support for multiple observations #256

Uh oh!

asolano commented Jan 17, 2018

Uh oh!

awjuliani commented Jan 17, 2018

Uh oh!

awjuliani commented Jan 18, 2018 •

edited

Loading

Uh oh!

asolano commented Jan 18, 2018

Uh oh!

vincentpierre Jan 18, 2018

Uh oh!

asolano Jan 18, 2018

Uh oh!

awjuliani commented Jan 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Initial support for multiple observations #256

Initial support for multiple observations #256

Uh oh!

Conversation

asolano commented Jan 17, 2018

Uh oh!

awjuliani commented Jan 17, 2018

Uh oh!

awjuliani commented Jan 18, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asolano commented Jan 18, 2018

Uh oh!

vincentpierre Jan 18, 2018

Choose a reason for hiding this comment

Uh oh!

asolano Jan 18, 2018

Choose a reason for hiding this comment

Uh oh!

awjuliani commented Jan 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

awjuliani commented Jan 18, 2018 •

edited

Loading