[question] Why are RL CNNs so shallow? #367

AlanKuurstra · 2019-06-11T12:58:11Z

It seems that RL CNNs are much more shallow than the ones used on imagenet? Am I right about this? And why would that be the case?

araffin · 2019-06-11T14:39:12Z

Hello,

are much more shallow than the ones used on imagenet? Am I right about this? And why would that be the case?

That's a good question, and you are right in most cases.
I think a simple answer would be that they are complex enough to solve the tasks.

To my knowledge, the most complex (and successful) CNN Policy architecture is the one from IMPALA, where some residual connections are used.
The way RL works makes it also tricky to use with batch-norm, which usually allow the use of deeper net.

Then, a lot of RL problems do not use images as input (e.g. Mujoco/Pybullet envs, where the input is the joints angles), in that case there is no need to have more complex architecture.

Finally, you can always try to use deeper net, but by experience, this does not often result in better perfomances.

SmileLab-technion · 2019-09-16T12:51:28Z

Hello,

as i see it:
In image recognition the algorithm needs to recognize the image label. This is done by projecting the image to some latent space where the pictures are separable. In RL the image just represent the state
which is why only few features of the pictures is needed. your confusion comes from your view on how humans make choices which is not the same as RL.(look on this video)

araffin added the question Further information is requested label Jun 11, 2019

araffin mentioned this issue Jun 14, 2019

InvalidArgumentError: You must feed a value for placeholder tensor 'model/batch_normalization_1/keras_learning_phase' with dtype bool error in Custom Policy #373

Closed

This was referenced Mar 29, 2020

[Question] Why only shallow net arch is used in RL? openai/spinningup#225

Open

GPU vs CPU Performance araffin/rl-baselines-zoo#72

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] Why are RL CNNs so shallow? #367

[question] Why are RL CNNs so shallow? #367

AlanKuurstra commented Jun 11, 2019

araffin commented Jun 11, 2019

SmileLab-technion commented Sep 16, 2019

[question] Why are RL CNNs so shallow? #367

[question] Why are RL CNNs so shallow? #367

Comments

AlanKuurstra commented Jun 11, 2019

araffin commented Jun 11, 2019

SmileLab-technion commented Sep 16, 2019