shape error when learning DQN agent #88

tdr1991 · 2018-11-21T09:38:37Z

Hi everyone,

I am trying to run the DQN agent on a custom environment, I get this error:

File "/Users/tdr/miniconda3/envs/py3/lib/python3.6/site-packages/stable_baselines/deepq/dqn.py", line 227, in learn

tensorflow.python.framework.errors_impl.InvalidArgumentError: Tensor must be 4-D with last dim 1, 3, or 4, not [32,5,14]

but learning A2C agent and PPO2 agent, it will work.

araffin · 2018-11-21T09:40:08Z

Hello,
Please use the issue template. Otherwise, it will be hard for me to help you.

EDIT: It seems that the problem may come from your custom env and the way you defined action_space and observation_space

tdr1991 · 2018-11-21T09:51:42Z

@araffin Hi
I am trying to run the DQN agent on a custom environment, I get this error:

File "/Users/tdr/miniconda3/envs/py3/lib/python3.6/site-packages/stable_baselines/deepq/dqn.py", line 227, in learn

tensorflow.python.framework.errors_impl.InvalidArgumentError: Tensor must be 4-D with last dim 1, 3, or 4, not [32,5,14]

Some training codes:
from stable_baselines.common.vec_env import DummyVecEnv
from stable_baselines.deepq.policies import MlpPolicy
env = DummyVecEnv([lambda: ScavengerDayEnv(datasource="/reinforcement_learning/day_train/")])

alg = DQN
model = alg(MlpPolicy, env, verbose=1)

Some custom environment definition:
self.action_space = spaces.Discrete( 21 )
self.observation_space= spaces.Box(self.src.min_values, self.src.max_values, np.shape(self.src.time_serie[0]))

araffin · 2018-11-21T10:06:23Z

what is the shape of self.src.time_serie[0] ?

Which stable-baselines version are you using? TF version ?

Can you provide minimal code to reproduce the error ?

tdr1991 · 2018-11-21T10:17:59Z

@araffin

the shape of self.src.time_serie[0] is [5, 14].
stable-baselines version : 2.2.1.
TF version : 1.10.0.

araffin · 2018-11-21T10:20:40Z

Is there something that prevent you from flattening the ndarray to a 1D array?
If you do that, it will work. (so the shape will be (70,) instead of (5, 14))

tdr1991 · 2018-11-21T10:30:19Z

but when I use openAI baselines, it will work.

tdr1991 · 2018-11-22T08:46:10Z

I have located error：

File "/Users/tdr/miniconda3/envs/py3/lib/python3.6/site-packages/stable_baselines/deepq/build_graph.py", line 436, in build_train
tf.summary.image('observation', obs_phs[0])

the shape of obs_phs[0] is [32,5,14], but image function need 4-D with last dim 1, 3, or 4.

when I commet this code, it will work.

araffin · 2018-11-22T08:53:24Z

I see, but in your code, you have passed a tensorboard log dir, right?
I will add a check to avoid that error.

tdr1991 · 2018-11-22T09:07:21Z

I don't know whether I have passed a tensorboard log dir, I can save and load model.

araffin · 2018-11-22T09:36:37Z

@tdr1991 I just pushed the "patch-dqn" branch, can you confirm it solves your problem?

tdr1991 · 2018-11-22T09:47:56Z

@araffin It will work, thank you.

tdr1991 · 2018-11-22T10:32:58Z

When I learning ACER agent, there is similar shape error:

File "/Users/tdr/miniconda3/envs/py3/lib/python3.6/site-packages/stable_baselines/acer/acer_simple.py", line 569, in init
obs_height, obs_width, obs_num_channels = env.observation_space.shape
ValueError: not enough values to unpack (expected 3, got 2)

env.observation_space.shape=(5,14), so I set observation_space.shape=(5,14,1)

obs_shape = np.shape(self.src.time_serie[0])
self.observation_space= spaces.Box(self.src.min_values, self.src.max_values, (obs_shape[0], obs_shape[1],1))
print(self.observation_space.shape) #(5,14,1)

but it is still error.

araffin · 2018-11-22T12:06:16Z

I think this comes from the ACER buffer, which is a different issue... If you want to use ACER, you need to flatten your observation space to a 1D array. (Otherwise, that would mean refactoring the all ACER buffer, a thing that I don't have time for).

tdr1991 · 2018-11-22T13:57:09Z

If I flatten my observation space to a 1D array, will it affect other agents? PS: Until now, these agents(A2C, ACKTR, PPO1, PPO2 , TRPO, DQN) can work.

araffin · 2018-11-22T14:06:10Z

It should work because it was made with feature vectors in mind.

tdr1991 · 2018-11-22T14:38:47Z

Ok, thank you, I'll try tomorrow.

nachovoss · 2019-06-30T00:38:52Z

hey guys i'm getting this error running ppo2
any help would be appreciated

n_images, height, width, n_channels = img_nhwc.shape
ValueError: not enough values to unpack (expected 4, got 1)

araffin added the custom gym env Issue related to Custom Gym Env label Nov 21, 2018

araffin mentioned this issue Nov 22, 2018

Patch Tensorboard Logging #92

Merged

araffin added the bug Something isn't working label Nov 22, 2018

araffin closed this as completed in #92 Nov 25, 2018

araffin mentioned this issue Mar 27, 2019

ValueError: could not broadcast input array from shape (2) into shape (7,3,5) #242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shape error when learning DQN agent #88

shape error when learning DQN agent #88

tdr1991 commented Nov 21, 2018

araffin commented Nov 21, 2018 •

edited

tdr1991 commented Nov 21, 2018

araffin commented Nov 21, 2018 •

edited

tdr1991 commented Nov 21, 2018

araffin commented Nov 21, 2018

tdr1991 commented Nov 21, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

nachovoss commented Jun 30, 2019

shape error when learning DQN agent #88

shape error when learning DQN agent #88

Comments

tdr1991 commented Nov 21, 2018

araffin commented Nov 21, 2018 • edited

tdr1991 commented Nov 21, 2018

araffin commented Nov 21, 2018 • edited

tdr1991 commented Nov 21, 2018

araffin commented Nov 21, 2018

tdr1991 commented Nov 21, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

araffin commented Nov 22, 2018

tdr1991 commented Nov 22, 2018

nachovoss commented Jun 30, 2019

araffin commented Nov 21, 2018 •

edited

araffin commented Nov 21, 2018 •

edited