-
Notifications
You must be signed in to change notification settings - Fork 243
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QUESTION] multidimensional states and actions #391
Comments
@bzeni1 Hi, could you share the minimal example that I can reproduce your issue? It sounds like your code is simply incorrect. btw, when you instantiate algorithms, you need to do as follows:
|
@takuseno Hi, find my code below. What could be the problem? Thank you in advance for your assistance on this matter.
|
Thanks for sharing your code. It looks like
|
Thanks for your advice. By removing next_states I am encountering a new issue:
However I already defined the segment by the 'done' flags, I still don't know how to determine the episodes. What do you think? |
My guess is that |
Hi I think I am running into a similar issue. I have 2 datasets. FOr both of them all the dimensions are the same But with 1 dataset the fit function for IQL fails. Although I am getting a different error. I can see that both datasets have some terminals = 1. `
` |
@rohanblueboybaijal Sorry for the late response. Could you share a minimal example that I can reproduce your error? |
Let me close this issue since the initial question should be resolved. Feel free to open a new issue to follow up. |
When attempting to create an MDPDataset in d3rlpy with data shaped as for example (100, 5) for observations, (100, 5) for actions, (100,) for rewards, (100, 5) for next observations, and (100,) for terminals, all of which are valid and consistent, I encounter an error: "ValueError: operands could not be broadcast together with shapes (500,) (100,)." This error suggests a broadcasting issue internal to d3rlpy, occurring during dataset creation despite correctly matched data dimensions. It seems to interpret or handle multidimensional data incorrectly, potentially a bug with the library’s handling of input shapes for MDPDataset.
When I select only 1 feature, I'm able to create the MDPDataset with shapes like (100, 1). However, I encounter another error later in the code when I try to use the for example the DDPG model.
The error message states that the DDPG model requires 'config' and 'device' arguments, but based on the documentation, DDPG() does not have these arguments. When I try to use the arguments mentioned in the documentation, I get an 'unexpected keyword argument' error.
Do you think this could be a problem with the library? I already tried several python envrionments and got the same errors.
Library used is d3rlpy-2.4.0.
The text was updated successfully, but these errors were encountered: