-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
environement.step() reset action #39
Comments
What is the problem here? The reset of the Action into do-nothing ? Maybe the simulator changed the action because:
As for the difference with np array, this is because you cannot change a np array by reference in python, but an object passed as reference can be modified within a function. Do you mean in both case the action object of the agent should be replaced in-place or not replaced at all ? |
as far as I know the action applied is legal : for (GreedySearch, and an expert agent) I think the object shouldn't be modified. |
Okay I'll look into it in a couple of hours and make it such that Action
type objects are not modified with environment.step
…On Mon, May 6, 2019 at 4:10 PM Luca Veyrin-Forrer ***@***.***> wrote:
as far as I know the action applied is legal : for (GreedySearch, and an
expert agent)
happens at each timestep (even the first one (no cooldown))
I think the object shouldn't be modified.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#39 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADMF4JP2TGM6H2IZWYM55ZDPUA34BANCNFSM4HLAGSJA>
.
|
This should be fixed in 536100b, can you confirm ? |
from runner.log : lines 81-84
action = self.agent.act(observation)
observation, reward_aslist, done, info = self.environment.step(action, do_sum=False)
when action is object Action (and not array)
after environement.step(..)
the action is reset to the Do_Nothing action
Note that the action is applied.
but it is a bit inconsistent
and when,t action is an np.array this doesn't happen.
The text was updated successfully, but these errors were encountered: