Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simulation results in GA3C-CADRL #21

Open
Yuanzizizi opened this issue Oct 9, 2022 · 2 comments
Open

Simulation results in GA3C-CADRL #21

Yuanzizizi opened this issue Oct 9, 2022 · 2 comments

Comments

@Yuanzizizi
Copy link

Hi,

I find the results in your paper (GA3C-CADRL) are pretty good. And it surprises me that the results in Fig.6 (b)(c) is so perfect.
image

It seems that these agents have to move very carefully since they are very close to each other.
image

I try to replicate this result using the checkpoint file you provided, but only got some results like this ....
image

I wonder if it's because I need to retrain for this scene, or if it's because the car size or whatever it is isn't set up properly.

I really need your help. Thanks ~

@mfe7
Copy link
Collaborator

mfe7 commented Oct 9, 2022

There should be code in the gym repo to replicate these figures. In particular the small test suite script should contain the right scenarios, but you might need to set the number of agents since there are some different test scenarios for each number of agents. One thing that I notice in your plots is that the agents are not taking the same actions, which suggests they have different states/observations. In our evaluations in that paper, the agents start with an orientation aligned toward the goal. In the circle case, this means all agents should have the same observations at each time step.

@Yuanzizizi
Copy link
Author

We generate these 10 agents evenly across the circle, and they were all oriented toward their own goal. The initial positions were calculated, so the positions could be slightly different in their value, which will lead to slightly different decision-making outcomes.

The difference in decision making will further widen the difference in states, which ultimately leads to what shown in my plots.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants