model over-fitted? #2300

majid5776 · 2024-07-22T11:50:14Z

majid5776
Jul 22, 2024

I've design my custom navigation env that have obstacles on it and the agents don't hit each other.
when I run my mappo_ippo.py I got strange outputs. can my model overfitted?
my custom_env is:
https://drive.google.com/file/d/1yw1rOpJcmoU99zcz-2wEGqV_ZnfOT1qF/view?usp=sharing
my config of mappo_ippo is:
max_steps:200
n_iters:625
n_agents and n_targets:3
backend:csv
entropy_eps:0.0001
remain confs is the same.

when I look at my csv and my videos they are surprising:
in my video the 20 first epochs they reach the goals very easy but after that episode they stop.
in my csv the train_mean_reward increasing non-stoply but my critic loss also increasing.
is this meaning my model overfitted?

@matteobettini

vmoens · 2024-07-23T08:06:37Z

vmoens
Jul 23, 2024
Collaborator

I'm tagging Matteo which is our PoC for MARL things and the owner of the mappo_ippo script!
He's at ICML atm so he's probably not going to be very active on GH these days...

0 replies

matteobettini · 2024-07-23T08:55:45Z

matteobettini
Jul 23, 2024

It could be that the reward for not colliding is taking over and preventing the navigation success.

The reward increasing is a good sign in general in case the reward function makes sense.

But in general I am not able to make diagnostic comments about your custome environment, sorry.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model over-fitted? #2300

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

model over-fitted? #2300

majid5776 Jul 22, 2024

Replies: 2 comments

vmoens Jul 23, 2024 Collaborator

matteobettini Jul 23, 2024

majid5776
Jul 22, 2024

vmoens
Jul 23, 2024
Collaborator

matteobettini
Jul 23, 2024