Performance on D4RL AntMaze tasks? #1

yuxudong20 · 2022-09-05T02:37:24Z

Hello, I have a question about the performance of SAC-N or EDAC on AntMaze tasks. Have you ever tested it?

In my experiments based on this official implementation, I found that average returns in evaluation are always 0, which is worse than behavior cloning. Then I try to run with a modified reward (r=4*(r-0.5)) and max Q backup. However, they didn't help.

I'll appreciate it a lot if you give me some related advices. Thanks a lot.

yuxudong20 · 2022-09-05T08:40:51Z

@drillermoon @dssrgu

dssrgu · 2022-09-05T12:16:45Z

Hi,

As we have not done any experiments on Antmaze, it is hard to tell what would specifically be the problem.
However, I recommend checking if quantitative statistics such as Q-values have reasonable values while tuning the hyperparameters.

Thank you.

gaisibo · 2022-11-17T01:43:02Z

Hello,
Do you have got reasonable results in AntMaze tasks? I have the same question.
Thanks a lot.
@yuxudong20

symoon11 closed this as completed Jan 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance on D4RL AntMaze tasks? #1

Performance on D4RL AntMaze tasks? #1

yuxudong20 commented Sep 5, 2022

yuxudong20 commented Sep 5, 2022

dssrgu commented Sep 5, 2022

gaisibo commented Nov 17, 2022

Performance on D4RL AntMaze tasks? #1

Performance on D4RL AntMaze tasks? #1

Comments

yuxudong20 commented Sep 5, 2022

yuxudong20 commented Sep 5, 2022

dssrgu commented Sep 5, 2022

gaisibo commented Nov 17, 2022