Why training ends at epoch 50? #4

yuxudong20 · 2022-09-12T15:21:16Z

Hello, I have tried to reproduce ATAC's results in the paper. However, when I run the official codes, the experiment automatically ends at epoch 50. I cannot find where the problem is? Could you give me some help?
For example, I have run 'python scripts/main.py -e hopper-medium-expert-v2 --gpu_id 0 --seed 15'. Are there any other hyperparameters that need to be given?
@chinganc

chinganc · 2022-09-12T19:13:35Z

Hi @yuxudong20, there was pytorch bug created in our refactoring which causes errors when using gpu. It's fixed now. Can you please give it a try? Thanks.

yuxudong20 · 2022-09-15T13:33:59Z

Hi, @chinganc, thanks a lot. It seems that current codes can run 3000 epochs. However, the learning curves suddenly fall down, as in the following figure. Is this correct?

yuxudong20 · 2022-09-15T13:35:28Z

And the amplification shows that it falls down at epoch 50.

chinganc · 2022-09-15T17:27:17Z

This is normal: Currently, it by default uses 50 epochs to do warmstart, which is just BC for the policy, and then ATAC starts. The drop you see there is the first epoch of ATAC.

chinganc closed this as completed Sep 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why training ends at epoch 50? #4

Why training ends at epoch 50? #4

yuxudong20 commented Sep 12, 2022

chinganc commented Sep 12, 2022

yuxudong20 commented Sep 15, 2022

yuxudong20 commented Sep 15, 2022

chinganc commented Sep 15, 2022

Why training ends at epoch 50? #4

Why training ends at epoch 50? #4

Comments

yuxudong20 commented Sep 12, 2022

chinganc commented Sep 12, 2022

yuxudong20 commented Sep 15, 2022

yuxudong20 commented Sep 15, 2022

chinganc commented Sep 15, 2022