Skip to content

AlexPiche/fr-tmlr

 
 

Repository files navigation

Bridging the Gap Between Target Networks and Functional Regularization

Screenshot 2023-09-05 at 6 55 07 PM

Convergence disks

https://github.com/AlexPiche/cleanrl-tmlr/blob/master/Convergence_disks.ipynb

4 Rooms

cd toy_task
git clone https://github.com/zafarali/emdp
python3 main_toy.py --batch_size 32 --discount $discount --epsilon $epsilon --target_update_freq $freq --use_target_net 0 --seed $seed --reg_weight $reg_weight --size 11

Atari Results

The results in the paper can be achieved by running the following command

python cleanrl/fr_dqn_atari_jax.py

About

Bridging the Gap Between Target Networks and Functional Regularization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 50.6%
  • Jupyter Notebook 47.0%
  • Shell 1.8%
  • Other 0.6%