Skip to content

Aladoro/Stabilizing-Off-Policy-RL

Repository files navigation

A-LIX on DMC

PyTorch implementation of Adaptive Local Signal Mixing from Stabilizing Off-Policy Deep Reinforcement Learning from Pixels. This repository can be used to reproduce DeepMind Control experiments.

For further details see our ICML 2022 paper:

Instructions

Install MuJoCo

Install dependencies with conda:

conda env create -f conda_env.yml
conda activate drqv2

Train an agent/collect results

Use Hydra configuration files (provided in the cfgs folder), specifying algo and env, representing the algorithm and environment configurations.

E.g.:

python train.py algo=ALIX task=quadruped_walk

You can monitor with tensorboard by running:

tensorboard --logdir exp_local

Extend/contact

The main classes/functions used for A-LIX are located in the analysis_* files.

For any queries/questions, feel free to raise an issue and/or get in contact with Edoardo Cetin or Philip J. Ball.

To cite our work, use:

@inproceedings{cetin2022stabilizing,
  title={Stabilizing Off-Policy Deep Reinforcement Learning from Pixels},
  author={Cetin, Edoardo and Ball, Philip J and Roberts, Stephen and Celiktutan, Oya},
  booktitle={International Conference on Machine Learning},
  pages={2784--2810},
  year={2022},
  organization={PMLR}
}

Acknowledgements

We would like to thank Denis Yarats for open-sourcing the DrQv2 codebase. Our implementation builds on top of their repository.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages