SRLF helps to set up and train agent using one of these five algorithms:
- Evolution strategies (paper)
- Deep Deterministic Policy Gradient (paper)
- TRPO (paper)
- Rainbow (paper)
- A3C (paper)
I didn't make very user-friendly interface, but if you know how the algorigthm you want to use works, you can do it more or less easily.
To use, follow these steps:
- Choose algorithm you want to use
- Be sure you know how it works
- Study this implementation and how hyperparameters are used
- Use prepare_config.py as an example to make your config. Also check helpers/utils.py to know how to set some of non-obvious hyperparameters (like trainer)
- Launch
python3 prepare_config.py
to make config-file - Launch
python3 run_experiment.py
to run experiment (you should also check how run_experiment.py works) - Enjoy!
I also hope that framework can be helpful for people who want to study how to implement some of 5 algorithms that can be found here.