We aim to achieve the state-of-the art results on [Montezuma's Revenge](https://github.com/RL-ninja/beating-montezuma/wiki/Montezuma's-Revenge
See Todos & Ideas
- game score:
- number of rooms/level explored:
- 20+ rooms
- beat level 1
See Resources
If you use Anaconda, you can try conda env create -f environment.yml
.
Requirements
- Python 3.4+
- TensorFlow 1.0+ (choose a GPU version, if you have GPU)
- Arcade-Learning-Environment
- cython (pip3 package)
- scikit-image (pip3 package)
- python3-tk
- opencv (opencv-python)
To train an agent to play, for example, pong run
python3 train.py -g <game-name> -df logs/<game-name>/ -algo paac_cts
- Open a new terminal
- Attach to the running docker container with
docker exec -it CONTAINER_NAME bash
- Run
tensorboard --logdir=<absolute-path>/paac/logs/tf
. - In your browser navigate to localhost:6006/
If running locally, skip step 2.
To test the performance of a trained agent run python3 test.py -f logs/ -tc 5
Output:
Performed 5 tests for seaquest.
Mean: 1704.00
Min: 1680.00
Max: 1720.00
Std: 14.97
python3 test.py -f logs/<game-name>/ -gn breakout
This may take a few minutes.