ai

Hand crafted AI algorithms made with tender loving care (and numpy)

This repo contains

An implementation of Natural Evolution Strategies (the OpenAI variant where sigma is fixed, for simplicity)
An implementation of Covariance-Matrix Adaptation (CMA-ES), along with an adapter for pycma.
A few pretrained networks in ./nets

Lunar Lander

After way too much training NES with a low sigma it was able to mostly solve Lunar Lander

Sometimes it fails, though it usually comes close

To test it yourself make sure nets/LunarLanderContinuous-v2-16.pkl exists then run

python main.py --env LunarLanderContinuous-v2 --eval

An agent was also trained using Covariance-Matrix Adaptation (the --cma option). After ~220 generations it looks like this

The resulting agent is more robust, and successfully deactivates the boosters after landing. I think this is because CMA-ES can fine-tune better by adapting sigma, I ought to try sigma-adaptation for my NES agent too.

See the CMA-ES agent with

python main.py --env LunarLanderContinuous-v2 --eval --save LunarLanderContinuous-v2-16-CMA.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ai

Lunar Lander

Files

README.md

Latest commit

History

README.md

File metadata and controls

ai

Lunar Lander