Skip to content

Latest commit

 

History

History
37 lines (22 loc) · 1.54 KB

README.md

File metadata and controls

37 lines (22 loc) · 1.54 KB

ai

Hand crafted AI algorithms made with tender loving care (and numpy)

This repo contains

Lunar Lander

After way too much training NES with a low sigma it was able to mostly solve Lunar Lander

winning lunar lander

Sometimes it fails, though it usually comes close

failing lunar lander

To test it yourself make sure nets/LunarLanderContinuous-v2-16.pkl exists then run

python main.py --env LunarLanderContinuous-v2 --eval

An agent was also trained using Covariance-Matrix Adaptation (the --cma option). After ~220 generations it looks like this

cma lunar lander

The resulting agent is more robust, and successfully deactivates the boosters after landing. I think this is because CMA-ES can fine-tune better by adapting sigma, I ought to try sigma-adaptation for my NES agent too.

See the CMA-ES agent with

python main.py --env LunarLanderContinuous-v2 --eval --save LunarLanderContinuous-v2-16-CMA.pkl