Skip to content

julianw/learn_ml

Repository files navigation

Implemented Algorithms

  • Table lookup q value on FrozenLakeNotSlippery notebook
  • Neural network as q function approximation on Cartpole notebook
  • Cross Entropy Method on Pendulum notebook
  • REINFORCE (Williams, 1992) policy gradient on Cartpole notebook
  • A2C on Cartpole notebook
  • DDPG on Pendulum notebook
  • PPO on Pendulum notebook

About

learn machine learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published