Skip to content

Latest commit

 

History

History
34 lines (21 loc) · 1.24 KB

first_agent.rst

File metadata and controls

34 lines (21 loc) · 1.24 KB

Steps: [ install <install> | jax <prereq_jax> | haiku <prereq_haiku> | q-learning | dqn <second_agent> | ppo <third_agent> | next_steps <next_steps> ]

Q-Learning on FrozenLake

In this first reinforcement learning example we'll solve a simple grid world environment.

Non-Slippery FrozenLake solved

Our agent starts at the top left cell, labeled S. The goal of our agent is to find its way to the bottom right cell, labeled G. The cells labeled H are holes, which the agent must learn to avoid.

In this example, we'll implement a simple value-based agent, which we update using the q-learning </examples/stubs/qlearning> algorithm.

To run this, either hit the Google Colab button or download and run the script on your local machine.


qlearning.py </examples/frozen_lake/qlearning.py>

Open in Google Colab

/examples/frozen_lake/qlearning.py