Applying sevel reinforcement learning algorithms (Q-Learning (with NN as Q-values approximator), modified reward Q-Learning, Double Deep Q-Learning, Dueling Deep Q-Learning) to OpenAI gym's Acrobot environment and compare their performance.
-
Notifications
You must be signed in to change notification settings - Fork 0
Applying sevel reinforcement learning algorithms (Q-Learning (with NN as Q-values approximator), modified reward Q-Learning, Double Deep Q-Learning, Dueling Deep Q-Learning) to OpenAI gym's Acrobot environment and compare their performance.
PongC/AcrobotRL
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Applying sevel reinforcement learning algorithms (Q-Learning (with NN as Q-values approximator), modified reward Q-Learning, Double Deep Q-Learning, Dueling Deep Q-Learning) to OpenAI gym's Acrobot environment and compare their performance.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published