qwop_RL

It is one of the hardest games in the world. You can try on http://www.foddy.net/Athletics.html. Our objective is to:

train up a model to complete 100m in a reasonable time
improve the speed of the ragdoll to a competitive level
stablize the running pattern of the agent
create the best record on our own

Setup Procedures

Run pip install -r requirements.txt
Make sure you have installed chromedriver. If no, https://chromedriver.chromium.org/downloads. Store it into the ~/qwop_RL dir.
Create a terminal to host game: python host_game.py
Create another terminal to train the agent. You can train either on CPU or GPU environment.

Proposed Methodology

Reward Design

Training Flowchart

Our approach is through DQN and DDQN to train our agent to complete the 100m racing game. Following diagram illustrates the entire process of updating Q network for approximating the value function by DQN with Target Network as well as DDQN. They share the same updating approach except the way they calculate for TD target.

Training

Before training the agent, you can configure the training parameters as well as the action/state/reward design. You can choose to run the training by:

Deep Q Network with Target Network: python dqn_main.py --train, or python dqn_main.py --retrain
Double Deep Q Network: python ddqn_train.py

Testing

Deep Q Network with Target Network: python dqn_main.py --test
Double Deep Q Network: python ddqn_test.py

Videos of Performance

First Running Form Trained (5~10 min):

Best Running Form Trained (3 min 15 sec): https://youtu.be/tmRIeSdA7N8

Credits

to the author of QWOP http://www.foddy.net/Athletics.html
to @Wesleyliao for the game engine and help: https://github.com/Wesleyliao/QWOP-RL

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

qwop_RL

Setup Procedures

Proposed Methodology

Reward Design

Training Flowchart

Training

Testing

Videos of Performance

Credits

Files

README.md

Latest commit

History

README.md

File metadata and controls

qwop_RL

Setup Procedures

Proposed Methodology

Reward Design

Training Flowchart

Training

Testing

Videos of Performance

Credits