cs181-practical-4

WARMUP: To run the policy iteration for the warm-up, inside the "Warmup" directory type

python main.py

This will print the optimal policies and the values corresponding to those policies. This will also show a plot of the values versus state. Gamma can be changed in the main.py file.

SWINGYMONKEY: To run the different reinforcement learning techniques, inside the "Monkey" directory type

python model_free.py

(for model-free or "Q-learning" technique)

OR

python model_based.py

(for model-based learning technique)

OR

python td_value.py

(for temporal difference learning technique)

This will run the SwingyMonkey simulation with the given reinforcement learning techniques. The terminal window will print the current iteration, current score, highest score, and average score.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
Monkey		Monkey
Warmup		Warmup
data		data
distro		distro
writeup		writeup
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cs181-practical-4

About

Releases

Packages

Contributors 3

Languages

rhedshi/cs181-practical-4

Folders and files

Latest commit

History

Repository files navigation

cs181-practical-4

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages