Getting Good at Googol:

Reinforcement Learning Meets Optimal Stopping Theory

DS3001 Computational Cognitive Modeling, Spring 2019

Erica Dominic, Erica Detemmerman, and Isaac Haberman

Abstract

The game of googol is the quintessential optimal stopping problem: the player's only task is to determine the best time to quit the game. We trained four agents to play the game of googol using common reinforcement learning algorithms: first visit Monte Carlo, SARSA, Q-learning, and deep Q-learning. All agents learn to play the game of googol with success rates between 19% and 37%, with the deep Q-learning agent proving the most successful. Furthermore, we demonstrate that all agents are able to transfer the knowledge accumulated while playing one version of the game to another version of the game. We also conducted a small number of trials in which human participants were asked to play the game of googol. Collectively, human participants attained a success rate of 35%, comparable to the deep Q-learning agent; however, human players' strategies do not correspond with strategies observed in the reinforcement learning agents, as evidenced by trends in stopping choices.

See Report.pdf for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 216 Commits
hpc		hpc
results		results
viz		viz
.gitignore		.gitignore
README.md		README.md
Report.pdf		Report.pdf
agent.py		agent.py
game.py		game.py
networks.py		networks.py
optim_test.py		optim_test.py
train_dq.py		train_dq.py
train_mc.py		train_mc.py
train_q.py		train_q.py
trainer.py		trainer.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting Good at Googol:

Reinforcement Learning Meets Optimal Stopping Theory

DS3001 Computational Cognitive Modeling, Spring 2019

Abstract

About

Releases

Packages

Contributors 3

Languages

EricaHD/GettingGoodAtGoogol

Folders and files

Latest commit

History

Repository files navigation

Getting Good at Googol:

Reinforcement Learning Meets Optimal Stopping Theory

DS3001 Computational Cognitive Modeling, Spring 2019

Abstract

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages