BestResponsePOMDP.jl

Converts extensive form games into POMDPs by keeping one players' strategy constant.

Currently, only the generative model is known, so POMCP is the only online solver able to efficiently compute an approximates best response.

Usage

using CounterfactualRegret
using CounterfactualRegret.Games
using BestResponsePOMDP

game = Kuhn()
sol = CFRSolver(game)
e_sol = POMCPExploitabilitySolver(sol, POMCPSolver(max_depth=10, max_time=0.1, tree_queries=10_000))
cb = POMCPExploitabilityCallback(e_sol, 1)
train!(sol, 1000, cb=cb; show_progress=true)

using Plots
plot(cb.hist)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
img		img
src		src
test		test
.gitignore		.gitignore
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

img

img

src

src

test

test

.gitignore

.gitignore

Project.toml

Project.toml

README.md

README.md

Repository files navigation

BestResponsePOMDP.jl

Usage

About

Releases

Packages

Languages

WhiffleFish/BestResponsePOMDP.jl

Folders and files

Latest commit

History

Repository files navigation

BestResponsePOMDP.jl

Usage

About

Resources

Stars

Watchers

Forks

Languages