Remove necessity to keep explicit default policy for MCCFR solvers #154

inejc · 2020-02-27T16:48:44Z

These changes are based on discussions in #149. The idea is to remove the need to keep an explicit default policy within MCCFR solvers and thus make them feasible to run on large games where memory constraints are tighter.

The proposed changes allow for usage of:

TabularPolicy which is the default behavior and the current state
UniformPolicy for cases where the inferred average CFR policy will only ever be queried with a State instance
nullptr which will enable to also query the inferred policy with an info state string but move the handling of info state lookup fails to the external caller (empty policy would be returned in that case)

googlebot · 2020-02-27T16:48:53Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

inejc · 2020-02-27T17:05:36Z

@googlebot I signed it!

googlebot · 2020-02-27T17:05:42Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

lanctot · 2020-02-27T17:39:20Z

This is great, thanks!

I'm making a few (mostly) cosmetic changes. Turns out we don't have to store a tabular policy after all (we have a UniformPolicy object and the best response code does state-based queries already.

Should get merged in Monday's update. (Please don't close the PR.)

lanctot · 2020-02-27T17:43:40Z

Nevermind, I spoke too soon. They do indeed need to be tabular policies due to assumptions in the best response code (which we should probably tweak eventually...)

inejc · 2020-02-27T18:10:51Z

Nevermind, I spoke too soon. They do indeed need to be tabular policies due to assumptions in the best response code (which we should probably tweak eventually...)

I see... I suppose this isn't a blocker for the proposed changes?

One minor thing: we should probably rename uniform_policy_ in solvers to default_policy_ since this could, in theory, be something different to uniform now. Please LMK whether I can add commits to this PR or you want to potentially change that later?

lanctot · 2020-02-27T18:32:27Z

I see... I suppose this isn't a blocker for the proposed changes?

Correct! It's already under internal review, which should be easy to get done today, so will almost surely get merged in Monday's update.

One minor thing: we should probably rename uniform_policy_ in solvers to default_policy_ since this could, in theory, be something different to uniform now. Please LMK whether I can add commits to this PR or you want to potentially change that later?

Yep, that was one of the changes I made :) (renaming to default_policy_) Please don't add commits at this point because I've already imported the PR and the changes would clash with mine.

inejc added 3 commits February 27, 2020 16:40

add vscode IDE to gitignore

73e1f44

generalize default policy within CFRAveragePolicy and CFRCurrentPolicy

88d7470

add ability to pass custom uniform default policy to MCCFR solvers

c9d3fde

googlebot added the cla: no label Feb 27, 2020

inejc mentioned this pull request Feb 27, 2020

Persisting checkpoints of CFR solvers #149

Closed

googlebot added cla: yes and removed cla: no labels Feb 27, 2020

lanctot self-assigned this Feb 27, 2020

OpenSpiel merged commit 36ba1b1 into google-deepmind:master Mar 2, 2020

inejc deleted the inejc/fix-explicit-cfr-policy branch March 2, 2020 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove necessity to keep explicit default policy for MCCFR solvers #154

Remove necessity to keep explicit default policy for MCCFR solvers #154

inejc commented Feb 27, 2020

googlebot commented Feb 27, 2020

inejc commented Feb 27, 2020

googlebot commented Feb 27, 2020

lanctot commented Feb 27, 2020

lanctot commented Feb 27, 2020

inejc commented Feb 27, 2020

lanctot commented Feb 27, 2020

Remove necessity to keep explicit default policy for MCCFR solvers #154

Remove necessity to keep explicit default policy for MCCFR solvers #154

Conversation

inejc commented Feb 27, 2020

googlebot commented Feb 27, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

inejc commented Feb 27, 2020

googlebot commented Feb 27, 2020

lanctot commented Feb 27, 2020

lanctot commented Feb 27, 2020

inejc commented Feb 27, 2020

lanctot commented Feb 27, 2020