How to make a copy for simulation #196

LucasColas · 2023-05-31T06:30:49Z

Hello,
I would like to know how I can make a copy of a game (with the same players, the same pots, etc.).
I need it for Monte Carlo Tree Search where the algorithm needs to do simulation.
I can't do deepcopy because there are generators.

SirRender00 · 2023-06-01T03:39:19Z

Good point. I do not believe there is a simple way to get this right now. There would also be some good questions regarding the copying of the Deck in this use-case: most likely we'll want to reshuffle the Deck in each instance so that the Monte Carlo simulation does not learn the hidden state of the Deck,

LucasColas · 2023-06-01T06:01:15Z

Ok,
I tried to copy every attribute but there's still some issues. The issues are often related to the generators.

LucasColas · 2023-06-15T16:08:09Z

Hello,

I tried another way to make a copy.

I copy the cards of each board.
However I have an issue when there's an evaluation. prime_product_from_rankbits gives a number that doesn't exist in Look up table.

LucasColas · 2023-06-17T17:40:31Z

I tried to do a copy using hand_history.
Here is my code (taken from the _import_history(history: History) method) :

num_players = len(history.prehand.player_chips)
game = TexasHoldEm(
            buyin=1,
            big_blind=history.prehand.big_blind,
            small_blind=history.prehand.small_blind,
            max_players=num_players,
        )
    
gui = TextGUI(game=game)

# button placed right before 0
game.btn_loc = num_players - 1

        # read chips
for i in game.player_iter(0):
        game.players[i].chips = history.prehand.player_chips[i]

        # stack deck
deck = Deck()
if history.settle:
    deck.cards = list(history.settle.new_cards)


# player actions in a stack
player_actions  = []
for bet_round in (history.river, history.turn, history.flop, history.preflop):
        if bet_round:
            deck.cards = bet_round.new_cards + deck.cards
            for action in reversed(bet_round.actions):
                player_actions.insert(
                    0, (action.player_id, action.action_type, action.total)
                )

# start hand (deck will deal)
game.start_hand()

# give players old cards
for i in game.player_iter():
       game.hands[i] = history.prehand.player_cards[i]

        # swap decks
game._deck = deck

while game.is_hand_running():
        gui.display_state()
        gui.wait_until_prompted()
        try:
            player_id, action_type, total = player_actions.pop(0)
            game.take_action(action_type=action_type, total=total)
        except:
            action, total = random_agent(game)
            game.take_action(action_type=action, total=total)

        gui.display_action()

gui.display_win()

It works pretty well except I can't get the same sb_loc, bb_loc, and btc_loc. I can change their values with the values of my hand. But the next current player will still be different.

LucasColas · 2023-06-17T17:44:20Z

I think this issue is related to _prehand method (from TexasHoldem).
"Because" of this :

self.btn_loc = active_players[0]
self.sb_loc = active_players[1]

# heads up edge case => sb = btn
if len(active_players) == 2:
    self.sb_loc = self.btn_loc

self.bb_loc = next(self.in_pot_iter(self.sb_loc + 1))

And this :

self._player_post(self.sb_loc, self.small_blind)
self._player_post(self.bb_loc, self.big_blind)
self.last_raise = 0

# action to left of BB
self.current_player = next(self.in_pot_iter(loc=self.bb_loc + 1))

SirRender00 · 2023-06-18T07:24:51Z

I think for a full __copy__ dunder method, it will be very similar to the the _import_history method for sure. So definitely good start.

As for the discrepancies with the button/sb/bb locations, this is probably because the hand history attached to the game object does not have canonical player IDs (e.g. when we export, we make the button player id 0). So this line game.btn_loc = num_players - 1 may not be right.

I am planning to take a look at this when I have some time in the coming weeks. Thanks for taking a look, feel free to continue to experiment

LucasColas · 2023-06-18T09:29:21Z

Here's another thing I tried :

def generate_game(history, blinds, gui=False):
    
    num_players = len(history.prehand.player_chips)
    game = TexasHoldEm(
            buyin=1,
            big_blind=history.prehand.big_blind,
            small_blind=history.prehand.small_blind,
            max_players=num_players,
        )
    
    gui = TextGUI(game=game)

        # button placed right before 0
    game.btn_loc = num_players - 1

    # read chips
    

    # stack deck
    deck = Deck()
    if history.settle:
        deck.cards = list(history.settle.new_cards)


    # player actions in a stack
    player_actions  = []
    for bet_round in (history.river, history.turn, history.flop, history.preflop):
        if bet_round:
            deck.cards = bet_round.new_cards + deck.cards
            for action in reversed(bet_round.actions):
                player_actions.insert(
                    0, (action.player_id, action.action_type, action.total)
                )

    # start hand (deck will deal)
    game.start_hand()

    # give players old cards
    for i in game.player_iter():
        game.hands[i] = history.prehand.player_cards[i]

    game.pots = [Pot()]

    for i in game.player_iter(0):
        game.players[i].chips = history.prehand.player_chips[i]
        game.players[i].state = PlayerState.IN
        game.players[i].last_pot = 0



    game.btn_loc = history.prehand.btn_loc
    game.sb_loc = blinds[0]
    game.bb_loc = blinds[1]
    game._player_post(game.sb_loc, history.prehand.small_blind)
    game._player_post(game.bb_loc, history.prehand.big_blind)
    game.current_player = next(game.in_pot_iter(loc=game.bb_loc + 1))
    print("current_player : ", game.current_player)
    print("pot iter : ", next(game.in_pot_iter(loc=game.bb_loc + 1)))

        # swap decks
    game._deck = deck

    while game.is_hand_running():
        print("current_player : ", game.current_player)
        gui.display_state()
        gui.wait_until_prompted()
        try:
            print("game current player : ",game.current_player)
            player_id, action_type, total = player_actions.pop(0)
            game.current_player = player_id
            game.take_action(action_type=action_type, total=total)
            #print("current_player : ", game.current_player)
            print("player iter", next(game.player_iter(game.current_player)))
        except Exception as e:
            print(e)
            print("random action")
            action, total = random_agent(game)
            game.take_action(action_type=action, total=total)

        gui.display_action()

    gui.display_win()

That's pretty similar to the previous code.
It seems next(game.player_iter(game.current_player)) gives the next current player. But this current player is different from the current player of the history.

SirRender00 · 2023-06-18T18:27:29Z

^ This pull request should do it. I'll make a prerelease soon and you can try it out and provide feedback if that does the trick.

LucasColas · 2023-06-18T19:24:26Z

Thank you for your effort.

SirRender00 · 2023-06-18T20:49:38Z

@LucasColas Okay, 0.10-alpha.0 is the prelease for this if you wanted to upgrade to it and try it out.

LucasColas · 2023-06-19T19:30:55Z

The cards of the players are not the same ?

LucasColas · 2023-06-19T19:38:17Z

I think we should add the possibility to copy the cards of one or several players.

SirRender00 · 2023-06-19T20:20:09Z

Yes, seems like the hands are not being copied properly. Gonna take a look at this

SirRender00 · 2023-06-19T20:43:48Z

@LucasColas Okay, just released 0.10-alpha.1 that should be copying the hands correctly

LucasColas · 2023-06-20T04:26:44Z

OK, thank you. I'll try.

LucasColas · 2023-06-20T04:30:55Z

It seems to be working.

LucasColas · 2023-06-20T04:57:53Z

Do you think it's possible to copy the cards of selected players only ?

SirRender00 · 2023-06-20T06:20:16Z

What are you trying to do exactly? The intent might be out of the realm of the TexasHoldEm object. This is possible by messing with the hands attribute and the _deck attribute.

LucasColas · 2023-06-20T06:39:19Z

Let's assume I want to get the cards of the player 3. But the other players should not have the same cards.
So my copy should return the same game except for players 1,2,4,..n where they would have different cards. The cards of the boards and the player 3 should be the same in this example.

I think it's better to do this because for Monte Carlo Tree Search you're not supposed to know the cards of the opponents (except if they reveal the cards).

LucasColas · 2023-06-20T15:21:33Z

Here's a possible draft : #203

SirRender00 added the enhancement New feature or request label Jun 1, 2023

SirRender00 mentioned this issue Jun 18, 2023

feat: Ability to create a copy of the TexasHoldEm object #199

Merged

SirRender00 added this to the v1.0.0 milestone Jun 18, 2023

SirRender00 self-assigned this Jun 18, 2023

SirRender00 mentioned this issue Jun 19, 2023

0.10 #201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to make a copy for simulation #196

How to make a copy for simulation #196

LucasColas commented May 31, 2023 •

edited

Loading

SirRender00 commented Jun 1, 2023

LucasColas commented Jun 1, 2023

LucasColas commented Jun 15, 2023

LucasColas commented Jun 17, 2023 •

edited

Loading

LucasColas commented Jun 17, 2023

SirRender00 commented Jun 18, 2023 •

edited

Loading

LucasColas commented Jun 18, 2023

SirRender00 commented Jun 18, 2023

LucasColas commented Jun 18, 2023

SirRender00 commented Jun 18, 2023

LucasColas commented Jun 19, 2023

LucasColas commented Jun 19, 2023

SirRender00 commented Jun 19, 2023

SirRender00 commented Jun 19, 2023

LucasColas commented Jun 20, 2023

LucasColas commented Jun 20, 2023

LucasColas commented Jun 20, 2023

SirRender00 commented Jun 20, 2023

LucasColas commented Jun 20, 2023

LucasColas commented Jun 20, 2023

How to make a copy for simulation #196

How to make a copy for simulation #196

Comments

LucasColas commented May 31, 2023 • edited Loading

SirRender00 commented Jun 1, 2023

LucasColas commented Jun 1, 2023

LucasColas commented Jun 15, 2023

LucasColas commented Jun 17, 2023 • edited Loading

LucasColas commented Jun 17, 2023

SirRender00 commented Jun 18, 2023 • edited Loading

LucasColas commented Jun 18, 2023

SirRender00 commented Jun 18, 2023

LucasColas commented Jun 18, 2023

SirRender00 commented Jun 18, 2023

LucasColas commented Jun 19, 2023

LucasColas commented Jun 19, 2023

SirRender00 commented Jun 19, 2023

SirRender00 commented Jun 19, 2023

LucasColas commented Jun 20, 2023

LucasColas commented Jun 20, 2023

LucasColas commented Jun 20, 2023

SirRender00 commented Jun 20, 2023

LucasColas commented Jun 20, 2023

LucasColas commented Jun 20, 2023

LucasColas commented May 31, 2023 •

edited

Loading

LucasColas commented Jun 17, 2023 •

edited

Loading

SirRender00 commented Jun 18, 2023 •

edited

Loading