Artificial Intelligence learns to play Kariba!

Python implementation of Multiple-Observer Information Set Monte Carlo Tree Search (MOISMCTS) for the card game Kariba, as described in this blog.

Getting Started

First, install required dependencies (either conda or pip)

    conda install numpy
    conda install jupyter
    conda install tqdm

Clone this repo

    git clone https://github.com/KnurpsBram/AI_plays_kariba.git
    cd AI_plays_kariba

To play a game of Kariba against an AI using MOISMCTS, open interactive_game.ipynb. From the command line:

    cd src
    jupyter notebook

MOISMCTS keeps track of a separate tree per player. A node in this tree is not a state, but an 'information set'; the set of all states the game could be in given the information that the player has. A player cannot observe the opponent's hand, but it can reason about which hands are either in the deck or in the opponents hand. These cards are in the 'jungle', the union of the deck and the opponent's hand(s). The tree distinguishes between post-action-nodes and neutral-nodes. We only keep track of the amount of simulated wins from post-action-nodes onwards, because only for those nodes do we need to compute the upper confidence bound (UCB) (which requires the amount of wins).

>>> from kariba_moismcts import Kariba, moismcts, Simulators, Node

... kariba = Kariba()
... event = kariba.random_card_draw() # actions and card draws are both 'events', represented as a dictionary
... kariba.apply_event(event)
... root_state = kariba
... best_action = moismcts(root_state, n=100)

... print(kariba)
... print(event)
... print(best_action)

100%|██████████| 100/100 [00:00<00:00, 130.70it/s]

-------------------------
turn: player0
deck:
[8 7 8 8 8 7 7 6]
field:
[0 0 0 0 0 0 0 0]
hands:
player0 [0 1 0 0 0 1 1 2]
player1 [0 0 0 0 0 0 0 0]
-------------------------

{'kind': 'deck_draw', 'who': 'player0', 'cards': array([0, 1, 0, 0, 0, 1, 1, 2])}
{'kind': 'action', 'who': 'player0', 'cards': array([0, 0, 0, 0, 0, 1, 0, 0])}

# what happens inside the moismcts function
>>> simulators = Simulators(root_state)

... simulators.apply_event(simulators.select_action())
... simulators.next_turn()
... simulators.apply_event(simulators.random_card_draw()) # draw cards for player0

... print("Complete information of the state:")
... print(simulators.game)

... print("Partial information available to player0:")
... print(Node(simulators.game, "player0"))

... print("Partial information available to player1:")
... print(Node(simulators.game, "player1"))

Complete information of the state:
-------------------------
turn: player1
deck:
[7 7 8 8 6 7 7 4]
field:
[0 0 0 0 0 0 1 0]
hands:
player0 [0 1 0 0 0 1 0 2]
player1 [1 0 0 0 2 0 0 2]
-------------------------

Partial information available to player0:
+------------------------
neutral_node
self: player0
turn: player1
n: 0
jungle:
[8 7 8 8 8 7 7 6]
field:
[0 0 0 0 0 0 1 0]
hand:
[0 1 0 0 0 1 0 2]

Partial information available to player1:
+------------------------
neutral_node
self: player1
turn: player1
n: 0
jungle:
[7 8 8 8 6 8 7 6]
field:
[0 0 0 0 0 0 1 0]
hand:
[1 0 0 0 2 0 0 2]

>>> # run n simulations
... n = 6

... simulators = Simulators(root_state)
... for _ in range(n):
...     while not simulators.game.is_final:
...         simulators.apply_event(simulators.random_card_draw()) # give cards to the player whose turn it is, at the very first turn, this should not do anything
...         simulators.apply_event(simulators.select_action()) # the player whose turn it is may select the action, apply the action to the game and update both the players' trees
...         simulators.next_turn()
...     winner = simulators.game.leading_player
...     simulators.backpropagate(winner)
...     simulators.reset_game()
    
... print("Tree of player0 after "+str(n)+" simulations:")
... print(simulators.tree_dict["player0"])

... print("Tree of player1 after "+str(n)+" simulations:")
... print(simulators.tree_dict["player1"])

Tree of player0 after 6 simulations:
+------------------------
neutral_node
self: player0
turn: player1
n: 6
jungle:
[8 7 8 8 8 7 7 6]
field:
[0 0 0 0 0 0 1 0]
hand:
[0 1 0 0 0 1 0 2]
    +------------------------
    neutral_node
    self: player0
    turn: player1
    n: 2
    jungle:
    [8 7 8 8 7 7 7 6]
    field:
    [0 0 0 0 1 0 1 0]
    hand:
    [0 1 0 0 0 1 0 2]
        +------------------------
        neutral_node
        self: player0
        turn: player0
        n: 1
        jungle:
        [7 7 8 8 7 7 7 6]
        field:
        [0 0 0 0 1 0 1 0]
        hand:
        [1 1 0 0 0 1 0 2]
        
    +------------------------
    neutral_node
    self: player0
    turn: player1
    n: 1
    jungle:
    [7 7 8 8 8 7 7 6]
    field:
    [1 0 0 0 0 0 1 0]
    hand:
    [0 1 0 0 0 1 0 2]
    
    +------------------------
    neutral_node
    self: player0
    turn: player1
    n: 1
    jungle:
    [8 7 8 8 8 7 7 4]
    field:
    [0 0 0 0 0 0 1 2]
    hand:
    [0 1 0 0 0 1 0 2]
    
    +------------------------
    neutral_node
    self: player0
    turn: player1
    n: 1
    jungle:
    [8 7 8 8 6 7 7 6]
    field:
    [0 0 0 0 2 0 1 0]
    hand:
    [0 1 0 0 0 1 0 2]
    
    +------------------------
    neutral_node
    self: player0
    turn: player1
    n: 1
    jungle:
    [8 7 8 8 8 7 7 5]
    field:
    [0 0 0 0 0 0 1 1]
    hand:
    [0 1 0 0 0 1 0 2]
    
Tree of player1 after 6 simulations:
+------------------------
neutral_node
self: player1
turn: player1
n: 6
jungle:
[7 8 8 8 6 8 7 6]
field:
[0 0 0 0 0 0 1 0]
hand:
[1 0 0 0 2 0 0 2]
    +------------------------
    post_action_node
    self: player1
    turn: player1
    n: 2
    w: 2
    jungle:
    [7 8 8 8 6 8 7 6]
    field:
    [0 0 0 0 1 0 1 0]
    hand:
    [1 0 0 0 1 0 0 2]
        +------------------------
        neutral_node
        self: player1
        turn: player0
        n: 1
        jungle:
        [7 8 8 8 6 8 7 4]
        field:
        [0 0 0 0 1 0 1 2]
        hand:
        [1 0 0 0 1 0 0 2]
        
    +------------------------
    post_action_node
    self: player1
    turn: player1
    n: 1
    w: 0
    jungle:
    [7 8 8 8 6 8 7 6]
    field:
    [1 0 0 0 0 0 1 0]
    hand:
    [0 0 0 0 2 0 0 2]
    
    +------------------------
    post_action_node
    self: player1
    turn: player1
    n: 1
    w: 0
    jungle:
    [7 8 8 8 6 8 7 6]
    field:
    [0 0 0 0 0 0 1 2]
    hand:
    [1 0 0 0 2 0 0 0]
    
    +------------------------
    post_action_node
    self: player1
    turn: player1
    n: 1
    w: 1
    jungle:
    [7 8 8 8 6 8 7 6]
    field:
    [0 0 0 0 2 0 1 0]
    hand:
    [1 0 0 0 0 0 0 2]
    
    +------------------------
    post_action_node
    self: player1
    turn: player1
    n: 1
    w: 0
    jungle:
    [7 8 8 8 6 8 7 6]
    field:
    [0 0 0 0 0 0 1 1]
    hand:
    [1 0 0 0 2 0 0 1]

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
other		other
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Artificial Intelligence learns to play Kariba!

Getting Started

About

Releases

Packages

Languages

KnurpsBram/kariba

Folders and files

Latest commit

History

Repository files navigation

Artificial Intelligence learns to play Kariba!

Getting Started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages