In the Coordination game <motivating-example-coordination-game>
in how many situations do neither player have an incentive to independently change their strategy?
Neither player having a reason to change their strategy implies that both strategies are Best responses<definition-of-best-response>
to each other.
To identify such pairs of strategies, we will use the best_response_condition
by considering all possible non zero valued elements σr and σc.
Recall that for the Coordination game the matrices A and B are given by:
If we consider strategies that only play a single action there are two options for each strategy:
σr ∈ {(1, 0), (0, 1)}
and:
σc ∈ {(1, 0), (0, 1)}
We will inspect all four combinations:
- σr = (1, 0) and σc = (1, 0) which corresponds to both players playing their first action which gives: ur(σr, σc) = 3 and uc(σr, σc) = 2. If the row player where to modify their strategy (while the column player stayed unchanged) to play the second action their utility would decrease. Likewise, if the column player were to modify their strategy their utility would also decrease.
- σr = (1, 0) and σc = (0, 1) which corresponds to the row player playing their first action and the column player playing their second action which gives: ur(σr, σc) = 1 and uc(σr, σc) = 1. In this case, if either player were to move their utility would increase.
- σr = (0, 1) and σc = (1, 0) which corresponds to the row player playing their second action and the column player playing their first action which gives: ur(σr, σc) = 0 and uc(σr, σc) = 0. In this case, if either player were to move their utility would increase.
- σr = (0, 1) and σc = (0, 1) which corresponds to both players playing their second action which gives: ur(σr, σc) = 2 and uc(σr, σc) = 3. If the row player where to modify their strategy (while the column player stayed unchanged) to play the second action their utility would decrease. Likewise, if the column player were to modify their strategy their utility would also decrease.
If we now consider strategies that play both actions there is a single general form:
σr = (x, 1 − x) for 0 < x < 1
σc = (y, 1 − y) for 0 < y < 1
We can apply the best_response_condition
here.
If σr is a best response to σc then:
(AσcT)i = maxk ∈ {1, 2}(AσcT)k for all i ∈ {1, 2}
which gives:
which in turn corresponds to:
Thus σr = (x, 1 − x) with 0 < x < 1 is a best response to σc if and only if σc = (1/4, 3/4).
We will now apply the best_response_condition
again but to the column player:
If σc is a best response to σr then:
(σrB)j = maxk ∈ {1, 2}(σrB)k for all j ∈ {1, 2}
which gives:
which in turn corresponds to:
Thus σc = (y, 1 − y) with 0 < y < 1 is a best response to σr if and only if σr = (3/4, 1/4).
There are 3 pairs of strategies that are best responses to each other:
- σr = (1, 0) and σc = (1, 0).
- σr = (0, 1) and σc = (0, 1).
- σr = (3/4, 1/4) and σc = (1/4, 3/4).
The approach used in motivating-example-coordination-game-nash-equilibria
is in fact an application of a formalised algorithm called support enumeration.
The algorithm is as follows:
For a non Degenerate <degenerate-games-discussion>
2 player game (A, B) ∈ ℝm × n2 the following algorithm returns all pairs of best responses:
- For all 1 ≤ k1 ≤ m and 1 ≤ k2 ≤ n;
- For all pairs of
support <definition-of-support-of-a-strategy>
(I, J) with |I| = k1 and |J| = k2. -
Solve the following equations (this ensures we have best responses):
∑i ∈ IσriBij = v for all j ∈ J
∑j ∈ JAijσcj = u for all i ∈ I - Solve
-
$\sum_{i=1}^{m}{\sigma_{r}}_i=1$ and σri ≥ 0 for all i -
$\sum_{j=1}^{n}{\sigma_{c}}_i=1$ and σcj ≥ 0 for all j
-
- Check the best response condition.
Repeat steps 3,4 and 5 for all potential support pairs.
Question
Use support enumeration to find all Nash equilibria for the game given by $A=\begin{pmatrix} 1 & 1 & -1 \\ 2 & -1 & 0 \end{pmatrix}$ and $B=\begin{pmatrix} 1/2 & -1 & -1/2 \\-1 & 3 & 2 \end{pmatrix}$.
Answer
- It is immediate to note that there are no pairs of pure best responses.
- All possible support pairs are:
- I = {1, 2} and J = {1, 2}
- I = {1, 2} and J = {1, 3}
- I = {1, 2} and J = {2, 3}
- Let us solve the corresponding linear equations:
-
I = {1, 2} and J = {1, 2}:
1/2σr1 − σr2 = − σr1 + 3σr2
σr1 = 8/3σr2
σc1 + σc2 = 2σc1 − σc2
σc1 = 2σc2 -
I = {1, 2} and J = {1, 3}:
1/2σr1 − σr2 = − 1/2σr1 + 2σr2
σr1 = 3σr2
σc1 − σc3 = 2σc1 + 0σc3
σc1 = − σc3 -
I = {1, 2} and J = {2, 3}:
− σr1 + 3σr2 = − 1/2σr1 + 2σr2
σr1 = 2σr2
σc2 − σc3 = − σc2 + 0σc3
2σc2 = σc3
-
- We check which supports give valid strategies:
-
I = {1, 2} and J = {1, 2}:
σr = (8/11, 3/11)
σc = (2/3, 1/3, 0) -
I = {1, 2} and J = {1, 3}:
σr = (3/4, 1/4)
σc = (k, 0, − k)which is not a valid strategy.
-
I = {1, 2} and J = {2, 3}:
σr = (2/3, 1/3)
σc = (0, 1/3, 2/3)
-
-
Let us verify the best response condition:
-
I = {1, 2} and J = {1, 2}:
σc = (2/3, 1/3, 0)
$$\begin{aligned} A\sigma_c^T= \begin{pmatrix} 1\\ 1 \end{pmatrix} \end{aligned}$$Thus σr is a best response to σc
σr = (8/11, 3/11)
σrB = (1/11, 1/11, 2/11)Thus σc is not a best response to σr (because there is a better response outside of the support of σc).
-
I = {1, 2} and J = {2, 3}:
σc = (0, 1/3, 2/3)
$$\begin{aligned} A\sigma_c^T= \begin{pmatrix} -1/3\\ -1/3 \end{pmatrix} \end{aligned}$$Thus σr is a best response to σc
σr = (2/3, 1/3)
σrB = (0, 1/3, 1/3)Thus σc is a best response to σr.
Thus the (unique) Nash equilibrium for this game is:
((2/3, 1/3), (0, 1/3, 2/3)) -
See how-to-use-support-enumeration
for guidance of how to use Nashpy to use support enumeration.