FEAT: Add tools for repeated games #347

QBatista · 2017-09-30T08:57:01Z

Adds tools for repeated games including the outer hyperplane approximation described by Judd, Yeltekin, Conklin 2002.

The implementation is currently much slower than the one in Julia.

# Setup

pd_payoff = np.array([[9.0, 1.0], [10.0, 3.0]])

A = Player(pd_payoff)
B = Player(pd_payoff)

nfg = NormalFormGame((A, B))
rpd = RepeatedGame(nfg, 0.75)

In Julia: 1.532440 seconds (1.17 M allocations: 87.199 MiB, 0.92% gc time)
In Python: 1 loop, best of 3: 1min 8s per loop

coveralls · 2017-09-30T11:38:51Z

Coverage increased (+0.09%) to 93.75% when pulling fe1c996 on QBatista:repeated-games into f48e7d7 on QuantEcon:master.

oyamad

I haven't followed the details. Only small comments at the moment.

General comments:

Try %prun to detect bottlenecks.
Try method='interior-point' (ENH: added "interior-point" method for scipy.optimize.linprog scipy/scipy#7123) as a linprog option, which is available with the latest dev version of scipy.
Consider providing a linprog_method option to outerapproximation, to be passed to linprog.
Modification of the docstring in ce_util.py should belong to a separate PR.
Did you compare gridmake with cartesian?

oyamad · 2017-09-30T12:32:51Z

quantecon/game_theory/__init__.py

+	 best_dev_payoff_i, best_dev_payoff_1, best_dev_payoff_2, initialize_hpl,
+	 worst_value_i, worst_value_1, worst_value_2, worst_values, RepeatedGame,
+	 outerapproximation
+)


I don't think all the routines should be imported. Just import important routines (perhaps RepeatedGame and outerapproximation?).

oyamad · 2017-09-30T12:34:24Z

quantecon/game_theory/repeated_game.py

@@ -0,0 +1,384 @@
+"""
+Filename: repeated_game.py
+Author: Quentin Batista


The author (Chase Coleman) of the original code also should be here.

oyamad · 2017-09-30T12:35:12Z

quantecon/game_theory/repeated_game.py

+
+class RepeatedGame:
+    """
+    Class representing an N-player repeated form game.


Remove "form".

oyamad · 2017-09-30T12:41:35Z

quantecon/game_theory/repeated_game.py

+
+    # Create the unit circle, points, and hyperplane levels
+    C, H, Z = initialize_sg_hpl(rpd, nH)
+    Cnew = copy.copy(C)


Wouldn't C.copy() work?

oyamad · 2017-09-30T12:44:58Z

quantecon/game_theory/repeated_game.py

+            warn("Maximum Iteration Reached")
+
+        # Update hyperplane levels
+        C = copy.copy(Cnew)


Wouldn't C[:] = Cnew work?

oyamad · 2017-09-30T12:45:05Z

quantecon/game_theory/repeated_game.py

+
+    # Set iterative parameters and iterate until converged
+    itr, dist = 0, 10.0
+    while (itr < maxiter) & (dist > tol):


Use and instead of &.

oyamad · 2017-09-30T12:45:09Z

quantecon/game_theory/repeated_game.py

+    tol_int = int(round(abs(np.log10(tol))) - 1)
+
+    # Find vertices that are unique within tolerance level
+    vertices = np.vstack({tuple(row) for row in np.round(vertices, tol_int)})


Are these two blocks really necessary?

They seem to be -- here is the output I get without them:

array([[ 10.00000001, 3.97266052], [ 10.00000001, 2.99999998], [ 2.99999998, 10.00000001], [ 3.97266052, 10.00000001], [ 9.00000001, 8.99999999], [ 8.99999999, 9.00000001], [ 2.99999998, 3.00000001], [ 2.99999998, 3. ], [ 2.99999999, 2.99999999], [ 2.99999998, 3. ], [ 3.00000001, 2.99999998], [ 3. , 2.99999998], [ 2.99999999, 2.99999999], [ 3. , 2.99999998], [ 9.00000001, 9. ], [ 9.00000001, 9. ], [ 9. , 9.00000001], [ 9. , 9.00000001]])

I wonder why we have these duplications. We should look into the algorithm.

In numpy version 1.13.1 they have updated the np.unique function to accept an axis argument -- Once this is released, we could do something like:

_, inds = np.unique(np.round(vertices, tol_int), axis=0, return_index=True) vertices = vertices[inds, :]

or just

vertices = np.unique(np.round(vertices, tol_int), axis=0)

depending on whether we want the returned values to be rounded or not.

oyamad · 2017-09-30T12:47:20Z

quantecon/game_theory/utilities.py

+
+
+class RGUtil:
+    def frange(start, stop, step=1.):


What's wrong with using np.linspace?

oyamad · 2017-09-30T12:48:20Z

quantecon/game_theory/utilities.py

+            x = x0 + i * step
+            yield x
+
+    def unitcircle(npts):


I would put this in repeated_game.py, as it's very specific to the code there.

oyamad · 2017-09-30T12:57:09Z

quantecon/game_theory/repeated_game.py

+    pure_nash_exists = pure_nash_brute(sg)
+
+    if not pure_nash_exists:
+        raise ValueError('No pure action Nash equilibrium exists in stage game')


No need to compute all the pure Nash equilibria.

Try:

try: next(pure_nash_brute_gen(sg)) except StopIteration: raise ValueError('No pure action Nash equilibrium exists in stage game')

coveralls · 2017-10-01T07:07:37Z

Coverage increased (+0.003%) to 94.378% when pulling a60656a on QBatista:repeated-games into 21c1a94 on QuantEcon:master.

This reverts commit b3e720a.

coveralls · 2017-10-01T07:14:59Z

Coverage increased (+0.02%) to 94.396% when pulling 4afe764 on QBatista:repeated-games into 21c1a94 on QuantEcon:master.

QBatista · 2017-10-01T09:47:07Z

Here is the comparison between cartesian and gridmake -- there does not seem to be much difference.

With cartesian: 1min 22s ± 3.32 s per loop (mean ± std. dev. of 7 runs, 1 loop each)
With gridmake : 1min 18s ± 1.7 s per loop (mean ± std. dev. of 7 runs, 1 loop each)

oyamad · 2017-10-01T14:38:52Z

It is interesting that cartesian is slower than gridmake (if the difference is not insignificant).
There has been a discussion that gridmake would be removed in favor of cartesian (sometime in the future); see #35.

albop · 2017-10-01T17:03:13Z

@oyamad and @QBatista : I refreshed my memories by looking at the implementation of cartesian and gridmake.
cartesian uses a function repeat_1d which was at the time faster than numpy's tile and repeat used in gridmake. Maybe this is not true anymore, which would explain why both methods have similar timings.
Actually, cartesian still does many memory allocating operations, so it is probably not near the optimal performance (would be easy to write a 2d benchmark). It doesn't do recursive calls like gridmake though, so I would expect it to be faster for higher dimensional products.

albop · 2017-10-01T22:06:48Z

Here a small gist to test the claims above, https://gist.github.com/albop/a4e6af9311fe9a9a392462ed757018bb
I compared cartesian, gridmake and a custom implementation in numba (2d and 3d only). I also added one version which assumes intervals are regularly spaced (so doesn't read memory).
Results below seem consistent: with what I said before. cartesian is faster than gridmake (but result is not the same as it is C-order instead of F-order), and the numba implementation is 4 times faster. Apparently computing coordinates doesn't improve anything.

# the product grids of K elements along each of the d dimensions is performed N times
test_1(N=100, d=2, K=1000) # cartesian : 0.499 seconds
test_2(N=100, d=2, K=1000) # gridmake : 0.609 seconds
test_3(N=100, d=2, K=1000) # numba : 0.114 seconds
test_4(N=100, d=2, K=1000) # numba regular : 0.141 seconds
test_1(N=100, d=3, K=100) # cartesian : 0.828 seconds
test_2(N=100, d=3, K=100) # gridmake : 1.012 seconds
test_3(N=100, d=3, K=100) # numba : 0.166 seconds
test_4(N=100, d=3, K=100) # numba regular : 0.164 seconds

cc7768 · 2017-10-04T15:28:20Z

I haven't had a time to properly review this, but it looks like @oyamad has done a pretty thorough review. The code looks well organized and nicely written in the 10 minutes I spent reading through it.

One "Python" vs "Julia" comment that I have is in Julia you write functions that take a type as an argument, but in Python you typically attach these functions to the class itself as methods -- This means it might make sense to have some of these functions (in particular, any of the functions that take the RepeatedGame class as the first argument) become methods of the RepeatedGame class. This might mean that it makes sense to have nH be an argument to RepeatedGame. One argument against this is that it would make it so that the interface in Julia and Python be slightly different, so I'm not 100% convinced but wanted to bring it up. A possible way around this is just to have outerapproximation be the only additional method RepeatedGame and that would take the nH as an argument. Anyways, just thinking out loud. Am keen to hear what others think.

I'm not surprised that this code is a bit slower (this is precisely the type of example where one would expect Julia to perform better). It would be nice to investigate whether this could be sped up a little, but I don't think that is a first order priority for now.

oyamad · 2017-10-06T04:57:22Z

@cc7768 I agree that outerapproximation should better be a method of RepeatedGame (but with nH as an argument of the method).

For speedups we might implement a LP solver (by a simple simplex method) in Numba (as a medium term project).

oyamad · 2017-10-11T13:34:41Z

quantecon/game_theory/repeated_game.py

@@ -353,7 +353,7 @@ def outerapproximation(rpd, nH=32, tol=1e-8, maxiter=500, check_pure_nash=True,
                b[nH+1] = (1-delta)*flow_u_2(rpd, a1, a2) - \
                          (1-delta)*best_dev_payoff_2(rpd, a1) - delta*_w2

-                lpout = linprog(c, A_ub=A, b_ub=b, bounds=(lb, ub))
+                lpout = linprog(c, A_ub=A, b_ub=b, bounds=(lb, ub), method='interior-point')


Should be something like:

def outerapproximation(..., linprog_method='simplex'): ... lpout = linprog(c, A_ub=A, b_ub=b, bounds=(lb, ub), method=linprog_method)

(scipy version 1.0.0 has not been released.)

coveralls · 2017-10-19T07:40:37Z

Changes Unknown when pulling 48021ff on QBatista:repeated-games into ** on QuantEcon:master**.

QBatista · 2017-10-20T05:11:10Z

@oyamad Here is a visualization of how the algorithm works on a few experiments: https://nbviewer.jupyter.org/github/QBatista/Notebooks/blob/master/JYC_Algo_Visualization.ipynb

An important observation is that the quality of the approximation is not weakly increasing with the number of gradients. For this game, choosing 32 gradients seems to give a better approximation than choosing 127 gradients which I suspect is because of the symmetry of payoffs. Additionally, it appears that the approximation is very sensitive to the geometry of the initial guess.
Absent a very good initial guess, the number of gradients will need to be much higher than the number of vertices to get a good approximation. I suspect this is because the algorithm runs into problems when the initial guess is not very good because of a combination of its monotonicity and outer approximation properties: once convex combinations of the approximation vertices are close to the true vertices, then even if the vertices are far from the boundary of the solution set, these vertices will not be able to move much. As such, a high number of gradients allows to increase the flexibility of the shape of the approximation, and therefore mitigate this issue.
Therefore, I think it would be better to intelligently pop similar vertices as we iterate once they reach a certain tolerance level by periodically checking for them in order to be able to set a large default number of gradients.
There are different options for doing this. A naive approach would be to substitute the two vertices by a convex combination of these two. The problem is that the outer approximation property would not necessarily be preserved. A better approach would be to substitute the approximation by a new polytope that ignores the edge connecting two similar vertices. This would imply losing the monotonicity property of the algorithm, which is not a real issue because we can only pop a finite number of vertices. Another point to consider is how to pop chains of vertices. One way of doing it would be to pop this group in one go, but this might be too aggressive if the true set has multiple close vertices. Instead, we could increase the frequency of checks depending on how many vertices were popped on the last iteration on the basis that the probability conditional on the number of vertices popped is likely an increasing function of the latter. Alternatively, we could increase the frequency based on the length and number of chains of similar vertices.
Based on the behavior of the algorithm with 128 gradients, it seems we could probably eliminate between 1/3 to 1/2 of the computations by avoiding unnecessary calls to the linear programming solver by implementing this type of feature while also improving the quality of the approximation.

cc7768 · 2017-10-20T18:21:03Z

These visualizations are very cool. Nice work @QBatista. I think @thomassargent30 would be quite interested in seeing these visualizations.

Is there a reason you think that the value set corresponding to 32 subgradients looks much better than the one with 127? I kind of see what you mean since there are a few extra points between (3, 3) and the other two vertices, but they don't seem far off that line.

I agree the solution will have some dependence on the geometry of the initial guess (I think the difference between the set with 127 points and with 128 points actually illustrates this nicely -- The 127 point version doesn't necessarily have (0, 1), (1, 0), (0, -1), and (-1, 0) to work with which seem to be important components of this value set so it has to place more points along what should be the vertical line). This algorithm finds the smallest convex set (for a given set of subgradients!) that contains the fixed point of the B operator. A natural way to investigate this further would be to work on writing up the inner approximation which is also described by JYC. The fixed point of the B operator should lie between the inner and outer approximations -- My suspicion is that there are some very cool graphs you could draw that show the dependence on the initial geometry and how the inner and outer approximations differ for different games/geometries.algorithmically

I'm hesitant to be too aggressive with popping vertices. It does seem that not all vertices end up mattering very much, but I suspect it is hard to determine algorithmically which are the ones that we should keep. For example, in this game (-1, 0) and (0, -1) seem to be important vertices. I would be interested in seeing some of the tests you described above though as a proof-of-concept.

oyamad · 2017-10-22T04:46:24Z

@QBatista Animations look very nice.

Maybe we need "a better understanding of the manner in which extreme points of the equilibrium payoff set are generated" (Abreu and Sannikov, 2014). We should study Abreu and Sannikov (as we talked).

mmcky · 2017-10-23T23:29:19Z

@oyamad Is your last comment suggesting further development of this PR or a new project?

oyamad · 2017-10-24T01:15:49Z

@mmcky I suggested a new project. (I am afraid the JYC algorithm is too inefficient for pure Python/NumPy.)

mmcky · 2018-02-26T00:32:32Z

Hi @oyamad and @QBatista. Shall I close this PR? There is a lot of interesting work in this branch so I don't want to remove it without that work being documented. From reading this thread - we won't merge this at this time.

QBatista added 3 commits September 30, 2017 17:44

FEAT: Add functionalities for repeated games

e16a730

TEST: Add tests for repeated_game.py

c81b88d

FEAT: Add utilities for repeated_game.py

0890ca8

QBatista force-pushed the repeated-games branch from 460c55b to 649ab5b Compare September 30, 2017 09:47

QBatista added 3 commits September 30, 2017 20:32

TEST: Add tests for utilities.py

75a07c7

DOC: Complete docstring for gridmake and _gridmake2

b3e720a

FEAT: Add repeated_game functions to game_theory

fe1c996

QBatista force-pushed the repeated-games branch from 649ab5b to fe1c996 Compare September 30, 2017 11:32

oyamad reviewed Sep 30, 2017

View reviewed changes

oyamad added the in-work label Sep 30, 2017

QBatista force-pushed the repeated-games branch 3 times, most recently from c7f46e4 to 90646c2 Compare October 1, 2017 06:49

FIX: Correct multiply typos

a60656a

QBatista force-pushed the repeated-games branch from 90646c2 to a60656a Compare October 1, 2017 07:01

Revert "DOC: Complete docstring for gridmake and _gridmake2"

4afe764

This reverts commit b3e720a.

FIX: Change linprog method to interior-point

1b68b42

QBatista force-pushed the repeated-games branch from a6a5918 to 9f22ddc Compare October 8, 2017 07:30

RFC: Refactor repeated_game.py

0cd2faa

QBatista force-pushed the repeated-games branch from 9f22ddc to 0cd2faa Compare October 8, 2017 07:41

oyamad reviewed Oct 11, 2017

View reviewed changes

FIX: Add option to choose linprog method

48021ff

mmcky closed this Apr 4, 2018

oyamad mentioned this pull request Sep 19, 2018

ENH: Add AS algorithm. #433

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Add tools for repeated games #347

FEAT: Add tools for repeated games #347

QBatista commented Sep 30, 2017

coveralls commented Sep 30, 2017

oyamad left a comment •

edited

Loading

oyamad Sep 30, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

QBatista Oct 1, 2017 •

edited

Loading

oyamad Oct 1, 2017

cc7768 Oct 4, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

oyamad Sep 30, 2017

coveralls commented Oct 1, 2017

coveralls commented Oct 1, 2017

QBatista commented Oct 1, 2017

oyamad commented Oct 1, 2017

albop commented Oct 1, 2017

albop commented Oct 1, 2017

cc7768 commented Oct 4, 2017

oyamad commented Oct 6, 2017

oyamad Oct 11, 2017

coveralls commented Oct 19, 2017

QBatista commented Oct 20, 2017

cc7768 commented Oct 20, 2017 •

edited

Loading

oyamad commented Oct 22, 2017

mmcky commented Oct 23, 2017

oyamad commented Oct 24, 2017

mmcky commented Feb 26, 2018

FEAT: Add tools for repeated games #347

FEAT: Add tools for repeated games #347

Conversation

QBatista commented Sep 30, 2017

coveralls commented Sep 30, 2017

oyamad left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QBatista Oct 1, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls commented Oct 1, 2017

coveralls commented Oct 1, 2017

QBatista commented Oct 1, 2017

oyamad commented Oct 1, 2017

albop commented Oct 1, 2017

albop commented Oct 1, 2017

cc7768 commented Oct 4, 2017

oyamad commented Oct 6, 2017

Choose a reason for hiding this comment

coveralls commented Oct 19, 2017

QBatista commented Oct 20, 2017

cc7768 commented Oct 20, 2017 • edited Loading

oyamad commented Oct 22, 2017

mmcky commented Oct 23, 2017

oyamad commented Oct 24, 2017

mmcky commented Feb 26, 2018

oyamad left a comment •

edited

Loading

QBatista Oct 1, 2017 •

edited

Loading

cc7768 commented Oct 20, 2017 •

edited

Loading