Allow matrix structure in covariance matrices to be exploited #1643

kshedden · 2014-04-28T07:46:09Z

This PR provides adds a new method covariance_matrix_solve to the CovStruct class that solves a system of equations whose coefficient matrix is the covariance matrix represented by the CovStruct instance.

Currently, we construct this covariance matrix explicitly, and solve these systems with a general-purpose solver. However, for many of the dependence structures, the linear algebra can be optimized to exploit the structure of the matrix.

We provide a default implementation of covariance_matrix_solve that uses a general-purpose solver.

We override this with optimized methods for the independence, autoregressive, and exchangeable cases.

The speed improvement would be most noticeable when the cluster sizes are large.

Tests for the independence and exchangeable cases match results obtained from R (as did the earlier code).

We don't have a comparator for the autoregressive case, so I added a regression test. It agrees with the results from the previous (non-optimized) implementation.

…ited

coveralls · 2014-04-28T07:59:00Z

Coverage remained the same when pulling 45728d3 on kshedden:gee-linalg-refactor2 into b52bc09 on statsmodels:master.

josef-pkt · 2014-04-28T12:57:08Z

statsmodels/genmod/generalized_estimating_equations.py

-            vinv_d = spl.cho_solve(vco, dmat)
+            rslt = self.cov_struct.covariance_matrix_solve(expval, i,
+                                                sdev, (dmat, resid))
+            vinv_d, vinv_resid = tuple(rslt)


rslt might be None
?

…terations

coveralls · 2014-04-28T15:23:35Z

Coverage remained the same when pulling 078eaae on kshedden:gee-linalg-refactor2 into b52bc09 on statsmodels:master.

josef-pkt · 2014-04-28T15:56:56Z

statsmodels/genmod/dependence_structures/covstruct.py

+            y /= sdev[:, None]
+
+            if flatten:
+                y = y[:,0]


np.squeeze would be safer I think.
np.squeeze will cause shape error later if y.shape[1] > 1
IIUC

josef-pkt · 2014-04-28T16:05:58Z

M-dependent is the only case I really worked my way through it. So I might postpone understanding this and just do a superficial review before merge.
Frank's m-dependent PR #1495 will need an update after this is merged.

coveralls · 2014-04-28T17:14:16Z

Coverage remained the same when pulling 4bf5117 on kshedden:gee-linalg-refactor2 into b52bc09 on statsmodels:master.

coveralls · 2014-04-29T05:57:40Z

Coverage remained the same when pulling 69e1166 on kshedden:gee-linalg-refactor2 into b52bc09 on statsmodels:master.

coveralls · 2014-04-29T06:05:55Z

Coverage remained the same when pulling 69e1166 on kshedden:gee-linalg-refactor2 into b52bc09 on statsmodels:master.

kshedden · 2014-04-29T06:11:38Z

Thanks for the comments. I think I've addressed everything.

The m-dependent structure needs a bit of cleanup. I will work with Frank on that.

josef-pkt · 2014-04-29T13:27:33Z

I'm not a big fan of rhs, but I don't see an alternative. Now with the docstring it's easy to understand.
We don't have a good terminology for this kind of linear equations. Linear restrictions use r_matrix and q_matrix or something like this, R b = q.
rhs is a bit confusing, because before the docstring I thought of y = x b and x is the "rhs", not y.

This looks like a clean branch according to the network tree, and can be merged with green button.

josef-pkt · 2014-04-29T13:34:29Z

statsmodels/genmod/dependence_structures/covstruct.py

+        for x in rhs:
+            if x.ndim == 1:
+                x1 = x / stdev
+                y = x1 / (1 - self.dep_params)


side comment, doesn't need to be changed

in general, I still prefer floating point integers to signal that we are working with floats not integers.
(1. - self.dep_params)

(I still have a left-over habit to hunt for bugs caused by integer division.)

josef-pkt · 2014-04-29T13:55:36Z

one more: can you inherit the docstring for the methods in the subclasses? for update, covariance_matrix_solve, ...
e.g. generalized linear model has this
622: remove_data.__doc__ = base.LikelihoodModelResults.remove_data.__doc__

I mentioned this in Frank's PR

josef-pkt · 2014-04-29T13:56:39Z

I think this is about ready to merge.
Kerby, whenever you think it's ready, I will hit the merge button.

kshedden · 2014-04-30T09:40:48Z

Ready to merge now. Thanks for your help.

REF: Allow matrix structure in covariance matrices to be exploited

josef-pkt · 2014-05-20T15:22:12Z

merged

REF: Allow matrix structure in covariance matrices to be exploited

Refactor to allow matrix structure in covariance matrices to be explo…

45728d3

…ited

josef-pkt reviewed Apr 28, 2014
View reviewed changes

Improve handling of linear algebra issues; add options to fine tune i…

078eaae

…terations

josef-pkt reviewed Apr 28, 2014
View reviewed changes

Fixed typo NOne -> None

4bf5117

kshedden added 2 commits April 29, 2014 01:43

Minor renaming and docstring edits

a0fb449

Minor typographical changes

69e1166

josef-pkt added PR labels Apr 29, 2014

josef-pkt reviewed Apr 29, 2014
View reviewed changes

Minor docstring and naming changes, ready to merge

8e19eb9

josef-pkt added a commit that referenced this pull request May 20, 2014

Merge pull request #1643 from kshedden/gee-linalg-refactor2

fb2a7de

REF: Allow matrix structure in covariance matrices to be exploited

josef-pkt merged commit fb2a7de into statsmodels:master May 20, 2014

kshedden deleted the gee-linalg-refactor2 branch June 9, 2014 01:55

PierreBdR pushed a commit to PierreBdR/statsmodels that referenced this pull request Sep 2, 2014

Merge pull request statsmodels#1643 from kshedden/gee-linalg-refactor2

533f50e

REF: Allow matrix structure in covariance matrices to be exploited

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow matrix structure in covariance matrices to be exploited #1643

Allow matrix structure in covariance matrices to be exploited #1643

kshedden commented Apr 28, 2014

coveralls commented Apr 28, 2014

josef-pkt Apr 28, 2014

coveralls commented Apr 28, 2014

josef-pkt Apr 28, 2014

josef-pkt commented Apr 28, 2014

coveralls commented Apr 28, 2014

coveralls commented Apr 29, 2014

coveralls commented Apr 29, 2014

kshedden commented Apr 29, 2014

josef-pkt commented Apr 29, 2014

josef-pkt Apr 29, 2014

josef-pkt commented Apr 29, 2014

josef-pkt commented Apr 29, 2014

kshedden commented Apr 30, 2014

josef-pkt commented May 20, 2014

Allow matrix structure in covariance matrices to be exploited #1643

Allow matrix structure in covariance matrices to be exploited #1643

Conversation

kshedden commented Apr 28, 2014

coveralls commented Apr 28, 2014

josef-pkt Apr 28, 2014

Choose a reason for hiding this comment

coveralls commented Apr 28, 2014

josef-pkt Apr 28, 2014

Choose a reason for hiding this comment

josef-pkt commented Apr 28, 2014

coveralls commented Apr 28, 2014

coveralls commented Apr 29, 2014

coveralls commented Apr 29, 2014

kshedden commented Apr 29, 2014

josef-pkt commented Apr 29, 2014

josef-pkt Apr 29, 2014

Choose a reason for hiding this comment

josef-pkt commented Apr 29, 2014

josef-pkt commented Apr 29, 2014

kshedden commented Apr 30, 2014

josef-pkt commented May 20, 2014