Mediation analysis #2352

kshedden · 2015-04-04T03:47:36Z

This is an implementation of mediation analysis following the approach of Imai and collaborators.

coveralls · 2015-04-04T19:15:09Z

Coverage increased (+0.05%) to 83.95% when pulling 00a8a43 on kshedden:mediation into a1f53b0 on statsmodels:master.

coveralls · 2015-04-04T19:32:45Z

Coverage increased (+0.05%) to 83.95% when pulling 00a8a43 on kshedden:mediation into a1f53b0 on statsmodels:master.

coveralls · 2015-04-04T23:13:37Z

Coverage increased (+0.04%) to 83.95% when pulling 836396e on kshedden:mediation into a1f53b0 on statsmodels:master.

kshedden · 2015-04-05T03:10:46Z

A notebook illustrating the mediation procedure:

http://nbviewer.ipython.org/urls/umich.box.com/shared/static/jpmd9y99259u6dv0rj6p46993981m7zm.ipynb

josef-pkt · 2015-04-18T16:06:12Z

just some general comments:

I looked at the reference briefly after you submitted the PR. I understand the basic idea but haven't figured out the assumptions yet. Overall I'm far behind in my readings about the differences of "causality" between statistics and econometrics, mainly Imbens and Rubin articles (and a new book).

Essentially, the direction that we have been and are going is that additional functionality in statistics are your area in statsmodels and I try to keep up enough to help with the integration into statsmodels.
(It took me more than half a year to understand the basic idea of GEE and it's relationship to GMM, and several years to understand most of GLM/LEF/QMLE)

One question here is where to put it. Should we create a new subdirectory causality or treatment or with a similar category? Or can we add it to stats or another existing subpackage.

Similar question for propensity score matching, regression discontinuity design, and whatever else will come into statsmodels in future.
I have no idea whether we will get SEM and graphical models anytime soon, but I doubt it.

kshedden · 2015-04-19T16:25:11Z

Thanks for the comments. I don't have a strong opinion about where to put it, your suggestions are all fine with me.

I've been using this in a project and it holds up pretty well. I used to just use the simple method of multiplying coefficients but was never sure if that was meaningful outside of linear models. This approach seems more correct for generic use.

coveralls · 2015-05-01T04:14:30Z

Coverage increased (+0.04%) to 83.95% when pulling 511f602 on kshedden:mediation into a1f53b0 on statsmodels:master.

sjgiorgi · 2015-07-24T15:59:14Z

Line 351 of mediation.py: index = ["ACME (control)", "ACME (treated)", "ADE (control)", "ADE (treated)",
Line 356 of mediation.py: for i, vec in enumerate([self.ACME_ctrl, self.ACME_tx, self.ADE_tx, self.ADE_ctrl,

Seems like self.ADE_tx and self.ADE_ctrl should be switched on line 356?

kshedden · 2015-07-24T17:51:43Z

@sjgiorgi yes, you appear to be correct. I have fixed it. Thanks. Any other comments are very welcome.

josef-pkt · 2015-09-29T18:33:43Z

statsmodels/stats/mediation.py

+        self.ACME_ctrl = indirect_effects_avg[0]
+        self.ACME_tx = indirect_effects_avg[1]
+        self.ADE_ctrl = direct_effects_avg[0]
+        self.ADE_tx = direct_effects_avg[1]


since these are user facing attributes, longer more descriptive names would be usefull

josef-pkt · 2015-09-29T18:50:25Z

Looks good based on skimming the code

incomplete docstrings, needs eventually some standardized naming with treatment effects, and can go into new folder treatment (if we stick with that name)
related PR #2455 which doesn't allow different types of outcome or treatment models yet, and might be able to follow the same or similar pattern as here.

I did a bit of background reading and opened #2627 for a possible second approach.

josef-pkt · 2015-11-04T02:27:51Z

I'm merging this, naming convention might and location will still change before 0.8 release

ENH: Mediation analysis

josef-pkt · 2015-11-04T02:29:08Z

Thanks @kshedden

As I mentioned, I started to read to catch up with the topic but I'm still far behind.

kshedden force-pushed the mediation branch from bc6ef48 to cdcf3d4 Compare April 4, 2015 18:34

josef-pkt added type-enh comp-treatment labels Jun 23, 2015

josef-pkt added this to the 0.8 milestone Jun 23, 2015

josef-pkt mentioned this pull request Sep 28, 2015

ENH: mediation analysis - triply robust estimators, inverse probability weights #2627

Open

josef-pkt reviewed Sep 29, 2015
View reviewed changes

kshedden added 12 commits October 18, 2015 23:27

Initial commit

8426b66

get_distribution methods

d15abda

data file for tests

bdafeeb

moved from sandbox to stats directory

2c06215

Fixed import path

d73f5e7

Removed unwanted file from sandbox

e0e501e

Remove unwanted files from sandbox

e078b5b

Added gamma to fit_distribution

d87d88a

Add p-values to summary table

c4160cc

Fix py2 division error

d0a3b9d

Fix error in results (swapped rows)

6e2bcd5

Updates following code review

f0c1516

kshedden force-pushed the mediation branch from 175ca82 to f0c1516 Compare October 19, 2015 03:28

josef-pkt added a commit that referenced this pull request Nov 4, 2015

Merge pull request #2352 from kshedden/mediation

a561db1

ENH: Mediation analysis

josef-pkt merged commit a561db1 into statsmodels:master Nov 4, 2015

josef-pkt mentioned this pull request Feb 9, 2016

release 0.8 #2176

Closed

kshedden deleted the mediation branch July 1, 2016 03:59

josef-pkt mentioned this pull request Feb 5, 2017

ENH: predict_cdf predict_prob GLM gof #3415

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mediation analysis #2352

Mediation analysis #2352

kshedden commented Apr 4, 2015

coveralls commented Apr 4, 2015

coveralls commented Apr 4, 2015

coveralls commented Apr 4, 2015

kshedden commented Apr 5, 2015

josef-pkt commented Apr 18, 2015

kshedden commented Apr 19, 2015

coveralls commented May 1, 2015

sjgiorgi commented Jul 24, 2015

kshedden commented Jul 24, 2015

josef-pkt Sep 29, 2015

josef-pkt commented Sep 29, 2015

josef-pkt commented Nov 4, 2015

josef-pkt commented Nov 4, 2015

Mediation analysis #2352

Mediation analysis #2352

Conversation

kshedden commented Apr 4, 2015

coveralls commented Apr 4, 2015

coveralls commented Apr 4, 2015

coveralls commented Apr 4, 2015

kshedden commented Apr 5, 2015

josef-pkt commented Apr 18, 2015

kshedden commented Apr 19, 2015

coveralls commented May 1, 2015

sjgiorgi commented Jul 24, 2015

kshedden commented Jul 24, 2015

josef-pkt Sep 29, 2015

Choose a reason for hiding this comment

josef-pkt commented Sep 29, 2015

josef-pkt commented Nov 4, 2015

josef-pkt commented Nov 4, 2015