Gformula sequential #32

pzivich · 2018-11-15T12:23:26Z

In reference to #30

Dividing TimeVaryGFormula into two different estimation methods. Monte Carlo (currently implemented) and Sequential Regression (new method). Monte Carlo works better for survival data while Sequential Regression works best for longitudinal data

Sequential regression uses the following process

at Q_t fit a regression model to those who survived till T=t.
predict Y_t based on that model for the intervention of interest
for those who followed the treatment plan AND had the outcome they have a 1 carried forward
all else who did NOT have the outcome, are considered censored (np.nan)
above process is repeated. For those WITH predicted outcomes, their predicted outcome is used in the model fitting. Those who were observed at Q_{t-1} but censored at Q_t have their observed outcome used

…ary. Still needs custom treatment support and testing

…as a reference. Had some weird values show-up on the sample data when converted to time chunks

pzivich · 2018-11-16T20:43:18Z

Sometimes risks go down over time when using these longitudinal methods (personal communication regarding AIPW). Even in the LTMLE paper, they have some risks go down over time. Still weird and feels unnatural to me

My best bet is to simulate some reasonable data and compare to R's ltmle estimated via gcomp=TRUE. If I can obtain consistent results, I will be more confident in my implementation

Might know the issue. In current implement; if have outcome at that time point then always gets reset to 1. However, it should only be set to 1 for FUTURE outcomes.

pzivich · 2018-11-16T21:51:21Z

Found the issue. It was a tricky little piece. In case I need to remember back, The outcomes for individuals ONLY is set to 1 iff they followed the treatment regime of interest, had the outcome, and had that outcome before the current iteration.

This is now caught by adding an additional condition asserting that the current outcome is NaN. This occurs in Step 2.3 of the estimation procedure

pzivich · 2018-11-20T12:18:57Z

Next step is to simulate data. It looks like it will be the easiest way. Some publicly available longitudinal data requires registering, so I don't think I can include the data with zEpid...

R's ltmle has some recipes for simulated data that would be a good starting point

pzivich added 3 commits November 6, 2018 10:04

Sequential regression estimation for the g-formula. Start of code

ad56fc6

Added Sequential Regression Estimator for g-formula. This is prelimin…

645cb6a

…ary. Still needs custom treatment support and testing

Cleaned out bugs from adapting code. Will compare to R's ltmle gcomp …

da70560

…as a reference. Had some weird values show-up on the sample data when converted to time chunks

pzivich mentioned this pull request Nov 15, 2018

Add LTMLE #19

Open

Something wrong somewhere. Risks sometimes go down over time

cd9a8db

pzivich added the enhancement label Nov 16, 2018

Needs further testing, but behaves as I would expect now

79fdb4c

pzivich changed the base branch from master to v0.4.0 November 20, 2018 18:30

pzivich added 2 commits November 28, 2018 16:03

Removed natural course reference in SeqReg

694ad87

Updated g-formula documentation

1866c0b

pzivich merged commit 1866c0b into v0.4.0 Dec 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gformula sequential #32

Gformula sequential #32

pzivich commented Nov 15, 2018

pzivich commented Nov 16, 2018 •

edited

Loading

pzivich commented Nov 16, 2018

pzivich commented Nov 20, 2018

Gformula sequential #32

Gformula sequential #32

Conversation

pzivich commented Nov 15, 2018

pzivich commented Nov 16, 2018 • edited Loading

pzivich commented Nov 16, 2018

pzivich commented Nov 20, 2018

pzivich commented Nov 16, 2018 •

edited

Loading