Draft (new feature) : Model to estimate when a intervention had effect #480

JeanVanDyk · 2025-05-28T16:34:37Z

New Feature: `InterventionTimeEstimator` for Unknown Treatment Timing

This PR introduces a new model, InterventionTimeEstimator, designed to estimate when an intervention has an effect in a time series — especially in cases where the exact treatment time is unknown.

Use Case

Enhances the Interrupted Time Series (ITS) feature by providing a way to infer the likely time of intervention
Supports scenarios where the treatment onset is uncertain or delayed
Helps identify lagged effects between intervention and observable outcomes

This addition gives users a flexible, Bayesian approach to model treatment timing uncertainty directly within the CausalPy framework.

Notes / Open Questions

Where should this model fit into the CausalPy workflow?
I’m unsure whether InterventionTimeEstimator should be integrated within the InterruptedTimeSeries (ITS) feature, or used as a standalone tool.
This affects how a user-defined model could be supported.
Depending on the intended usage, I can propose a solution to allow users to inject their own custom models.
Custom model usage — base vs. intervention
Should users be able to:
- Provide a custom model to represent the base time series (e.g. intercept, trend, seasonality)?
- Provide a custom model to capture the intervention effect (e.g. shape or dynamics of the post-switch impact)?
- Or support both?
Covariates
I considered adding time-varying covariates to improve the fit. Would that be useful or out of scope?
Multivariate Time Series
It's relatively easy to extend the model for multivariate input. Let me know if this is something you'd like to see.

Model Summary

Inputs:
- t: 1D array of time points
- y: 1D array of observed values
- Optional span: restricts the window for switchpoint detection
- Optional coords: can include seasons for modeling periodic effects
- effect: list of components, e.g. "trend", "level", "impulse"
- grain_season: number of time steps per season
Model Components:
- Time series is modeled as:
  intercept + trend + seasonal
- A Uniform prior is used to place a switchpoint
- A sigmoid curve models the onset of the effect after the switchpoint, applied to the selected effect components

Feel free to share any feedback or suggestions! I'm happy to refine the model or explore extensions based on your input.

📚 Documentation preview 📚: https://causalpy--480.org.readthedocs.build/en/480/

codecov · 2025-05-30T08:29:19Z

Codecov Report

Attention: Patch coverage is 95.28796% with 9 lines in your changes missing coverage. Please review.

Project coverage is 94.42%. Comparing base (491adc1) to head (692d85c).

Files with missing lines	Patch %	Lines
causalpy/experiments/interrupted_time_series.py	95.28%	5 Missing ⚠️
causalpy/pymc_models.py	95.38%	3 Missing ⚠️
causalpy/custom_exceptions.py	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #480      +/-   ##
==========================================
+ Coverage   94.40%   94.42%   +0.01%     
==========================================
  Files          29       29              
  Lines        2075     2241     +166     
==========================================
+ Hits         1959     2116     +157     
- Misses        116      125       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…over intervention's distributions

JeanVanDyk · 2025-05-30T12:10:42Z

Note

In my last PR, I added functionality allowing users to specify priors for the effect of the intervention. This provides more flexibility for Bayesian modeling and allows users to incorporate domain knowledge directly into the inference process.

drbenvincent

Hi @JeanVanDyk. I'm very excited about the new addition, thanks for putting it together. Bear with me if I'm sometimes slow to reply - busy dad life!

At this point, could I request that you put together an ipynb file that showcases the functionality. I'm assuming this PR will evolve and go through a couple of iterations, so this notebook will end up being a new docs page to help users understand the new functionality. If you edit the ./docs/source/notebooks/index.md file, you can add the notebook name under the interrupted time series section. That way the notebook will render if you either build the docs locally, but it should also render in the remote docs preview build.

Did you still want feedback on the GitHub Discussion at this point?

review-notebook-app · 2025-06-04T08:59:42Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

JeanVanDyk · 2025-06-04T09:25:21Z

Simplified `.fit()` API

The InterventionTimeEstimator model has been streamlined. The .fit() method now only takes:

X: covariates
y: observed values
coords: coordinate labels for dimensions

Cleaner and more straightforward to use.

Optional Prior Specification

You can now pass a priors dictionary at init time to control intervention effects:

"level" and "trend" → [mu, sigma] (Normal prior)
"impulse" → [mu, sigma1, sigma2]
- mu: amplitude mean (Normal)
- sigma1: amplitude std (Normal)
- sigma2: decay rate std (HalfNormal)

If an effect is omitted or an empty list is provided, the model uses default priors.

Support for Inferred Intervention Time

InterruptedTimeSeries now works with models that infer the treatment time.

Before fit():
- If treatment_time is None or a Tuple, the model uses the full dataset to infer the switchpoint.
- treatment_time is passed to the model via set_time_range().
After fit():
- The inferred switchpoint is retrieved.
- data_pre is reset accordingly for prediction, as in the standard flow.

Timeline Requirement

The time column t must be the last column of X (excluding y).
This is how the model locates the timeline internally.

Extras and Ongoing Work

Included a demo notebook for quick experimentation.
Currently working on:
- More robust input_validation() inside InterruptedTimeSeries.
- Supporting timelines expressed as datetime objects.

drbenvincent · 2025-06-20T12:22:09Z

@JeanVanDyk I just did a quick update from main. Please make sure you pull the latest to avoid any conflicts :)

Hoping to start taking a decent look at this now. Will try to provide some comments even though I know this is not 100% finished.

drbenvincent

Can you run make uml? This will generate updated uml plots (e.g. classes.png)
Looking at that generated class diagram (and not yet having looked at the InterventionTimeEstimator the hope would be to avoid overwriting the methods calculate_impact, predict, score and fit methods. These are hopefully very generic and should hopefully be able to be dealt with in the PyMCModel base class. Like I say, I've not yet taken a look at the code itself, but this would be the hope.
I cannot currently run the new notebook. When the first model fit is attempted I get this

Traceback

TypingError Traceback (most recent call last) Cell In[4], [line 10](vscode-notebook-cell:?execution_count=4&line=10) 2 from causalpy.pymc_models import InterventionTimeEstimator as ITE 4 model = ITE( 5 time_variable_name="t", 6 treatment_type_effect={"level": []}, 7 sample_kwargs={"sample_seed": seed}, 8 ) ---> [10](vscode-notebook-cell:?execution_count=4&line=10) result = ITS( 11 data=df, 12 treatment_time=None, 13 formula="y ~ 1 + t", 14 model=model, 15 )

File ~/git/CausalPy/causalpy/experiments/interrupted_time_series.py:284, in InterruptedTimeSeries.init(self, data, treatment_time, formula, model, **kwargs)
281 raise ValueError("Model type not recognized")
283 # score the goodness of fit to the pre-intervention data
--> 284 self.score = self.model.score(X=self.pre_X, y=self.pre_y)
286 # Postprocessing with handler
287 self.datapre, self.datapost, self.pre_y, self.pre_X, self.treatment_time = (
288 self.handler.data_postprocessing(
289 self.model, data, idata, treatment_time, self.pre_y, self.pre_X
290 )
291 )

File ~/git/CausalPy/causalpy/pymc_models.py:773, in InterventionTimeEstimator.score(self, X, y)
771 mu_ts = az.extract(mu_ts, group="posterior_predictive", var_names="mu_ts").T
772 # Note: First argument must be a 1D array
--> 773 return r2_score(y.data, mu_ts)

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/stats/stats.py:1167, in r2_score(y_true, y_pred)
1134 def r2_score(y_true, y_pred):
1135 """R² for Bayesian regression models. Only valid for linear models.
1136
1137 Parameters
(...) 1165
1166 """
-> 1167 r_squared = r2_samples(y_true=y_true, y_pred=y_pred)
1168 return pd.Series([np.mean(r_squared), np.std(r_squared)], index=["r2", "r2_std"])

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/stats/stats.py:1127, in r2_samples(y_true, y_pred)
1125 var_e = _numba_var(svar, np.var, (y_true - y_pred))
1126 else:
-> 1127 var_y_est = _numba_var(svar, np.var, y_pred, axis=1)
1128 var_e = _numba_var(svar, np.var, (y_true - y_pred), axis=1)
1129 r_squared = var_y_est / (var_y_est + var_e)

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/utils.py:372, in _numba_var(numba_function, standard_numpy_func, data, axis, ddof)
350 """Replace the numpy methods used to calculate variance.
351
352 Parameters
(...) 369
370 """
371 if Numba.numba_flag:
--> 372 return numba_function(data, axis=axis, ddof=ddof)
373 else:
374 return standard_numpy_func(data, axis=axis, ddof=ddof)

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/stats/stats_utils.py:535, in stats_variance_2d(data, ddof, axis)
533 var = np.zeros(a_a)
534 for i in range(a_a):
--> 535 var[i] = stats_variance_1d(data[i], ddof=ddof)
536 else:
537 var = np.zeros(b_b)

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/utils.py:205, in maybe_numba_fn.call(self, *args, **kwargs)
203 """Call the jitted function or normal, depending on flag."""
204 if Numba.numba_flag:
--> 205 return self.numba_fn(*args, **kwargs)
206 else:
207 return self.function(*args, **kwargs)

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/numba/core/dispatcher.py:424, in _DispatcherBase._compile_for_args(self, *args, **kws)
420 msg = (f"{str(e).rstrip()} \n\nThis error may have been caused "
421 f"by the following argument(s):\n{args_str}\n")
422 e.patch_message(msg)
--> 424 error_rewrite(e, 'typing')
425 except errors.UnsupportedError as e:
426 # Something unsupported is present in the user code, add help info
427 error_rewrite(e, 'unsupported_error')

File ~/mambaforge/envs/CausalPy/lib/python3.13/site-packages/numba/core/dispatcher.py:365, in _DispatcherBase._compile_for_args..error_rewrite(e, issue_type)
363 raise e
364 else:
--> 365 raise e.with_traceback(None)

TypingError: Failed in nopython mode pipeline (step: nopython frontend)
non-precise type pyobject
During: typing of argument at /Users/benjamv/mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/stats/stats_utils.py (517)

File "../../../../../mambaforge/envs/CausalPy/lib/python3.13/site-packages/arviz/stats/stats_utils.py", line 517:
def copy(self, deep=True): # pylint:disable=overridden-final-method

@conditional_jit(nopython=True)
^

During: Pass nopython_type_inference

This error may have been caused by the following argument(s):

argument 0: Cannot determine Numba type of <class 'xarray.core.dataarray.DataArray'>

I think there should be no changes to its_skl.ipynb right? Can we revert those changes back to the state on main, so there's no change in that file in this PR.
Same for its_covid.ipynb?

drbenvincent · 2025-06-20T12:33:25Z

docs/source/notebooks/index.md

@@ -40,6 +40,7 @@ did_pymc_banks.ipynb
 its_skl.ipynb
 its_pymc.ipynb
 its_covid.ipynb
+its_no_treatment_time.ipynb


This is good. But in the notebook can you ensure you only have ONE top level markdown header. Otherwise it adds these as additional entries into the How To index page.

Okay, sure. I'll fix that!

As for the two notebooks you mentioned, I don't recall making any changes to them. I only ran them to check whether the modifications contained any small mistakes. I'll try reverting those changes.

drbenvincent · 2025-06-20T13:28:46Z

Could you send me a screenshot of the graphviz? At the moment I can't get the result because of the error mentioned above. But if it's working for you, then you can do this. It would be useful for me to get the birds-eye-view of the model as I'm reading the code.

(that example is for the existing InterruptedTimeSeries model)

JeanVanDyk · 2025-06-20T13:53:43Z

Here is the graph @drbenvincent :

…d fixing a bug

JeanVanDyk · 2025-06-24T11:58:55Z

Hi @drbenvincent,

I've made some changes and will summarize them here, addressing the points you raised:

Regarding the "overwriting of methods" : While I was able to remove the custom implementations of fit and calculate_impact, I still need to override predict because I need the model to resample additional variables. The core of this necessity lies in how the model infers the observed data. You can think of the time series as a combination of two components:
- The intervention effect, denoted mu_in
- The base time series, which I’ve named mu
For plotting, computation, and overall coherence, I believe it makes more sense to name the base time series mu. As in classic interrupted time series models, the idea is for mu to represent the time series without the causal effect, so we can assess the intervention’s impact by comparing it to the full signal.

However, since this model also infers the causal effect in mu_in, the observed data is associated with mu_ts = mu_in + mu. That’s why predict needs to return both mu_ts and mu_in. I considered swapping the roles of mu_ts and mu, but that would require significant changes in the plotting logic.

Similarly, I had to adjust the score method so that it computes the score between mu_ts and the observed data. I believe this provides a clearer picture of how well the model captures the actual time series.
Regarding the notebooks: I’ve removed the changes in the one you mentioned, updated the notebook I had modified, and ensured that there is now only one top-level Markdown header in each.
Regarding the meeting we had:
- On the time column: I’ve adjusted the implementation so that the user now specifies the name of the time column, rather than being forced to call it "t". However, since the model infers the time of intervention using this column, it still needs to be accessed in the HandlerUTT class. To preserve API compatibility, I now retrieve it via model.time_variable_name, which works as expected. That said, if a user provides a custom model for detecting an unknown treatment time, it must expose this attribute.
- On uncertainty: I now compute the cumulative causal impact by aggregating the estimated impact across all posterior samples, at each time step. For each sample, I set the causal impact to zero at all time steps preceding its own inferred treatment time. This ensures that the cumulative sum reflects only the post-treatment causal effect, specific to each sample, and therefore fully incorporates both uncertainty in the effect size and in the timing of the intervention.
- You’ve probably already seen the new examples I added to the notebooks.

drbenvincent

Handlers

Can we rename to make more clear. E.g. UnknownTreatmentTimeHandler and KnownTreatmentTimeHandler
We discussed a range of options to do this, and I still think the handler pattern is quite a good approach. If we did conditional logic in the main InterruptedTimeSeries class then it would be disconnected and unclear. Though I think we should formalise the interface with an abstract base class. Any major objections?

from abc import ABC, abstractmethod

class TreatmentTimeHandler(ABC):
    @abstractmethod
    def data_preprocessing(self, data, treatment_time, model): pass
    
    @abstractmethod
    def data_postprocessing(self, ...): pass
    
    @abstractmethod
    def plot_intervention_line(self, ax, ...): pass
    
    @abstractmethod
    def plot_impact_cumulative(self, ax, ...): pass
    
    def plot_treated_counterfactual(self, ax, ...):
        """Optional: override if needed"""
        pass

`treatment_type_effect` input

Minor gripes about this. The data structure is potentially a bit complex and allows user to add multiple keys (i.e. effects). What do you think about changing the API to something like this:

model = ITE(
    time_variable_name="t",
    treatment_effect_type="level", # Required: one of "level", "trend", "impulse"
    treatment_effect_params=None, # Optional: custom parameters
    sample_kwargs={"sample_seed": seed},
)

Notebook

Rather than using the abbreviation ITE, can you just import as InterventionTimeEstimator? We want to maximise clarity for the reader and not require that they remember acronyms :)
Sorry to drag it out, but I will have to leave a proper review of the notebook for another day. But we are definitely getting there. Maybe a couple more small iterations.

drbenvincent

Architecture / API though

At the moment there is an API change to the norm in that for the first time we would be directly telling the model about the data when we generate it, by giving it time_variable_name="t", like this:

model = ITE(
    time_variable_name="t",
    treatment_type_effect={"impulse": []},
    sample_kwargs={"random_seed": seed, "target_accept": 0.95},
)

All the other models just consume sample_kwargs and don't override the PyMCModel.__init__ method

CausalPy/causalpy/pymc_models.py

Lines 71 to 78 in 8b26d42

    
               def __init__(self, sample_kwargs: Optional[Dict[str, Any]] = None): 
        
                   """ 
        
                   :param sample_kwargs: A dictionary of kwargs that get unpacked and passed to the 
        
                       :func:`pymc.sample` function. Defaults to an empty dictionary. 
        
                   """ 
        
                   super().__init__() 
        
                   self.idata = None 
        
                   self.sample_kwargs = sample_kwargs if sample_kwargs is not None else {}

Your approach is to call the PyMCModel.__init__ with super().__init__(sample_kwargs) and then add in the other arguments to the model self.

I don't yet know if this is deeply problematic. But it is a change in terms of information flow in that it's the first time we are giving the models information about the data while they are created.

My temptation would be to get rid of this and stick with the current pattern.

An important point to think about here is that the build_model method is not actually called until we call PyMCModel.fit, so perhaps we can instead provide the time_variable_name and treatment_type_effect related stuff to the experiment and pass it to the model via build_model.

I'm tempted to try to revert to the current architecture. Something along the lines of

class InterruptedTimeSeries:
    def fit(self):
        time_var = self._extract_time_variable_from_formula(self.formula)
        
        if isinstance(self.model, ITE):
            idata = self.model.fit(X, y, coords, time_variable=time_var)
        else:
            idata = self.model.fit(X, y, coords)

New feature : Model to estimate when a intervention had effect

10a017e

JeanVanDyk added enhancement New feature or request help wanted Extra attention is needed labels May 28, 2025

JeanVanDyk added 15 commits May 29, 2025 12:40

New feature : Model to estimate when a intervention had effect

69d79b3

Minor fix in docstring

bf4eaaa

Minor fix in docstring

3420c9a

Minor fix in docstring

3dc23b3

Minor fix in docstring

d739b4a

Minor fix in docstring

d48f0c3

Minor fix in docstring

14afe09

Minor fix in docstring

60357a5

Minor fix in docstring

7f57b13

Minor fix in docstring

2cb92fc

Minor fix in docstring

d9c06ac

Minor fix in docstring

52cc0fa

Minor fix in docstring

faf085b

Minor fix in docstring

cc9a1f4

Minor fix in docstring

dea9d6e

andrewheusser requested a review from drbenvincent May 29, 2025 14:43

fix : hiding progressbar

5e9cde6

Enhancement : Adding the possibility for the user to indicate priors …

ee701f2

…over intervention's distributions

JeanVanDyk force-pushed the intervention-time-estimator branch from b7b91de to ee701f2 Compare May 30, 2025 09:50

drbenvincent marked this pull request as draft May 31, 2025 05:28

drbenvincent requested changes May 31, 2025

View reviewed changes

Minor fix in docstring

5ee3cb4

updating example notebook

08c520c

JeanVanDyk added 9 commits June 4, 2025 12:23

updating example notebook

b1681da

Supporting Date format and adding exceptions for model related issues

fcfd059

changing column index restriction to label restriction

64c97b7

codespell

2996331

resolved merge

1da80fd

fixing merging issues

020f679

fixing merging issues

5039fda

codespell

4761b7e

codespell

bec5cd8

JeanVanDyk requested a review from drbenvincent June 18, 2025 14:53

JeanVanDyk and others added 3 commits June 19, 2025 13:21

updating notebook

2d4d158

updating notebook with examples and adding time_variable_name parameter

8d607b8

Merge branch 'main' into pr/480

d00f828

fixing example

942a1d5

drbenvincent requested changes Jun 20, 2025

View reviewed changes

revert changes in docs and fixing issues

4aef14b

JeanVanDyk added 3 commits June 20, 2025 19:10

Removing the overriding of fit and calculate_impact, adding a test an…

2b2cbdf

…d fixing a bug

Using all samples for uncertainty

6769aa7

uml and docs

692d85c

JeanVanDyk marked this pull request as ready for review June 24, 2025 10:07

drbenvincent requested changes Jul 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Draft (new feature) : Model to estimate when a intervention had effect #480

Draft (new feature) : Model to estimate when a intervention had effect #480

Uh oh!

JeanVanDyk commented May 28, 2025 •

edited

Loading

Uh oh!

codecov bot commented May 30, 2025 •

edited

Loading

Uh oh!

JeanVanDyk commented May 30, 2025

Uh oh!

drbenvincent left a comment

Uh oh!

review-notebook-app bot commented Jun 4, 2025

Uh oh!

JeanVanDyk commented Jun 4, 2025

Uh oh!

drbenvincent commented Jun 20, 2025

Uh oh!

drbenvincent left a comment

Uh oh!

drbenvincent Jun 20, 2025

Uh oh!

JeanVanDyk Jun 20, 2025

Uh oh!

drbenvincent commented Jun 20, 2025 •

edited

Loading

Uh oh!

JeanVanDyk commented Jun 20, 2025 •

edited

Loading

Uh oh!

JeanVanDyk commented Jun 24, 2025

Uh oh!

drbenvincent left a comment

Uh oh!

drbenvincent left a comment

Uh oh!

Uh oh!

	def __init__(self, sample_kwargs: Optional[Dict[str, Any]] = None):
	"""
	:param sample_kwargs: A dictionary of kwargs that get unpacked and passed to the
	:func:`pymc.sample` function. Defaults to an empty dictionary.
	"""
	super().__init__()
	self.idata = None
	self.sample_kwargs = sample_kwargs if sample_kwargs is not None else {}

Draft (new feature) : Model to estimate when a intervention had effect #480

Are you sure you want to change the base?

Draft (new feature) : Model to estimate when a intervention had effect #480

Uh oh!

Conversation

JeanVanDyk commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New Feature: InterventionTimeEstimator for Unknown Treatment Timing

Use Case

Notes / Open Questions

Model Summary

Uh oh!

codecov bot commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

JeanVanDyk commented May 30, 2025

Note

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

review-notebook-app bot commented Jun 4, 2025

Uh oh!

JeanVanDyk commented Jun 4, 2025

Simplified .fit() API

Optional Prior Specification

Support for Inferred Intervention Time

Timeline Requirement

Extras and Ongoing Work

Uh oh!

drbenvincent commented Jun 20, 2025

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

drbenvincent Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

JeanVanDyk Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

drbenvincent commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JeanVanDyk commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JeanVanDyk commented Jun 24, 2025

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Handlers

treatment_type_effect input

Notebook

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Architecture / API though

Uh oh!

Uh oh!

JeanVanDyk commented May 28, 2025 •

edited

Loading

New Feature: `InterventionTimeEstimator` for Unknown Treatment Timing

codecov bot commented May 30, 2025 •

edited

Loading

Simplified `.fit()` API

drbenvincent commented Jun 20, 2025 •

edited

Loading

JeanVanDyk commented Jun 20, 2025 •

edited

Loading

`treatment_type_effect` input