# Chapter 4: Effect Modification

## 4.1 Heterogeneity of Treatment Effects

We say that $V$ is a modifier of the effect of $A$ on $Y$ when the average causal effect of $A$ on $Y$ varies across levels of $V$. Note that there are different effect measures (risk difference, risk ratio) and whether or not there is an effect modifier actually depends on which measure you are using.

When there is effect modification for the risk *difference*, we say that there is effect modification on the *additive* scale (For risk ratio, it is the multiplicative scale).

Additive Effect Modification:
$$E[Y^{a=1}-Y^{a=0}|V=1]\neq E[Y^{a=1}-Y^{a=0}|V=0]$$

Interestingly, the text notes that we do not consider effect modification on the odds ratio scale because the odds ratio if rarely, if ever, the parameter of interest for causal inference.

If the effect modification "flips" the result (e.g., goes the opposite direction for each $V$), then we say there is a *qualitative effect modification*, as opposed to a difference of 1.5 vs 1.7.

## 4.2 Stratification to Identify Effect Modification

A stratified analysis is the natural way to identify effect modification. This is done by computing the causal effect of $A$ on $Y$ in each level (stratum) of the variable $V$.

## 4.3 Why care about effect modification

If a drug effects women negatively and men positively, this is a pretty important thing to know when deciding to make a drug mainstream.

Therefore, there is generally no such thing as "*the* causal effect of $A$ on $Y$", but instead, "the average causal effect of $A$ on $Y$ in a population with a particular mix of causal effect modifiers".

The extrapolation of causal effects computed in one population to a second population is referred to as *transportability* of causal inferences across populations. Some refer to the lack of transportability as lack of *external validity*.

Unfortunately, it is not easy to make sure that there is transportability of causal effects. Thus, it is often understood as an unverifiable assumption that relies heavily on subject-matter knowledge.

For instance, most experts would agree that health effects of increasing a household's annual income by $100 in Niger cannot be transported to the Netherlands, but most would agree that the health effects of use of cholesterol-lowering drugs in Europeans can be transported to Canadians.

In addition, in some situations, transportability doesn't matter. For example, Smith and Pell (2003) could not identify any meaningful effect modifiers in the question of, "Does having a parachute increase the survival rate of jumping from an airplane", since all humans, regardless of gender, race, etc., would all die if they didn't have a parachute.

## 4.4 Stratification as a Form of Adjustment

Stratification can be used as a way to adjust for certain covariates. For example, if computing the causal effect separately for men and women, you do not have to worry about confounding from gender.

## 4.5 Matching as Another Form of Adjustment

The goal of matching is to construct a subset of the population in which the variables $L$ have the same distribution in both the treated and the untreated.

Because the matched population is a subset of the original study population, the distribution of causal effect modifiers in the matched study population will generally differ from that in the original.

## 4.6 Effect Modification and Adjustment Methods

IPW, Stratification/restriction, and matching are different approaches to estimate average causal effects, but they estimate different types of causal effects. These 4 approaches can be divided into two groups:
- Standardization and IP Weighting can be used to compute either the marginal or conditional effects
- Stratification/restriction and Matching can only be used to compute conditional effects in certain subsets of the population. (For matching, this is often a *consequence*, not the intention)

All 4 approaches require exchangeability and positivity but the subsets of the population in which these conditions need to hold depend on the causal effect of interest.

In essence, the previous chapter argued that a well-defined causal effect is a prereq for meaningful causal inference. This chapter argues that a well characterized target population is another prereq. While these assumptions are usually valid in experiments (assuming valid design), in observational studies they cannot be taken for granted.