# Chapter 6: Graphical Representation of Causal Effects

Causal inference generally requires expert knowledge and untestable assumptions about the causal network linking treatment, outcome, and other variables.

For complex situations, it will become crucial to be explicit about what we know and wehat we assume about the variables relevant to our particular causal inference problem.

DAGs are a tool to represent our qualitative expert knowledge and *a priori* assumptions about the causal structure of interest.

By summarizing knowledge and assumptions in an intuitive way, graphs help clarify conceptual problems and enhance communication among investigators.

>"*draw your assumptions before your conclusion*"

## 6.1 Causal Diagrams

We define a directed acyclic graph (DAG) $G$ to be a graph whose nodes (vertices) are random variables $V=(V_1,...,V_M)$ with directed edges (arrows) and no directed cycles.

We use $PA_m$ to denote the parents of $V_m$, i.e., the set of nodes from which there is a direct arrow *into* $V_m$. The variable $V_m$ is a descendant of $V_j$ (and $V_j$ is an ancestor of $V_m$) if there is a sequence of nodes connected by edges between $V_j$ and $V_m$ such that, following the direction indicated by the arrows, one can reach $V_m$ by starting at $V_j$.

We adopt the ordering convention that if $m>j$, $V_m$ is not an ancestor of $V_j$.

We define the distribution of $V$ to be Markov with respect to a DAG $G$ (equivalently, the distribution factors according to a DAG $G$) if, for each $j$, $V_j$ is independent of its non-descendants conditional on its parents. This latter statement is mathematically equivalent to the statement that the density $f(V)$ of the variables $V$ in DAG $G$ satisfies the Markov factorization
$$f(v)=\prod_{j=1}^{M}f(v_j|pa_j)$$

So for example, when you have $$A_1\rightarrow A_2\rightarrow A_3$$
We assume that if you control for $A_2$, then $A_3$ and $A_1$ become independent.

A causal DAG is a DAG in which
1. the lack of an arrow from node $V_j$ to $V_m$ can be interpreted as the absense of a direct causal effect of $V_j$ on $V_m$ relative to the other variables in the graph
2. all common causes, even if unmeasured, of any pair of variables on the graph are themselves on the graph
3. any variable is a cause of its descendants

Causal DAGs are of no practical use unless we make an assumption linking the causal structure represented by the DAG to the data obtained in a study. This assumption, referred to as the causal Markov assumption, states that, conditional on its direct causes, a variable $V_j$ is independent of any variable for which it is not a cause. That is, conditional on its parents, $V_j$ is independent of its non-descendants; hence, a causal DAG is Markov with respect to the DAG $G$.

## 6.2 Causal Diagrams and Marginal Independence

## 6.3 Causal Diagrams and Conditional Independence

## 6.4 Positivity and Consistency in Causal Diagrams

**D-separation**: We define a path to be either blocked or open according to the following graphical rules.
1. If there are no variables being conditioned on, a path is blocked if and only if two arrowheads on the path collide at some variable on the path. For instance, $L\rightarrow A\rightarrow Y$ is open, whereas $A\rightarrow Y\leftarrow L$ is blocked because two arrowheads on the path collide at $Y$. We call $Y$ a collider on the path $A\rightarrow Y\leftarrow L$.
2. Any path that contains a non-collider that has been conditioned on is blocked.
3. A coolider that has been conditioned on does not block a path.
4. A collider that has a descendent that has been conditioned on does not block a path.

The above 4 rules can be summarized as:
A path is blocked if and only if it contains a non-collider that has been conditioned on, or it contains a collider that has not been conditioned on and has no descendents that have been conditioned on.

Two variables are $d$-separated if all paths between them are blocked.

The relationship between statistical independence and the purely graphical concept of $d$-separation relies on the causal markov assumption: In a causal DAG, any variable is independent of its non-descendants conditional on its parents.

Pearl (1988) proved the following fundamental theorem: The causal Markov assumption implies that, given any 3 disjoint sets $A,B,C$ of variables, if $A$ is $d$-separated from $B$ conditional on $C$, then $A$ is statistically independent of $B$ given $C$.

Because causal diagrams encode our qualitative expert knowledge about the causal structure, they can be used as a visual aid to help conceptualize causal problems and guide data analyses. In fact, the formulas we described in Chapter 2 to quantify treatment effects, standardization and IP weighting - can also be derived using causal graphs theory, as part of what is sometimes referred to as the *do-calculus*.

Regardless of notation used (potential-outcomes or graphs), exchangeability, positivity, and consistency are conditions required for causal inference via standardization or IP weighting.

## 6.5 A Structural Classification of Bias

We can describe how lack of exchangeability can result from two different causal structures:
1. Common causes: When the treatment and outcome share a common cause, the association measure generally differs from the effect measure. Many epidemiologists use the term *confounding* to refer to this bias.
2. Conditioning on common effects: This structure is the source of bias that many epidemiologists refer to as *selection bias under the null*.

The next 3 chapters will be about the 3 types of systematic bias:
- confounding
- selection
- measurement



## 6.6 The Structure of Effect Modification

Causal diagrams are less helpful to illustrate the concept of effect modification.

In addition, Causal diagrams are in principle agnostic about the presence of interaction between two treatments $A$ and $E$.