# Describing the statistical tests

Illustrate the null and alternate hypotheses of a test.

We first add the paths where the figures will be written.

In [5]:
from pathlib import Path

OUTPUT_ROOT = Path("~/repos/MutationDiseqMS").expanduser()
FIG_DIR = OUTPUT_ROOT / "figs_supp"
FIG_DIR.mkdir(parents=True, exist_ok=True)

## Test-of-equilibrium (TOE) on an edge

The edge of interest is termed the foreground edge (red), all others (blue) are background edges.

Substitutions on the background edges are modelled by discrete-time Markov nucleotide processes (as per Barry and Hartigan) in both the null and alternate hypotheses.

The foreground edge is modelled by a General Stationary Nucleotide (GSN) Markov model in the null hypothesis and a General Nucleotide (GN) Markov model in the alternate hypothesis.

In [6]:
from cogent3 import make_tree


def get_base_dnd():
    tree = make_tree("(C,(A,B))")
    dnd = tree.get_figure()
    dnd.line_width = 2.5
    dnd.tip_font.size = 24
    dnd.marker = 1
    dnd.layout.update(
        legend_font=dict(size=20),
        width=600,
        height=450,
        margin=dict(l=60, r=10, t=25, b=60),
    )
    return dnd

In [7]:
dnd = get_base_dnd()
dnd.style_edges(
    ["B", "C", "edge.0"], line=dict(color="black"), legendgroup="background"
)
dnd.style_edges("A", line=dict(color="red"), legendgroup="foreground")
dnd.show()
dnd.write(FIG_DIR / "stat-test-toe.pdf")

## Temporal equivalence of process (TEOP) test

In this case, we compare the equivalence of the substitution process between two edges on a tree. Substitutions on all edges are modelled by GN Markov nucleotide processes. In the null, the foreground edges have the same GN process which differs from that on the background edge. In the alternate, all edges have a distinct GN process.


In [8]:
dnd = get_base_dnd()
dnd.style_edges(["C", "edge.0"], line=dict(color="black"), legendgroup="background")
dnd.style_edges("A", line=dict(color="red"), legendgroup="A")
dnd.style_edges("B", line=dict(color="blue"), legendgroup="B")
dnd.show()
dnd.write(FIG_DIR / "stat-test-teop.pdf")

## Adjacent equivalence of process (AEOP) test

In this case, we compare the equivalence of the substitution process between two adjacent loci with exactly the same tree topology. Substitutions on all edges are modelled by GN Markov nucleotide processes. The background edges are modelled using the BH model, with a single foreground edge modelled using a GN process. In the null, the two alignments share every aspect of the substitution process (which corresponds to TOE). In the alternate, the two alignments have the same model form, but do not share any parameters.
