# Stochastic Models in Neurocognition

## Class 4

<hr>

**Preliminary Notes**:

- etienne.tanre@inria.fr (numerical homework to send), Building Cauchy C0011, +33 (0)6 66 97 44 34
- February, final exam
- **tutorials**:
    - purely numeric: simulation of a process seen in class
    - To send to teacher + Josue: josue.tchouanti-fotso@unice.fr
        1. ***code***
        2. ***pdf file*** including for each question/model to simulate: goal, method chosen to achieve the goal, results, comments

<hr>

# 1 - Markov Chains

## Introduction

### Definition of Stochastic Process

> We consider a probability space $\Omega, \mathcal{F}, \mathbb{R})$. 
>
> $(X_t)_{t\in I}$ is a stochastic process if:
\begin{align}
X_t:\Omega\times I&\rightarrow E\\
(\omega, t)&\rightarrow X_t(w)
\end{align}
>
> We consider:
> - $I=\mathbb{N}$ the **discrete time** SP ($X_t$ can be said to be a time series)
> - $I=\mathbb{R}_+$ the **continuous time** SP
> - $E=\{0,1\}$.

<u>Examples for $E$:</u> any finite sets, $E=\mathbb{R}$ or $E=\mathbb{R}^d$

#### Vocabulary

1. If $\omega$ is fixed, $X_t(\omega)$ is called a **path** or **trajectory**. 
2. If $t$ is fixed and is *unary* (it's not a list), $X_t(\omega)$ is a **random variable**, and usually we are interested in its **distribution** (or law)

### Definition of Markov Chains

Markov Chains are a particular case of a Stochastic Process:

> $I=\mathcal{N}$; we are only interested in discrete time SP.
> 
> A SP is a MC if and only if: $$\forall n\in I, \forall p\ge1,\mathcal{L}(X_{n+p}|X_n,...,X_1,X_0)=\mathcal{L}(X_{n+p}|X_n)$$
>
> $n$ represents the current time/present, $n+p$ represents the future.

### Why are we interested in MC?

The main reason is that **one can forget the past**. I.e. to be able to make a forecast, make an estimate, we only need to take a snapshot of the current state as an input parameter.

<u>Example 1:</u>

\begin{align}
I&=\mathcal{N};\,\,E=\{0,1\}\\
1&\rightarrow\text{, the neuron is excited};\,\,0\rightarrow\text{, the neuron is not}\\
\forall n\in\mathcal{N}, X_n&=\{0,1\}\\
\end{align}

As such, the law at time $t=n+1$ is completely discribed by the probability to be in state $1$ and probability to be in state $0$.

\begin{align}
\mathbb{P}(X_{n+1}=1|X_n=0)&=\mathbb{P}_{0\rightarrow1}\\
\mathbb{P}(X_{n+1}=0|X_n=0)&=\mathbb{P}_{0\rightarrow0}=1-\mathbb{P}_{0\rightarrow1}\\
\mathbb{P}(X_{n+1}=1|X_n=1)&=\mathbb{P}_{1\rightarrow1}=1-\mathbb{P}_{1\rightarrow0}\\
\mathbb{P}(X_{n+1}=0|X_n=1)&=\mathbb{P}_{1\rightarrow0}\\
\text{with}&,\\
\mathbb{P}_{0\rightarrow0} + \mathbb{P}_{0\rightarrow1} &= 1\\
\mathbb{P}_{1\rightarrow0} + \mathbb{P}_{1\rightarrow1} &= 1
\end{align}

We can represent this as a matrix:

$$\begin{pmatrix} \mathbb{P}_{0\rightarrow0} & \mathbb{P}_{0\rightarrow1} \\ \mathbb{P}_{1\rightarrow0} & \mathbb{P}_{1\rightarrow1} \end{pmatrix}$$

<u>Example 2:</u>

\begin{align}
X_{n+1}=1&\text{ with probability }q_{i,j\rightarrow1}\\
X_{n+1}=0&\text{ with probability }q_{i,j\rightarrow0} = 1-q_{i,j\rightarrow1}\\
\text{if }X_n=i&;\,\,X_{n-1}=j
\end{align}

If $q_{i,j\rightarrow1}\neq q_{i,1-j\rightarrow1}$, $X_n$ is not a MC.

<span style="color:red">In this case $q_{1,1\rightarrow1}\neq q_{1,0\rightarrow0}$, It means that $\mathbb{P}(X_{n}=1|X_n-1, X_{n-1}=1)=q_{1,1\rightarrow1})$ is not equal to $\mathbb{P}(X_{n}=1|X_n-1, X_{n-1}=1)=q_{1,0\rightarrow1})$</span>

In this case, we introduce a *new* SP. We consider:

\begin{align}
\hat{X}_n = (X_n, X_{n-1})\\
E=\{(0,0), (0,1), (1,0), (1,1)\}
\end{align}

**Property**: $\hat{X}_n$ is a MC if

<span style="color:red">ADD PROB with MATRIX VALUES HERE</span>

<u>In general:</u>

> If one wants to consider a SP s.t. the law of the position of at time $n+1$ depends on a finite past, say $X_n, X_{n-1},...,X_{n-d}$, we introduce: $\tilde{X}_n=(X_n, X_{n-1},...,X_{n-d})$ with $\tilde{E}=\{0,1\}^{d+1}$ and $\tilde{X}_n$ is a MC.

## How to simulate a MC (<span style="color:red">HOMEWORK</span>)

We introduce 2 sequences of Bernouilli  random variables with parameters:

\begin{align}
\mathbb{P}_{0\rightarrow1}\\
\mathbb{P}_{1\rightarrow0}\\
(Y_i)_{i\in \mathcal{N}}&,\mathbb{P}(success =Y_i)=\mathbb{P}_{0\rightarrow1}\\
(Z_i)_{i\in \mathcal{N}}&,\mathbb{P}(success =Z_i)=\mathbb{P}_{1\rightarrow0}\\
Z_i, Y_i&\in\{0, 1\}\\
X_{n+1} &= X_n*(1-Z_{n+1}) + (1-X_n)*Y_{n+1}\\
X_{n+1} &= X_n*\mathbb{1}_{[\mathbb{P}_{1\rightarrow0}, 1]}(U_{n+1}) + (1-X_n)*\mathbb{1}_{[0, \mathbb{P}_{0\rightarrow1}]}(U_{n+1}
\end{align}

Where $(U_n)_{n\ge1}$ is a sequence of IID random variables with the uniform distribution on [0,1]

We assume that the two processes are independent and that $Y_1, ..., Y_n$ and $Z_1, ..., Z_n$ are IID respectively (they don't share the same law).

#### Homework remarks

- Choose an **initial condition**, i.e. the value of $X_0$
- choose the **parameters** $\mathbb{P}_{0\rightarrow1}\in[0,1]$ and $\mathbb{P}_{1\rightarrow0}\in[0,1]$
- Simulate many MC!
- Possible different implementations: Simulate a bernouilli (bernouilli function or uniform discrete on 0, 1), i.e. $\mathcal{B}(p)\sim U[0,1]$, $Y=\mathbb{1}_{[0,p]}(U)$
    1. simulated the full sequence X_n
    2. simulate the sequence with the uniform
    3. simulate using a pruning algorithm
    4. TIME THEM

# 2 - Continuous time Markov Processes

It means that $I=\mathbb{R}_+$

## Definition

$$\forall t\ge 0, s>0, \mathcal{L}(X_{t+s}|(X_u)_{0\le u \le t}) = \mathcal{L}(X_{t+s}|X_t)$$

<u>Example 1 (simple):</u>

$E=\{0,1\}$, the MP is completely described by two rates $\alpha$ (the rate of jump between state $0$ to state $1$) and $\beta$ (the rate of jump between state $1$ to state $0$). 

**definition of rate**: 

> For physicists, the rate is: $\mathbb{P}(X_{t+\delta}=1|X_{t}=0)\approx\alpha*\delta$
>
> $\underset{\delta\rightarrow0}{lim}\,\, \frac{1}{\delta}\mathbb{P}(X_{t+\delta}=1|X_{t}=0)=\alpha$

<span style="color:red">ADD GRAPH</span>

## Link with MC

<span style="color:red">ADD GRAPH</span>

I introduce a "small" parameter $\delta$ (delta means an accuracy of a measure that specify the maximum 'frequency'/'rate' of an underlying process) and the discrete time process $\tilde{X}_n = X_n\delta$.

\begin{align}
\mathbb{P}(\tilde{X}_{n+1}=1|\tilde{X}_n=0)\approx \alpha\delta\\
\mathbb{P}(\tilde{X}_{n+1}=0|\tilde{X}_n=1)\approx \beta\delta\\
\end{align}

We can approximate the continuous time MP with a MC ($Y_n$) with parameters $\mathbb{P}_{0\rightarrow1}=\alpha\delta$ and $\mathbb{P}_{1\rightarrow0}=\beta\delta$

If we set $Z_t = Y_{\frac{t}{\delta}}$, $X_t$ has a law close to the law of $X_t$: $\mathcal{L}(Z_t)\approx\mathcal{L}(X_t)$.

## MC/MP relation

These 2 processes $Y_n$ and $X_t$ take only two values: 0 and 1. As such, they are **fully-characterized by the time at which they jump**. 

<span style="color:red">ADD GRAPH</span>

The knowledge of $(\tau_1, ...,\tau_k,...)$ is equivalent to the knowledge of the complete trajectory.

Consider the sequence (with $\alpha\delta<1$):
\begin{align}
\tilde{\tau}_1&=\delta x \text{, the first jumping time of the markov chain $(Y_n)$}\\
\tilde{\tau}_2&=\delta x\text{, the second...}\\
t&=p\delta;\,\,Y_0=0\\
\mathbb{P}(\tilde{\tau}_1\ge t)&=(1-\alpha\delta)^p = (1-\frac{\alpha t}{p})^p\underset{p\rightarrow\infty}{\rightarrow} exp(-\alpha t)
\end{align}

<span style="color:red">ADD GRAPH</span>

\begin{align}
t&=p\delta\text{ (p going to infinity is equivalent to letting t go to 0)}\\
(1-\frac{\alpha t}{p})^p&=exp(p*\log(1-\frac{\alpha t}{p})) \approx exp(p(-\frac{\alpha t}{p}+\omicron(\frac{1}{p^2}))=exp(-\alpha t)\\
ADD TWO MISSING FORMULAS
\end{align}

> **We recognize the distribution of $\tau_1\rightarrow\mathcal{E}(\lambda)$ exponential law with parameter $\lambda=\alpha$.**

<hr>

#### Reminder on Exponential laws

\begin{align}
\mathcal{E}(\lambda)\sim f(\theta)&=0\text{ if $\theta<0$}\\
&=\lambda exp(-\lambda\theta)\text{ if $theta\ge0$}
\end{align}

<span style="color:red">ADD CDF</span>

\begin{align}
\mathbb{E}[Z] = \int^{+\infty}_0\lambda\theta exp(-\lambda\theta)d\theta=\frac{1}{\lambda}\\
Var(Z) = \frac{1}{\lambda^2}
\end{align}

An exponential distribution has no memory implies that:

\begin{align}
\mathbb{P}(Z\ge t+s|Z\ge t) = \mathbb{P}(Z\ge s)
\end{align}

<hr>

Thanks to this simple computation, one can directly **simulate the jumping time of the continuous time MP**.

\begin{align}
\tau_1&\overset{\mathcal{L}}{=}\mathcal{E}(\alpha)\\
\tau_2-\tau_1&\overset{\mathcal{L}}{=}\mathcal{E}(\beta)\\
\tau_3-\tau_2&\overset{\mathcal{L}}{=}\mathcal{E}(\alpha)\\
\end{align}

## How to simulate a MP (<span style="color:red">HOMEWORK</span>)

Simulate $(Y_n)$ with parameters $\alpha\delta$ and $\beta\delta$. Evaluate the distribution of $\tau_1^\delta=N_1\delta$ where $N_1$ is the first time $Y_1 = 1$ (apply the previous algorithm but by modifying the parameters)

Plot the empirical distribution of $\tau_1^\delta$ for small $\delta$ and compare with the distribution of $\mathcal{E}$ with parameter $\alpha$. 

# 3 - Next Week: Generalization

Generalization to $E=\{1,2,...,d\}$ where a graph of jumps is built.