# What is a point process?

A temporal point process is a probability distribution over variable-length sequences in some time interval
$[0 ; T]$. A realization of a marked temporal point process can be represented as an event sequence $X = {(t_1 ; m_1); \cdots ; (t_N ; m_N)}$, where $N$, the number of events, is itself a random variable.

Here, $0 < t_1 < \cdots < t_N \leq T$ are the times of occurrence of events and $m_i \in \mathcal{M}$ are the marks.

_Examples of point processes:_

- item 1
- item 2

We denote $\mathcal{H}_t$ the history up to but not including time $t$:

$\mathcal{H}_t = \left\{ \left( t_i , m_i \right) \forall i : t_i < t \right\}$

where $t_i$ is the time of the $i$th event and $m_i$ is the corresponding mark. If we denote by $N_{\delta} \left( t \right)$ the number of events occurring in $[ t , t + \delta )$, the ground intensity function is defined by:

$\lambda_g \left( t \mid \mathcal{H}_t \right) = \lim_{\delta \rightarrow 0} \frac{1}{\delta} Pr \left\{ N_{\delta} \left( t \right) > 0 \mid \mathcal{H}_t \right\}$

Our objective is to use a dataset of events to model the ground intensity function. Then, we can generate event sequences is the same probability distribution, or predict the time (and the mark) of the next event.

To compute the likelihood, we denote $\tau$ the time of the last event before time $t$, and $\emptyset_{(\tau , t)}$ the null outcome, that is there is no event between time $\tau$ and time $t$. The conditional distribution of the time of the next event, that is the probability that the time $t$ of the next event is lower or equal to $T$ given the history until time $\tau$ ($\mathcal{H}_{\tau}$) and the fact that there is no event between time $\tau$ and time $t$, is denoted:

$H(t \mid \mathcal{H}_{\tau} \cap \emptyset_{(\tau , t)}) = Pr \{T \geq t \mid \mathcal{H}_{\tau} \cap \emptyset_{(\tau , t)}\}$

The corresponding density function is denoted $h(t \mid \mathcal{H}_{\tau} \cap \emptyset_{(\tau , t)})$. Then we have:

$\lambda_g(t \mid \mathcal{H} \cap \emptyset_{(\tau , t)}) = \frac{h(t \mid \mathcal{H} \cap \emptyset_{(\tau , t)})}{1 - H(t \mid \mathcal{H} \cap \emptyset_{(\tau , t)})}$

Solving the differential equation gives:

$H(t \mid \mathcal{H} \cap \emptyset_{(\tau , t)}) = 1 - \exp \{- \int_{\tau}^t \lambda_g(u \mid \mathcal{H} \cap \emptyset_{(\tau , u)}) du\}$

and:

$h(t \mid \mathcal{H} \cap \emptyset_{(\tau , t)}) = \lambda_g(t \mid \mathcal{H} \cap \emptyset_{(\tau , t)}) \exp \{- \int_{\tau}^t \lambda_g(u \mid \mathcal{H} \cap \emptyset_{(\tau , u)}) du \}$

We fit the model by maximizing the likelihood of the ground intensity function. If $T_1$ and $T_2$ are the beginning and end time of the catalog of events and we only explicitly include in the likelihood the events which time $t_i$ are such that $T_1 \leq t_i \leq T_2$, the log-likelihood is equal to:

\begin{equation}
\begin{aligned}
\log L &= \log h \left( t_1 \mid \mathcal{H}_{T_1} \cap \emptyset_{\left( T_1 , t_1 \right)} \right) + \sum_{i = 2}^n \log h \left( t_i \mid \mathcal{H}_{t_i - 1} \cap \emptyset_{\left( t_{i - 1} , t_i \right)} \right) \\
&+ \log \left( 1 - H \left( T_2 \mid \mathcal{H}_{t_n} \cap \emptyset_{\left( t_n , T_2 \right)} \right) \right) \\
&= \sum_{i = 1}^n \log \lambda_g \left( t_i \mid \mathcal{H}_{t_i} \right) - \int_{T_1}^{t_1} \lambda_g \left( u \mid \mathcal{H}_{T_1} \cap \emptyset_{\left( T_1 , u \right)} \right) du \\
&- \sum_{i = 2}^n \int_{t_{i - 1}}^{t_i} \lambda_g \left( u \mid \mathcal{H}_{t_{i - 1}} \cap \emptyset_{\left( t_{i - 1} , u \right)} \right) du - \int_{t_n}^{T_2} \lambda_g \left( u \mid \mathcal{H}_{t_n} \cap \emptyset_{\left( t_n , u \right)} \right) du \\
&= \sum_{i: T_1 \leq t_i \leq T_2} \log \lambda_g \left( t_i \mid \mathcal{H}_{t_i} \right) - \int_{T_1}^{T_2} \lambda_g \left( t \mid \mathcal{H}_t \right) dt
\end{aligned}
\end{equation}