# Sequential estimation




## Robbins-Monro Algorithm
*<a href="https://en.wikipedia.org/wiki/Stochastic_approximation">Robbins-Monro</a>* algorithm is a method to estimate the solution(root) of a function from ramdom variables <font color='red'>iteratively</font>.  
Assume that we have a function $M(\theta)$, and a constant $\alpha$, such that the equation $M(\theta)=\alpha$ has a unique root at $\theta^*$. It is assumed that while we cannot directly observe the function $M(\theta)$, we can instead obtain measurements of the random variable $N(\theta)$ where $\mathbb{E}[N(\theta)]=M(\theta)$. The structure of the algorithm is to then generate iterates of the form:
$$\theta_{n+1}=\theta_n - a_n(N(\theta_n)-\alpha)$$
Here, $a_1,a_2,\cdots$ is a sequence of positive step sizes. Robbins and Monro proved that $\theta_n$ conveges in $L^2$ to $\theta$, and the convergence is actually with probability one, provided that  
- $N(\theta)$ is uniformly bounded.
- $M(\theta)$ is nondecreasing. 
- $M'(\theta)$ exists and is positive.
- $a_n$ satisfies the following requirements
$$\begin{align*}
\lim_{N\to\infty}a_N &= 0\\
\sum_{N=1}^{\infty}a_N&=\infty\\
\sum_{N=1}^{\infty}a_N^2&<\infty
\end{align*}$$

## Application of Robbins-Monro Algorithm

For maximizing the log likelihood function, we derivate the function and let it equal to zero
$$\frac{\partial}{\partial \theta}\left .\left\{\frac{1}{N}\sum_{n=1}^N\ln p(\mathbf{x}_n|\theta) \right\}\right|_{\theta_{ML}}=0$$
Exchanging the derivative and the summation, and taking the limit $N\to\infty$ we have
$$\lim_{N\to\infty}\frac{1}{N}\sum_{n=1}^N\ln p(\mathbf{x}_n|\theta)=\mathbb{E}_x\left[\frac{\partial}{\partial\theta}\ln p(x|\theta)\right]$$
For each single $x$, its hyperparameter $\theta$ makes the expectation of $\frac{\partial}{\partial\theta}\ln p(x|\theta)$ to be zero.Compare the form here with that of Robbins-Monro algorithm, we have
- $0$, the original result given the input $\theta$, $M(\theta)=\alpha$.
- $\frac{\partial}{\partial\theta}\ln p(x|\theta)$, the measurement of the random variable $N(\theta)$.
- $\mu_{ML}$, the solution we need to figure out, $\theta$.
- $a_N$, the coefficient.

Then the sequential form of $\mu_{ML}$ is given by
$$\begin{align*}
\mu_{ML}^{(N)}&=\mu_{ML}^{(N-1)}+a_{N-1}\left(\frac{\partial}{\partial\theta}\ln p(x_N|\mu_{ML}^{(N-1)})-0\right)\\
&=\mu_{ML}^{(N-1)}+a_{N-1}\left(\frac{1}{\sigma^2}(x_N-\mu_{ML}^{(N-1)})\right)\\
&=\mu_{ML}^{(N-1)}+\frac{1}{N}(x_N-\mu_{ML}^{(N-1)})\qquad let\ a_N=\frac{\sigma^2}{N+1}
\end{align*}$$