## [Small-Sample Inference and Bootstrap](https://ocw.mit.edu/courses/sloan-school-of-management/15-450-analytics-of-finance-fall-2010/lecture-notes/MIT15_450F10_lec09.pdf )

In small samples, asymptotic inference may be unreliable:
- Estimators may be consistent but biased.
- Standard errors may be imprecise, leading to incorrect confidence intervals and statistical test size

We can use simulation methods to deal with some of these issues:
- Bootstrap can be used instead of asymptotic inference to deal with analytically challenging problems.
- Bootstrap can be used to adjust for bias. 
- Monte Carlo simulation can be used to gain insight into the properties of statistical procedures. 

Estimate by OLS (GMM)
$$x_t = a_0 + \rho_1x_{t−1} + e_t$$
We know that this estimator is consistent:

$$p\lim_{T\to\infty} \hat{\rho}_1  = {\rho}_1 $$

We want to know if this estimator is biased, i.e., we want to estimate
$$ E(\hat{\rho}_1) - {\rho}_1 $$

Perform a Monte Carlo study to gain insight into the phenomenon, simulate independently N random series of length T, estimate the bias:
$$ \hat{E}(\hat{\rho}_1) - {\rho}_1 =\frac{1}{N} \sum \hat{\rho}_1(i) - {\rho}_1 $$

Standard error of our simulation-based estimate is
$$\hat{\sigma}=\sqrt{\frac{1}{N} \sum (\hat{\rho}_1(i) - \hat{E}(\hat{\rho}_1))^2 }$$

There is an analytical formula for the average bias due to Kendall:
$$ E(\hat{\rho}_1) - {\rho}_1 \approx -\frac{1+3{\rho}_1}{T}$$

Consider a predictive regression (e.g., forecasting stock returns using
dividend yield): 
$$ r_{t+1} = α + βx_t + u_{t+1} \\
x_{t+1} = θ + ρx_t + ε_{t+1} $$


Stambaugh bias:
$$ E(\hat{β} − β) = \frac{Cov(u_t , ε_t)}{Var(ε_t)} E(\hat{\rho}) - {\rho} \approx -\frac{1+3{\rho}}{T}  \frac{Cov(u_t , ε_t)}{Var(ε_t)} $$



#### Bootstrap: General Principle


- Bootstrap is effectively a Monte Carlo study which uses the empirical distribution as if it were the true distribution.
- Used to evaluate distributional properties of complicated estimators, perform bias adjustment; improve the precision of asymptotic approximations in small samples (confidence intervals, test rejection regions, etc.) 
- For t-statistic, bootstrapped distribution is more accurate than the large-sample normal approximation.

Bootstrap is not a silver bullet:
- Does not work well if rare events are missing from the empirical sample;
- Does not account for more subtle biases, e.g., survivorship, or sample selection.
- Does not cure model misspecification. 

#### Parametric Bootstrap

- Parametric bootstrap can handle non-IID samples.

Example: a sample from an AR(1) process: 
$$x_t = a_0 + a_1x_{t−1} + e_t$$

Want to estimate a confidence interval for $\hat{a}_1$

- Estimate the parameters $\hat{a}_0$, $\hat{a}_1$ and residuals $\hat{e}_1$.

- Generate R bootstrap samples for $x_t$: for each sample generate a long series according to the AR(1) dynamics with $\hat{a}_0$, $\hat{a}_1$, drawing shocks with replacement from the sample $\hat{e}_1, ..., \hat{e}_T$, then retain only the last T observations (drop the burn-in sample).

- Compute the confidence interval as we would with basic nonparametric bootstrap using R samples. 

#### Bootstrap Bias Adjustment

$$ E [\hat{θ} - θ_0] \approx  E_R[ \hat{θ}^* -\hat{θ} ] $$

where $E_R$ denotes the average across the R bootstrapped samples.

__Intuition:__ treat the empirical distribution as exact, compute the
average bias across bootstrapped samples.

__Caution:__ by estimating the bias, we may be adding sampling error.
Correct for the bias if it is large compared to the standard error of $\hat{θ}$