# __Chapter 2: Statistical Properties of Financial Data__

## Section 2.1: A First Look at the Data

### 2.1.1 Prices

### 2.1.2 Returns

### 2.1.3 Dividends

### 2.1.4 Bond Yields

### 2.1.5 Financial Distributions

### 2.1.6 Transactions 

## Section 2.2: Summary Statistics

<br>

So far we have relied upon graphical tools such as time-series plots and histograms to explore financial data. Below we will review and employ descriptive statistical tools.

<br>

### 2.2.1 Univariate

##### __Sample Mean__

<br>

A simple measure of the expected return is given by the sample mean

$$
\bar{r} = \frac{1}{T} \sum_{t=1}^{T} r_{t}
$$

<br>

See Figure 2.3. These are monthly S&P 500 data, $r_{t}$, plotted for the period January 1950 - September 2016. The sample mean in monthly tearms is is $\bar{r_{t}} = 0.0006$. In annual terms this becomes: $\bar{r_{t}} = 0.006 \times 12 = 0.072$ This means that the average return over the period 1950 - 2016 is $7.02\%$ per annum. 

The sample mean is interpreted as the level around which $r_{t}$ fluctuates and represents a summary measure of the location of the data. 

<br>

##### __Sample Variance and Standard Deviation__

<br>

A measure of the deviation of the actual return on an asset around its sample mean is given by the sample variance: 

<br>

$$
s^{2} = \frac{1}{T}\sum_{t=1}^{T} (r_{t} - \bar{r})^{2}
$$

<br>

__NB:__ this form is actually a biased estimator of the population variance. An unbiased estimator would replace the $T$ term in the denominator with $T - 1$, which is known a degrees of freedom (small sample) correction. If the size of $T$ is sufficiently large this will make little difference in practice. 

<br>

In the case of the S&P 500 data, this sample variance is $s^{2} = 0.0348^{2} = 0.0012$. 

<br>

Usually in empirical financial applications the sample standard deviation is used as a measure of riskiness of an investment and is typically called the ___volatility___ of the asset. 

The sample standard deviation is the square root of the sample variance: 

<br>

$$
s = \sqrt{\frac{1}{T}\sum_{t=1}^{T} (r_{t} - \bar{r})^{2}}
$$

<br>

For the S&P 500 data, this quantity is $s = 0.0348$. The sample standard deviation (or volatility) is convenient because it is directly interpretable in terms of the assets returns. (Variance has a unit of measure in terms of squared returns, which are harder to reason about). 

<br>

##### __Sample Skewness__

<br>

A measure of skewness in the data sample is:

<br>

$$
SK = \frac{1}{T}\sum_{t=1}^{T}\left(\frac{r_{t} - \bar{r}}{s}\right)^{3}
$$

<br>

If the extreme returns in any sample are mainly:

1. Positive, then the distribution of $r_{t}$ is positively skewed

2. Negative, then the distribution of $r_{t}$ is negatively skewed

3. If the sample skewness is zero, then the distribution is said to be symmetric

<br>

The S&P 500 data have a sample skewness of $SK = -1.005$, where the sign of the statistic emphasizes negative skewness. This support evidence of a heavier left tail in returns distributions in comparison with a Normal distribution (which is symmetric). 

<br>

##### __Sample Kurtosis__

<br>

If there are extreme returns relative to a benchmark distribution (typically the Normal or Gaussian), the distribution of $r_{t}$ is said to exhibit excess kurtosis. 

<br>

The sample kurtosis is given by:

<br>

$$
KT = \frac{1}{T} \sum_{t=1}^{T} \left( \frac{r_{t} - \bar{r}}{s} \right)^{4} - 3
$$

<br>

The sample kurtosis of the S&P 500 data is $KT = 6.938$. The value of this parameter for normally distributed data is $3$. This is much greater than $3$ suggesting that the log returns from the S&P 500 data exhibit more extreme returns (or have heavier tails) than would be predicted by a normal distribution for log returns. 

<br>


### 2.2.2. Bivariate

##### __Covariance__

##### __Correlation__

## Section 2.3: Percentiles and Value-at-Risk

1. __Historical Simulation__



2. __The Variance Method__



3. __Monte Carlo Simulation__

## Section 2.4: The Efficient Market Hypothesis

### 2.4.1 Return Predictability

### 2.4.2 The Variance Ratio

## Section 2.5: Exercises