# L4a: Equity Exchanges, Return, and Stylized Facts
Today, we will introduce our next asset class: __equity securities__. Equity securities (stocks and Exchange Traded Funds - ETFs) are traded on exchanges (secondary markets) with several players and components. We'll also introduce the log return and explore its stylized facts (statistical properties).

> __Learning Objectives__ 
>
> By the end of this lecture, you will be able to define and demonstrate mastery of the following key concepts:
>
> * __Equity securities__ represent ownership in a corporation, entitling shareholders to a portion of the company's profits and assets, with the potential for capital gains and voting rights, while exposing them to the risks associated with the company's performance.
> * The __log return__ of a stock price is defined as the natural logarithm of the ratio of the stock's price at the end of a period to its price at the beginning of that period, providing a measure of continuously compounded returns that can be easily aggregated over time.
> * __Stylized facts__ refer to empirical statistical properties of asset returns across various financial markets, such as heavy tails, volatility clustering, and the lack of autocorrelation. These serve as essential guidelines for developing and validating financial models.

Equity securities are what everyone thinks of when they think of the stock market. This is a cool topic area, so let's get started!
___

## Examples
Today, we will be using the following example(s) to illustrate key concepts:

* > [▶ Singular Value Decomposition (SVD) of the Daily Growth Rate of the S&P500](CHEME-5660-L4a-SVD-SP500-Example-Fall-2025.ipynb). In this example, we decompose the daily growth rate values for components of the S&P500 using SVD. Singular value decomposition helps us break down complex data into simpler, interpretable components. What patterns will we find in the daily growth rates of S&P500 stocks?

* > [▶ Computing the Stylized Facts of Common Stocks and ETFs](./CHEME-5660-L4a-StylizedFacts-Example-Fall-2025.ipynb). Compute stylized facts for stocks and ETFs, including S&P 500 ETF (SPY) and NASDAQ 100 ETF (QQQ), using 10 years of data to analyze log returns and their statistical properties.
___

<div>
    <center>
        <img src="figs/Fig-RiskReward-Asset-Schematic.svg" width="800"/>
    </center>
</div>

## Exchanges, Stocks and ETFs
Before diving into the analysis of returns and risk, let's establish a foundation by defining the key players and instruments in equity markets.

### What is an Exchange?
An __exchange__ is an organized marketplace where securities, commodities, derivatives, and other financial instruments are traded between buyers and sellers. In the context of equity securities, exchanges serve as the infrastructure that facilitates the buying and selling of stocks and ETFs.

> __Key Functions of an Exchange:__
> * __Price Discovery and Liquidity__: Exchanges aggregate buy and sell orders to determine fair market prices through supply and demand while bringing together many buyers and sellers to ensure efficient trading. In the U.S., this process is enhanced by the __National Best Bid and Offer (NBBO)__ system, which consolidates the best available bid and ask prices across all exchanges and market centers for each security.
> * __Market Regulation__: Exchanges enforce trading rules, listing standards, and disclosure requirements to maintain fair and orderly markets.
> * __Settlement and Clearing__: Exchanges facilitate the transfer of securities and funds between parties to complete transactions.

The United States has __16 registered national securities exchanges__ as of 2025, with the three largest being the [New York Stock Exchange (NYSE)](https://www.nyse.com/), [NASDAQ](https://www.nasdaq.com/), and [Cboe BZX Exchange](https://www.cboe.com/). 

When you trade, you are really trading on all of these exchanges simultaneously because of something called the __National Best Bid and Offer (NBBO)__ system.

> __NBBO and Market Structure:__
> The NBBO represents the highest bid price and lowest ask price available across all U.S. exchanges and Alternative Trading Systems (ATS). For example, if Stock `XYZ` has a bid of 50.00 on NYSE and 50.01 on NASDAQ, while the ask is 50.05 on NYSE and 50.04 on NASDAQ, the NBBO would be 50.01 × 50.04. Brokers must either match or improve upon these prices when executing customer orders. This system promotes competition between trading venues and helps ensure fair pricing for all market participants.

Other notable exchanges include IEX, NYSE Arca, NASDAQ BX, and various Cboe-operated venues, all competing to provide the best execution for different types of securities and trading strategies.

### What is a Stock?
A __stock__ (also called a __share__ or __equity security__) represents fractional ownership in a corporation. When you purchase stock, you become a shareholder and own a piece of that company.

> __Key Characteristics of Stocks:__
> * __Ownership Rights__: Stockholders typically have voting rights on major corporate decisions, such as electing the board of directors.
> * __Dividend Potential__: Some stocks pay dividends, which are periodic distributions of company profits to shareholders.
> * __Capital Appreciation__: Stock prices can increase over time, providing capital gains when sold at a higher price than purchased.
> * __Limited Liability__: Shareholders' potential losses are limited to their investment amount; they are not personally liable for company debts.
> * __Residual Claims__: In the event of liquidation, stockholders have claims on company assets only after all creditors and bondholders are paid.

__Types of Common Stocks:__
Stocks are categorized as __growth__ (fast-growing earnings), __value__ (trading below intrinsic value), __dividend__ (regular profit distributions), or __blue-chip__ (large, established companies).

### What is an ETF?
An __Exchange-Traded Fund (ETF)__ is an investment fund that trades on stock exchanges like individual stocks but holds a diversified portfolio of underlying assets such as stocks, bonds, commodities, or other securities.

> __Key Characteristics of ETFs:__
> * __Diversification__: ETFs typically hold dozens or hundreds of different securities, reducing concentration risk.
> * __Transparency__: ETF holdings are disclosed daily, allowing investors to know exactly what assets they own.
> * __Liquidity__: ETFs can be bought and sold throughout trading hours at market prices, unlike mutual funds which price once daily.
> * __Lower Costs__: ETFs generally have lower expense ratios compared to actively managed mutual funds.
> * __Tax Efficiency__: ETFs typically generate fewer taxable events than mutual funds due to their structure.

__Popular ETF Categories:__ The most common types include index ETFs that track market indices like the S&P 500 ([SPY](https://finance.yahoo.com/quote/SPY)) or NASDAQ 100 ([QQQ](https://finance.yahoo.com/quote/QQQ)), sector ETFs focused on specific industries such as technology or healthcare, and international ETFs providing exposure to foreign markets. Additionally, investors can access bond ETFs for fixed-income exposure and commodity ETFs that track prices of gold, oil, or agricultural products.

However, the key for us today is that ETFs trade like stocks on exchanges, making them a convenient way to gain diversified exposure to various asset classes. 
___

## Reward
At the core of every investment is the trade-off between risk and reward. 

> __No Free Lunch:__ The higher the risk, the higher the potential reward, but also the higher the potential loss. __This is no free lunch!__ But how do we measure this risk and reward in practice?

We measure investment __reward__ using a concept called __return__. We use a variation of the log return called the __continuously compounded growth rate__ (CCGR).

> __Continuously Compounded Growth Rate (CCGR)__
>
> Let's assume the share price of firm $i$ is governed by an expression of the form:
>$$
\begin{align*}
S^{(i)}_{j} &= S^{(i)}_{j-1}\;\exp\left(\mu^{(i)}_{j,j-1}\Delta{t}_{j}\right)
\end{align*}
$$
> where $S^{(i)}_{j-1}$ denotes the share price of firm $i$ at time index $j-1$, $S^{(i)}_{j}$ denotes the share price of firm $i$ at time index $j$, and $\Delta{t}_{j} = t_{j} - t_{j-1}$ denotes the time step length (units: years) between time index $j-1$ and $j$. The value $\mu^{(i)}_{j,j-1}$ (units: inverse years) is the continuously compounded growth rate (CCGR) of firm $i$ between time index $j-1$ and $j$. The product $\mu^{(i)}_{j,j-1}\;\Delta{t}_{j}$ is the __log return__ of the stock price over the time period $\Delta{t}_{j}$.


> __Why use continuous growth rate?__ Unlike log returns, CCGR is annualized by default since our convention is to measure time in years. Continuous compounding also enables aggregation of returns over multiple periods, essential for long-term investment analysis.

We can estimate the growth rate $\mu$ (and a closely related quantity called the excess growth rate, or $\bar{\mu}$) from the stock price at time $t-1$ and $t$:
$$
\begin{align*}
S_{t} &= S_{t-1} \cdot e^{\mu\Delta{t}}\quad\Longrightarrow\text{solve for}\;\mu\\
\mu &= \underbrace{\left(\frac{1}{\Delta{t}}\right)\ln\left(\frac{S_{t}}{S_{t-1}}\right)}_{\text{growth rate}}\quad\blacksquare \\
\end{align*}
$$
The sign of the growth rate $\mu$ indicates whether the stock price increased or decreased over the time period $\Delta{t}$:
> __Sign of the growth rate__
> * If $\mu > 0$, the stock price _increased_, i.e., $S_{t} > S_{t-1}$ over the time period $\Delta{t}$.
> * If $\mu < 0$, the stock price _decreased_, i.e., $S_{t} < S_{t-1}$ over the time period $\Delta{t}$.
> * If $\mu = 0$, the stock price _did not change_, i.e., $S_{t} = S_{t-1}$ over the time period $\Delta{t}$.
>
> Because $\Delta{t}>0$, the sign of $\mu$ is the same as the sign of the return.

Often, we want the growth rate _relative to a benchmark_ (e.g., a risk-free asset). The __excess growth rate__ $\bar{\mu}$ is:
$$
\begin{align*}
\mu - \bar{r} &= \left(\frac{1}{\Delta{t}}\right)\ln\left(\frac{S_{t}}{S_{t-1}}\right) - \bar{r}\quad\Longrightarrow\;\bar{\mu} = \mu - \bar{r} \\
\bar{\mu} &= \underbrace{\left(\frac{1}{\Delta{t}}\right)\ln\left(\frac{S_{t}}{S_{t-1}}\right) - \bar{r}}_{\text{excess growth rate}}\quad\blacksquare \\
\end{align*}
$$
where $\bar{\mu}$ is the continuously compounded excess growth rate (CCEGR) and $\bar{r}$ is the continuously compounded risk-free rate (units: inverse years). Before examining estimation methods, consider this example:

> __Example__
>
> Suppose we invested 100 USD in a stock with share price $S_0$ USD/share and held it for one year (i.e., $\Delta{t} = 1$). Alternatively, we could have invested the same amount (e.g., 100 USD) in a (hypothetical) continuously compounding zero-coupon risk-free asset, e.g., a STRIPS bond with a discount rate of $\bar{r}$. At the end of the year, the share price of the stock is $S_1$ USD/share.
>
> * If $\bar{\mu} > 0$, then the capital gain from the stock price (i.e., the increase of the stock price) is greater than what would have been returned if we had invested in a zero-coupon risk-free asset held for the same time period.
> * If $\bar{\mu} < 0$, then the capital change from the stock price is less than what would have been gained if we had invested in a zero-coupon risk-free asset held for the same time period.
> * If $\bar{\mu} = 0$, then the capital change from the stock price is equal to what would have been gained if we had invested in a zero-coupon risk-free asset held for the same time period.
>
> Thus, the continuously compounded excess growth rate $\bar{\mu}$ is a measure of the stock price's performance relative to a _benchmark_ risk-free investment over the same time period.

### Algorithm
Let's examine some pseudocode to estimate the continuously compounded growth rate $\mu$ and the excess growth rate $\bar{\mu}$ from a time series of stock prices $\left\{S_1,S_2,\ldots,S_T\right\}$. If we have $T$ days (minutes, hours, etc.) of stock prices, then we can estimate $T-1$ continuously compounded growth rate $\mu$ values.

__Initialization:__ Given a price dataset $\mathcal{D}_{i} = \left\{S^{(i)}_1,S^{(i)}_2,\ldots,S^{(i)}_T\right\}$ for firm $i$, the time step $\Delta{t}$ (units: years) of the data, and the risk-free growth rate $\bar{r}$ (units: inverse years).

For each time step $t = 2,3,\ldots,T$ __do__:
1. __Lookup__ the share price at time $t$ from dataset $\mathcal{D}_{i}$: $S^{(i)}_t\gets\mathcal{D}_{i}[t]$.
2. __Lookup__ the share price at time $t - 1$ for firm $i$ from dataset $\mathcal{D}_{i}$: $S^{(i)}_{t-1}\gets\mathcal{D}_{i}[t - 1]$.
3. __Compute__ the continuously compounded excess growth rate $\bar{\mu}$:
   $$\bar{\mu}_t^{(i)} \gets \left(\frac{1}{\Delta{t}}\right)\ln\left(\frac{S^{(i)}_t}{S^{(i)}_{t-1}}\right) - \bar{r}$$

   > __Alternative:__
   > If you want to compute the continuously compounded growth rate $\mu$ instead of the excess growth rate $\bar{\mu}$, then set $\bar{r} = 0$.
4. __Store__ the continuously compounded excess growth rate $\bar{\mu}_t^{(i)}$ in a list or array for firm $i$.

The (excess) growth rate samples $\left\{\bar{\mu}_2^{(i)},\bar{\mu}_3^{(i)},\ldots,\bar{\mu}_T^{(i)}\right\}$ can then be used to analyze the patterns and statistical properties of the stock returns.

> __Example__
>
> [▶ Singular Value Decomposition (SVD) of the Daily Growth Rate of the S&P500](CHEME-5660-L4a-SVD-SP500-Example-Fall-2025.ipynb). In this example, we decompose the daily growth rate values for components of the S&P500 using SVD. Singular value decomposition helps us break down complex data into simpler, interpretable components. What patterns will we find in the daily growth rates of S&P500 stocks?
___

## Risk
The risk of an investment is the uncertainty associated with the potential for loss or gain. 
> __Risk:__ We quantify risk using a variety of statistical measures, but the most common is the __standard deviation__ of the continuously compounded growth rate $\mu$, which we denote as $\sigma_{\mu}$ (units: inverse years). The standard deviation is a measure of the dispersion or variability of the returns around their mean.

Suppose we have $T$ price observations. From these observations, we can construct a time series of length $T-1$ of continuously compounded growth rates $\left\{\mu_2,\mu_3,\ldots,\mu_{T}\right\}$ for a stock. The sample standard deviation of the continuously compounded growth rate (our risk measure), is given by:
$$
\boxed{
\begin{align*}
\sigma_{\mu} &= \underbrace{\sqrt{\frac{1}{T - 2}\sum_{t=2}^{T}\left(\mu_t - \mu^{\prime}\right)^2}}_{\text{volatility of the growth rate}} \\
\end{align*}}
$$
where $\mu^{\prime} = \frac{1}{T-1}\sum_{t=2}^{T}\mu_t$ is the sample mean of the $T-1$ growth observations. The denominator $T-2$ represents the degrees of freedom for the sample standard deviation: $(T-1) - 1 = T-2$. We'll typically compute the standard deviation using [the `std(...)` method exported by the `Statistics.jl` package](https://docs.julialang.org/en/v1/stdlib/Statistics/#Statistics.std).

### Other Risk Measures
While standard deviation (volatility) is the most commonly used risk measure, it's important to note that risk can be quantified in many ways, each capturing different aspects of uncertainty:

* __Value at Risk (VaR)__: The maximum potential loss over a specific time period at a given confidence level (e.g., 5% chance of losing more than X USD in one day).
* __Conditional Value at Risk (CVaR)__: Also known as Expected Shortfall, it measures the average loss beyond the VaR threshold, providing information about tail risk.
* __Maximum Drawdown__: The largest peak-to-trough decline in portfolio value, measuring the worst-case scenario an investor would have experienced.
* __Beta__: A measure of systematic risk that quantifies how much an asset's returns move relative to the overall market. We'll for sure see $\beta$ in __many__ future lectures and exercises.
* __Downside Deviation__: Similar to standard deviation but only considers negative returns, focusing on the risk of losses rather than overall volatility.
* __Tracking Error__: The standard deviation of the difference between an investment's returns and its benchmark, measuring relative risk.

The choice of risk measure depends on the specific application, investor preferences, and the aspects of risk that are most relevant to the investment decision. In this course, we'll primarily focus on standard deviation due to its mathematical tractability, but we'll see a few of these alternative risk measures in future lectures.

___

## Stylized Facts of Equity Returns
Analysts and investors gain insights into market behavior, risk, and investment opportunities by examining returns and their properties (stylized facts). While several stylized facts have been developed, let’s consider the following:

* __Heavy (also called fat) tailed distribution__: Stock returns often exhibit a distribution with fatter tails than would be expected under a normal distribution. This means that extreme price movements are more likely than would be predicted by a normal distribution.
* __Absence of Autocorrelation__: Autocorrelation refers to the tendency of stock returns to correlate with past returns over time. Autocorrelation suggests predictability in returns, which traders could exploit. On the other hand, a random walk is uncorrelated with itself. We expect a random walk with occasional jumps.
* __Volatility clustering__: Stock returns tend to be more volatile during specific periods and less volatile during others. This phenomenon is known as volatility clustering, suggesting that large price movements are more likely to be followed by other large moves, and other small moves follow small moves.

These are not the only stylized facts of equity returns, but they are among the most widely recognized and studied. For more details, see the following references:
* Cont, R. (2001). Empirical properties of asset returns: stylized facts and statistical issues. *Quantitative Finance*, 1(2), 223-236. [Link](https://doi.org/10.1080/713665670)
* Mandelbrot, B. (1963). The Variation of Certain Speculative Prices. The Journal of Business, 36(4), 394–419. [Link](http://www.jstor.org/stable/2350970)

Let's look at each of these stylized facts in more detail.

### Heavy Tailed Return Distributions
Heavy‐tailed distributions have probability of extreme deviations that decays more slowly than exponentially, making large shocks rarer but much more likely than under a normal distribution. 

> __Black swan events__: Heavy tails in stock‐return distributions make extreme jumps far more frequent than normal distributions predict. These _black swans_, such as the 1987 crash, 2008 crisis, the COVID-19 pandemic, or April 2025 tariff sell-offs, arise from larger systemic failures, or sudden news, earnings surprises, or geopolitical shocks.

A **heavy-tailed distribution** has tails that are not exponentially bounded. For random variable $X$, tail probabilities decay more slowly than exponential:
$$
\mathbb{P}(|X| > x) \sim L(x)\,x^{-\alpha}, \quad \text{as } x \to \infty,
$$
where $\alpha > 0$ is the **tail index** (controlling decay rate) and $L(x)$ is a **slowly varying function** at infinity: $\lim_{x \to \infty} \frac{L(tx)}{L(x)} = 1$ for all $t > 0$.

> __Tail index__: 
> The tail index $\alpha$ is a key parameter in characterizing the heaviness of the tail. It determines how quickly the tail probabilities decay. 
> A smaller $\alpha$ indicates a heavier tail, while a larger $\alpha$ indicates a lighter tail. Here's a guide:
> * If $\alpha < 2$, the distribution is considered **heavy-tailed** in the strict sense.
> * If $\alpha < 1$, the distribution is **very heavy-tailed** (e.g., financial crashes, Cauchy).
> * If $\alpha > 4$, the distribution is often treated as **effectively light-tailed**.

#### Hill's estimator of the tail index
Hill's estimator is a method for estimating the tail index $\alpha$ of a heavy-tailed distribution. 
Let $X_1, X_2, \dots, X_n$ be i.i.d. realizations of a heavy-tailed random variable $X$, and let $X_{(1)} \ge X_{(2)} \ge \cdots \ge X_{(n)}$ denote the **order statistics** (sorted in decreasing order). Fix $k < n$, the number of **upper order statistics** to use (typically with $k \ll n$), then the **Hill estimator** of the **tail index $\alpha$** is:
$$
\boxed{
\hat{\alpha}_k = \left[ \frac{1}{k} \sum_{i=1}^{k} \ln\left(\frac{X_{(i)}}{X_{(k+1)}}\right) \right]^{-1}.
}
$$

The choice of $k$ is crucial, as it balances bias and variance in the estimate; we'll typically look at a range of $k$ values and plot the estimates to find a stable region.
___

### Autocorrelation
Autocorrelation is a key concept in time series analysis, particularly in the context of financial returns. Suppose we have the growth rate series $\left\{\mu_2^{(i)},\mu_3^{(i)},\ldots,\mu_T^{(i)}\right\}$ for firm $i$. The (empirical) autocorrelation sequence (where we neglect the firm superscript for simplicity) is given by:
$$
\hat{R}(k) = \frac{1}{T-1-k}\sum_{t=2}^{T-k}\left(\mu_t - \mu^{\prime}\right)\left(\mu_{t+k} - \mu^{\prime}\right)\quad\;k=0,1,\ldots,T-2
$$
where $k$ is the lag (units: time steps) and $\mu^{\prime}$ is the mean of the continuously compounded growth rates over the $T - 1$ samples. The autocorrelation function (ACF), which is the normalized autocorrelation sequence, is given by:
$$
\boxed{
\rho(k) = \frac{\hat{R}(k)}{\hat{R}(0)}\quad\;k=0,1,\ldots,T-2
}
$$
The ACF has some nice properties: $|\rho(k)| \leq 1$ for all $k$, and $\rho(0) = 1$ by definition. Additionally, for a stationary process, we expect the ACF to decay to zero as $k$ increases, indicating that past returns have less influence on future returns over time.

> __Random Walk:__ In the famous book [A Random Walk Down Wall Street](https://en.wikipedia.org/wiki/A_Random_Walk_Down_Wall_Street), Burton Malkiel argued that stock prices follow a random walk, meaning that past price movements do not predict future price movements. In this case, the autocorrelation function (ACF) of stock returns would be close to zero for all lags $k > 0$. This suggests that stock prices are unpredictable and follow a random walk.

If the random walk hypothesis holds, then the autocorrelation function (ACF) of the continuously compounded growth rates $\left\{\mu_2^{(i)},\mu_3^{(i)},\ldots,\mu_T^{(i)}\right\}$ should be close to zero for all lags $k > 0$ and all firms $i$. In other words, there should be no significant correlation between past and future returns.

___

### Volatility Clustering
Volatility clustering refers to the tendency for large (in absolute value) returns to be followed by large returns, and small returns by small ones, creating persistent periods of high and low volatility.

> __Why volatility clustering matters:__ Dynamic risk must be modeled to avoid misestimated risk measures (like Value at Risk), derivative prices, and portfolio allocations. Incorporating clustering improves risk forecasts, option valuations, hedging strategies, and capital allocation decisions.


Let firm $i$ have growth rate series: $\bigl\{\mu_{2}^{(i)},\,\mu_{3}^{(i)},\,\dots,\,\mu_{T}^{(i)}\bigr\},$
and let
$$
\overline{(\mu^2)_{i}} 
\;=\;\frac{1}{T-1}\sum_{t=2}^T\bigl(\mu_t^{(i)}\bigr)^2
$$
be the sample mean of the squared growth rates. Define the normalized empirical autocorrelation of squared returns at lag $k$ as

$$
\boxed{
\hat{\rho}_i(k)
=\frac{\displaystyle
   \overbrace{\sum_{t=2}^{\,T-k}
     \Bigl[\bigl(\mu_t^{(i)}\bigr)^2-\overline{(\mu^2)_{i}}\Bigr]
     \,\Bigl[\bigl(\mu_{t+k}^{(i)}\bigr)^2-\overline{(\mu^2)_{i}}\Bigr]
   }^{\hat R_i(k)\;\text{with squared growth rates}}
}{\displaystyle
   \underbrace{\sum_{t=2}^{\,T}
     \Bigl[\bigl(\mu_t^{(i)}\bigr)^2-\overline{(\mu^2)_{i}}\Bigr]^{2}
   }_{\hat R_i(0)\;\text{with squared growth rates}}
}\,.}
$$

Volatility clustering is observed when $\hat{\rho}_i(k)>0$ for many $k$ and typically $\hat{\rho}_i(k)\searrow0$ slowly as $k\to\infty$.

### Example

> [▶ Computing the Stylized Facts of Common Stocks and ETFs](./CHEME-5660-L4a-StylizedFacts-Example-Fall-2025.ipynb). Let's compute the stylized facts for stocks and ETFs, including S&P 500 ETF (SPY) and NASDAQ 100 ETF (QQQ), using 10 years of daily open, high, low, close (OHLC), volume, and volume weighted average price (VWAP) data to analyze log returns and their statistical properties.

___

## Disclaimer and Risks
__This content is offered solely for training and informational purposes__. No offer or solicitation to buy or sell securities or derivative products or any investment or trading advice or strategy is made, given, or endorsed by the teaching team. 

__Trading involves risk__. Carefully review your financial situation before investing in securities, futures contracts, options, or commodity interests. Past performance, whether actual or indicated by historical tests of strategies, is no guarantee of future performance or success. Trading is generally inappropriate for someone with limited resources, investment or trading experience, or a low-risk tolerance.  Only risk capital that is not required for living expenses.

__You are fully responsible for any investment or trading decisions you make__. Such decisions should be based solely on evaluating your financial circumstances, investment or trading objectives, risk tolerance, and liquidity needs.

___