# Forecasting Returns 


## Risk Premia Across Assets

The **risk premium** of an asset, $i$, is defined as the **expected excess return**:  
$$\mathbb{E} [\tilde{r}^i]$$

### Key Points:
- **Linear Factor Models (LFMs):**  
  LFMs describe how risk premia vary across assets.
- Variation in risk premia is often attributed to differences in risks.
- The **Factor Pricing Model (FPM)** is giving some sense of the forecast of the returns, but it is doing it through the asset pricing theory.

---

## Risk Premia Over Time

How do risk premia evolve over time?  

$$\mathbb{E}_t [\tilde{r}^i_{t+1}]$$

### Two Possibilities:
1. **Time-Invariant Risk Premium:**  
   The expected excess return remains constant:  
   $$\mathbb{E}_t [\tilde{r}^i_{t+1}] = \mathbb{E} [\tilde{r}^i]$$


   _Example: CAPM_
    - The **CAPM** states that risk premia across assets $i$ are proportional (by beta) to the market risk premium.  
    - Beta and risk premia are often estimated as **stationary time series averages** and do not condition on time.
    - If the LFPM is building orecast consistency across 1000s of assets, we could think of it as disciplining the forecast from one asset to another to guarantee no arbitrage.
$$\mathbb{E} [\tilde{r}^i] = (\beta^{i,m}) \lambda_m$$


2. **Time-Varying Risk Premium:**  
   Risk premium depends on time-varying factors ("signals"), $x_t$:  
   $$\mathbb{E}_t [\tilde{r}^i_{t+1}] = f(\underbrace{x_t}_{\text{signals}})$$

   Signals might be forecasting $r_{t+1}$, $r_{t+1}$, $r_{t+12}$...

If you want forecast individual securities, make sure the factor has consistency across the universe securities.
Or you might want to forecast a dozen of securities that you think really block out the dynamics of the market premia and use then the factor pricing models to forecast the returns of the other securities.

---

## Modeling Time-Varying Risk Premia

To capture time variation, we specify a functional form for $f(x)$:  
$$\mathbb{E}_t [\tilde{r}^i_{t+1}] = f(x_t)$$

*Steps for Modeling:*
1. Identify the relevant factors, $x_t$.
2. Use a **linear function** for simplicity:
   - Linear regression provides the best linear estimator of $f(x)$.

---

## Measuring Conditional Expectations via Regression

### Linear Regression Model:  

Why use linear regression?
- Statistical power,
- Interpretability,
- Simplicity.

$$y = \alpha + \beta x + \epsilon$$

- A regression is in some sense a conditional expectation. The conditional expectation of $y$ given $x$ is the best linear predictor of $y$ given $x$:
- The conditional expectation of $y$ given $x$:  
  $$\mathbb{E}[y \mid x] = \alpha + \beta x$$

- If $\beta \neq 0$, the conditional expectation varies with $x$.


### Forecasting Returns

A **forecasting regression** for returns:  
$$\tilde{r}_{t+1} = \alpha + \beta x_t + \epsilon_{t+1}$$

#### Key Implications:
- If $\beta \neq 0$, expected returns vary over time as $x_t$ changes:  
  $$\mathbb{E} [\tilde{r}_{t+1} \mid x_t] = \alpha + \beta x_t$$

- Similar regressions are used in LFMs to explore risk premia variations across assets.

**Reasons to forecasting/ market timing:**
- Economic cycles.
- Valuation metrics.

**Market Efficiency/Inefficiency:**
- Whether we can forecast returns with some signals has nothing to do with market efficiency.


### Forecasting Returns and Market Efficiency  

- Whether we can forecast returns with some signals has nothing to do with market efficiency.
- Forecasting returns based on business cycle signals can indicate when expected returns are higher or lower.

1. **Higher Returns = Higher Risk**:  
   - Forecasts tied to the business cycle may predict higher returns, but this is often due to taking on higher risks, such as tail risks.  
   - This is consistent with market efficiency: higher risk justifies higher expected returns.  

2. **Forecasting Returns ≠ Market Inefficiency**:  
   - The ability to forecast returns does not inherently imply market inefficiency.  
   - True inefficiency would involve forecasting returns **without** additional risk.  

3. **Efficient Market Interpretation**:  
   - An efficient market means you cannot forecast returns **without compensating for risk**.  
   - The classic misconception equates market efficiency with the inability to forecast returns altogether, which is incorrect.  

In essence, forecasting that accounts for risk aligns with efficient markets. Market efficiency depends on whether your forecasting signal is causing  you to take on more or less risk:
- True inefficiency would require forecasting excess returns without added risk exposure.  

---

## Classic View of Risk Premia

### Assumptions:
- **Risk premia are constant over time.**
- Returns follow a **random walk** (with drift):  
  $$\log P_{t+1} - \log P_t = \text{constant} + \epsilon_{t+1}$$

### Implications:
- Forecasting regressions yield $\beta = 0$.
- Price changes are unpredictable:  
  $$\mathbb{E}_t \left[\frac{P_{t+1}}{P_t}\right] = \text{constant}.$$

### Testing the Classic View

To test the classic view, regress today’s return on tomorrow’s return:  
$$\tilde{r}^m_{t+1} = a + \beta \tilde{r}^m_t + \epsilon_{t+1}$$

### Conclusions from the Return Auto-Regressions:

**The excess market return has a regression coefficient near zero, which fits the classic view of risk premia.**
- If using a traded public return that is well measured, you will find that $\beta \approx 0$ and an $R^2$ very small. --> not easy to forecast returns.
  - Easier to forecast interest rates, macroeconomic variables, etc. than to forecast returns.
  - If $R^2$ is high, or $\beta$ is significantly different from zero, then you probably have a problem with the model.
- High returns do not indicate particularly high or low returns going forward.
- The annual data estimates suggest stock returns, particularly excess returns, are i.i.d.
- The monthly data shows some autocorrelation, but not much explanatory power.
- Furthermore, trading costs would seem to make this small predictability a novelty of no economic importance.
- **Implication: High returns today do not predict future returns.**


---

## Other Types of Signals

Notwithstanding the **classic** view, asset managers use many signals to forecast returns with linear regression.

- **Macroeconomic signals**
- **Asset return signals**
- **Short-term signals** (forecast horizons)
- **Long-term signals** (forecast horizons)
- **Dividend-yield signals**

#### Examples:
- Cyclically adjusted price-earnings ratios.
- Macro-economic indicators** like investment or consumption.
- Inflation** and **interest rates** (e.g., the "Fed Model").
---

## Dividend-Yield Forecasting

The **dividend-yield**, $\text{DP}_t$, refers to the **dividend-price ratio** and is defined as:  
$$\text{DP}_t = \frac{D_t}{\underbrace{P_t}_{\text{price ex-div}}}$$

- Other measures include earnings-price book-price ratios, and value-to-cash-flow ratios, such as dividend-price
- For an individual stock, dividends are not paid continuously, but for the market index, there is a steady stream for analysis.


### Returns and Dividend-Yield

Stock returns are defined as:  
$$R_{t+1} \equiv \frac{P_{t+1} + D_{t+1}}{P_t}$$
$$R_{t+1} \equiv \underbrace{\left( \frac{D_t}{P_t} \right)}_{\text{Dividend-yield}} * \underbrace{\left( \frac{D_{t+1}}{D_t} \right)}_{\text{Dividend growth}} + \underbrace{\left( \frac{P_{t+1}}{P_t} \right)}_{\text{Price growth}}$$

The identity holds for horizon, $t+k$, nd in expectation:
$$\mathbb{E}_t [R_{t,t+k}] = \text{DP}_t \cdot \mathbb{E}_t \left[ \frac{D_{t+k}}{D_t} \right] + \mathbb{E}_t \left[ \frac{P_{t+k}}{P_t} \right]$$

---

### Classic View of Dividend-Yield

- **Expected returns are constant:**  
  $$\mathbb{E}_t \left[ r_{t,t+k} \right] = \theta_r.$$

- **Price appreciation is a random walk:**  
  $$\mathbb{E}_t \left[ \frac{P_{t+k}}{P_t} \right] = \theta_p.$$

From this, the expected return can be expressed as:  
$$\theta_r = \text{DP}_t \cdot \mathbb{E}_t \left[ \frac{D_{t+k}}{D_t} \right] + \theta_p.$$

#### Key Insight:  
Under the classic view:  
- **An increase in the dividend-yield is offset by a decrease in expected dividend growth.**

#### Testing the Classic View - Results:  
_Is dividend-yield a good predictor of future returns?_  

**At a one-month horizon:**
- Slope coefficient is insignificant—both statistically and economically.  
- Agrees with the implications of the auto-regression.  
- Provides **more evidence supportive of the classic view.

**At longer horizons:**
- Coefficient is economically significant.
- At one year, a one-point increase in dividend-price forecasts a*four-point increase in returns.

---

### Modern View of Dividend-Yield

**Empirical Observations:**
- Higher dividend-yield predicts higher expected returns:  
  $$\mathbb{E}_t [R_{t,t+k}] = \text{DP}_t \cdot \mathbb{E}_t \left[ \frac{D_{t+k}}{D_t} \right] + \mathbb{E}_t \left[ \frac{P_{t+k}}{P_t} \right]$$

- Expected returns increase one-for-one with the dividend-yield.
- This is not offset by dividend growth or price appreciation.
- Instead, estimates show prices move the wrong way, further increasing expected returns.

![Dividend Forecast](dividend_forecast.png)


#### Long-Horizon Predictability

**Empirical Observations:**
- Predictability by DP is evident in **long horizons** due to DP’s persistence.
- Regression:  
  $$\text{DP}_{t+1} = a + b \cdot \text{DP}_t + \epsilon_{t+1}$$

If I use a forecasting signal that has high auto regress a lot of serial correlation (highly autoregressive), then it will do better at longer horizons.
  - But be careful with the increase in the error term over time.

** Statistical Concerns:**
_- When you have a highly autoregressive signal, it makes it hard to get precise estimates of the forecasting regression._


---

## Statistical Concerns in Forecasting

### Challenges:
1. High persistence in variables like DP introduces **bias** in regression.
2. Errors grow over time, inflating $t$-statistics.

### Research Focus:
- Improving models to address bias.
- Validating the predictive power of DP and related variables.

--- 


## Industry Approach to Forecasting

- In reality, absolute excess returns are a _highly complex and potentially nonlinear combination of random variables_  
  - News events, human behavior, movements in related markets, etc.

- We acknowledge we cannot possibly measure (let alone understand, or predict) all of the forces driving returns.

- Therefore, we try to ascertain the things that drive the variation in returns **the most**  
  - This is not groundbreaking – this is simply feature selection in any model.

- This is often a two-step process:
  1. **Decompose returns**  
  2. **Forecast the components**

- **How does decomposition benefit forecasts?**  
  - Returns have lots of noise that will make direct return forecasting attempts difficult.  
  - We can do a good job of decomposition.


### Return Decomposition

- Reduces the dimensionality of our problem (going from many things that impact returns to just a few).

- Allows us to express returns in terms of things we understand (e.g., market beta).

- Allows us to express returns in forms we understand (e.g., linear).

- Understanding return components is highly beneficial for both risk management and forecasting.

- "Factors" are the underlying components of the decompositions.

- **How do we decompose returns?**

#### **1. Statistical Decomposition**

- Using data, how can I express returns as some basic (typically linear) function of other things?
  - The more variance explained, the better.
  - The more interpretable, the better. (ideally they have economic significance)
  - The more predictable, the better.

- **Examples:**
  - Linear factor decomposition (LFD).
  - Machine learning: 
    Ex.: 
    
    **PCA** --> get a group of stocks, run PCA to get a few risk factors, cut out the ones that don't matter. Then do a principal component regression on one stock against the factors. to get an understanding of the sort of the risks there.

- The underlying components (factors) may or may not have economic significance.

$$\tilde{r}_t = \alpha + \beta' \cdot x_t + \epsilon_t$$



#### **2. Direct Decomposition**

- Directly expressing the return formula as a function of underlying things (using an identify ot what the returns formula is) is also beneficial:
  - This is the approach taken by GMO: decompose returns into components that they understand and believe they can forecast. (dividend yield + multiple expansion + change in profit margin + growth in sales per share)

- Directly understanding the big picture is hard, but maybe we can understand the components, and how the components compose the whole.

- **Taylor expansion**
  - Greeks:

![Direct Decomposition](/images/direct_decomposition.png)

