"Eugene Fauma and Kenneth French won the Nobel Prize for their Efficient Market Hypothesis research, a consequence of the theory being 'it is impossible to "beat the market" consistently on a risk-adjusted basis since market prices should only react to new information. '

debate between traditional financial economics which uses risk theories to explain asset pricing and the newer behavioral finance field that uses human behavior to provide the explanations. Are premiums risk-based or behavioral-based?

# Worflow

Stock selection is driven by a systematic multi-factor approach that focuses on value, quality, price momentum, operational momentum, and trend risk premia. Long and short positions will be selected from the top 10% and bottom 10%, respectively, of stocks generated by this model.


Building a rigorous workflow will make your strategies more robust and less prone to overfitting

1. **Universe Selection:** define the universe of tradeable components.
* The universe should be broad but have some degree of self similarity to enable extraction of relative value. Many papers eliminate companies in the *utilities* and *finance* sectors as they follow different accounting rules.
Financials as well, as they have a much different capital structure, making some value and quality metrics not analogous across sectors.
 'Quality' for financial companies tend to be measured differently from stocks in other sectors, e.g. due to their larger balance sheets.
* It should also eliminate hard to trade or prohibited instruments. Many papers eliminate companies below a certain market capitalization as they are less liquid.

2. **Stock Selection** based on a certain set of criteria. Many paradigms emerged in academia and the professional realm, most relevant:

|Investing Paradigm|Main Idea|Metrics|Seminal Works|
|:------------------:|:--- |:------------------:|:----:|
|Value investing|Buying companies with low multiples relative to fundamental metrics i.e what is cheap| Valuation Models<br /> Price-to-Earnings<br />Price-to-Sales<br />Price-to-Book<br />Debt-to-Equity<br /> Price-to-Free Cash Flow<br />EV/EBITDA|Piotroski F-Score (Joseph Piotroski, in *Value Investing: The Use of Historical Financial Statement Information to Separate Winners from Losers*)<br />Enterprising vs Defensive Investor Criteria (Benjamin Graham, in *The Intelligent Investor*)<br />Magic Formula Investing (Joel Greenblatt, in *The Little Book that Beats the Market*)|
|Growth investing|Buying companies with high prices relative to fundamental metrics i.e. what is expensive|Net Profit Margin<br />Sales Growth<br />Earnings Growth<br />Free Cash Flow Growth|x|
|Momentum investing|x|Relative Returns<br />Historical Alpha|x|
|Income investing|x|Dividend Yield<br />Payout Ratio<br />Dividend Growth<br />|x|
|Quality investing|x|Operating Margin<br />Return on Equity<br />Return on Invested Capital<br />|x|
|Factor investing|x|Macroeconomic Factors<br />Statistical Factors<br />Style Factors|x|

Value vs Growth investing: There is no debate about whether value stocks have outperformed growth stocks, on average. Historically, cheaper stocks have earned higher returns than expensive stocks.
**Talk about higher potential for growth for value, but recent value trap**: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2494412

After computing the metrics for each stock in your universe, you rank them:
* Order by some, then other etc.: For example, Joel Greenblatt's *Magic Formula Investing* ranks all by highest earnings yield then highest return on capital
* Aggregation: Create a composite score
  * Piotroski style: Binary condition, where you set a hard criteria (i.e. ROA this year greater than last year, P/E in lower 10% of industry etc.), add add 1 for each company for each met criteria. Summing up of all achieved points gives your composite score for said company.
  * Lo/Patel style: Standardize each factor by finding the mean and standard deviation of each. Replace values that are greater that 10 or less that -10 with 10 and -10 respectively in order to limit the effect of outliers. Sum these values for each equity and divide by the number of factors in order to give a value between -10 and 10

4. **Money Management**

* *Number of securities to invest in*:
  * Too little and too much is tradeoff between cost and diversification achieved. 
  * Middle Ground --> just enough stocks to reduce correlation up to the point of diminishing marginal returns i.e. 10-12 according to Ray Dalio.

* *Types of positions:*
  * Long-only:
  * Long-short:
    * 1X0/X0: 
    * Market neutral: Pairs trading

The top stocks would be held long, the bottom stocks short.[3]

* *Position sizing:* Weight allocation schemes
  * Even allocation (i.e. 1/# of securities to trade)
  * Value (capitalization) weighted (i.e. market cap of security / total market cap of securities)
  * Optimized for risk-adjusted returns (i.e. Sharpe, Sortino etc.)
  * For long and short: 130-30 (up to 150-50)
NB: Ideally set a hard limit for each, as low position concentration is ideal (i.e. < 5% of capital invested in each stock)

* *Portfolio Turnover:* this is the rebalancing frequency. 
  * Too frequent (i.e. twice a day to once a month) --> trading costs (in fees and taxes). high portfolio turnover rate will have higher fees, to reflect the turnover costs. Trading liquid stocks (i.e. 1. Universe selection) mitigates this effect.
  * Less frequent (i.e. once a year) --> less data to test

* *Timing:* take care of earnings season, seasonality factors (i.e. 'Sell in May'), half-days...

5. **Execution:** implement a trading process to transition the current portfolio (if any) to the target portfolio.

Then, we analyze the backtest, specifically its performance and risk exposure. Some guidelines:
* Beta: between -0.3 and 0.3, ideally 0
* Volatility: 
* Sharpe ratio: greater than 1.0
* Turnover: between twice a day to once a month (if longer than that, we might not have enough data)

# Value Investing

As of 2020, value factor has been struggling in the past few years. 

It's a constant trial and error. What worked once loses its edge (or premium) eventually i.e. Fama French factors of size and value. ARBITRAGED AWAY



Growth investors share a different view of the market compared to value investors.

# Factor Investing

What drives stock returns? 

A factor can be thought of as any characteristic relating a group of securities that is important in explaining their return and risk. A large body of academic research highlights that long term equity portfolio performance can be explained by factors. 

Certain factors have historically earned a long-term risk premium and represent exposure to systematic sources of risk. Factor investing is the investment process that aims to harvest these risk premia through exposure to factors.






## Standing on the Shoulders of Giants

Evaluating risk is not only about evaluating the amount of potential loss. It allows us to set reasonable expectations for returns and make well-informed decisions about potential investments. Quantifying the sources of risk associated with a portfolio can reveal to what extent the portfolio is actually accomplishing a stated investment goal. If an investment strategy is described as targeting market and sector neutrality, for example, the underlying portfolio should not be achieving significant portions of its returns from a persistent long exposure to the technology sector. While this strategy may show profit over a given timeframe, understanding that those profits are earned on the basis of unintended bets on a single sector may lead the investor to make a different decision about whether and how much capital to allocate. Quantifying risk exposures allows investors and managers to create risk management strategies and refine their portfolio.

Developing a risk model allows for a clear distinction between common risk and specific risk. 

**Common risk** is defined here as risk attributable to common factors which drive returns within the equity stock market. These factors can be composed of either fundamental or statistical information about the underlying investment assets that make up the market. 
  * Fundamental factors are often observable fundamental ratios reported by companies that issue stock, such as the ratio of book value to share price, or earnings per share. These factors are typically derived from financial and macroeconomic sources of data. 
  * Statistical factors use mathematical models to explain the correlations between asset returns time-series without consideration of company- specific fundamental data (Axioma, Inc. 2011).

Some commonly-cited risk factors are the influence of an overall market index, as in the Capital Asset Pricing Model (CAPM) (Sharpe 1964), risk attributable to investing within individual sectors, which give an idea of the space a company works within, as in the BARRA risk model (BARRA, Inc. 1998), or style factors, which mimic investment styles such as investing in “small cap” companies or “high growth” companies, as in the Fama-French 3-factor model .

**Specific risk** is defined here as risk that is unexplainable by the common risk factors included in a risk model. Typically, this is represented as a residual component left over after accounting for common risk (Axioma, Inc. 2011). When we consider risk management in the context of quantitative trading, our understanding of risk is used in large part to clarify our definition of "alpha". This residual after accounting for the common factor risk of a portfolio can be thought of as a proxy for or estimate of the alpha of the portfolio.

### Capital Asset Pricing Model

Your asset or portfolio is exposed to the overall market, which inherently involves a risk, the **systematic risk**. The more it is exposed to it, the more it depends on its fluctuations, so the riskier it is, and the more you should be compensated for taking that additional risk. 

The Capital Asset Pricing Model (CAPM in short) attempts to explain the expected return of a security as a function of one risk factor, the market risk premium. It is the influence of the market on the security. It is computed as the excess market returns, in other words the additional return an investor expects from holding a risky market portfolio instead of risk-free assets.

Investors expect to be compensated for risk and the time value of money. 
* The risk-free rate in the CAPM formula accounts for the time value of money. 
* The other components of the CAPM formula account for the investor taking on additional risk.


$$ R_i - R_f = \alpha^J + \beta_i * (R_m - R_f)_t + \epsilon_t $$

where 
* $R_i$ is the expected return of the security
* $R_f$ is the risk-free rate of return, or that of a hypothetical investment with no risk of financial loss (i.e. monthly Treasury Bill (t-bill) rate)
* $R_m$ is the expected market return
* $R_m - R_f$ is the market risk premium (excess market returns)
* $R_i - R_f$ is the monthly return to the asset of concern in excess of the monthly t-bill rate.
* $\beta_i$ is the Beta of the investment, the measure of systematic risk. It represents the influence of the market on the excess return of the investment, i.e. the volatility of the investment as compared to the overall market. 

Once the market risk premium and risk free rate are defined, the $\beta$ coefficient can be determined by linear regression. The intercept in this model is referred to as the "Jensen's alpha".

Intuitively, the more a security is exposed to systematic risk i.e. the overall market (captured by the Beta), the more return one should expect from that security. Another way to think about is, since the risk-free rate can be obtained with no risk, any other investment having some risk will have to have a higher rate of return in order to induce any investors to hold it. 

Your return should therefore be proportional to your exposure to that risk factor, which is captured by the Beta.


The CAPM is a simple model and is most commonly used in the finance industry. It is used in the calculation of the Weighted Average Cost of Capital/ Cost of equity.

But this model is based on a few slightly unreasonable assumptions such as ‘the riskier the investment, the higher the return’ which might not be necessarily true in all the scenarios, an assumption that historical data accurately predicts the future performance of the asset/stocks, etc. 

Furthermore, it uses only one variable to describe the returns of a portfolio or stock with the returns of the market as a whole. What if there are many factors and not just one which determines the rate of return? Because these patterns in average returns apparently are not explained by the CAPM, they are called anomalies.

### Arbitrage Pricing Theory

The arbitrage pricing theory was developed by the economist Stephen Ross in 1976, as an alternative to the capital asset pricing model (CAPM). Unlike the CAPM, which assume markets are perfectly efficient, APT assumes markets sometimes misprice securities, before the market eventually corrects and securities move back to fair value. Using APT, arbitrageurs hope to take advantage of any deviations from fair market value. However, this is not a risk-free operation in the classic sense of arbitrage, because investors are assuming that the model is correct and making directional trades—rather than locking in risk-free profits.

It allows us to measure the influence of more than one factor when considering the forces that drive returns. APT is therefore a multi-factor asset pricing model based on the idea that an asset's returns can be predicted using the linear relationship between the asset’s expected return and a number of macroeconomic variables that capture systematic risk. It is a useful tool for analyzing portfolios from a value investing perspective, in order to identify securities that may be temporarily mispriced.

APT expresses the returns of individual assets using a multiple linear regression, a linear factor model, like so:

$$ R_i = {\alpha}_i + {\beta}_{i,0}F_0 +  {\beta}_{i,1}F_1 + ... + {\beta}_{i,m}F_m + {\epsilon}_i$$

### Barra Risk Factor Analysis

Another category of risk factors are those attributable to investing within individual sectors, which give an idea of the space a company works within. 

The Barra Risk Factor Analysis is a multi-factor model, created by Barra Inc., used to measure the overall risk associated with a security relative to the market. Barra Risk Factor Analysis incorporates over 40 data metrics, including earnings growth, share turnover and senior debt rating. The model then measures risk factors associated with three main components: industry risk, the risk from exposure to different investment themes and company-specific risk.

An element that investors and portfolio managers scrutinize when evaluating the markets or portfolios is investment risk. Identifying and measuring investment risk is one of the most important steps taken when deciding what assets to invest in. This is because the level of risk taken determines the level of return that an asset or portfolio of assets will have at the end of a trading cycle. Consequently, one of the most widely accepted financial principles is the tradeoff between risk and return.

One method that a portfolio manager might use to measure investment risk is evaluating the impact of a series of broad factors on the performance of various assets or securities. Using a factor model, the return-generating process for a security is driven by the presence of the various common fundamental factors and the asset's unique sensitivities to each factor. Since a few important factors can explain the risk and return expected on investment to a large degree, factor models can be used to evaluate how much of a portfolio's return is attributable to each common factor exposure. Factor models can be broken down into single-factor and multiple-factor models. One multi-factor model that can be used to measure portfolio risk is the Barra Risk Factor Analysis model.

The Barra Risk Factor Analysis was pioneered by Bar Rosenberg, founder of Barra Inc., and is discussed at length in Grinold and Kahn (2000), Conner et al (2010) and Cariño et al (2010). It incorporates a number of factors in its model that can be used to predict and control risk. The multi-factor risk model uses a number of key fundamental factors that represent the features of an investment. Some of these factors include yield, earnings growth, volatility, liquidity, momentum, size, price-earnings ratio, leverage, and growth; factors which are used to describe the risk or returns of a portfolio or asset by moving from quantitative, but unspecified, factors to readily identifiable fundamental characteristics.

The Barra Risk Factor Analysis model measures a security's relative risk with a single value-at-risk (VaR) number. This number represents a percentile rank between 0 and 100, with 0 being the least volatile and 100 being the most volatile, relative to the U.S. market. For instance, a security with a value-at-risk number of 80 is calculated to have a greater level of price volatility than 80% of securities in the market and its specific sector. So, if Amazon is assigned a VaR of 80, it means that its stock is more price volatile than 80% of the stock market or the sector in which the company operates.





### Fama-French Three-Factor Model

A final category of risk factors are style factors, which mimic investment styles such as investing in “small cap” companies or “high growth” companies, as in the Fama-French 3-factor model .

In 1996, Fama and French observed that two classes of stocks have tended to do better than the market as a whole: (i) small caps and (ii) stocks with a high book-to-market ratio (B/P, customarily called value stocks, contrasted with growth stocks). They have thus identified three risk factors to describe stock returns.

|Factor|Idea|Symbol|Calculated|
|------|-|-|-|
|Market Risk Premium|additional return an investor expects from holding a risky market portfolio instead of risk-free assets|$R_m - R_f$|It is calculated as the monthly return of the CRSP value-weighted index less the risk free rate|
|Size Premium|historical tendency for the stocks of firms with smaller market capitalizations to outperform the stocks of firms with larger market capitalizations|$SMB$ (Small Minus Big, in terms of Market Cap)||
|Value Premium|outperformance of high book / market versus small book / market companies|$HML$ (High Minus Low, in terms of Book-to-Market)||

To compute those values, the stock universe considered is all NYSE, AMEX, and NASDAQ stocks for which they have ME for December of t-1 and June of t, and BE for t-1.

* SMB is a zero-investment portfolio that is long on small capitalization (cap) stocks and short on big cap stocks. 
* HML is a zero-investment portfolio that is long on high book-to-market (B/M) stocks and short on low B/M stocks. Portfolios are formed on B/M at the end of each June using NYSE breakpoints. The BE used in June of year t is the book equity for the last fiscal year end in t-1. ME is price times shares outstanding at the end of December of t-1.
  * BE < 0; bottom 30%, middle 40%, top 30%; quintiles; deciles.
  * Firms with negative book equity are in only the BE < 0 portfolio.

The premiums are computed on a monthly basis in their methodology.


They then added those two additional factors to CAPM to reflect a portfolio's exposure to these two classes:

$$ R_i - R_f = \alpha^{FF} + \beta_{mkt} * (R_m - R_f)_t + \beta_{SMB} * SMB_t + \beta_{HML} * HML_t + \epsilon_t$$ 

The market risk, $\beta$ which is analogous to the classical $\beta$ (but not equal to it, since there are now two additional factors to do some of the work)

Once SMB and HML are defined, the corresponding beta coefficients are determined by linear regressions and can take negative values as well as positive values. The intercept in this model i.e. $\alpha^{FF}$ is referred to as the "three-factor alpha"

The more exposure our portfolio has on small caps and high book-to-value, the higher those coefficients (risk factors) will be, so the higher the expected return.

They find that, except for the continuation of short-term returns, the anomalies largely disappear in a three-factor model. Their results are consistent with rational ICAPM or APT asset pricing, but they also consider irrational pricing and data problems as possible explanations.

However, the size and book/market ratio themselves are not in the model. For this reason, there is academic debate about the meaning of the last two factors.


### Carhart Four Factor Model

In 1997, Mark Carhart extended the Fama-French three-factor model to include a momentum factor, UMD (short for Up Minus Down, or MOM, short for monthly momentum), from Jegadeesh and Titman's paper.

https://alphaarchitect.com/2016/10/14/how-to-measure-momentum/

French Fama: https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/Data_Library/det_mom_factor_daily.html

Carhart: https://breakingdownfinance.com/finance-topics/equity-valuation/carhart-4-factor-model/#:~:text=Carhart%204%20factor%20model%20equation&text=where%20Mkt%20is%20the%20return,easily%20be%20estimated%20using%20OLS.

Momentum in a stock is described as the tendency for the stock price to continue rising if it is going up and to continue declining if it is going down.

The MOM can be calculated by subtracting the equal weighted average of the lowest performing firms from the equal weighed average of the highest performing firms, lagged one month (Carhart, 1997). A stock is showing momentum if its prior 12-month average of returns is positive. Similar to the three factor model, momentum factor is defined by self-financing portfolio of (long positive momentum)+(short negative momentum). Momentum strategies continue to be popular in financial markets such that financial analysts incorporate the 52-week price high/low in their Buy/Sell recommendations

$$ R_i - R_f = \alpha^C + \beta_{mkt} * (R_m - R_f) + \beta_{SMB} * SMB_t + \beta_{HML} * HML_t + \beta_{UML} * UMD_t + \epsilon_t$$ 

Here, UMD is the fourth risk factor, representing the monthly premium on winner minus losers. UMD is a zero-cost portfolio that is long previous 12-month return winners and short previous 12-month loser stocks.

The intercept in this model is referred to as the "four-factor alpha"

### Fama-French Five-Factor Model

In 2015, Fama and French extended the model, adding a further two factors -- profitability and investment. Defined analogously to the HML factor, the profitability factor (RMW) is the difference between the returns of firms with robust (high) and weak (low) operating profitability; and the investment factor (CMA) is the difference between the returns of firms that invest conservatively and firms that invest aggressively. In the US (1963-2013), adding these two factors makes the HML factors redundant since the time series of HML returns are completely explained by the other four factors (most notably CMA which has a -0.7 correlation with HML).

Whilst the model still fails the Gibbons, Ross & Shanken (1989) test, which tests whether the factors fully explain the expected returns of various portfolios, the test suggests that the five-factor model improves the explanatory power of the returns of stocks relative to the three-factor model. The failure to fully explain all portfolios tested is driven by the particularly poor performance (i.e. large negative five-factor alpha) of portfolios made up of small firms that invest a lot despite low profitability (i.e. portfolios whose returns covary positively with SMB and negatively with RMW and CMA). If the model fully explains stock returns, the estimated alpha should be statistically indistinguishable from zero.



## Risk-Premia Factors

In this section, we will analyze and improve on the factors that were described in those seminal works.

### Value Factor

Value: composite of trailing cash-flow yield, earnings yield and country relative sales to price ratio



### Size Factor

Size: full market capitalization


### Momentum Factor

Momentum: residual Sharpe ratio




### Quality Factor

Quality: composite of profitability (return on assets), efficiency (change in asset turnover), earnings quality (accruals) & leverage

### Volatility Factor

Volatility: standard deviation of 5 years of weekly (wed/wed) local total returns


### Liquidity Factor

Liquidity: Amihud ratio – median ratio of absolute daily return to daily traded value over the previous year


### Dividend Yield Factor

## Combining Alphas



https://www.quantopian.com/posts/alphalens-a-new-tool-for-analyzing-alpha-factors#:~:text=Alphalens%20is%20a%20Python%20package,of%20information%20and%20future%20returns.

Alpha factors express a predictive relationship between some given set of information and future returns. By applying this relationship to multiple stocks we can hope to generate an alpha signal and trade off of it.

  a. **Single Alpha Factor Modeling:** define and evaluate individual expressions which rank the cross section of equities in your universe. The following information can tell you if the alpha factor you found is predictive; whether you have found an "edge." These statistics cover:
    * Returns Analysis
    * Information Coefficient Analysis
    * Turnover Analysis
    * Sector Analysis

  b. **Alpha Combination:** combine many single alphas into a final alpha which has stronger prediction power than the best single alpha. This is often due to the noise in each alpha being canceled out by noise in other alphas, allowing signal to come through.

Beta is a measure of the returns (of the asset or portfolio of assets) which are attributed to the market. The farther from zero, the more sensitive it is to the returns of the market (> 0 is same way, < 0 is opposite way).
Alpha is a measure of the returns which are NOT attributed to the market. It's your edge.

When we combine criteria like in the paradigms above, we do not know how each criteria (or signal), singled out, provides to the alpha. And we don't know:
* Just because you run a backtest and is profitable, you don't know whether the signal you're trading on has any alpha, you might have just gotten lucky.  
* Alternatively, if your backtest is not profitable, that doesn't mean your signal doesn't have alpha. It means your strategy was bad.
Maybe other factors in the strategy helped making it good or bad i.e. weight allocation, rebalancing etc., so better to isolate the concern of alpha factors and test them separately. After that, we find those that complement each other i.e. a factors that underperformed compared to another that outperformed during a certain period of time and vice versa. 

To determine whether something is a factor, you should be able to rank stocks based on such factor, and allocate to a different portfolio, and observe monotically decreasing returns.