In [30]:
import numpy as np
import pandas as pd

In [31]:
asset = pd.read_csv("./data/HistoricalData_1641411397677.csv", index_col="Date")
asset.info()

<class 'pandas.core.frame.DataFrame'>
Index: 21 entries, 01/04/2022 to 12/06/2021
Data columns (total 5 columns):
 #   Column      Non-Null Count  Dtype 
---  ------      --------------  ----- 
 0   Close/Last  21 non-null     object
 1   Volume      21 non-null     int64 
 2   Open        21 non-null     object
 3   High        21 non-null     object
 4   Low         21 non-null     object
dtypes: int64(1), object(4)
memory usage: 1008.0+ bytes


In [32]:
asset.head()

Unnamed: 0_level_0,Close/Last,Volume,Open,High,Low
Date,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1
01/04/2022,$179.7,99310440,$182.63,$182.94,$179.12
01/03/2022,$182.01,104701200,$177.83,$182.88,$177.71
12/31/2021,$177.57,64062260,$178.085,$179.23,$177.26
12/30/2021,$178.2,59773010,$179.47,$180.57,$178.09
12/29/2021,$179.38,62348930,$179.33,$180.63,$178.14


We will consider just one asset to be the basket of goods and all other assets to be individual stocks.

In [33]:
# What is the Sharpe Ratio for a given portfolio?
# What does the covariance matrix for all assets in our univers look like?
# What is the correlation of all assets?
# We will use Close/Last as the ASSET attribute

In [60]:
# how do you update the covariance matrix daily?
# how can you cluster assets on attributed?
# EG: P/E ratio, Volume, Price, what other attributes can we use?

In [78]:
# Get only one Attribute (Price) and INDEX by a date
# Indexing by date allows us to use a date that is in minutes, seconds, days, months, etc...
filt = ["Close/Last"]

s1 = pd.read_csv("./data/HistoricalData_1641411397677.csv", index_col="Date", 
                 usecols=["Date", "Close/Last"]).rename(columns={"Close/Last":"s1"})
s2 = pd.read_csv("./data/HistoricalData_1641411477736.csv", index_col="Date", 
                 usecols=["Date", "Close/Last"]).rename(columns={"Close/Last":"s2"})
s3 = pd.read_csv("./data/HistoricalData_1641411479430.csv", index_col="Date", 
                 usecols=["Date", "Close/Last"]).rename(columns={"Close/Last":"s3"})
s4 = pd.read_csv("./data/HistoricalData_1641411480873.csv", index_col="Date", 
                 usecols=["Date", "Close/Last"]).rename(columns={"Close/Last":"s4"})
sp = pd.read_csv("./data/HistoricalData_1641411482228.csv", index_col="Date", 
                 usecols=["Date", "Close/Last"]).rename(columns={"Close/Last":"sp"})

# Create a Data Frame for each date the price of each stock
stocks = pd.concat([s1, s2, s3, s4, sp], axis=1)

# Convert Strings to Floats
stocks = stocks.applymap(lambda x: float(x[1:]))

# Convert Nominal to Percentage
# %change = (cur - prev) / prev + 1, can #.applymap(lambda x: x+1) to remove negative percentage
stocks = stocks.pct_change().applymap(lambda x: x+1)

stocks.dropna(inplace=True)

stocks.head()

Unnamed: 0_level_0,s1,s2,s3,s4,sp
Date,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1
01/03/2022,1.012855,1.021359,1.017446,1.031184,0.994552
12/31/2021,0.975606,1.002485,1.00469,1.003325,0.982063
12/30/2021,1.003548,0.993759,1.00892,1.003945,0.999234
12/29/2021,1.006622,1.001204,1.007751,1.005344,1.01899
12/28/2021,0.999498,0.99304,0.997953,0.993277,0.992589


In [107]:
# Covariance takes a LIST of values (daily stock values is the list) and AGGREGATES that data into a single variable

# For each pair, for each date, calculate average and subtract cur value for both cols then sum
s1_mean = stocks["s1"].mean()
s1_sigma = stocks["s1"].std()

s2_mean = stocks["s2"].mean()
s2_sigma = stocks["s2"].std()

cov_s1_s2 = 0
s1_var = 0

for date, row in stocks.iterrows():
    s1_var += (row['s1'] - s1_mean)**2
    cov_s1_s2 += (row['s1'] - s1_mean) * (row['s2'] - s2_mean)

cov_s1_s2 /= (stocks.shape[0] - 1)
print(cov_s1_s2)


s1_var /= (stocks.shape[0] - 1)   
print(s1_var)

0.00017188145892634048
0.00037412185579229146


In [109]:
# the simple way to get the entire covariance matrix but wanted to show how to build it on one's own
stocks.cov()

Unnamed: 0,s1,s2,s3,s4,sp
s1,0.000374,0.000172,0.000285,0.000155,0.000351
s2,0.000172,0.000249,0.000171,0.000115,0.000152
s3,0.000285,0.000171,0.000313,0.0002,0.000281
s4,0.000155,0.000115,0.0002,0.000252,0.000143
sp,0.000351,0.000152,0.000281,0.000143,0.000479


### Proposition 1

The index for the covariance matrix is asset by asset, NOT date. The covariance takes a list of dates as input and aggregates the data of two variables into a single number (similar to aggregates like the mean and variance). 

The index for a portfolio is also by ASSET (though the portfolio is a vector and not a matrix). 

A characteristic portfolio minimizes exposure to some asset attribute. The characteristic portfolio is simply a weighted combination of assets.

If we want to minimize the characteristic portfolio to some attribute, then we can solve for the weights of each asset:  

$$ h_{a} = \frac{V^{-1}a}{a^{T}V^{-1}a} $$
$$ \sigma^{2}_{h_{a}} = h^{T}_{a}Vh_{a} = \frac{1}{a^{T}V^{-1}a} $$

Where $a$ is the attribute that we are trying to minimize and $V$ is the covariance matrix for all tradeable assets. Specifically $a$ is a vector of attributes of each stock. So for a given period of time, we aggregate the timeperiod into a single statistic that captures the attribute. This means that to index attribute vector $a$, we use the asset ID, which is how we index the characteristic portfolio (which would give a weight of that asset) and the covariance matrix (which gives covariance between two assets)

_are the values $a$ standardized?_  
???

The attribute vector $a$ also represents the Beta of all assets with respect to characteristic portfolio $h_a$

$$ a = \frac{Vh_a}{\sigma^{2}_{h_a}} $$

_how to we aggregate an attribute over a period of time?_  
???

_when we say "exposure" do we mean an aggregate number to some attribute?_  
By definition, the "unit exposure" of characteristic portfolio $h_a$ to attribute $a$ is:
$$ a_p = \sum_{n}a_nh_{p,n} $$
Where $n$ iterates over each asset in the tradeable universe. Our goal is to minimize "exposure" to an attribute, which is to make $a_p$ as close to zero as possible

Let's say we have two different characteristic portfolios, which minimize exposure to some (different) assset attribute (say, P/E and Volume). Let $a_d$ be the exposure of portfolio $h_d$ to attribute $a$, and $d_a$ be the exposure of portfolio $h_a$ to attribute $d$. The exposure is the summation defined above. The covariance between $h_a$ and $h_d$ is 

$$ \sigma_{a,d} = a_d\sigma^2_a = d_a\sigma^2_d$$

In english, this tells us that we may relate the covariance of two portfolio's to the variance of a portfolio and the exposure of that portfolio to the attribute which the other portfolio is trying to minimize

If $\kappa$ is a positive scalar, then the characteristic portfolio of $\kappa a$ is $h_a/\kappa$ because characteristic portfolios have unit exposure to the attribute (because we normalize the attribute???).

If attribute $a$ is a weighted combination of two different attributes, $d$ and $f$, then the characteristic portfolio $h_a$ is a weighted combination of the two portfolios $h_d$ and $h_f$. That is, if $a = \kappa_dd + \kappa_ff$ (note, that the sum of $\kappa$ need not add to one) then:

$$ h_a = \left(\frac{\kappa_d\sigma^2_a}{\sigma^2_d}\right)h_d + \left(\frac{\kappa_f\sigma^2_a}{\sigma^2_f}\right)h_f $$
Where
$$ \frac{1}{\sigma^2_a}=\left(\frac{\kappa_da_d}{\sigma^2_d}\right) + \left(\frac{\kappa_fa_f}{\sigma^2_f}\right) $$

_why do we set exposure a characteristic portfolio's exposure to $a$ equal to 1? That is, why can we say $h^T_aa = 1$ for the optimizaton problem?_  
We set it to one because we DEFINE $h_a$ to have minimum risk AND unit exposure to $a$. This means we want to minimizes $\sigma^2_{h_a}$ while holding $h^T_aa=1$  

The equation for risk (variability) of the portfolio is given as:  

$$ \sigma^{2}_{h_{a}} = h^{T}_{a}Vh_{a} = \frac{1}{a^{T}V^{-1}a} $$  

So we have the LP (linear program):
$$
min \quad \sigma^{2}_{h_{a}} \\
s.t. \quad h^T_aa = 1
$$

Which for some reason we may use a Lagrangian to solve... to prove the first point that the characteristic portfolio of $h_a = \frac{V^{-1}a}{a^{T}V^{-1}a}$

By definition the equation $\beta$ is: 
$$ \beta = \frac{Vh_p}{\sigma^2_p} $$  

We know that Beta is a vector by dimensional analysis. This gives us a vector of Beta's, where we index by asset to figure out what the beta is of an asset to a characteristic portolfio. The characteristic portfolio may be anything! It can be our own basket of assets, some index fund, or anything! The equation for beta always remains the same! This makes sense because earlier we said that for an asset:
$$ \beta_a = \frac{\sigma_{a,m}}{\sigma^2_m} $$  

However, we are now creating a vector of betas rather than an individual value. Moreover, the covariance between asset $a$ and the market $m$ is now  

_what is the DEFINITION of portfolio variance?_
The equation for portfolio variance in terms of its assets is:
$$ \sigma^2_p = \sum^n_i\sum^n_j w_iw_j\sigma_{i,j}$$

For the special case of $i=j$, the term resolves to $w^2_i\sigma^2_i$. Given this equation for portfolio variance, if we have a covariance matrix $V$ and portfolio $h_p$, then we may write the variance (risk) of $h_p$ as:
$$ \sigma^2_p = h^T_pVh_p $$

We are multiplying by $h^T_p$ and then $h_p$ because we need to square the weights, and we can do that conveniently by multiplying by the same vector twice!

Above is literally the definition of the equation for variance. In our example, we may simplify to other terms because we know what the characteristic portfolio is equal to.  
Given:
$$ \sigma^2_{h_a}=h^T_aVh_a $$
$$ h_a=\frac{V^{-1}a}{a^{T}V^{-1}a} $$

The proof of the other risk equality

$$
\begin{align*}
	\sigma^2_{h_a}
	&= h^T_aVh_a
	\\ &= \left(\frac{V^{-1}a}{a^{T}V^{-1}a}\right)^TV\left(\frac{V^{-1}a}{a^{T}V^{-1}a}\right)
	\\ &= \left(\frac{V^{-1}Va}{a^{T}V^{-1}a}\right)^T\left(\frac{V^{-1}a}{a^{T}V^{-1}a}\right) \tag{V transpose = V}
	\\ &= \left(\frac{a}{a^{T}V^{-1}a}\right)^T\left(\frac{V^{-1}a}{a^{T}V^{-1}a}\right)
	\\ &= \frac{a^TV^{-1}a}{\left(a^{T}V^{-1}a\right)^2}
	\\ &= \frac{1}{a^{T}V^{-1}a}
\end{align*}
$$



**I do have qualms with the Beta definition, because one was for the market, and this is for a portfolio. But you may express the market as a linear combination of each individual asset in the universe of tradeable assets.**

Given the beta definition is correct (I still need to intuitively understand why). Then we have the very simple proof using the information about $h_a$ and $\sigma^2_a$ in terms of $a^TV^{-1}a$:
$$
\begin{align*}
	h_a
	&= \frac{V^{-1}a}{a^{T}V^{-1}a}
	\\ &= \sigma^2_aV^{-1}a
	\\ \frac{h_a}{\sigma^2_a}
	&= V^{-1}a
	\\ a
	&= \frac{Vh_a}{\sigma^2_a}
\end{align*}
$$

Moving on to another interesting thing! Say we have two portfolio's $h_a$ and $h_d$, what is the covariance between the two portfolios? The equation for covariance between portfolios is basically the same as a single portfolio's variance:
$$ \sigma^2_p = h^T_pVh_p $$
$$ \sigma_{a,b} = h^T_aVh_b $$

Given this is how we compute covariance across different portfolios, we can now plug-in to prove the covariance between characteristic portfolios is related to each portfolios exposure to the attribute the other portfolio has unit exposure and minimal risk to.
$$
\begin{align*}
	\sigma_{h_a,h_d}
	&= h^T_aVh_d
	\\ &= \left(h^T_aV\right)h_d
	\\ &= \left(\sigma^2_{h_a}a^T\right)h_d
	\\ &= \sigma^2_{h_a}d_a
\end{align*}
$$
$$
\begin{align*}
	\sigma_{h_d,h_a}
	&= h^T_dVh_a
	\\ &= \left(h^T_dV\right)h_a
	\\ &= \left(\sigma^2_{h_d}d^T\right)h_a
	\\ &= \sigma^2_{h_d}a_d
\end{align*}
$$

These proofs all follow from proving that the beta is equal to the attribute vector $a$ and rearranging to say that:
$$a=\frac{Vh_a}{\sigma^2_{h_a}}$$
$$\sigma^2_{h_a}a=Vh_a$$
$$\sigma^2_{h_a}a^T=h^T_aV$$

And we also use the fact that portfolio $h_a$ has unit exposure to attribute $a$, so $1=h^T_aa$. And the definition of $h_a$ exposure to attribute $d$, which is $a_d=h^T_ad$

Given $1=h^T_aa$, if we multiply $a$ by $\kappa$ then we must divide $h_a$ by $\kappa$ for this equation to remain correct.
$$1=\frac{h^T_a}{\kappa}\kappa a$$

_what is the variance for portfolio $h_a$ now?_

Say $a$ is now a weighted combination of two different attributes. We still want $h_a$ to have unit exposure to $a$ with minimum risk. By definition, we have $a=\kappa_dd+\kappa_ff$. What is the equation for the characteristic portfolio $h_a$ now? We know for a portfolio to have unit exposure and minimal risk to attribute $a$ then:
$$ h_a=\frac{V^{-1}a}{a^{T}V^{-1}a} $$

They recommend substituting the result into equation 3: $a=\frac{Vh_a}{\sigma^2_a}$
$$
\begin{align*}
	h_a
	&= \sigma^2_aV^{-1}a
	\\ &= \sigma^2_aV^{-1}\left(\kappa_dd+\kappa_ff\right)
	\\ &= \kappa_d\sigma^2_aV^{-1}d+\kappa_f\sigma^2_aV^{-1}f
	\\ &= \kappa_d\sigma^2_a\left(\frac{h_d}{\sigma^{2}_{h_d}}\right)+\kappa_f\sigma^2_a\left(\frac{h_f}{\sigma^{2}_{h_f}}\right)
	\\ &= \left(\frac{\kappa_d\sigma^2_a}{\sigma^{2}_{h_d}}\right)h_d+\left(\frac{\kappa_f\sigma^2_a}{\sigma^{2}_{h_f}}\right)h_f
\end{align*}
$$

Now to solve for the risk, $\sigma^2_{h_a}$

Let $a_d$ be the exposure of portfolio $h_d$ to attribute $a$, so $a_d=h_da$ by definition. Also remember that characteristic portfolio $h_a$ has unit exposure to attribute $a$, so $h^T_aa=1=h_aa^T$

$$
\begin{align*}
	h_a
	&= \left(\frac{\kappa_d\sigma^2_a}{\sigma^2_d}\right)h_d + \left(\frac{\kappa_f\sigma^2_a}{\sigma^2_f}\right)h_f
	\\ h_aa^T
	&= \left(\frac{\kappa_d\sigma^2_a}{\sigma^2_d}\right)h_da^T + \left(\frac{\kappa_f\sigma^2_a}{\sigma^2_f}\right)h_fa^T
	\\ 1
	&= \left(\frac{\kappa_d\sigma^2_a}{\sigma^2_d}\right)h_da^T + \left(\frac{\kappa_f\sigma^2_a}{\sigma^2_f}\right)h_fa^T
	\\ \frac{1}{\sigma^2_{a}}
	&= \left(\frac{\kappa_d}{\sigma^2_d}\right)h_da^T + \left(\frac{\kappa_f}{\sigma^2_f}\right)h_fa^T
	\\ &= \left(\frac{\kappa_d}{\sigma^2_d}\right)a_d + \left(\frac{\kappa_f}{\sigma^2_f}\right)a_f
	\\ &= \left(\frac{\kappa_da_d}{\sigma^2_d}\right) + \left(\frac{\kappa_fa_f}{\sigma^2_f}\right)
\end{align*}
$$

All proofs done! I just have that pesky beta intuition that I want to think about!

### Proposition 2

In this part we consider very specific attributes. Consider the attribute $e$ that has a value of 1 for every asset, so $e=\{1,1,...,1\}$. We still have the portfolio that minimizes risk and has unit exposure to $e$ as $h^T_Ce=1$ (both are row vectors, i.e. one row and n-columns where n is the number of assets). Also $e_C=1=\sum^n_i h_C(i)$

Specifically, we call $h_C$ the portfolio that minimizes risk and has unit exposure to $e$, so $h^\mathsf{T}_Ce=1$. Then from what we proved before we know the following:
$$h_c=\frac{V^{-1}e}{e^\mathsf{T}V^{-1}e}$$
$$\sigma^2_c=h^\mathsf{T}_CVh_C=\frac{1}{e^\mathsf{T}V^{-1}e}$$
$$e=\frac{Vh_C}{\sigma^2_C}$$

Portfolio C is the _minimum risk fully invested_ portfolio for attribute $e$. Because $h_c$ has unit exposure to attribute $e$, which is just a vector of ones, we know that the sum of the weights in $h_c$ must sum to one. And if the weights of C sum to one, then it is "fully invested"

Portolio C => Fully Invested Portfolio

Suppose the attribute rather than $e$, is now $\beta$. Then we have another special portfolio definition. Beta is a _relative measure_, that is, it is the ratio of covariance of a stock and a market over the variance of the market. The market, also called a _benchmark_ is, like a portfolio, just a linear combination of asset weights. Say we have some benchmark, $h_B$. By definition, we know the beta of all assets to this benchmark is:
$$\beta=\frac{Vh_B}{\sigma^2_B}$$

We want to prove that the weights of the $h_\beta$ vector are the same weights as the portfolio vector $h_B$. That is, $h_\beta=h_B$, which makes intuitive sense because if we want our portfolio to have unit exposure to another portfolio and minimize risk, then the best solution is to just construct the exact same portfolio! Let's prove it! We have defined beta (that is by definition, that is the equation for beta).
$$
\begin{align*}
    1
    &= h^\mathsf{T}_\beta\beta
    \\ &= \frac{h^\mathsf{T}_\beta Vh_B}{\sigma^2_B}
    \\ &= \frac{\sigma_{\beta,B}}{\sigma^2_B}
    \\ \sigma^2_B 
    &= \sigma_{\beta,B}
\end{align*}
$$

Maybe I should have gone about this another way:
$$
\begin{align*}
    h_\beta
    &= \frac{V^{-1}\beta}{\beta^\mathsf{T}V^{-1}\beta}
    \\ &= \frac{V^{-1}\frac{Vh_B}{\sigma^2_B}}{\frac{Vh_B}{\sigma^2_B}^\mathsf{T}V^{-1}\frac{Vh_B}{\sigma^2_B}}
    \\ &= \frac{\frac{h_B}{\sigma^2_B}}{\frac{h^\mathsf{T}_BVh_B}{\sigma^4_B}}
    \\ &= \frac{\sigma^2_Bh_B}{h^\mathsf{T}_BVh_B}
    \\ &= \frac{(h_B^\mathsf{T}Vh_B)h_B}{h^\mathsf{T}_BVh_B}
    \\ &= h_B
\end{align*}
$$


It then follows that:
$$ \sigma^2_\beta=\sigma^2_B=h^\mathsf{T}_BVh_B = \frac{1}{\beta^\mathsf{T}V^{-1}\beta}$$

_what if stocks have the same beta to the benchmark? you should then be able to create portfolio weights that are different but have the same beta and minimize the risk (if both asset risks are the same). This is like multicollineartiry where you have infinite possible solutions if two variables are interchangeable._

"The benchmark is the minimum-risk portfolio with $\beta=1$ This makes sense intuitively. All $\beta=1$ portfolios have the same systematic risk. Since the benchmark portfolio has zero residual risk, it has the minimum total risk of all $\beta=1$ portfolios."

We can now relate portfolios, $h_C$ and $h_B$ by using what we proved before. That is:
$$ \sigma_{C,B}=e_B\sigma^2_C=\beta_c\sigma^2_B $$

In CAPM, the equation for expected excess returns is $r_a=r_f+\beta\left(r_m-r_f\right)$. However, we have not made any assumptions about CAPM just yet. We can define excess returns simply as the difference between the asset's return and the risk-free rate. We do not come up with an equation for excess returns, we simply define that there is a value that represents expected excess returns that we may call $f$.

So each asset has it's own attribute of expected excess return. Thus, we can create a characteristic portfolio for excess return, which we will call $h_q$:
$$ h_q = \frac{V^{-1}f}{f^\mathsf{T}V^{-1}f}$$

The Sharpe ratio definition is the expected excess returns divided by the risk of the portfolio:
$$ SR_P = \frac{f_P}{\sigma_P}$$

The equation for $f_P$ is would be the weights of each asset multiplied by the expected excess return of each asset. So, $f_P=h^\mathsf{T}_Pf$ where f is the asset attribute for expected excess returns. And we know the variance of a portfolio is $\sigma^2_P=h^\mathsf{T}_PVh_P$. One could rewrite the Sharpe Ratio of a given portfolio then as:
$$SR_P=\frac{h^\mathsf{T}_Pf}{\left(h^\mathsf{T}_PVh_P\right)^{\frac{1}{2}}}$$

We want to prove the following statements given everything we have just said.
1. $SR_q = max\{SR_P|P\} = \left(f^\mathsf{T}V^{-1}f\right)^{\frac{1}{2}}$
2. $f_q = 1$
3. $\sigma^2_q = \frac{1}{f^\mathsf{T}V^{-1}f}$
4. $f=\frac{Vh_q}{\sigma^2_q}=\left(\frac{Vh_q}{\sigma_q}\right)SR_q$ because $f_q=1$
5. $SR_P=\rho_{P,q}SR_q$ where $\rho_{P,q}$ is the correlation coefficient. P need not maximize the Sharpe Ratio
6. $e_q=\frac{f_c\sigma^2_q}{\sigma^2_c}$ is the fraction of $q$ invested in risky assets

For any portfolio $P$, we have $SR_P=\frac{f_P}{\sigma_P}$. We also know for any constant $\kappa$, the holding portfolio of $\kappa h_P$ has the same Sharpe Ratio of $\frac{f_P}{\sigma_P}$. We can prove this by simply plugging in $h_P=\kappa h_P$ into the Sharpe Ratio equation we have in term of the portfolio P:
$$
\begin{align*}
    SR_{\kappa P}
    &= \frac{\kappa h^\mathsf{T}_Pf}{\left(\kappa h^\mathsf{T}_PV\kappa h_P\right)^{\frac{1}{2}}}
    \\ &= \frac{\kappa h^\mathsf{T}_Pf}{\kappa\left(h^\mathsf{T}_PVh_P\right)^{\frac{1}{2}}}
    \\ &= \frac{h^\mathsf{T}_Pf}{\left(h^\mathsf{T}_PVh_P\right)^{\frac{1}{2}}}
    \\ &= SR_P
\end{align*}
$$

What this mean is that we have some portfolio $h_P$ with exposure to expected excress returns $f$ as $f_P=h^\mathsf{T}_Pf$ We have just proved that multiplying $h_P$ by any constant does not change it's Sharpe Ratio. We know there exists some portfolio where $\kappa h^\mathsf{T}_Pf=1$. And important, we know that portfolio has the _same_ Sharpe Ratio as our current portfolio P. So let's use that portfolio and just call it $h_P$ because it has the same sharpe ratio. Bascially, we are saying $h_P=\kappa h_P$ such that $h^\mathsf{T}_Pf=1$.

Now we want to maximize the sharpe ratio. We know that we can set ANY portolfio P to have unit exposure to expected excess attribute by the simple proof above. Intuitively, we take ALL portoflio (each can have a different Sharpe Ratio) and we basically normalize those portfolios so that they ALL have unit exposure to $f$ because we can! Now every existing portfolio has the same unit exposure to $f$, so the best portfolio, the one that has the highest sharpe ratio, will be the one with the smallest risk. We are basically normalizing every portfolio to the same value and then finding the one with the smallest variance, and we know that will have the biggest Sharpe Ratio because the numerator is fixed for all portfolios.

To then $max\{SR_P|P\}$ we just use the same optimization that we used in the beginning part - that is, minimize risk with unit exposure (which we did with a Lagrangian). This gives us the equation for the portfolio P with the maximum Sharpe Ratio as:
$$h_P=\frac{V^{-1}f}{f^\mathsf{T}V^{-1}f}=h_q$$

To find the equation for Sharpe Ratio of $h_q$ we need to just plug-in. We know by definition $f_q$ is the exposure of portfolio $q$ to attribute $f$, that is $f_q=h^\mathsf{T}_qf$. However, we constructed $h_q$ by definition to have unit exposure to $f$, so we know $h^\mathsf{T}_qf=1=f_q$. 

Using this information, we can solve for the Sharpe Ratio of portfolio $h_q$. The equation for Sharpe Ratio is:
$$
\begin{align*}
	SH_q
	&= \frac{h^\mathsf{T}_qf}{\left(h^\mathsf{T}_qVh_q\right)^{\frac{1}{2}}}
    \\ &= \frac{1}{\left(h^\mathsf{T}_qVh_q\right)^{\frac{1}{2}}}
    \\ &= \left(\frac{1}{f^\mathsf{T}V^{-1}f}\right)^{-\frac{1}{2}}
    \\ &= \left(f^\mathsf{T}V^{-1}f\right)^{\frac{1}{2}}
\end{align*}
$$

Items 3 and 4 are simple enough to get by plugging in what we have already proved. Part 5 forces us to rearrange. I will just prove 4 and 5 together, and 3 is by previous proof:
$$
\begin{align*}
	f
    &= \frac{Vh_q}{\sigma^2_q}
    \\ &= \left(\frac{Vh_q}{\sigma_q}\right)\left(\frac{1}{\sigma_q}\right)
    \\ &= \left(\frac{Vh_q}{\sigma_q}\right)\left(\frac{f_q}{\sigma_q}\right)
    \\ &= \left(\frac{Vh_q}{\sigma_q}\right)SR_q
    \\ \left(\frac{h_P^\mathsf{T}}{\sigma_P}\right)f
    &= \left(\frac{h^\mathsf{T}_P}{\sigma_P}\right)\left(\frac{Vh_q}{\sigma_q}\right)SR_q
    \\ \frac{h_P^\mathsf{T}f}{\sigma_P}
    &= \left(\frac{h^\mathsf{T}_PVh_q}{\sigma_P\sigma_q}\right)SR_q
    \\ \frac{f_P}{\sigma_P}
    &= \left(\frac{\sigma_{P,q}}{\sigma_P\sigma_q}\right)SR_q
    \\ SR_P
    &= \rho_{P,q}SR_q
\end{align*}
$$

Something to Note: while the concept of $e$ being a vector of only ones seemed trivial, it represent more information than just that. "Every portfolio's exposure to $e$ measures the extent of its investment. If $e_p=1$, then the portfolio is fully invested." Basically, we are asking do the weights in our portfolio sum to 1? If they do, then we are investing everything, if not, then we are not investing everything. This seems tricky, because we may allocate a portion of our Book to risk free and then we split up the remainder in risky assets by 100% rather then consider the risky investments as part of the whole. I assume there will be more information on this later.

The reason portfolio $h_C$ is important, is because it implies that it has invested 100% of its capital in risky investments (assets). When we look at the covariance between portfolios $h_c$ and $h_q$ we can use what we have already proved to state:
$$\sigma_{q,C}=e_q\sigma^2_C=f_C\sigma^2_q$$

Now the ALPHA portfolio! One thing I haven't done well is denote scalar and vectors well, turns out using \textbf{f} is a good way to do that. Similarly, I started using \mathsf{T} to make the transpose notation look better.

Let us define alpha as $\alpha = \textbf{f}-\beta f_B$ This is what it gives in the book, though alpha should probably be bold because it is a vector. Because alpha is composed of two attributes, that is, expected excess return and beta, we can finally use what we proved at the end of part one to write $h_\alpha$ in terms of $h_B$ and $h_q$.
$$ \sigma_{B, A}=\alpha_B\sigma^2_A=\beta_A\sigma^2_B$$
Which also means $\beta_A=0$

We call $h_A$ the characteristic portfolio of $\alpha$

Basically, $f_B$ is the _benchmark's_ exposure to expected excress returns. The proof of why $\alpha_B=0=\beta_A$ is as follows:
$$
\begin{align*}
	\alpha_B
	&=h^\mathsf{T}_B\alpha
	\\ &= h^\mathsf{T}_B(f-\beta f_B)
	\\ &= h^\mathsf{T}_Bf - h^\mathsf{T}_B\beta f_B
	\\ &= h^\mathsf{T}_Bf - (h^\mathsf{T}_B\beta) f_B
	\\ &= h^\mathsf{T}_Bf - (1) f_B
	\\ &= f_B - f_B
	\\ &= 0
\end{align*}
$$

_is this a generic way to construct ANY attribute such that it has 0 exposure to some other attribute?_
The answer is yes! Create an attribute $z$, which is a combination of two other attributes $x$ and $y$. The portfolio $h_x$ is the characteristic portfolio of $x$ and $h_y$ is the characteristic portfolio of $y$. Then, if we say $z=x-yx_y$, where $x_y=h^\mathsf{T}_yx$ is the exposure of portfolio $h_y$ to attribute $x$, then the covariance between portfolios $z$ and $x$ will be equal to zero by construction. In other words, portfolio $h_z$ will have __zero__ exposure to attribute $x$.
$$\sigma_{z,x}=z_x\sigma^2_z=x_z\sigma^2_x=z_x=x_z=0$$

"In many cases we will find it convenient to assume there is a fully invested portfolio that explains the expected excess returns. That will be the case if the expected excess return on portfolio C is positive."

### Proposition 3

Given $h_Q$ is the characteristic portfolio of $e_qf$ and $f_C\gt0$, we want to prove the following statements:
1. $e_q \gt 0$
2. $SR_P=\rho_{P,Q}SR_Q$, $\frac{f_C}{\sigma^2_C}=\frac{f_Q}{\sigma^2_Q}$, and $f=f_Q\left( \frac{Vh_Q}{\sigma^2_Q} \right) = f_Q\beta_Q $
3. $\beta_Q=\frac{f_B\sigma^2_Q}{f_Q\sigma^2_B}$
4. $\beta_Q=\frac{\beta_Cf_B}{f_C}$

__Part 1:__

We have proved before that $\sigma_{a,b}=a_b\sigma^2_a=b_a\sigma^2_b$ for any two attributes $a$ and $b$. We know that $\sigma^2>0$, and we are given that $f_C>0$, so we can see that $e_q$ indeed is greater than zero because:
$$e_q\sigma^2_C=f_C\sigma^2_q$$
Both $f_C>0$ and $\sigma^2_q>0$, therefore $e_q>0$

__Part 2:__

Let us first get a better grasp of what exactly $h_Q$ is. It is the characteristic portfolio of attribute $Q=e_qf$. Therefore $h_Q^\mathsf{T}Q=1=h_Q^\mathsf{T}(e_qf)$. Using bold to indicate vectors and non-bold to indicate scalars would have been useful because $e_q$ is equal to $h^\mathsf{T}_qe$ which is just a scalar and $f$ is a vector.

With this in mind, we proved some useful things about multiplfying a arbitrary constant to a characteristic portfolio. Mainly, if we multiply the attribute vector by some constant kappa, then we must divide the characteristic portfolio by that same constant to maintain unit exposure to kappa multiplied to the attribute vector.
$$1=\frac{h^T_a}{\kappa}\kappa a$$

In this case, $e_q$ is just a constant. Therefore, if we multiply an attribute by the constant, then we must divide the characteristic portfolio to maintain unit exposure. Therefore:
$$h_Q=\frac{h_q}{e_q}$$

We also proved that multiplying a portfolio by a constant does not change it's Sharpe Ratio. Therefore, we know:
$$SR_Q=SR_q$$

We had previously proved that $SR_P=\rho_{P,q}SR_q$, so we can substitute and we get:
$$SR_P=\rho_{P,q}SR_Q$$

Notice we have $\rho_{P,q}$ and not $\rho_{P,Q}$. I also need to prove that multiplying by a constant does not change the correlation coefficient between two variables.
$$\rho_{a,b}=\frac{\sigma_{a,b}}{\left( \sigma^2_a\sigma^2_b \right)^{\frac{1}{2}}}=\frac{\sigma_{ca,b}}{\left( \sigma^2_{ca}\sigma^2_b \right)^{\frac{1}{2}}}$$

Where $c$ is some constant. If we can prove this, then we can say that $\rho_{P,Q}=\rho_{P,q}$ because $h_Q=\kappa h_q$. It just so happens that $\kappa=\frac{1}{e_q}$ for this specific portfolio.

We know $Var(aX)=a^2Var(X)$ and $Cov(aX,Y)=aCov(X,Y)$. So if we just plug-in we have the following.
$$
\begin{align*}
    \rho_{cA,B}
    &= \frac{Cov(cA,B)}{\left( Var(cA)Var(B) \right)^\frac{1}{2}}
    \\ &= \frac{cCov(A,B)}{\left( c^2Var(A)Var(B) \right)^\frac{1}{2}}
    \\ &= \frac{c\sigma_{A,B}}{\left( c^2\sigma^2_A\sigma^2_B \right)^\frac{1}{2}}
    \\ &= \frac{c\sigma_{A,B}}{c\sigma_A\sigma_B}
    \\ &= \frac{\sigma_{A,B}}{\sigma_A\sigma_B}
    \\ &= \rho_{A,B}
\end{align*}
$$

Therefore,
$SR_P=\rho_{P,q}SR_q=\rho_{P,Q}SR_Q$

To relate attributes and characteristic portfolios of $e_qf$ and $e$ we use the following:
$$\sigma_{h_C,h_Q}=e_qf_C\sigma^2_Q=e_Q\sigma^2_C$$

That is really interesting that $e_Q=1$. That is characteristic portfolio $h_Q$ has unit exposure to $e$ It just does not mean that it has minimal variance. Unit exposure to $e$ is not sufficient for the portfolio to be the characteristic portfolio of $e$. It seems like we have discovered another useful relationship when constructing portfolios. Mainly, how to construct a portfolio with unit exposure to some attribute and unit exposure AND minimal variance to another attribute. 
$$
\begin{align*}
	e_Q
	&= h^\mathsf{T}_Qe
	\\ &= \frac{h^\mathsf{T}_qe}{e_q}
	\\ &= \frac{e_q}{e_q}
	\\ &= 1
\end{align*}
$$

We also need to prove that $f_Q=1$ to finish the proof
$$
\begin{align*}
	f_Q
	&= h^\mathsf{T}_Qf
	\\ &= \frac{h^\mathsf{T}_qf}{e_q}
	\\ &= \frac{1}{e_q}
\end{align*}
$$

For the full proof:
$$
\begin{align*}
	e_qf_C\sigma^2_Q
	&= e_Q\sigma^2_C
	\\ &= \sigma^2_C
	\\ \frac{f_C}{\sigma^2_C}
	&= \frac{1}{e_q\sigma^2_Q}
	\\ &= \frac{f_Q}{\sigma^2_Q}
\end{align*}
$$

We want to isloate $f$, we do know that $Q=\frac{Vh_Q}{\sigma^2_Q}$. We also know that attribute $Q=e_qf$. And we constructed portfolio $h_Q$ such that $h^\mathsf{T}_Q(e_qf)=1$. And we also know that $h_Q=\frac{h_q}{e_q}$

$$
\begin{align*}
	1
	&= h^\mathsf{T}_Q(e_qf)
	\\ &= e_qf_Q
	\\ f
	&= e_qf_Qf
	\\ &= f_Q(e_qf)
	\\ &= f_Q(Q)
	\\ &= f_Q\left(\frac{Vh_Q}{\sigma^2_Q}\right)
\end{align*}
$$

A question that lingers: _why does $(\frac{Vh_Q}{\sigma^2_Q})=\beta_Q$?_ Or does _beta with respect to Q mean something different?_ I think that it means something different. It is NOT related to characteristic portfolio B which represents the benchmark portfolio weights.

__Part 3:__

$$
\begin{align*}
	f
    &= f_Q\left(\frac{Vh_Q}{\sigma^2_Q}\right)
    \\ h^\mathsf{T}_Bf
    &= \frac{h^\mathsf{T}_BVh_Qf_Q}{\sigma^2_Q}
    \\ f_B
    &= \left( \frac{h^\mathsf{T}_BVh_Q}{\sigma^2_B} \right)\left( \frac{f_Q\sigma^2_B}{\sigma^2_Q} \right)
    \\ &= \beta_Q\left( \frac{f_Q\sigma^2_B}{\sigma^2_Q} \right)
    \\ \beta_Q
    &= \frac{f_B\sigma^2_Q}{f_Q\sigma^2_B}
\end{align*}
$$

This is a general case for any characteristic portfolio. One can solve for the exposure to market beta of the portfolio using this proof.

__Part 4:__



$$\frac{f_C}{\sigma^2_C}=\frac{f_Q}{\sigma^2_Q}$$
$$\sigma^2_Q=\frac{f_Q\sigma^2_C}{f_C}$$
$$\beta_Q=\frac{f_B\sigma^2_Q}{f_Q\sigma^2_B}$$

Somehow we need to prove $\beta_C=\frac{\sigma^2_C}{\sigma^2_B}$ if $e_B=1$
$$
\begin{align*}
	e_B\sigma^2_C
	& =\beta_C\sigma^2_B
	\\ \sigma^2_C
	& =\beta_C\sigma^2_B
	\\ \beta_C
	&= \frac{\sigma^2_C}{\sigma^2_B}
\end{align*}
$$

Full Proof:
$$
\begin{align*}
	\beta_Q
	&= \frac{f_B\sigma^2_Q}{f_Q\sigma^2_B}
	\\ &= \frac{f_B}{f_Q\sigma^2_B}\frac{f_Q\sigma^2_C}{f_C}
	\\ &= \frac{f_B}{\sigma^2_B}\frac{\sigma^2_C}{f_C}
	\\ &= \frac{\beta_Cf_B}{f_C}
\end{align*}
$$

__Portfolio C__ represents the _minimum risk fully invested portfolio_. Does this mean if there is one asset that has a clearly lower risk than all other assets (but NOTHING matters about expected excess return, or beta, or any attribute), would Portfolio C then just have a 1 for the asset and a 0 for all other assets? Is it possible for $Var(aX+bY)<Var(X)|Var(Y)$ where $a+b=1$?
$$Var(X+Y)=Var(X)+Var(Y)+2Cov(X,Y)$$
In this scenario, if $Cov(X,Y)<0$ then we do in fact have less risk. This means that you wont just select the asset with the least variance, because if covariance of two other assets is negative, you can have less variance in your portfolio if you combine those rather than selecting the minimum variance, because the negative covariance offsets the positive variance.

Once more, Portfolio C literally just finds the portfolio that has minimum variance (and the vector of ones just means that it is fully invested).

__Solving for the Variance of a Portolfio that lives on the Efficient Frontier:__

Minimizing $\frac{h^\mathsf{T}Vh}{2}$  
S.T. $e^\mathsf{T}h=1$, $f^\mathsf{T}h=f_P$

We get the following equation for any portolfio that lives on the efficient frontier
$$h_P=\left( \frac{f_Q-f_P}{f_Q-f_C} \right)h_C + \left( \frac{f_P-f_C}{f_Q-f_C} \right)h_Q$$



$$ h_a = \left(\frac{\kappa_d\sigma^2_a}{\sigma^2_d}\right)h_d + \left(\frac{\kappa_f\sigma^2_a}{\sigma^2_f}\right)h_f $$
Where
$$ \frac{1}{\sigma^2_a}=\left(\frac{\kappa_da_d}{\sigma^2_d}\right) + \left(\frac{\kappa_fa_f}{\sigma^2_f}\right) $$