# Overview of Advanced Methods of Reinforcement Learning in Finance

#### Notes by Carlos Santillán

#### May 2020

## Prerquisites

- Python & Jupyter Notebooks (libraries such as Numpy, Pandas and TensorFlow)

- Linear Algebra

- Probability

- Chapters 2, 3 in Goodfellow et. al. "Deep Learning"

- Basic Calculus

- Conceptual understanding of core ML Methods

## How to approach as a Physisist

- When you come accross a new ML model, start with the abstract.

- If interested, scheme through main equations and understand what they mean

- Assume equations are right

- Play / hands on the model

- Check data for sanity checks

- Notice behaviors 


**All financial concepts will be covered in the course**

## Main References

- C. Bishop, "Pattern Recognition and Machine Learning" (2006)

- K.P. Murphy, "Machine Learning: A Probabilistic Perspective" (2012)

- I. Goodfellow, Y. Bengio and A. Courville, "Deep Learning" (2016)

- A.Geron, "Hands-On Machine Learning with Scikit-Learn and TensorFlow" (2017)

**Additional References**

- S.Marsland, "MAchine Learning: An Algorithmic Perspective" (2009)

- N. Gershenfeld, "The Nature of Mathematical Modeling" (1999)

- D. Barber, "Bayesian Reasoning and Machine Learning" (2012)

**Rossen Roussev - Quantitative Analyst at J.P. Morgan (Physicist)**

Advice:

- He took a course on financial math in school, there he heard notions of Black-Scholes

- Most Learning was done in the job

- C++ and systems development

- Ask questions

- Quant industry is reinventing itself because of ML

- ML became powerful when its time came

- Finance has always been a data-intensive industry

- Financial data is not as signal-rich as other types of data

- ML is a tool, not so different from traditional math

- ML is not supposed to do our thinking for us

**Advice for students:**

- Be courious (specially after school)

- In terms of opportunity, the times we live in are great

- Powerful computational libraries

- High-quality online lessons

- With so many available resources it can be hard to stay on topic, there is a high signal-to-noise ratio

- Stay focused and at the same time, Be Curious!


# Week 1

## Lesson 1

### 1.1. Reinforcement Learning and Ptolemy's Epicycles

Precursor of the Kalman Filter, which eventually gave ideas that ended up in the General Relativity Theory of Einstein.

### 1.2. PDEs in Physics & Finance

Where do we stand today aftter the Black-Scholes-Merton model of 1973?

Physics and RL have new findings.

- Mostly about solving diffusion equations with various boundary conditions

- Econophysics

- Works by Paul Willmot

- In finance, time is measured in discrete ways

- But in physics, continuos time is easier to model

For many options, Black-Scholes model has no solution. Many problems require "time discretization".



### 1.3. Competitive Market Equilibrium Models in Finance

"Everything should be made as simple as possible, but not simpler" - Einstein

PDEs help in almos all physical problems.

There are Nobel Prizes in Economics taht talk about market equilibrium:

1. Modigliniani-Miller

2. Capital Asset Pricin Model by William Sharpe (1964)

3. Black-Scholes option pricing theory (1973) (no-arbitrage as a weaker form of market equilibrium)

This is consistent with Soros' idea that this theories "model themselves on Newtonian Physics", which in reality model into thermodynamics equilibrium of statistical mechanics as proposed by Boltzmann.



**Does the Black-Scholes Model Pass Einstein's Test?**

There are 2 key elements:

- Option pricing by replication (dynamic hedging)

- Taken to continuous-time limit as $\Delta t \rightarrow 0$

Together, these 2 steps produce the celebrated Black-Scholes Equation:

$$\frac{\partial C_{t}}{\partial t} + rS_{t} \frac{\partial C_{t}}{\partial S_{t}} + \frac{1}{2} \sigma^{2} S^{2}_{t} \frac{\partial^{2} C_{t}}{\partial S^{2}_{t}} - rC_{t} = 0 .....(1)$$

**Black-Scholes Model: the Main Take-aways:**

- **Data requirements:** 2 numbers, current stock price $S_{t}$ and stock volatility of $\sigma$ (plus parameters for an option)

- The option price is *unique* and given by a solution of the **BS** equation

- The optimal option hedge (the amount of stock in a replicating portfolio) is obtained *after* the option price is computed

- Options are **redundant** (as in prefectly replicable in terms of stocks and cash) and have **instantaneously 0 risk**!

- (What does it even mean? Time in Finance is fundamentally discrete...)

- "When people are seeking profits, equilibrium will prevail" (Fisher Black)

### 1.4. I certainly hope you are wrong, Herr Professor!

There might be some logical loopholes in this ideas:

- Arbitrage pricing gives you an equilibrium price, so that you should not trade below it and you should not trade above it


- It only forgot to explain why you should trade at the equilibrium price itself!


- What is the rationale of having entirely redundant financial instruments?


- Options are **not** redundant because they carry **risk**!

Essentialy speaking, the Black-Scholes Model is a model of a fake market!

**BS** is only an approximation to reality. There's a story about a German bank that hired a professor of a leading university to help quantify risk. After some months of analysis, the professor concluded that the bank had "absolutely no risk".

The bank´s head of trading responded: "Well I certainly hope you are wrong, Herr professor. If you are correct then we can´t be making any money!"

Risk is a central factor in finance, which is the main problem in the **BS** model.

**What is the Main Problem with the Black-Scholes Model?**

- Does not match option price data

- Does not match stock prices (stock prices are not lognormal)

- Transaction costs neglected

- What about discrete hedging?

- Real markets are incomplete

- Risk has disappeared

- What would be a meaningful change to the BS model so that the new model:

    - Will be more useful / meaningful
    
    - Will have the same or similar level of tracttability as teh BS model.

Before applying models we also have to calibrate them, which can be computaionaly expensive.

The result of "match the market" mantras are:

- **Parametric mantras**: Stochastic volatility models, jump-diffusion, Levy models, etc.

- **Non-parametric mantras**: Local volatility models, MaxEnt, non-parametric Bayes.

Still we´re missing risk and that is the most important thing in the game.

### 1.5. Risk as a Science of Fluctuation

Taking Mark Twain´s approach to Quantitative Finance:

Which would be the wrong words that should be crossed out when trying to improve on the Black-Scholes model?

- No arbitrage pricing

- Risk-less hedges

- "Risk-neutral" option valuation

- The continuous-time limit

- PDE's

Let us remember that risk is a story of fluctuations, and one approach to it is variance, for exameple:

- In the Markowitz portfolio theory risk is $R_{t} = \lambda Var(\Pi_{T})$ where $\lambda$ is the risk-aversion parameter.


- In statistichal mechanics, there are models for both equilibrium and non-equilibrium fluctuations


- The BS model **neglects fluctuations.** This is equivalent to a **thermodynamic limit** in equilibirum statistical mechanics, where all fluctuations die off.

### 1.6. Markets & the Heat Death of the Universe

Market equilibrium models are models where entropy is maximized and does not fluctuate: they are models of a "heat death" of the Universe as an equilibrium system in a thermodynamic limit.

Here comes a **50Bn question (by the size of the option market):** Is it a right limit to use as a reference point (or a "first aproximation") to describe a risky business?

Finance marketes are in fact vey far from a "heat-death" state.

So, to bring more realism to option pricing we need the following:

-By prematurely applying a continuous-time limit from the start, the following happens:

1. As pricing by replication becomes exact in this limit, all risk is instantaneously eliminated


2. To re-install option risk as a first-class citizen of the model, we need to revert back to a discrete-time setting!


3. This view will show that the BS equation is just a PDE for a mean of the option value in the **mathematical** limit $\Delta \rightarrow 0$


4. This limit makes a perfect sense mathematically but not financially, as it loses risk: the option magically becomes risk-less!


5. But shouldn´t risk in the option be the original purpose, a part of the option valuation?

## Lesson 2

### 2.1. Option Trading & RL

Bates concluded that the BS model doesn´t capture the real nature of options in the market.

- Richard Feynman made a great point on Quantum Field Theory: "Renormalization is like brushing garbage under a carpet"

- This means that if you follow rules of an analytical construction and solve for a model, you'll get an infinite value that comes from divergence in integrals.

- Risk in options makes them non-redundant:

    - Risk due to demand peressure in option market
    
    - Risk due to discret-time re-hedging: Q-Learning for the Black-Scholes problem
    
    - Risk due to other factors (which is what this lesson is about)
    

**Reinforcement Learning ("Action tasks"):** sequential (multi-step) decision-making by choosing multiple possible actions. As the state of the environment may change with time, RL involves planning and forecasting the future.

A **Feddback loop** is unique to RL, not encountered in SL or UL

![2020-05-29_21-57-09.png](attachment:2020-05-29_21-57-09.png)

### 2.2. Liquidity

There is no single commonly accepted definnition of market liquidity, but we can look at it as the ease of trading a security.

In general:

- Liquidity is the ease of trading a secutiry


- One source of liquidity is *exogenous transaction costs* (brokerage fees, processing costs, transaction taxes)


- Oher sources is *demand pressure* and *inventory risk*

    - An example would be owning a car or a house, where turning the asset into liquidity depends on the demand only and you may need to sell for a lowe price.
    
    
- Cost of trading due to *private information*

    - A trading desk may know that a Hedge Fund needs to liquidate a position and may sell earlier at a higher price
    
    
- Search friction in finding counter-party (for OTC trades)



All these makes security valuation harder.

**Standard Asset Pricing**

- Based on the assumption of a perfectly liquid (frictionless) market.

- Frictionless market is combined with one of the 3 concepts:
    1. Competitive Market Equilibrium
    2. Agent Optimality
    3. No arbitrage
    
- **No arbitrage:** one cannot make money in one state of nature without paying money in at least one other state of nature

- In a frictionless market, no arbitrage is equivalent to the existance of a stochastic discount factor $m_{t}$ so that the price process $p_{t}$ of a security with a dividend process $d_{t}$ would be:

$$ p_{t} = E_{t} \left[(p_{t+1} + d_{t+1}) \frac{m_{t+1}}{m_{t}} \right] \iff p_{t} = E_{t} \left[\Sigma^{\infty}_{\tau = t+1} d_{\tau} \frac{m_{\tau}}{m_{t}} \right]$$

- This is the main equation of standard asset pricing theory. It can also be obtained as an agent optimality.


- If investor preferences for a consumption process $c_{t}$ are represented by an additively separable utility function $E_{t}[u_{t}(c_{t})]$, then $m_{t} = \frac{du_{t}(c_{t})}{dc_{t}}$

**Frictionless Markets Do Not Exist!**

In real markets, frictions are always present, and they impact prices.

- If they did not, liquidity providers would not be present in the markets


- For more details, see "Liquidity and Asset Prices" by Amihud (2005)


- There are examples in the market where securities with the same cashflows have different prices: the difference is due to liquidity.

    - On-the-run (newly issued) Treasury bills have lower yield than off-the run ones
    
    - The same for credit indexes such as CDX
    

- This means that in a real market, a single stochastic discount factor $m_{t}$ that would apply to **all** securities does not exist:

$$ p_{t} = E_{t} \left[\Sigma^{\infty}_{\tau = t+1} d_{\tau} \frac{m_{\tau}}{m_{t}} \right]$$

- In a frictionless market, the price depends **only** on the pricing kernel $\frac{m_{\tau}}{m_{t}}$ and cashflows $d_{t}$


- Liquidity effects sometimes are incorporated as modifications of the pricing kernel 

- In transaction cost based models, there is no pricing kernel



- Comparing to liquid market, now *pricing becomes additionaly dependent on both, the investor preferences* **(utility function)** and **liquidity** of the security


- Also need to relax the assumption that all investors have the same information


### 2.3. Modeling Market Frictions

There exists different approaches to model market frictions

- Option pricing models incorporaing **transaction costs**

    - Fixed or proportional transaction costs
    - Can be incorporated as modifications to the BS formula


- **Supply curve** approach

    - Traderes are not price takers, price they pay dependes on the quantity they trade
    
    - The execution price is driven by a current balance between buyers and sellers
    
- **Feedback effects** of the impact of delta-hedging on the dynamics of the underlying

    - Permanent impact
    - Temporary impact
    
- Modeling liquidity effects as **convex execution costs** (instead of transaction costs linear in the volume)


**Transaction Costs Models**

- Are the costs in the buying and selling of the underlying, related to the **bid-ask spread**.

- Importance of **transaction costs** for option pricing:

    - In **liquid** markets (e.g. gov. bonds) costs are low, and frequent rehedging is possible.
    
    - In **iliquid** markets, costs are high, hedging is done less frequently. It leads to a substantial impact on security prices.

    

**Leland Model**

- Discrete time

- Re-hedging is done every time step, wether it is optimal or not

- Proportional costs model: the costs are $\kappa |\Delta x_{t}| S_{t}$ (proportional to the traded volume $|\Delta x_{t}|$ where $\Delta x_{t}$ is a change of the stock holding in a delta-hedge.


- Hedged portfolio has a risk-free return, as in the original BS model.


- For vanilla calls and puts, pricing uses the BS formula with adjusted vol. e.g. for long calls and puts, the adjusted vol is (for short options, flip the sign 1+...):

$$ \hat{\sigma} = \sigma \left[1 - \sqrt{\left(\frac{8}{\pi \delta_{t}}\right) \frac{\kappa}{\sigma}} \right]^{\frac{1}{2}}$$


- This induces bid-ask spreads for options

- More details and models can be found in P. Willmot "Derivatives"

**Demand-based Models**

- Try to take a more fundamental view of market liquidity and price impact.

    - The execution price is driven by a current balacne between buyers and sellers.
    
    - When there are more buyers than sellers, this creates a dis-balance that would drive prices up, which would bring more sellers unitl prices will go down. And vice versa for the opposite scenario.
    
- What this approach tries to achieve:

    - A richer structural model of transaction costs, rather than a fixed proportional cost model
    
    - Can incorporate dynamic signals information via signals in modeling frictions
    

- Two typs of demand-based models:

    1. Demand for stocks (hedged by stocks)
    
    2. Demand for options
    

**Convex Transaction Costs**

- Prop. cost models do not take into account the depth of the limit order book.

- The difference of execution price from mid-price is determined by a functions obtained by an inversion of demand-supply difference.


- A simple parametrization of such a function as a quadratic function produces an effective transaction cost model with convex costs, which is different from proportional cost model


- Option pricing amounts to solving a HJB equation, see paper for details.

### 2.3. Modeling Feedback Frictions

Large trades on stocks move the market, such effects must be incorporated in our models.

An option derives its value by a stock, and it is also hedged by the stock.

**Stock Pinnning (Avellaneda-Lipkin Model**

- The tendency of the underlying asset price to approach an option strike price at expiration

- The model is based on the behavior of option market-makers that impact the stock price by delta-hedging their positions

- Uses a linear price-impact model or a power-law model

- Impact in this model is measured by:

$$\frac{\Delta S_{t}}{S_{t}} \sim E \left|\Delta x_{t} \right|^{p}$$

where $E$ is an elasticity parameter

In this model, pinning exists if $p > 0.5$

- In terms of physics, this means that there is a phase transition in the system



# Week 2

## Lesson 1

### 2.1.1 From Portfolio Optimization to Market Model

A number of problems in stock trading (optimal execution, index tracking, portfolio optimization) amount to RL.

- When applied to a particular trader, the model needs propietrary data


- It can also be used for the market portfolio


- When rewards are unobservable, one can use IRL
    - Problems in which we only absorve actions, but not rewards.
    - We want to find the optimal functions (policy and reward functions)
    
    
- There are also cases in which you can´t observe either
    - Intraday trading: you can see mkt prices, but you cannot see actions of your competitors
        - You can then use hidden-variable algorithms such as EM
        

### 2.1.2. Invisible Hand

Two types of agent-based approaches to modeling market dynamics:

1. A representative rational or bounded-rational investor (economics) - a 'mean' of all investors.


2. Multi-agent models (physics, computer science)


- To identify an agent whose optimal portfolio is a market portfolio, as in the BL model, we have to use an agent who is a 'sum' of all investors


- Such an agent cannot be a rational agent, but should have a bounded rationality


- Embodies an 'Invisible Hand'-type market mechanism (Adam Smith, etc.)




**True or False**

1. Bounded rationality is a principle saying that rationallity in scientific deduction methods has limits due to noise or quantum effects:  **FALSE**


2. THe notion of 'Invisible Hand' refers to the observation first made by Adam Smith that agents in multi-agent models are hard to identify, and therefore they should be modeled using hidden variables: **FALSE**


3. Multi-agent models produce identical results to single agent models if all agents are fully rational:  **FALSE**


4. IRL can be applied to both, an individual investor portfolio and a market portfolio:   **TRUE**

### 2.1.3. GBM & its Problems

Also known as the *log-normal asset return model**

$$dX_{t} = (r_{f} + wz_{t})X_{t}dt + \sigma X_{t} dW_{t} ..... Eq. (1)$$

Where:

- $X_{t}$ is an asset price at time $t$

- $r_{f}$ is a risk-free rate

- $z_{t}$ are predictors

- $w$ are weights and

- $W_{t}$ is a Standard Brownian Motion

The GBM model improved over the ABM (Arithmetic Brownian Motion) of Bachelier

This model can be viewed as a model with *linear drift* $f(x) = (r_{f} + wz_{t})x$.


The main difference with ABM is that it has a constant drift and volatility.


Another important improvement regarding ABM is that with GBM the prices are always positive.

By replacing infinitesimal increments with finite increments we can obtain a **Discrete GBM** as follows:

We have:

$$dX_{t} = (r_{f} + wz_{t})X_{t}dt + \sigma X_{t} dW_{t}  \; \; \; \; \; \; \; \; \; \; \; \; Eq. (2)$$

The GBM in Eq. (2) is a continuous-time limit $\Delta t \rightarrow dt$ of a discrete-time dynamics:

$$\Delta X_{t} = r_{t}X_{t}\Delta t \; \; \; \; \; \; \; \; \; \; \; \; Eq. (3)$$

Where:

- $r_{t} = r_{f} + wz_{t} + \frac{\sigma}{\sqrt{\Delta t}} \xi_{t}$

- $\xi_{t} \sim N(\cdot|0, 1)$

Equivalently we can write:

$$X_{t + \Delta t} = (1 + r_{t}\Delta t) X_{t} \; \; \; \; \; \; \; \; \; \; \; \; Eq.(4)$$

Many models use the GBM model including the CAPM and the Black-Scholes option pricing model.

But the model does not incorporate the following:

- Defaults and market crashes


- Rare events of large market moves


- Market frictions (feedback effects or transactions costs)


- Exchange of capital with an outside world (isolated system)


- Volatility patterns


**Traditional Approaches to Improving the GBM Model**

- Extend a set of predictors $z_{t}$


- Include non-linear dependencies on predictors $z_{t}$ (NN, or SVM)


- Include more complex state-dependent or/and stochastic noise coefficients


- All these approaches preserve linearity of dynamics in the state variable $X_{t}$


- (Beyond ML & RL?) We will see that including instead non-linearities in $X_{t}$ may be more important!

**True or False**

1. The GBM model overestimates probabilities of large market moves or defaults:    **FALSE**


2. The GBM is applied for a Brownian Motion in non-trivial geometries e.g. for a diffusion on a finite interval:   **FALSE**


3. The GBM model is incompatible with defaults, because the boundary X = 0 in the GBM model is unattainable:   **TRUE**


4. 'Non-linear' extensions of the GBM may involve non-linearities in space or non-linearities in predictors:   **TRUE**

### 2.1.4. The GBM Model: An Unbounded Growth Without Defaults

Corporate defaults are similar to absrobing state: once a system gets there, it cannot escape.

- The zero level $X = 0$ could naturally serve as a default/absorbing boundary


- The problem is that in the GBM model, the zero level $X = 0$ is unattainable: defaults cannot happen in the GBM model


- Defaults can be described as level crossing at some $\hat{X} > 0$ (e.g. the Merton model) but this approach has some issues too


- As it is hard to have defaults in the GBM model, we need other state variables such as credit spreads


- But credit spreads and stock prices are not independent - leading to highly complex joint dynamics of stock prices and spreads.

**Unbounded Growth in the GBM Model**

$$\Delta X_{t} = r_{t}X_{t}\Delta t \;, \;r_{t} = r_{f} + wz_{t} + \frac{\sigma}{\sqrt{\Delta t}} \xi_{t}  \; \; \; \; \; \; \; \; \; \; \; \; Eq. (5)$$

This equation has a linear drift $f(x) = r_{t}X_{t}$

Taking the averages on both sides, we obtain an equation for the mean $\bar{X_{t}}$:

$$d\bar{X_{t}} = r_{f}X_{t}dt \; \iff \; \bar{X_{t}} = \bar{X_{0}} e^{r_{f}t} \; \; \; \; \; \;\; \; \; \; \; \; Eq.(6)$$

We got an **exponential growth** of the mean asset price!

- This is a consequence of the linearity of the drift $f(x) = r_{t}X_{t}$ and resulting scale invariance of Eq.(2) with respect to the scale transformation $X_{t} \rightarrow \alpha X_{t}$

In the GBM world, you can get **infinitely rich** (due to linear drift and scale invariance).

(Eq.(7) = Eq.(6))

But the market is typically considered a closed system without any exchange of capital with an outside world.

How can you get infinitely rich in such market!?

A simple hypothesis is taht there are some saturation effects for large values of $X_{t}$, so that you will not get infinitely rich in the end.

**True or False**

1. Effects of saturations in the market, that are produced by interactions and a finite depth of the market, can change returns from unbounded to bounded:   **TRUE**


2. Credit spreads should be independent from stock prices, because doing otherwise produces overly complex models:   **FALSE**


3. The origin of unbounded returns in the GBM model is linearity of the drift and resulting scale invariance of the GBM model:   **TRUE**


4. The origin of unbounded returns in the GBM model is a desire to make the model more attractive to investors:   **FALSE**

### 2.1.5. Dynamics with Saturation: The Verhulst Model

The Verhust Model is popular in physics, biology and ecology as a model for the dynamics of a size pf population $x_{t}$ that competes for a limited resource such as food:

$$dx_{t} = (\theta x_{t} - \kappa x_{t}^{2}) dt = \kappa x_{t} \left(\frac{\theta}{\kappa} - x_{t} \right) dt \; \; \; \; \; \; \; \; \; \;\; \; Eq.(8)$$ 

This is an ODE, we can use it to describe stock prices instead.

If $x_{t}$ is used to model a stock price, this means that our model has a state-dependent diminishing returns as an effect of competition for a limited resource (a market value):

$$\bar{r_{t}} = \bar{r_{t}}(x_{t}) = \frac{\theta}{\kappa} - x_{t} \; \; \; \; \; \; \; \; \; \; \; \; \; Eq.(9)$$

This spells a boundedness of the total wealth, as we will see shortly

So consider the 'normal' regime with $\kappa > 0 $. For "small fields" $x_{t} \ll \frac{\theta}{\kappa}$, we have an exponential growth:

$$dx_{t} \simeq \theta x_{t} dt \;  \Rightarrow \; x_{t} \simeq x_{0} e^{\theta t} \; \; \; \; \; \; \; \; \; \; \; \; Eq.(10)$$ 

But this "inflationary" behavior is only approximate: it is valid only for short times (or small fields $x_{t} \ll \frac{\theta}{\kappa}$). In the long term, the system reaches an equilibrium at $\bar{x} = \frac{\theta}{\kappa}$

**The Opposite Limit $x_{t} \gg \frac{\theta}{\kappa}$**

1. Let's neglect $\frac{\theta}{\kappa}$ in parenthesis. We obtain:

$$dx_{t} \simeq \kappa x_{t}^{2} dt \; \iff \; x_{t} = C + \frac{1}{\kappa t} \; \; \; \; \; \; \; \; \; \; \; \; Eq.(13)$$ 

The solution approachaes a constant $C$ (which sould equal $\frac{\theta}{\kappa}$) in the long run, the speed of convergence is controlled by $\kappa$.

2. To insure $x_{t} \gg \frac{\theta}{\kappa}$, we could set $x_{t} = \frac{\theta}{\kappa} + y_{t}$ where $y_{t}$ is large.


Substituting into the Verhulst model, we get an equation for $y_{t}$:

$$dy_{t} = \kappa y_{t} \left( - \frac{\theta}{\kappa} - y_{t} \right) dt \; \; \; \; \; \; \; \; \; \; \; \; Eq.(14)$$

This is the same as the original Verhulst model but with a flipped sign of $\theta$, an interesting symmetry of the model!

**True or False**

1. In the lmit $x_{t} \ll \frac{\theta}{\kappa}$, the Verhulst process grows exponentially:   **TRUE**


2. In the limit $x_{t} \ll \frac{\theta}{\kappa}$, the Verhulst process grows logarithmically:   **FALSE**


3. In the limit $\rightarrow \; \infty$ the Verhulst process converges to a long-term mean $\frac{\theta}{\kappa}$:   **TRUE**


4. In  the limit $x_{t} \gg \frac{\theta}{\kappa}$ the Verhulst process grows:    **FALSE**

### 2.1.6. The Singularity is Near

The **Full Time-dependent Verhulst Model** has a solution of the form:

$$x_{t} = x_{0} \frac{e^{\theta t}}{1 + \frac{\kappa}{\theta} x_{0} \left(e^{\theta t} - 1 \right)} = \frac{\theta}{\kappa} \frac{1}{1 - \left(1 - \frac{\theta}{\kappa x_{0}} \right) e^{-\theta t}} \; \; \; \; \; \; \; \; \;  Eq.(15)$$ 

The only stable stationary solution for $\kappa, \theta > 0$ is $\bar{x} = \frac{\theta}{\kappa}$

We can get so flexible solutions by varying $\kappa$

For $\kappa < 0$, the only stable solutions is $\bar{x} = 0$

**Time Asymmetry**

This solution is strongly asymmetric in time. For an arbitray $x_{0} > 0$, the process (9) explodes to infinity in a finite time $t_{\infty}$:

$$t_{\infty} = \frac{1}{\theta} log \left(1 - \frac{\theta}{\kappa x_{0}} \right) \; \; \; \; \; \; \; \; \; \; Eq.(16)$$

A positive singularity is in the future ($t_{\infty} > 0$) only for 'non-physical' choices $\theta > 0, \; \kappa x_{0} \leq 0$ or $\theta \leq \kappa x_{0} \leq 0$

The model should be 'regularized' to treat such 'non-physical' parameter values!

When $\kappa, \theta > 0$, the singularity is in the **past**: $t_{\infty} < 0 $ ("emergence" from a negative singularity)

**True or False**

1. The Verhulst model has a singularity either in the past, or in the future, depending on the parameters:   **TRUE**


2. It is only singularities for positive times that matter, others are just irrelevant mathematical details, especially for financial models:   **FALSE**


3. Singularities of the Verhulst model point to the need to regularize the model:  **TRUE**


4. If $\frac{\theta}{\kappa x_{0}} > 1$, the solutions becomes singular for complex valued times:   **TRUE**

### 2.1.7. What are Defaults?

Technically considered a non-payment on an obligation.

Mathematically this can be modeled as the drop of the price of a stock to $0$ and staying there forever.

- When a firm defaults (more precisely, only when it goes bankrupt) its stock drops to zero, the company is then closed.


- So let's just describe bankruptcies / defaults as a drop price to zero in the GBM model!


- Oops, sorry, we can´t - the boundary $X = 0$ is unaccessible in the GBM model!


- The model is either wrong or incomplete (essentially the same)


- **A smart way out**: let´s model an unobservable firm value process, instead of the stock value process. The deafult boundary is at a non-zero level for the firm value process.

![2020-06-11_00-05-45.png](attachment:2020-06-11_00-05-45.png)

**Problems with the Merton Default Model**

- The firm value process used by the model is unobservable


- The exact default position is unobservable too


- Noise in the default position can be borught simply by observational noise in the simplest model formulation.


- Explicity uncertain default barrier models can also be constructed


- As a result of noise in the barrier, the default event itself becomes uncertain - we can´t say with certainty if a firm defaulted or not if we stay within the model


- Resembles the Schroedinger cat in Quantum Mechanics

**True or False**

1. When a firm goes bankrupt, its stock price drops to zero:   **TRUE**


2. A stock price cannot be negative because of limited liability of stockholders:   **TRUE**


3. If the default barrier position in the Merton model is not exactly known, at each moment in time we actually do not know if the default happened or not:   **TRUE**


4. The Merton Default Model describes a default as a level crossing event for an unobservable firm value process:   **TRUE**

### 2.1.8. Quantum Equilibrium - Disequilibrium

Let´s start with a joke: A biologist, a physicist and a mathematician observe how 2 people enter a house in the nex street. After a while, they see that 3 people leave the house, so the biologist says "the populations has triplicated", the physicist says "it is an error in the measurement" and the mathematician says "if now, a single person enters the house, the house will have 0 people in it".

**Competitive market equilibrium models:**

- markets near a state of a thermodynamic equilibrium, with zero exchange of money or information with an outsied wolrd.


- Produce an unbounded growth of asset


- **An alternative**: an "equilibrium - desequilibrium" in the market


- "Quantum Equilibrium-Desequilibrium" - to emphasize role of noise ( the same as quantum effects).

Let $X_{t}$ be tthe total capitalization of a firm at time $t$, rescaled to a dimensionless quantity $X_{t} \sim 1$.

**Discrete-time dynamics:**

$$X_{t+\Delta t} = (1 + r_{t} \Delta t) \left(X_{t} - c X_{t} \Delta t + u_{t} \Delta t \right) \; \; \; \; \; \; \; \; \; \; Eq.(17) $$

Where:

- $r_{t} = r_{f} + wz_{t} - \mu u_{t} + \frac{\sigma}{\sqrt{\Delta t}} \epsilon_{t}$


- $\mu$ is a market impact parameter


- $c$ is the dividend rate

Here, $u_{t} \Delta t$ is a neww capital injected in the market by investors at the star od the interval $[t, t + \Delta t]$, after which the new capital $X_{t} - cX_{t}\Delta t + u_{t}\Delta t$ grows ar a rate $r_{t}$.

When $u_{t} = 0 \; \forall t$ and $c = 0$, we recover the GBM model.

**Capital Supply Function**

In general, $u_{t}$ should be a function of $X_{t}$. We consider a simple quadratic specification


$$u_{t} = u \left(X_{t} \right) = \phi X_{t} - \lambda X^{2}_{t} \; \; \; \; \; \; \; \; \; \; Eq.(18)$$


We assume that $0 < \lambda \ll 1$. Then in a parametrically wide region $|X_{t}| \ll \left| \frac{\phi}{\lambda} \right|$:

$$u(x) \simeq \phi x, \; \; x \ll \frac{\phi}{\lambda} \; \; \; \; \; \; \; \; \; \; \; \; Eq.(19)$$

- $\phi > 0$: capital is injected ('growth')


- $\phi < 0$: capital is withdrawn ('contraction') 

Substituting **Eq.(18)** into **Eq.(17)**, neglecting term $(\Delta t)^{2}$ and taking the continuous-time limit $\Delta t \; \rightarrow \; dt$, we obtain the **"Quantum Equilibrium - Desequilibrium Model" (QED):**

$$dX_{t} = \kappa X_{t} \left(\frac{\theta}{\kappa} - X_{t} - \frac{g}{\kappa} X^{2}_{t} \right) dt + \sigma X_{t}(dW_{t} + wz_{t}) \; \; \; \; \; \; \; \; Eq.(20)$$

where we introduced parameters:

$$\theta = r_{f} - c + \phi, \; \kappa = \mu \phi - \lambda, \; g = \mu \lambda \; \; \; \; \; \; \; \; \; \;Eq.(21)$$


If we keep $\mu > 0$, the mean reversion parameter $\kappa$ can be of either sign, depending on the sign of $\phi$ and the value of $\lambda$

**True or False**

1. If we set $g = 0, \; w = 0$ and $\sigma = 0$ in the QED model, we recover the Verhulst model:   **TRUE**


2. The 'Q' in the name of the QED model stands for Q-Learning:   **TRUE**


3. A steady non-equilibrium state is only possible for open systems that interact with an outside world: **TRUE**


4. If we set $\kappa = 0$ in the QED model, we recover the GBM: **FALSE**


5. The "QED" model is a model with an inflow/outflow of capital into the market: **TRUE**

# Week 3

## Lesson 1

### 3.1.1. Approaches Beyond Reinformcement Learning 

One important topic is regularization, since in physics it is needed for the model to make sense at all.

Interpret the GIGO principle.For exmaple, consider having data from 2009 onwards, you would not be taking into consideration the Credit Crisis.

Prior information becomes really important.

### 3.1.2. Market Dynamics and IRL

Let us retake the **Quantum Equilibrium - Desequilibrium Model"**, in particular the **Discrete-time Dynamics** which  is composed of 3 equations:

$$X_{t+\Delta t} = (1 + r_{t} \Delta t) \left(X_{t} - c X_{t} \Delta t + u_{t} \Delta t \right) \; \; \; \; \; \; \; \; \; \; Eq.(1) $$

$$r_{t} = r_{f} + wz_{t} -\mu u_{t} + \frac{\sigma}{\sqrt{\Delta t}} \epsilon_{t}$$

$$u_{t} = \phi X_{t} + \lambda X^{2}_{t}$$

Where $c$ is the dividend rate

By using the 3rd equation in the first one and taking the limit $\Delta t \; \rightarrow \; dt$, this produces the **Quantum Equilibrium - Desequilibrium** model:

$$dX_{t} = \kappa X_{t} \left(\frac{\theta}{\kappa} - X_{t} - \frac{g}{\kappa} X^{2}_{t} \right) dt + \sigma X_{t}(dW_{t} + wz_{t}) \; \; \; \; \; \; \; \; Eq.(2)$$

where

$$\theta = r_{f} - c + \phi, \; \kappa = \mu \phi - \lambda, \; g = \mu \lambda \; \; \; \; \; \; \; \; \; \;Eq.(3)$$

If there is no money exchange between the market and the outside i.e. $\phi = \lambda = 0$ and hence $g = 0$ and $\kappa = 0$, we formally recover the **GBM model**:

$$dX_{t} = (r_{f} + wz_{t})X_{t}dt + \sigma X_{t}dW_{t} \; \; \; \; \; \;\; \; \; \;Eq.(4)$$


The same GBM dynamics are obtained if $\phi \neq 0$, but instead we take a limit of zero friction $\mu = 0, \; \lambda = 0$

If $\mu > 0$ and $\phi \neq 0$ but $\lambda = 0$ (and hence $g = 0$), the QED model reduces to the **Geometric Mean Reversion (GMR)** model (with signals):

$$dX_{t} = \kappa X_{t} \left( \frac{\theta}{\kappa} - X_{t} \right) dt + X_{t}(\sigma dW_{t} + wz_{t}) \; \; \; \; \; \; \; \; \; Eq.(5)$$

The GMR model without signals $z_{t}$ was studied by Dixit & Pindick, and Ewald and Yang.

If we also take the noiseless limit $\sigma = 0$ and $z_{t} = 0$, we then get the **"Verhulst Limit"**.

**True or False**

1. The GMR model is recovered from QED model in the limit $g = 0$:  **TRUE**


2. The GBM model is recovered from the QED model in the limit $\kappa = 0, g = 0$:   **TRUE**


3. The GMR model is recovered from the GBM model in the limit $g = 0$:   **FALSE**


4. The GBM model is recoeverd from the QED model in the limit $g = 0$: **FALSE**

### 3.1.3. Diffusion in a Potential: The Langevin Equation

Langevin Equation is one of the most famous SDEs in science.

First, a bit of History:

- Bachelier (1900): a model of Brownian motion (free diffussio) applied to stock market (ABM model)


- Louis Bachelier $\rightarrow$ Andrey Kolmogorov $\rightarrow$ Paul Levy $\rightarrow$ Leonard Savage $\rightarrow$ Paul Samuelson (GBM, 1965)


- **Einstein (1905):** diffusion for Brownian particles


- **Paul Langevin (1908):** simplified approach to diffusion in a force potential (e.g. intermolecular forces)


- **Focker and Plank (1920s):** Brownian motion in a force potential

**Langevin Equation:**

$$\ddot{x} + \gamma \dot{x} + U'(x) = \sqrt{\frac{2 \gamma k T}{M}} \dot{W} \; \; \; \; \; \; \; \; \; \; Eq.(6)$$

Here 

- $x$ is a particle position, 


- $M$ is its mass, 


- $\gamma$ is a dissipation constant


- $U(x)$ is a (generally non-linear) potential force


- $\dot{W}$ is a Gaussian white noise

**Example:** a potential $U$ created by light particles of mass $m$ at thermal equilibrium at tempreature $T$ with a Maxwell distribution of velocities given by:

$$f(v) = \sqrt{\frac{m}{2 \pi k T}} e^{- \frac{mv^{2}}{2 k T}} \; \; \; \; \; \; \; \; \; \; Eq.(8)$$


**Langevin Equation: The Phase Space Representation**

$$\ddot{X} + \gamma \dot{X} + U'(X) = \sqrt{2 \varepsilon \gamma} \dot{W} \; \; \; \; \; \; \; \; \; \; Eq.(9)$$

can also be written in a phase space representation, which can be written with 2 equations:

$$\dot{x} = v \; \; \; \;\; \; \; \; \; \;Eq.(10)$$

$$\dot{v} = - \gamma v - U'(x) + \sqrt{\frac{2 \gamma k T}{M}} \dot{W}$$

A solution of the LE is a 2-dimensional process for the pair $(x(t) , \dot{x}(t))$. (See Schuss)

The overdamped limit $\gamma \rightarrow \infty$ (the Smoluchowski limit) of the Langevin equation:

$$\gamma \dot{X} + U'(X) = \sqrt{\frac{2 \gamma k T}{M}} \dot{W} \; \; \;\; \; \; \; \; \; \; Eq.(11)$$

How to obtain: use the phase space representation and scale time $t = \gamma s $ (see Schuss for details).

Example of overdamped Langevin dynamics in ML: Stochastic Gradient Descent

A free Borwnian particle corresponds to a motion without a potential, i.e. $U = 0$. The Smoluchowski limit of the Langevin equation produces:

$$\gamma \dot{X} = \sqrt{\frac{2 \gamma k T}{M}} \dot{W} \; \; \; \; \; \; \; \; \; \; \; Eq.(12)$$


or 

$$dx = \sqrt{\frac{2 \gamma k T}{M}} dW_{t} \equiv \sqrt{2D} dW_{t} \; \; \; \; \; \; \; \; \; \; Eq.(13)$$

where $D$ is Einstein's diffusion coefficient

**Example: Smoluchowski Limit of a Free Brwonian Particle**

Langevin dynamics:

$$dX_{t} = - \frac{\partial U(X_{t})}{\partial X_{t}} dt + \sigma (X_{t})d\xi_{t} \; \; \; \; \; \; \; \; \; \; Eq.(14)$$

where $\xi_{t}$ is a noise term (a Gaussian noise $\xi_{t} = W_{t}$ or 'colored noise').

Ito diffusion for the GBM model:

$$dX_{t} = \mu X_{t} dt + \sigma X_{t} dW_{t} \; \; \; \; \; \; \; \; \; \; Eq.(15)$$

The classical potential for the GBM model is:

$$U_{GBM}(X) = - \frac{\mu}{2} X^{2} \; \; \; \; \; \; \; \; \; \; Eq.(16)$$


**True or False**

1. Stochastic Gradient Descent (SGD) is an example of free Ito diffusion without a potential: **FALSE**


2. The overdamped Langevin equation $\gamma \dot{X} + U'(X) = \sigma \dot{W}_{t}$ is obtained in the large friction limit $\gamma \rightarrow \infty$ of the Brownian motion: **TRUE**


3. The parameter evolution in SGD is described by the Langevin equation where the potential is given by the loss function: **TRUE**


4. The SGD is better described by a jump-diffusion process where jumps happens on outliers in the data: **FALSE**

### 3.1.4. Classical Dynamics

What happens when there is no noise at all in the system?

We had the SDE for the QED model:

$$dX_{t} = \kappa X_{t} \left(\frac{\theta}{\kappa} - X_{t} - \frac{g}{\kappa} X^{2}_{t} \right) dt + \sigma X_{t}(dW_{t} + wz_{t}) \; \; \; \; \; \; \; \; Eq.(17)$$

The classical potential $U(X)$ for the QED model is therefore:

$$U(x) = -\frac{1}{2}\theta x^{2} + \frac{1}{3} \kappa x^{3} + \frac{1}{4}gx^{4} \; \; \; \; \; \; \; \; \; \; Eq.(18)$$

This is a potential of a quartic oscillator 

This potentital reduce to one of the most famous physics problems.

Another parametrization in terms of parameters $a, b$ defining zeros of the potential:

$$U(x) = -\frac{1}{2}\theta x^{2} \left( 1 - \frac{x}{a} \right) \left(1 - \frac{x}{b} \right) \; \; \; \; \; \; \; \; \; \; Eq.(19)$$

The relation between two sets of parameters:

$$\frac{\kappa}{\theta} = \frac{3}{2} \frac{a+b}{ab}, \; \frac{g}{\theta} = - \frac{2}{ab} \; \; \; \; \; \; \; \; \; \; Eq.(20)$$

Another example would be a classic quartic potential with a metastable states with $\theta < 0$ in the log-space $y = log(x)$

Another parametrization: 

$$U(y) = -\theta y + \kappa e^{y} + \frac{1}{2}g e^{2y} \; \; \; \; \; \; \; \; \; \; Eq.(22)$$

**True or False**

1. Quartic potential is non-singular (i.e. it is finite for any finite real or complex-valued argument): **TRUE**


2. Quartic potential in the log-space is given by a fourth degree polynomial in $y = log(x)$:   **FALSE**


3. If we set $a = b$ in the classical potential $U(x)$, the resulting potential will only have two extrema, instead of three: **FALSE**


4. The only singularity of the quartic potential on the log-space $y = log(x)$ is at a negative infinity:  **TRUE**

### 3.1.5. Potential Minima and Newton´s Law

The classical potential (18) has 3 extrema at:

$$\bar{x_{0}} = 0, \; \bar{x_{1, 2}} = \frac{-\kappa \pm \sqrt{\kappa^{2} + 4g\theta}}{2g} \; \; \; \; \; \; \; \; \; \; Eq.(25)$$

Where $\bar{x_{1}}, \bar{x_{2}}$ correspond to the plus and minus signs, respectively. The firs extremum $\bar{x_{0}} = 0$ a degenerate solution: not only $U'(\bar{x_{0}}) = 0$, but also $U''(\bar{x_{0}}) = 0$. It is called a natural boundary. Once the price touches the zero level $x = 0$, the system will stay in this state forever. 

For small values $g \rightarrow 0$, we obtain the following expressions for the extrema $\bar{x}_{1, 2}$:

$$\bar{x}_{1} = \frac{\theta}{\kappa} \left(1 - \frac{g\theta}{\kappa^{2}} \right) + O(g^{2})$$

$$\bar{x}_{2} = - \frac{\kappa}{g} - \frac{\theta}{\kappa} \left(1 - \frac{g \theta}{\kappa^{2}} \right) + O(g^{2}) \; \; \; \; \; \; \; \; Eq.(26)$$

Note that the first root $\bar{x}_{1}$ is non-perturbative in $\kappa$ and perturbative in $g$

A particle with energy $E$ can move in a classically allowed region where the sum of kinetic and potential energy equals $E$.

The Newton second law (mass $m = 1$ times the acceleration $a \equiv \ddot{x}$ equals the force $F(x) = - U'(x))$

$$\ddot{x} = -U'(x) = \theta x - \kappa x^{2} - g x^{3} \; \; \; \; \; \; \; \; \; \; Eq.(27)$$

**The CPT symmetry of the Newtonian mechanics:**

- $C-parity: \kappa \rightarrow - \kappa$


- $P-parity: x \rightarrow -x$


- $T-parity (Time \; reversal): t \rightarrow -t \; \; \; \; \; \; \; \; \; \; Eq.(29)$

Eq.(27) is separately symmetric with respect to the time reversal $T$ and the joint $CP$ - inversion. As a consequence, it is also invariant with respect to a simultaneous $CPT$ transformation.

**True or False**

1. The word 'non-perturbative' means that a corresponding parameter is fixed and not subject to changes: **FALSE**


2. A model is non-perturbative in a parameter $\theta$ if dependence of observables on $\theta$ cannot be obtained as a result of a regular perturbation theory in small values of $\theta$.  **TRUE**


3. The Newtonian mechanics is invariant under reflection o time because the Lagrangian does not explicitly depend on time:  **FALSE**


4. The Newtonian mechanics is invariant under reflection of time because it contains the second derivative with respect to time. Under the time reversal, it stays the same: **TRUE**

### 3.1.6. Classical Dynamics: the Lagrangian and the Hamiltonian

The total energy $E$ that is equal to the sum of the kinetic energy $K = \frac{m \dot{x}^{2}}{2}$ and the potential energy $U(x)$ is a constant in time:

$$E \equiv \frac{m \dot{x}^{2}}{2} + U(x) \; \; \; \; \; \; \; \; \; \; Eq.(30)$$


Can write it as follows:

$$\frac{dx}{dt} = \sqrt{\frac{2}{m} \left[E - U(x) \right]} \; \; \; \; \; \; \; \; \; \; Eq.(31)$$

This is a differential equation that we can integrate:

$$t = t(x) = \sqrt{\frac{m}{2}} \int^{x}_{x_{0}}{\frac{dx}{\sqrt{E - U(x)} } + constant} \; \; \; \; \; \; \; \; Eq.(32)$$

The classical motion is only allowed in a region where $U(x) < E$

As $K = \frac{m \dot{x}^{2}}{2} \geq 0 $, turning points of a potential $U(x)$ are those points where $K = 0$ and hence:

$$Turning \; points: U(x) = E \; \; \; \; \; \; \; \; \; \; Eq.(33)$$

If for a given $E$ we have 2 points $x_{1}(E)$ and $x_{2}(E)$, then the period of classical oscillation in a potential well is:

$$T(E) = \sqrt{2m} \int^{x_{2}(E)}_{x_{1}(E)} { \frac{dx}{\sqrt{E - U(x)}} } \; \; \; \; \; \; \; \; \; \; Eq.(34)$$

**The Hamiltonian Principle of the Leas Action**

The action $S$ and the Lagrangian $\mathcal{L}$:

$$S = \int^{t_{2}}_{t_{1}} { \mathcal{L}(x, \dot{x}, t)dt } = \int^{t_{2}}_{t_{1}} { \left[ \frac{m\dot{x}^{2}}{2} - U(x) \right]dt } \; \; \; \; \;\; \; \; \; Eq.(35)$$

All of Classical Mechanics can be derived from a single principle: **The Hamiltonian principle**:

The motion of a mechanical system shoul be such that the action along a trayectory is minimized.

$$\delta S = \delta \int^{t_{2}}_{t_{1}} { \mathcal{L}(x, \dot{x}, t)dt }$$

$$\delta S = \int^{t_{2}}_{t_{1}} { \left[ \frac{\partial \mathcal{L}}{\partial x} \delta x + \frac{\partial \mathcal{L}}{\partial \dot{x}} \delta \dot{x} \right]dt } = 0 \; \; \; \; \; \; \; \; Eq.(36)$$

This produces the Lagrange equation:

$$\frac{d}{dt} \frac{\partial \mathcal{L}}{\partial \dot{x}} - \frac{\partial \mathcal{L}}{\partial x} = 0 \; \; \; \; \; \; \; \; Eq.(37)$$

this are very general, and in particular reproduce the Newtonian llaws of dynamics.

Using conservation of energy $E$, we can express the momentum $p = m \dot{x}$ in terms of energy and the potential energy:

$$H = \frac{1}{2} p^{2} + U(y) = E \; \iff \; p = \sqrt{2 [ E - U(y)]} \; \; \; \; \; \; \; \; Eq.(38)$$

Substituting this into (35), we obtain:

$$S = \int^{y_{f}}_{y_{0}} { \sqrt{2[E - U(y)]}dy } \; \; \; \; \; \; \; \; Eq.(39)$$

**From Classical Mechanics to Quantum Mechanics**

Eq.(40) = Eq(39)

- In classical mechanics: **one** path from $y_{0}$ to $y_{f}$ determined by the Lagrange equation, the action along the path is (40).


- In quantum mechanics: **infinite** number of paths $y_{0}$ to $y_{f}$, each path has the probability (weight)

$$p(E, path) \sim exp \left( \frac{i}{\hbar} S(E, path) \right) \; \; \; \; \; \; \; \; Eq.(41)$$

**True or False**

1. The Hamilton principle of the least action produces the Lagrange equation of motion that defines a trajectory of a classical particle: **TRUE**


2. In quantum mechanics, a trajectory of a particle is determined by the quantum Lagrange equation: **FALSE**


3. In quantum mechanics, a particle in a sense moves from one point to another along an infinite number of paths all at once: **TRUE**


4. In classical mechanics, a particle moves from an initial to final point along a single trajectory: **TRUE**

(ALL CORRECT AT 1ST TRY, CIST!)

### 3.1.7. Langevin Equation and Fokker - Plank Equations

Langevin equation is a path-wise SDE. If we wnat to study statistical propoerties of the stochastic systems, we can instead use equations for a probability distribution of the system. This produces the **Fokker - Plank equation (the forward Kolmogorov equation)**.

$$\dot{p}(x, \;t|x_{0}) = \frac{\partial}{\partial x}[U'(x) p(x, \; t|x_{0})] + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [\sigma^{2}(x) p(x, \;t|x_{0})] \; \; \; \; \; \; Eq.(42)$$ 


Initial conditios:

$$\lim_{t \to t_{0}} p(x, t|x_{0}) = \delta(x - x_{0})$$

**How to derive the Fokker-Plank equation from the Langevin Equation?**

CAlculate the time derivative of the mean valuie of some functional $f(X)$:

$$\frac{d}{dt} \langle f(X) \rangle = \left \langle \frac{d}{dt} f(X) \right \rangle = \langle - U'(X_{t})f_{x} + \frac{1}{2} \sigma^{2}(X_{t})f_{xx}\rangle$$


$$= \int \left(-U'(X_{t})f_{x} + \frac{1}{2}\sigma^{2}(x)f_{xx}\right) p(x|x_{0})dx$$

On the other hand, we can compute differently:

$$\frac{d}{dt} \langle f(X) \rangle = \int f(x) \frac{\partial p(x|x_{0})}{\partial t}dx \; \; \; \; \; \; \; \; Eq.(43)$$

Integrating by parts in the first relation, we have:

$$\frac{d}{dt}\langle f(X) \rangle = \int f(x) \left( U'(x) + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} \sigma^{2}(x) \right) p(x|x_{0})dx \; \; \; Eq.(44)$$

Therefore, the two expressions should be the same:

$$\int f(x) \frac{\partial p(x|x_{0})}{\partial t}dx = \int f(x) \left( U'(x) + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} \sigma^{2}(x) \right) p(x|x_{0})dx \; Eq.(45)$$

Because $f(x)$ is arbitrary, we obtain the FPE:

$$\frac{\partial p(x, t|x_{0})}{\partial t} = \frac{\partial}{\partial x}[U'(x) p(x, \; t|x_{0})] + \frac{1}{2} \frac{\partial^{2}}{\partial x^{2}} [\sigma^{2}(x) p(x, \;t|x_{0})]$$

An absorbing boundary condition at $x = 0$: a collapse of the process:

$$\lim_{x \to 0} p(x, t|x_{0}) \; \; \; \; \; Eq.(46)$$


The FPE in the log-price space $y = log(x)$:

$$\frac{\partial p(y, t|x_{0})}{\partial t} = \frac{\partial}{\partial y}[U'(y) p(y, t|x_{0})] + \frac{\sigma^{2}}{2} \frac{\partial^{2}}{\partial y^{2}} [p(y, t|y_{0})] \; \; \; Eq.(47)$$

Here the potential $U(y)$ is 

$$U(y) = -\theta y + \kappa e^{y} + \frac{1}{2} g e^{2y}$$

When $\sigma = 0$, the FPE in the log-space produces:

$$\dot{y} = -U'(y) \; \;\; \; \; \; \; \; \; \;Eq.(48)$$

This produces:

$$\frac{dU(y)}{dt} = U'(y)\dot{y} = -[U'(y)]^{2} \leq 0 \; \; \; \; \; \; \; \; Eq.(49)$$

The particle $y(t)$ always moves to minimize $U(y)$, only stops when $U'(y) = 0$

**True or False**

1. The FPE equation is a second-order equation, therefore it requires two boundary conditions: **TRUE**


2. The Fokker - Plank equation (FPE)  is a Partial Differential Equation (PDE):  **TRUE**


3. The Fokker-Plank equation (FPE) is a Stochastic Differential Equation (SDE): **FALSE**


4. The FPE equation is a first-order equation, therefore it requires one boundary condition: **FALSE** 

### 3.1.8. The Fokker-Plank Equation and Quantum Mechanics

The FPE in the log-price space $y = log(x)$:

$$\frac{\partial p(y, t|x_{0})}{\partial t} = \frac{\partial}{\partial y}[U'(y) p(y, t|x_{0})] + \frac{\sigma^{2}}{2} \frac{\partial^{2}}{\partial y^{2}} [p(y, t|y_{0})] \; \; \; Eq.(50)$$

Here the potential $U(y)$ is 

$$U(y) = -\theta y + \kappa e^{y} + \frac{1}{2} g e^{2y}$$

**The FPE Equation: Stationary and Quasi-stationary Distributions**

Stationary solution of the FPE equation

$$p(y, t|x_{0}) = \frac{1}{Z} exp\left( -\frac{2 U(y)}{\sigma^{2}} \right) \; \; \; \; \; \; \; \; \; Eq.(51)$$

This is known as a **Boltzmann or exponential distribution**, whwere $Z$ is a normalization constant. When there is metastability, it shows as a divergence of the normalization constant $Z$. A metastable state can decay through thermal fluctuations.

Assume that the volatitlity is constant, $\sigma(x) = \sigma$. Make the following ansatz for the FPE:

$$\hat{p}(x, t|x_{0}) = e^{-\frac{1}{\sigma^{2}}U(x)} K(y, t|y_{0}) \; \; \; \; \; \; \; \; \; Eq.(52)$$


Using this in Eq.(46), we obtain an imaginary time Schrodinger equation for $K(y, t|y_{0})$:

$$-\frac{\partial K(y, t|y_{0})}{\partial t} = HK(y, t|y_{0}) \; \; \; \; \; \; \; \; \; Eq,(53)$$

Where H is the Hamiltonian:

$$H = -\frac{\sigma^{2}}{2}\frac{\partial^{2}}{\partial x^{2}} + \frac{1}{2\sigma^{2}}(U'(x))^{2} - \frac{1}{2}U''(x)$$

$$\equiv -\frac{\sigma^{2}}{2}\frac{\partial^{2}}{\partial x^{2}} + V(x) \; \; \; \; \; \; \; \; Eq.(54)$$

Where $V(x)$ is an equivalent quantum-mechanical potential.



**The Schrödinger Equation and Supersymmetry**

Supersymmetry (SUSY) of the Schrödinger equation (53):

$$H = A^{+}A \; \; \; \; \; \; \; \; \; Eq.(55)$$

where

$$A = \frac{1}{\sqrt{2}}\left[\sigma\frac{\partial}{\partial y} + \frac{1}{\sigma} U' \right], \; A^{+} = \frac{1}{\sqrt{2}}\left[-\sigma\frac{\partial}{\partial y} + \frac{1}{\sigma} U' \right] \; \; \; Eq.(56)$$

Operators $A, A^{+}$ are sometimes called supercharge generators, and the function $U_{y}$ is called the superpotential. The supersymmetric partner Hamiltonian $H_{+}$ is obtained by swapping their order:

$$H_{+} = AA^{+} = -\frac{\sigma^{2}}{2}\frac{\partial^{2}}{\partial y^{2}} + \frac{1}{2\sigma^{2}}(U')^{2} + \frac{1}{2}U'' \; \; \; \; \; Eq.(57)$$

**The Unbroken and Broken SUSY**

Due to supersymmetry, if $\Psi_{n}$ is an eigenvector of $H$ with an eigenvalue $E_{n}$, then the state $A\Psi_{n}$ will be an eigenstate of $H_{+}$ with the same eigenvalue $E_{n}$:

$$H_{+}A\Psi_{n} = AA^{+}A\Psi_{n} = AH\Psi_{n} = AE_{n}\Psi_{n} = E_{n}A\Psi_{n} \; \; \; Eq.(58)$$


Meaning: that all eigenstates except a 'vacuum' state with energy $E_{0}=0$ (if it exists-see below) should be degenerate in energy with eigenstates of the SUSY partner Hamiltonian $H_{+}$.

SUSY can be unbroken or spontaneously broken. 

If the energy of the ground state is larger than zero, then SUSY is spontaneously broken:

$$Unbroken \;SUSY: \; \; A\Psi_{0}= 0 \cdot \Psi_{0} =0 \; (E_{0} = 0)$$

$$Broken \;SUSY: \; \; A\Psi_{0} = E_{0}\Psi_{0}, \; E_{0} > 0 \; \; \; \; \; Eq.(59)$$

**Escape from a Metastable Sstate**

Large sotck drops and defaults can be thought as tunneling through a potential barrier. The classical transition state theory gives the probability of a particle jumping over a barrier as a product of two factors: the Arrhenius factor $B$ and the pre-factor $A$. 

The Arrhenius factor is:

$$B = exp\left(-\frac{E_{b}}{kT} \right) \;  \;  \;  \;  \;  \;  \;  \;  \;  Eq.(60)$$

Where $E_{b}$ is the barrier height, and $T$ is the temperature. The pre-factor $A$ for a $1D$ well is given by the frequency $\omega_{0}$ of oscillations at the bottom of the well:

$$A = \frac{\omega_{0}}{2 \pi} \; \; \; \;\;\;\; \;Eq.(61)$$

Tunneling in QM: imaginary time and imaginary action:

$$S(E) = \int^{y_{f}}_{y_{0}} \sqrt{2[E-U(y)]}dy \; \; \; \; \; \; \; \; Eq.(62)$$

**Escape by Tunneling and Divergence of Perturbation Theory**

- Tunneling is a *non-perturbative* effect: it can´t be obtained as an expansion in small values of $\kappa$ and $g$ around a model with a 'trivial vacuum' $\bar{x} = 0$.


- Divergence of perturbative series and tunneling have the same origin


- This is similar to Dyson's divergence of Quantum Electro-Dynamics

**Summary**

- RL/IRL can be used not only to compute specific quantity, but also to build models themselves.


- The model we presented in the previous course can be both re-derived and improved using methods from physics


- Analysis of different symmetries of the problem play a key role


- Symmetries determine the nature of phase transitions


- For your course project: re-estimate the QED model with non-zero $g$

**True or False**

1. The FPE can be transformed to a Schrödinger equation by a substitution $P(y, t) = P_{0}(y)K(y, y)$ where $P_{0}$ is the stationary distribution: **FALSE**


2. The FPE Ccan be transformed to a Schrödinger equation by a substitution $P(y, t) = \sqrt{P_{0}(y)}K(y, y)$ where $P_{0}$ is the stationary distribution: **TRUE**


3. Tunneling is a process of passage through a potential barrier that is activated by noise: **TRUE**


4. Tunneling is a process of random transformation of profits to loss: **FALSE**