# Summary of Potential Microstructure ML Models

## Model Summaries 
1. TWAP
2. VWAP
    - Strategy: 
    ![image.png](attachment:image.png)
    - Pro:  **Transparency, easy to compute***is straightforward
 to calculate (ex-post), and is attractive to investors because execution prices close to VWAP
 indicate that they obtained prices similar to what the market bore over the execution window
    - Con: post-exe data
    - Paper: Cartea, Álvaro, and Sebastian Jaimungal. 2016. “A Closed Form Execution Strategy to Target Volume Weighted Average Price.”& Volume Weighted Average Price Optimal Execution (on a daily basis)
    - github: [Volume Weighted Average Price Optimal Execution](https://github.com/cvxgrp/vwap_opt_exec)
    - Simple Execution model: [VWAP Execution](https://github.com/QuantConnect/Lean/blob/master/Algorithm.Framework/Execution/VolumeWeightedAveragePriceExecutionModel.py)
3. Jamuna deep learning 
4. Micro-price 
6. CNN-LSTM
    - **Pros**: Shows high predictive power across a range of horizons, especially with OF inputs, achieving significantly positive forecasting accuracy across multiple stocks. Ideal for capturing temporal dependencies due to its sequential and convolutional architecture.
    - **Cons**: Provides limited improvement over simpler LSTM models and is computationally more intensive. Does not outperform simple LSTM consistently, particularly beyond short-term horizons.

7. NLP 
8. RL

## Other Potential Approaches (how to deal with the data?)
1. Which variable to forecast
    - market spread 
    - trading volume
    - return volatility
2. How to evaluate 
    - Qunatile predictions 
    - training separate models for the bid and ask sides 
3. Combination of ML Models 
    4. MLP
    - **Pros**: Simpler to implement, requires less computational power than LSTM and CNN-based models. Useful in setups where fast predictions are prioritized.
    - **Cons**: Prone to overfitting on complex datasets, lacks the sequential structure necessary for handling temporal data effectively. Performance generally lags behind LSTM and CNN-based models.
    5. ARX
    - **Pros**: Straightforward to interpret and implement. Useful in stationary datasets with exogenous factors where predictions do not require sophisticated modeling of time dependencies.
    - **Cons**: Not suitable for nonstationary inputs like LOB states. Limited predictive power compared to neural network-based models.
    6. LSTM 
    - **Pros**: Performs well on OF inputs, with good generalization abilities. LSTM's sequential structure is naturally suited to temporal data, making it effective in high-frequency trading applications. It achieves consistent performance across different horizons.
    - **Cons**: Performance decreases when using non-stationary LOB inputs. May overfit due to model complexity if not tuned properly.

    7. LSTM–MLP   
    - **Pros**: Combines benefits of LSTM's sequential learning with MLP's ability to handle nonlinearities, offering a robust model for predictive tasks. Shows similar performance to LSTM but with potential improvements in stability.
    - **Cons**: Limited additional gains over LSTM alone, especially in short-term forecasting horizons. Computationally more demanding due to its hybrid structure.
    ![image-2.png](attachment:image-2.png)
4. Input, e.g. LOB vs OF 

## Paper Summary
### 1. Cartea, Álvaro, and Sebastian Jaimungal. 2016. “A Closed Form Execution Strategy to Target Volume Weighted Average Price.”

  - Target POV (POCV) -> Target VWAP

  -  Assume a general stochastic process for volume and provide a closed-form expression for the
 optimal execution strategy.

  -  Accounts for **the impact that the agents trading has** through both temporary and permanent price impact.

  -  To set up a
 performance criteria where the investor seeks to execute a large order over a trading horizon T
 and the speed of trading targets POV or targets a percentage of cumulative volume (POCV).
 Once derive the optimal speed of trading for these two general cases, we show how these
 strategies include optimal execution strategies that target and achieve VWAP

### 2. Frei, C. and N. Westray (2015). Optimal execution of a VWAP order: a stochastic control approach. Mathematical Finance 25(3), 612-639
  - model the relative volume curve using a gamma bridge process.
  - incorporate a linear temporary market impact model for price effects.
  - assumes independence between relative volume curve and total trading volume.

    
### 3. Takashi Kato VWAP execution as an optimal strategy 
  *This paper is mainly about mathematical model without detailed explanations*


### 4.  E. Buerelli Volume Weighted Average Price Optimal Execution
  - Did not assume that the total market volume is known and instead treat it as a random variable. 
  - simulate execution on data from the NYSE stock market. Speci cally, we use the K = 30
  di erent stocks which make up the Dow Jones Industrial Average (DJIA), on N = 60 market
  days corresponding to the last quarter of 2012. **Raw Trade and Quotes (TAQ) data from Wharton Research Data Services (WRDS)**
  -  Reduce the RMSE of VWAP tracking by 10%. This is highly signi cant and could improve with a more sophisticated market volume model. On the other we can lower the execution costs by around 25%. *I checked their codes and did not get where they got this number from, but worth testing*

### 5. Kolm Deeporder flow imbalance: Extracting alpha at multiple horizons from the limit order book 
*with very helpful explanation n evaluation on different ML models*
  - The LSTM model, especially when trained on OF or OFI data, consistently outperforms other models in terms of predictive accuracy across multiple horizons.
  - More complex models, like CNN-LSTM, do not provide significant improvements over the simpler LSTM for short-term predictions.
  - Using OFI as input yields better model stability and accuracy compared to LOB data due to its stationarity and ability to capture market dynamics.

### 7. Ning, Brian, Franco Ho Ting Lin, and Sebastian Jaimungal. 2021. “Double Deep Q-Learning for Optimal Execution.” Applied Mathematical Finance 28 (4): 361–80. 
  - deep Q-learning 
  - training by storing past interactions and training the network with a random sample from this memory, mitigating correlation between consecutive data points.
