# Mathematical Framework
## Contents
1. Model Assumptions
2. Model Processes
3. Modelling Transcription
4. Modelling Translation
5. Simplified Notation for Transcription and Translation
6. Modelling Resource Coupling
7. Modelling Regulatory Interactions
   1. Promoter Activation
   2. Promoter Repression
   3. Combinatorial Promoters
   4. TF Activation/Inactivation
   5. Annihilation
8. ODE System Summary


## 1. Model Assumptions:
1. Gene cassettes are in a plasmid and there are no long-range structural interactions.
2. There are no side reactions between species.
3. Gene and promoter concentration is assumed to contribute a negligible impact on the expression of the gene and are therefore modelled as constant.
4. Concentrations of RNA polymerase (RNAP) and ribosome are relatively low.
5. The behaviour of the system can be approximated by the quasi-steady state (QSS) assumption.
6. The timescale of the system is short therefore dilution effects on mRNA can be absorbed into the mRNA decay rate.
7. There is no degradation of protein (although it can be easily added to the dilution term).
8. Growth rate is constant to model the average rate of dilution.
9. Each promoter can only be activated or repressed by one TF of varying cooperativities.
10.  Combinatorial promoters exhibit competitive binding of TFs but only can only bind to one activator and one repressor (although it can be extended).
11.  There is assumed to be no ”leakiness” to minimise the number of parameters.

## 2. Model Processes
This is a node, $i$. The 'gene' is a whole gene expression cassette which includes the actual gene sequence and regulatory sequences like the promoter, RBS and terminator.
<pre>
        1. Transcription       2. Translation  
               (TX)                 (TL)  
+------------+       +------------+       +------------+  
|            |       |            |       |            |  
|  Gene, g   +--+--> |   mRNA, m  +--+--> | Protein, x |  
|            |  |    |            |  |    |            |  
+------------+  |    +------------+  |    +------------+  
                |                    |  
              RNAP, [p]            Ribosome, [r]  
              + [ATP]              + [ATP]  
</pre>

- We use greek letters for RNAP, $\pi$ (pi) and ribosome, $\rho$ (rho).
- These chemical reaction equations are converted into Ordinary Differential Equations (ODEs).

## 3. Modelling Transcription
Transcription from the gene, $g_i$ to the mRNA $m_i$.

\begin{align*}
    g_i + \pi &\xrightleftharpoons[k_{1;i}^{-}]{k_{1;i}^{+}} X_i \\
    X_i &\xrightarrow{k^{\theta}_{1;i}} g_i + \pi + m_i \\
    m_i &\xrightarrow{\delta_i} \varnothing  
\end{align*}


- $g_i$ represents the concentration of the gene, $i$, which is the same as the concentration its promoter in the model. 
- The promoter binds to RNAP, $\pi$, at a rate $k_{1;i}^{+}$ to form the transcription-initiation complex, $X_i$.
- The complex dissociates back into the $g_i$ and $\pi$ at the rate $k_{1;i}^{-}$.
- $g_i$, is then transcribed at the rate $k^{\theta}_{1;i}$ into mRNA, $m_i$. This is modelled as the complex is converted back into gene, $i$, and RNAP while producing the associated mRNA, $m_i$.
- $m_i$ degrades at a rate $\delta_i$

## 4. Modelling Translation
Translation of the mRNA, $m_i$, to the protein $x_i$

\begin{align*}
    m_i + \rho &\xrightleftharpoons[k_{2;i}^{-}]{k_{2;i}^{+}} \Xi_i \\
    \Xi_i &\xrightarrow{k^{\theta}_{2;i}} m_i + \rho + x_i \\
    x^{\omega}_i &\xrightarrow{\lambda} \varnothing  
\end{align*}

- $m_i$ binds to a ribosome, $\rho$, at a rate $k_{2;i}^{+}$ and to form the translation initiation complex, $\Xi_i$. $\rho$ dissociates at a rate $k_{2;i}^{-}$ from $m_i$. 
- $m_i$ is then translated at a rate, $k^{\theta}_{1;i}$, to produce the protein, $x^{\omega}_i$, while returning the mRNA and ribosome. 
- $\omega \in \{0, +, 1\}$ where $x^+_i$ represents a TF that activates another node, $x^-_i$ represents a TF that represses another node and $x^0_i$ represents a protein with no regulatory function or a TF with reduced regulatory functions.

## 5. Simplified Notation for Transcription and Translation
Due to transcription and translation having a similar form, we can derive a net rate constant for both of them. First, we make some simplifications in notation:
\begin{align*}
    A_i + R &\xrightleftharpoons[k_{i}^{-}]{k_{i}^{+}} C_i \\
    C_i &\xrightarrow{k^{\theta}_{i}} A_i + R + B_i \\
    B_i &\xrightarrow{\Delta} \varnothing  
\end{align*}

- The substrate is denoted by $A_i \in \{g_i, m_i\}$ and the product by $B_i \in \{m_i, x_i\}$.
- The transcriptional and translational resources are denoted by $R \in \{\pi, \rho\}$.
- $k_{i}^+ \in \{k_{1;i}^+, k_{2;i}^+\}$, $k_{i}^- \in \{k_{1;i}^-, k_{2;i}^-\}$, $k_{i}^\theta \in \{k_{1;i}^\theta, k_{2;i}^\theta\}$.
- $\Delta$ represents degradation/dilution.

## 6. Modelling Resource Coupling
Use of effective rate constants from [Characterisation and mitigation of gene expression burden in mammalian cells](https://www.nature.com/articles/s41467-020-18392-x) by Frei. *et al*.

With some derivations we find the effective rate constant:
\begin{align*}
        k_{\text{eff}, i} &= \frac{k_{i}^{\theta} \cdot k^{m}_{i} \cdot R_T}{{1 + \sum_j k^{m}_{j} \cdot A_j}} \\
        k_{\text{eff}, i} &\in \{k_{\text{eff, 1};i} k_{\text{eff, 2};i}\}
\end{align*}

This is the effective rate of the production of $B_i$ from $A_i$ and takes into account the limitations in resources, $R$. The total amount of a $R$ present is denoted by $R_T$.

The term $J = \sum_j k^{m}_{j} \cdot A_j$ is the _resource demand coefficient_. 

We define the "binding constant":
\begin{align*}
        k^m_i &= \frac{k_{i}^+}{k_{i}^- + k^{\theta}_{i}} \\
        k^m_i &\in \{k^m_{1;i}, k^m_{2;i}\}
\end{align*}

Therefore, transcription and translation reactions take the form:

\begin{align*}
        A_i &\xrightarrow{k_{\text{eff}, i}} B_i \\
        B_i &\xrightarrow{\Delta} \varnothing
\end{align*}

## 7. Modelling Regulatory Interactions
Regulatory interactions modelled:
1. Activation of a promoter by transcription factor (TF) &rarr; TF that activates denoted by $x^+_j$
2. Repression of a promoter by TF &rarr; TF that activates denoted by $x^-_j$
3. Competitive activation and repression of promoter by TF (*lac* system)
4. Activation of TF by inducer
5. Inactivation of TF by inducer
6. Annihilation reactions (mRNA and protein)

The models are a simplified form of those used in [Resource Competition Shapes the Response of Genetic Circuits](https://pubs.acs.org/doi/10.1021/acssynbio.6b00361) by Qian. _et al_ based on the forms in [Biomolecular Feedback Systems](https://www.degruyter.com/document/doi/10.1515/9781400850501/html) By Domitilla. and Murray.

### 7.1 Promoter Activation
The reactions for transcription of a gene with an activated promoter are:
\begin{align*}
    g_i + n_j \cdot x_j^+ &\xrightleftharpoons[\hat{k}_{a;i}^-]{\hat{k}_{a;i}^+} c_i \\
    c_i + \pi &\xrightleftharpoons[k_{1;i}^-]{k_{1;i}^+} X_i \\
    X_i &\xrightarrow{k^{\theta}_{1;i}} c_i + \pi + m_i
\end{align*}

Where the promoter (connected to a gene) $g_i$ binds to $n_j$ number of $x_j^+$ TFs to form the activated complex, $c_i$, before transcription occurs.

By using the QSS assumption, the concentration of the activated complex is:
\begin{align*}
    c_i &=  g^T_i \cdot \alpha(x^+_j, n_j)
\end{align*}

Where $g^T_i$ is the total concentration of the gene present and we define the Hill function association with activation as:

\begin{align*}
    \alpha(x^+_j, n_j) = \frac{\hat{k}^m_{a;i} \cdot (x^+_j)^{n_j}}{1 + \hat{k}^m_{a;i} \cdot (x^+_j)^{n_j}}
\end{align*}

Where we define the binding constant:
\begin{align*}
        \hat{k}^m_{a;i} = \frac{\hat{k}_{a;i}^+}{\hat{k}_{a;i}^-} 
\end{align*}

### 7.2 Promoter Repression
Similarly for the transcription of a gene with a repressed promoter:
\begin{align*}
    g_i + n_j \cdot x_j^+ &\xrightleftharpoons[\hat{k}_{r;i}^-]{\hat{k}_{r;i}^+} c^0_i \\
    g_i + \pi &\xrightleftharpoons[k_{1;i}^-]{k_{1;i}^+} X_i \\
    X_i &\xrightarrow{k^{\theta}_{1;i}} g_i + \pi + m_i
\end{align*}

Where the promoter (connected to a gene) $g_i$ binds to $n_j$ number of $x_j^-$ TFs to form the repressed complex, $c^0_i$, before transcription occurs. This prevents transcription of the genes which are repressed.

By using the QSS assumption, the concentration of the gene/promoter present is:
\begin{align*}
    g_i &= g^T_i \cdot \beta(x^-_j, n_j)
\end{align*}

Where we define the Hill-function associated with repression as: 
\begin{align*}
    \beta(x^-_j, n_j) = \frac{1}{1 + \hat{k}^m_{r;i} \cdot (x^-_j)^{n_j}}
\end{align*}

And the binding constant is:
\begin{align*}
    \hat{k}^m_{r;i} = \frac{\hat{k}_{r;i}^+}{\hat{k}_{r;i}^-} 
\end{align*}



### 7.3 Combinatorial Promoters
For a combinatorial promoter with competitive binding of activators/repressors the reactions are a mix of the two:
\begin{align*}
    g_i + n_j \cdot x_j^+ &\xrightleftharpoons[\hat{k}_{r;i}^-]{\hat{k}_{r;i}^+} c^0_i \\
    g_i + n_j \cdot x_j^+ &\xrightleftharpoons[\hat{k}_{a;i}^-]{\hat{k}_{a;i}^+} c_i \\
    c_i + \pi &\xrightleftharpoons[k_{1;i}^-]{k_{1;i}^+} X_i \\
    X_i &\xrightarrow{k^{\theta}_{1;i}} c_i + \pi + m_i
\end{align*}

By using the QSS approximation again, the concentration of the activated complex is:
\begin{align*}
    c_i = g^T_i \cdot \gamma(x^+_k, n_k, x^-_j, n_j)
\end{align*}

Where we define the associated Hill function , $\gamma(x^+_k, n_k, x^-_j, n_j)$ as:
\begin{align*}
\gamma(x^+_k, n_k, x^-_j, n_j) = \frac{\hat{k}^m_{a;i} \cdot (x^+_j)^{n_j}}{1 + \hat{k}^m_{a;i} \cdot (x^+_j)^{n_j} + \hat{k}^m_{r;i} \cdot (x^-_j)^{n_j}}
\end{align*}

### 7.4 TF Activation/Inactivation
TF activation/inactivation by inducers is modelled by the following rate equation:
\begin{align*}
    x_i^0 + n_i \cdot I_i &\xleftrightharpoons[\kappa_i^-]{\kappa_i^+} x_i^{+,-}
\end{align*}

- Where $I_i$ represents an inducer that activates/inactivates a TF and $n_i$ is its corresponding cooperativity coefficient. 
- $n_i$ number of $I_i$ molecules bind to the TF $x^0_i$ at a rate $\kappa_i^+$ and dissociate at a rate $\kappa_i^-$. 
- These equations change the ODE equations for protein production (translation).

### 7.5 Annhilation
Annihilation is a specific reaction that is necessary to model integral controllers. It is represented by the following reactions:
\begin{align*}
    m_i + m_j &\xleftrightharpoons[k^{+}_{\emptyset_m;i,j}]{k^-_{\emptyset_m;i,j}} m_\emptyset \\
    x^{\omega}_i + x^{\omega}_j &\xleftrightharpoons[k^{-}_{\emptyset_x;i,j}]{k^+_{\emptyset_x;i,j}} x_\emptyset
\end{align*}

- Where $m_\emptyset$ and $x_\emptyset$ represent the inert dimers that result from the annihilation reaction. 
- $k^{+}_{\emptyset_m;i,j}$ and $k^{+}_{\emptyset_x;i,j}$ represent the forward reaction rates for mRNA and protein respectively. 
- $k^-_{\emptyset_m;i,j}$ and $k^-_{\emptyset_x;i,j}$, similarly, represent the reverse reaction rates.
- These chemical equations modify the ODE equations for transcription and translation. 
- For an "annihilation" reaction, the binding affinity between the two reactants is typically very large and can also be modelled as an irreversible reaction with a measured rate.

## 8. ODE System Summary
### Constitutively Expressed Node

\begin{align}
    \dot{m}_i &= k_{\text{eff}, 1; i} \cdot g^T_i - \delta_i \cdot m_i \\
    \dot{x}^{\omega}_i &= k_{\text{eff}, 2; i} \cdot m_i - \lambda \cdot x_i^{\omega}
\end{align}

### Activated Node

\begin{align}
    \dot{m}_i &= k_{\text{eff}, 1; i} \cdot g^T_i \cdot \alpha(x^+_j, n_j) - \delta_i \cdot m_i \\
    \dot{x}^{\omega}_i &= k_{\text{eff}, 2; i} \cdot m_i - \lambda \cdot x_i^{\omega} 
\end{align}
        
### Repressed Node

\begin{align}
    \dot{m}_i &= k_{\text{eff}, 1; i} \cdot g^T_i \cdot \beta(x^-_j, n_j) - \delta_i \cdot m_i \\
    \dot{x}^{\omega}_i &= k_{\text{eff}, 2; i} \cdot m_i - \lambda \cdot x_i^{\omega} 
\end{align}

### Combinatorial Node

\begin{align}
    \dot{m}_i &= k_{\text{eff}, 1; i} \cdot g^T_i \cdot \gamma(x^+_k, n_k, x^-_j, n_j) \\
    &- \delta_i \cdot m_i \\
    \dot{x}^{\omega}_i &= k_{\text{eff}, 2; i} \cdot m_i - \lambda \cdot x_i^{\omega} 
\end{align}

### TF Activation

\begin{align}
    \dot{x}^0_i &= k_{\text{eff}, 2; i} \cdot m_i - \lambda \cdot x_i^0 \\
    &- \kappa_i^+ \cdot x^0_i \cdot (I_i)^{n_i} + \kappa_i^- \cdot x^{+,-}_i \\
    \dot{x}^{+,-}_i &= \kappa_i^+ \cdot x^{0}_i \cdot (I_i)^{n_i} - \kappa_i^- \cdot x^{+,-}_i \\
    &- \lambda \cdot x_i^{+,-}
\end{align}

### TF Inactivation

\begin{align*}
    \dot{x}^0_i &= - \kappa_i^+ \cdot x^0_i \cdot (I_i)^{n_i} + \kappa_i^- \cdot x^{+,-}_i \\
    &- \lambda \cdot x_i^0 \\
    \dot{x}^{+,-}_i &= k_{\text{eff}, 2; i} \cdot m_i - \lambda \cdot x_i^{+,-} \\
    &+ \kappa_i^+ \cdot x^{0}_i \cdot (I_i)^{n_i} - \kappa_i^- \cdot x^{+,-}_i 
\end{align*}

### mRNA Annihilation

\begin{align*}
    \dot{m}_i: &+ k^{+}_{\emptyset_m;i,j} \cdot m_i \cdot m_j - k^{-}_{\emptyset_m;i,j} \cdot m_\emptyset \\
    \dot{m}_j: &+ k^{+}_{\emptyset_m;i,j} \cdot m_i \cdot m_j - k^{-}_{\emptyset_m;i,j} \cdot m_\emptyset
\end{align*}

### Protein Annihilation

\begin{align*}
    \dot{x}^0_i: &+ k^{+}_{\emptyset_x;i,j} \cdot x^0_i \cdot x^0_j - k^{-}_{\emptyset_x;i,j} \cdot x_\emptyset \\
    \dot{x}^0_j: &+ k^{+}_{\emptyset_x;i,j} \cdot x^0_i \cdot x^0_j - k^{-}_{\emptyset_x;i,j} \cdot x_\emptyset
\end{align*}