by Anna Chechenina, BII, July-September 2023
The goal of the project is to build modelling setup for robust simulations of bulk gene expression patterns given initial parameter values. In order to build such model we designed a set of chemical reactions that describes events leading to gene expression activation or repression. By adjusting rate constants and initial concentration of the species in the system, one can get simulation of number of transcripts over time.
Firstly, we attempted to model expression by modifying transcriptional model based on Gillespie stochastic algorithm presented in [1]. The initial system contains three ordinary differential equations with five kinetic parameters and considers the following events: promotor activation, promotor inactivation, transcription with amplification, and RNA decay:
In our model we want to consider next level of regulation by introducing enhancer and silencer terms as positive and negative regulators of the transcription. We created two modifications of the transcription equation of theinitial model:
First version:
Second version:
For both models we performed simulations and parameter space exploration. However, neith
You can find full code with runs and comments in the notebook ODE_modellng_part1.ipynb.
In order to better visualise the parameter values we made the Ploty application where the user can select initial parameters for the model, run the simulation and check the trajectory. Follow the instruction below to download and run the script:
To get the scripts clone the git repository:
git clone https://github.com/checheanya/DEs_for_expr_modelling.git && cd expression_modelling/part1Create a conda/mamba environment with necessary packages and activate it:
conda env create -f environment.yml
conda activate expr_modellingTo run the script, just call it from the directory where the tool is located:
python lineplots_2equations.py for plotting the line plots and
python scatterplots_3equations.py for plotting the scatterplots. Then proceed to the link that appears on the screen.
Since the model described above was not giving us results close to the real-world data, we decided to build another more complex model taking into account both chromatin state and regulators. In this model we assume four set of reactions leading to the gene expression activation or silencing. First set of reactions corresponds to the chromatin state. For both regulatory regions and promoters we are assuming two dynamic states: open and closed. Closed state correspond to the hetero-chromatin with nucleosomes occupying the region and so no transcripts can be produced by the polymerase. Open state, on the contrary, corresponds to the euchromatin, when promotors or regulatory elements are located in the inter nucleosomic regions accesible fo the polymerase.
Full set of discussed reactions and equations can be found in the file part2/Reactions_and_equations.pdf.
Simulation experiments were performed using Python library GillesPy2 that allows to construct systems of chemical kinetics reactions and converts it the system of differential equations for each species.
For full code, simulations and comments, please consult the notebook in the part2 section. This notebook is quite extensive and covers all analysis from building the model to parameter exploration and actual simulations. You can find the contents in provided notebook helpful for navigating over the document. Since the notebook size is large, you might prefer to view the notebook in Google Colab.
Additionally, you can find a parameter exploration plots here.
Sourses:
- Piras, V., Tomita, M. & Selvarajoo, K. Transcriptome-wide Variability in Single Embryonic Development Cells. Sci Rep 4, 7137 (2014). https://doi.org/10.1038/srep07137