# Tutorial 1: Full reconstruction

In this tutorial we will reconstruct a _Bacillus subtilis_ ME-model from just the input files.

## Import libraries

In [1]:
from IPython.display import display, HTML, Math, Markdown
display(HTML("<style>.container { width:95% !important; }</style>"))

from coralme.builder.main import MEBuilder

## Path to configuration files

For more information about these files see [Description of inputs](../../docs/BasicInputs.ipynb)

In [2]:
organism = './organism.json'
inputs = './input.json'

## Create MEBuilder instance

For more information about this class see [Architecture of coralME](../../docs/coralMEArchitecture.ipynb)

In [3]:
builder = MEBuilder(*[organism, inputs])

Set parameter Username

--------------------------------------------
--------------------------------------------

Academic license - for non-commercial use only - expires 2024-08-16


## Generate files

This corresponds to _Synchronyze_ and _Complement_ steps in [Architecture of coralME](../../docs/coralMEArchitecture.ipynb)

In [4]:
builder.generate_files(overwrite=True)

Initiating file processing...
~ Processing files for bsubtilis...


Checking M-model metabolites...                                            : 100.0%|██████████|   990/  990 [00:00<00:00]
Checking M-model genes...                                                  : 100.0%|██████████|   844/  844 [00:00<00:00]
Checking M-model reactions...                                              : 100.0%|██████████|  1250/ 1250 [00:00<00:00]
Generating complexes dataframe from optional proteins file...              : 100.0%|██████████|  4554/ 4554 [00:00<00:00]
Syncing optional genes file...                                             : 100.0%|██████████|  4541/ 4541 [00:00<00:00]
Looking for duplicates within datasets...                                  : 100.0%|██████████|     5/    5 [00:00<00:00]
Gathering ID occurrences across datasets...                                : 100.0%|██████████| 10647/10647 [00:00<00:00]
Solving duplicates across datasets...                                      : 0.0%|          |     0/    0 [00:00<?]
Pruning GenBank...            

Reading bsubtilis done.


Gathering M-model compartments...                                          : 100.0%|██████████|     2/    2 [00:00<00:00]
Fixing compartments in M-model metabolites...                              : 100.0%|██████████|   990/  990 [00:00<00:00]
Fixing missing names in M-model reactions...                               : 100.0%|██████████|  1250/ 1250 [00:00<00:00]


~ Processing files for iJL1678b...


Checking M-model metabolites...                                            : 100.0%|██████████|  1660/ 1660 [00:00<00:00]
Checking M-model genes...                                                  : 100.0%|██████████|  1271/ 1271 [00:00<00:00]
Checking M-model reactions...                                              : 100.0%|██████████|  2377/ 2377 [00:00<00:00]
Looking for duplicates within datasets...                                  : 100.0%|██████████|     5/    5 [00:00<00:00]
Gathering ID occurrences across datasets...                                : 100.0%|██████████|  8517/ 8517 [00:00<00:00]
Solving duplicates across datasets...                                      : 0.0%|          |     0/    0 [00:00<?]
Getting sigma factors...                                                   : 100.0%|██████████|     7/    7 [00:00<00:00]
Getting TU-gene associations from optional TUs file...                     : 100.0%|██████████|  1647/ 1647 [00:00<00:00]
Adding protein location...    

Reading iJL1678b done.
~ Running BLAST with 4 threads...


Converting Genbank contigs to FASTA for BLAST...                           : 100.0%|██████████|     5/    5 [00:00<00:00]
Converting Genbank contigs to FASTA for BLAST...                           : 100.0%|██████████|     1/    1 [00:00<00:00]


BLAST done.


Updating translocation machinery from homology...                          : 100.0%|██████████|     9/    9 [00:00<00:00]
Updating protein location from homology...                                 : 100.0%|██████████|   514/  514 [00:00<00:00]
Updating translocation multipliers from homology...                        : 100.0%|██████████|     3/    3 [00:00<00:00]
Updating lipoprotein precursors from homology...                           : 100.0%|██████████|    14/   14 [00:00<00:00]
Updating cleaved-methionine proteins from homology...                      : 100.0%|██████████|   343/  343 [00:00<00:00]
Mapping M-metabolites to E-metabolites...                                  : 100.0%|██████████|   147/  147 [00:00<00:00]
Updating generics from homology...                                         : 100.0%|██████████|    10/   10 [00:00<00:00]
Updating folding from homology...                                          : 100.0%|██████████|     2/    2 [00:00<00:00]
Updating ribosome subrea

File processing done.


## Build ME-model

This corresponds to _Build_ in [Architecture of coralME](../../docs/coralMEArchitecture.ipynb)

In [5]:
builder.build_me_model(overwrite=False)

Initiating ME-model reconstruction...


Adding biomass constraint(s) into the ME-model...                          : 100.0%|██████████|    11/   11 [00:00<00:00]

Read LP format model from file /tmp/tmpkmz_3qlj.lp
Reading time = 0.00 seconds
: 990 rows, 2500 columns, 10478 nonzeros





Read LP format model from file /tmp/tmpv5dwkjqf.lp
Reading time = 0.00 seconds
: 990 rows, 2496 columns, 10342 nonzeros


Adding Metabolites from M-model into the ME-model...                       : 100.0%|██████████|  1055/ 1055 [00:00<00:00]
Adding Reactions from M-model into the ME-model...                         : 100.0%|██████████|  1248/ 1248 [00:00<00:00]
Adding Transcriptional Units into the ME-model from user input...          : 100.0%|██████████|  1647/ 1647 [00:09<00:00]
Adding features from contig NC_000964.3 into the ME-model...               : 100.0%|██████████|  4537/ 4537 [00:08<00:00]
Adding features from contig G8J2-183 into the ME-model...                  : 100.0%|██████████|     2/    2 [00:00<00:00]
Adding features from contig G8J2-180 into the ME-model...                  : 100.0%|██████████|     2/    2 [00:00<00:00]
Adding features from contig G8J2-181 into the ME-model...                  : 100.0%|██████████|     2/    2 [00:00<00:00]
Adding features from contig G8J2-182 into the ME-model...                  : 100.0%|██████████|     2/    2 [00:00<00:00]
Updating all Translation

ME-model was saved in the ./ directory as MEModel-step1-bsubtilis.pkl


Adding tRNA synthetase(s) information into the ME-model...                 : 100.0%|██████████|   306/  306 [00:00<00:00]
Adding tRNA modification SubReactions...                                   : 0.0%|          |     0/    0 [00:00<?]
Associating tRNA modification enzyme(s) to tRNA(s)...                      : 0.0%|          |     0/    0 [00:00<?]
Adding SubReactions into TranslationReactions...                           : 100.0%|██████████|  4328/ 4328 [00:01<00:00]
Adding RNA Polymerase(s) into the ME-model...                              : 100.0%|██████████|    38/   38 [00:00<00:00]
Associating a RNA Polymerase to each Transcriptional Unit...               : 100.0%|██████████|  1647/ 1647 [00:00<00:00]
Processing ComplexData in ME-model...                                      : 100.0%|██████████|   312/  312 [00:00<00:00]
Adding ComplexFormation into the ME-model...                               : 100.0%|██████████|  5038/ 5038 [00:00<00:00]
Adding SubReactions into Translation

ME-model was saved in the ./ directory as MEModel-step2-bsubtilis.pkl
ME-model reconstruction is done.
Number of metabolites in the ME-model is 4438 (+348.28%, from 990)
Number of reactions in the ME-model is 7460 (+496.80%, from 1250)
Number of genes in the ME-model is 1105 (+30.92%, from 844)


## Troubleshoot ME-model

This corresponds to _Find gaps_ in [Architecture of coralME](../../docs/coralMEArchitecture.ipynb)

In [6]:
builder.troubleshoot(growth_key_and_value = { builder.me_model.mu : 0.001 })

The MINOS and quad MINOS solvers are a courtesy of Prof Michael A. Saunders. Please cite Ma, D., Yang, L., Fleming, R. et al. Reliable and efficient solution of genome-scale models of Metabolism and macromolecular Expression. Sci Rep 7, 40863 (2017). https://doi.org/10.1038/srep40863

~ Troubleshooting started...
  Checking if the ME-model can simulate growth without gapfilling reactions...
  Original ME-model is not feasible with a tested growth rate of 0.001000 1/h
  Step 1. Gapfill reactions to provide components of type 'ME-Deadends' using brute force.
          Finding gaps in the ME-model...
          Finding gaps from the M-model only...
          10 metabolites were identified as deadends.
            cbl1_c: Missing metabolite in the M-model.
            cs_c: Missing metabolite in the M-model.
            cu_c: Missing metabolite in the M-model.
            dad__5_c: Missing metabolite in the M-model.
            dpm_c: Missing metabolite in the M-model.
            fmnh2_c: 

Copy model file for other tutorials

In [7]:
!cp ./MEModel-step3-bsubtilis-TS.pkl ../