# Running the Micom Workflow for the Binary Azotobacter - Rhodosporidium Model

In this notebook we utilize the package `micom` to generate a binary community model for 2 organisms of interest:
- `Azotobacter vinelandii`
- `Rhodosporidium toruloides`

This binary consortium allows us to gain insights into the exchanges between the 2 organisms and run FBA experiments.

First off we can import all necessary packages for this notebook.

In [3]:
import pandas as pd
import cobra

from micom import Community
from micom.workflows import build, grow, tradeoff, fix_medium, build_database
from micom import load_pickle
from micom.viz import plot_tradeoff, plot_exchanges_per_sample, plot_growth

import os
os.environ["GRB_LICENSE_FILE"]

'/Users/mcna892/Desktop/Projects/Digital_Twins/gurobi.lic'

In [4]:
import requests
def load_model_from_git(model_name: str):
    """Helper function for loading a SBML compliant model into CobraPy directly from Git

    Args:
        model_name (str): The name of the model you wish to load

    Returns:
        model (cobra.core.model.Model): A Cobra Model object

    """
    
    model_url_dict = dict([('Rhodosporidium','https://raw.githubusercontent.com/PNNL-CompBio/RToruGEM/main/rtoru/Rhodo_Toru.xml'),
                           ('Azotobacter','https://raw.githubusercontent.com/PNNL-CompBio/iAzotobacterVinelandiiGEM/main/a_vine/azo_vine.xml'),
                           ('Synechococcus','https://raw.githubusercontent.com/PNNL-CompBio/S-elongatus7942/main/syn_elong/syn_elong.xml')])

    xml = requests.get(model_url_dict[model_name])

    with open('tmp.xml', 'wb') as f:
        f.write(xml.content)
        model = cobra.io.read_sbml_model('tmp.xml')
        
    return model

## Setting Up the Model in MICOM

To begin, we need to import our genome-scale models into `micom`. We have these models saved as `.sbml` files as this form is accepted easily into programs such as `cobrapy` and `micom`.

### Building a Taxonomy

Step #1: Establish a Taxonomy that lists the out the taxonomy for our organisms of interest

In [5]:
Tax= pd.DataFrame(columns=['id','genus','species','reactions','metabolites','sample_id','abundance'])
Tax.loc[len(Tax.index)] = ['Azotobacter','Azotobacter','A. vinelandii',2469,2003,'One',500]
Tax.loc[len(Tax.index)] = ['Rhodosporidium', 'Rhodosporidium', 'R. toruloides',2398,2051,'One',500]
Tax

Unnamed: 0,id,genus,species,reactions,metabolites,sample_id,abundance
0,Azotobacter,Azotobacter,A. vinelandii,2469,2003,One,500
1,Rhodosporidium,Rhodosporidium,R. toruloides,2398,2051,One,500


This taxonomy file lists important information for `micom` down the road, such as the number of `reactions` and `metabolites` in the provided models.

### Building a Database

Step #2: Now we must construct a database for the `.sbml` models to be preprocessed and stored. This is done by supplying `micom` with a file which contains model path locations.

In [48]:
models = ['Azotobacter','Rhodosporidium']
m_names = ['azo_vine.xml','Rhodo_Toru.xml']

for i,j in zip(models, m_names):
    print(f'Writing {i}, from {j}')
    cobra.io.write_sbml_model(load_model_from_git(i), j)
  

Writing Azotobacter, from azo_vine.xml
Writing Rhodosporidium, from Rhodo_Toru.xml


In [49]:
  
db = pd.read_csv('../Manifest_Files/man_av_rt.csv')
print(db)
db['file'] = m_names
print(db)

               file   kingdom          phylum                class  \
0    ./azo_vine.xml  bacteria  Pseudomonadota  Gammaproteobacteria   
1  ./Rt_IFO0880.xml     fungi               a    Ustilaginomycetes   

             order            family           genus        species  
0  Pseudomonadales  Pseudomonadaceae     Azotobacter  A. vinelandii  
1                b                 c  Rhodosporidium  R. toruloides  
             file   kingdom          phylum                class  \
0    azo_vine.xml  bacteria  Pseudomonadota  Gammaproteobacteria   
1  Rhodo_Toru.xml     fungi               a    Ustilaginomycetes   

             order            family           genus        species  
0  Pseudomonadales  Pseudomonadaceae     Azotobacter  A. vinelandii  
1                b                 c  Rhodosporidium  R. toruloides  


In [7]:
db_path = './db_av_rt' 
#build_database(db, db_path)

### Construct Manifest Object

Step #3: Now that we have the Taxonomy and Database constructed we can build our community model. This is done by using the `build()` method in `micom`.

__Note__:

Build Manifest object from Taxonomy DataFrame and the corresponding database directory

Skip this step if manifest has already been built and saved to "models" directory


__IMPORTANT__: Declare the Solver you would like to use for this Community model here:
- osqp (good for smaller models)
- gurobi
- glpk
- cplex
- scipy


In [8]:
manifest = build(
    Tax, 
    out_folder="models_av_rt", 
    model_db=db_path, 
    cutoff=0.0001, 
    threads=10,
    solver='gurobi'
)
manifest

Set parameter TokenServer to value "leghorn.emsl.pnl.gov"


Unnamed: 0,sample_id,abundance,file,found_taxa,total_taxa,found_fraction,found_abundance_fraction
0,One,500,One.pickle,2.0,2.0,1.0,1.0


## Running the Models with FBA

Now that we have the `manifest`, we can load the model as a `Community` object through `micom`. This will give us some functionality similar to that of `cobrapy`. This can be done with the `load_pickle()` method we imported above through `micom`.

In [9]:
community = load_pickle("models_av_rt/One.pickle")
print(len(community.reactions))

Set parameter TokenServer to value "leghorn.emsl.pnl.gov"
Read LP format model from file /var/folders/1f/ksln774x1hd1pzfgsjgpxt7r0000gn/T/tmpsorrjdrm.lp
Reading time = 0.03 seconds
: 4506 rows, 10633 columns, 41351 nonzeros
5316


### Exploring the Model Attributes

Our new variable `community` behaves very similarly to a standard `cobrapy` model. We can explore it's attributes in a similar way as well.

Things such as `reactions` and `metabolites`:

In [10]:
#community.reactions

In [11]:
#community.metabolites

and importantly the `medium`

In [12]:
community.medium

{'EX_pi_m': 999999.0,
 'EX_h_m': 999999.0,
 'EX_fe3_m': 999999.0,
 'EX_mn2_m': 999999.0,
 'EX_fe2_m': 999999.0,
 'EX_glc__D_m': 5.0,
 'EX_zn2_m': 999999.0,
 'EX_mg2_m': 999999.0,
 'EX_ca2_m': 999999.0,
 'EX_ni2_m': 999999.0,
 'EX_cu2_m': 999999.0,
 'EX_cobalt2_m': 999999.0,
 'EX_sel_m': 999999.0,
 'EX_h2o_m': 999999.0,
 'EX_nh4_m': 999999.0,
 'EX_mobd_m': 999999.0,
 'EX_so4_m': 999999.0,
 'EX_k_m': 999999.0,
 'EX_na1_m': 999999.0,
 'EX_o2_m': 999999.0,
 'EX_cl_m': 999999.0,
 'EX_tungs_m': 999999.0,
 'EX_slnt_m': 999999.0}

This behavior mimics the medium in `cobrapy`, but combines both models mediums into 1

### Running Optimization

Now that we have the model loaded, we can run standard `FBA` methods using `optimize()`. Default optimize does not return any fluxes from the model, so we can set the `fluxes=True` when calling the method to return them.

In [13]:
community.abundances

id
Azotobacter       0.5
Rhodosporidium    0.5
Name: abundance, dtype: float64

In [14]:
result = community.optimize(fluxes=True,pfba=True)
result

Unnamed: 0_level_0,abundance,growth_rate,reactions,metabolites
compartments,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1
Azotobacter,0.5,0.880683,2469,2003
Rhodosporidium,0.5,0.0,2398,2051
medium,,,449,449


We can see that both organisms have a non-zero growth rate and that the community growth is also non-zero. Let's check the fluxes.

In [15]:
result.fluxes.T.loc['EX_glyc__R_e']

compartment
Azotobacter      -39.180828
Rhodosporidium    39.180828
medium                  NaN
Name: EX_glyc__R_e, dtype: float64

### Testing

In [17]:
#community.reactions.BIOMASS_Av_DJ_core__Azotobacter.upper_bound
#community.reactions.BIOMASS_RT__Rhodosporidium.upper_bound
#community.reactions.BIOMASS_Av_DJ_core__Azotobacter

1.0

In [18]:
community.set_abundance([1,1],normalize=False)

In [20]:
community.optimize(fluxes=True,pfba=True)

Unnamed: 0_level_0,abundance,growth_rate,reactions,metabolites
compartments,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1
Azotobacter,1.0,0.424491,2469,2003
Rhodosporidium,1.0,0.0,2398,2051
medium,,,449,449


#### Changing parts of the medium to test it's effect on growth

Now that we can successfully optimize the community model, we can begin altering the models medium and seeing how it changes the (community) growth rate.

First, let's make a copy of the original medium so that we can restore it after making changes.

In [24]:
medium_bkp = community.medium

Now we can make changes to the medium. The following cell is meant to be re-run with making changes. It will first restore the medium to the original and them set 

In [None]:
# Restore medium to original
community.medium = medium_bkp

# Set variable to become new medium
medium_to_change = community.medium

#Add or subtract reactions
medium_to_change["EX_xyl__D_m"] = 0
medium_to_change["EX_glc__D_m"] = 5
#medium_to_change["EX_glyc__R_m"] = 0
medium_to_change["EX_nh4_m"] = 0
medium_to_change["EX_n2_m"] = 5


# Set the new medium as the model's medium
community.medium = medium_to_change
community.medium

Now that the medium is changed, we can rerun the model optimization.

In [None]:
result_altered_medium = community.optimize(fluxes=True, pfba=True)

In [None]:
result_altered_medium

In [None]:
community.reactions.EX_glyc__R_e__Rhodosporidium

In [None]:
community.reactions.EX_glyc__R_e__Azotobacter

In [None]:
result_altered_medium.fluxes.T.loc['EX_glyc__R_e']

In [None]:
result_altered_medium.fluxes.T.loc['EX_glc__D_e']

In [None]:
result_altered_medium.fluxes.T.loc['EX_co2_m']

In [None]:
x = result_altered_medium.fluxes.T.loc[result_altered_medium.fluxes.T.Azotobacter.index.str.startswith('EX_')]

In [None]:
from IPython.display import HTML

In [None]:
x.fillna(0,inplace=True)
HTML(x[(x.medium != 0) | (x.medium != 0)].sort_values('medium').to_html())

In [None]:
x.fillna(0,inplace=True)
HTML(x[(x.Azotobacter != 0) | (x.Rhodosporidium != 0)].sort_values('Azotobacter').to_html())

In [None]:
from IPython.display import HTML

### Testing Abundances

The following code is an inline way to change the abundances

In [None]:
community.set_abundance([1,1],normalize=False)

## Running the Models with MICOM Grow

An alternative to running standard community optimization with `optimize()`, we can also use a `micom.workflows` method called `grow()`. This simulates growth of the organism while also simulating potential tradeoffs (between prioritizing community vs. individual growth). This method does not require our previously constructed `community` object, but rather the `manifest` we added earlier.

A key difference here though, is that we need to create a `DataFrame` detailing the reaction, flux, and metabolite as the medium provided to the method.

### Building the Medium

In [None]:
# Restore medium to original
community.medium = medium_bkp

# Set variable to become new medium
grow_medium_to_change = community.medium

#Add or subtract reactions
#grow_medium_to_change["EX_glc__D_m"] = 0
#grow_medium_to_change["EX_sucr_m"] = 1
grow_medium_to_change["EX_nh4_m"] = 0
grow_medium_to_change["EX_n2_m"] = 5

In [None]:
grow_medium = pd.Series(grow_medium_to_change).to_frame('flux').reset_index()
grow_medium = grow_medium.rename(columns={'index':'reaction'})
grow_medium

In [None]:
result_grow = grow(
    manifest, 
    model_folder="models_av_se",
    medium=grow_medium, 
    tradeoff=0.01, 
    threads=2,
    presolve=True
)

In [None]:
result_grow.exchanges

In [None]:
result_grow.exchanges.to_csv('av_se_out.csv')

## Useful Utility Functions

In [None]:
def medium2extracellular(medium: dict):
    
    return dict([(k[:-1] + "e",v) for k,v in medium.items()])

In [None]:
def extracellular2medium(medium: dict):
    
    return dict([(k[:-1] + "m",v) for k,v in medium.items()])  

## CobraPy Models for Checking

In [None]:
model_azo = cobra.io.read_sbml_model('azo_vine.xml')
model_rhodo = cobra.io.read_sbml_model('Rt_IFO0880.xml')

In [None]:
med = model_azo.medium

med['EX_xyl__D_e'] = 0
med['EX_glc__D_e'] = 5

model_azo.medium = med

model_azo.optimize().objective_value

In [None]:
model_azo.reactions.BIOMASS_Av_DJ_core.build_reaction_string()

In [None]:
model_rhodo.genes.get_by_id('8631').annotation