Skip to content
forked from LMSE/KinMod

The hierarchical ontology of the KinMod database allows flexible exploration of data attributes and investigation of metabolic relationships within- and cross-species. Representing missing experimental values supports the rational experimental design to abridge kinetic parameter measurements. Linking multi-omics data and providing data on the me…

Notifications You must be signed in to change notification settings

Zhiqing-Xu/KinMod

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 

Repository files navigation

Table of Content

  1. Abstract
  2. Web Application
  3. Data
  1. Sequence Data

1. Abstract

The hierarchical ontology of the KinMod database allows flexible exploration of data attributes and investigation of metabolic relationships within- and cross-species. Representing missing experimental values supports the rational experimental design to abridge kinetic parameter measurements. Linking multi-omics data and providing data on the metabolic regulation network encourages the development of novel machine learning techniques for predicting missing kinetic parameters and promotes accurate kinetic model construction of cells metabolism by providing a comprehensive list of available kinetic measurements. To define a thorough depiction of KinMod data, we develop six analyses to visualize associations between data classes belonging to separate sections of the metabolism. Through these analyses, we demonstrate that the KinMod database provides a unique framework for biologists and engineers to retrieve, evaluate and compare the functional metabolism of species, including the regulatory network, and discover the extent of available and missing experimental values of the metabolic regulation.

2. Web Application

KinMod web application is available on LMSE KinMod Interface.

KinMod Interface

3. Data

To allow a comprehensive analysis of the data, we developed three analysis modules in the KinMod web application. The organism_to_effector module allows flexible exploration of linked effector molecules and available KI parameters to a specific organism, providing the opportunity of investigating the regulatory networks within species. The effector_to_organism module enables the investigation of the regulation effect of a given molecule across species, providing the opportunity to compare the functional metabolism of organisms. the organism_to_reactants module allows exploration of available KM parameters in a given organism, opening up insights into the saturation of enzymes within species.

  • organism_to_effector module:

This module lists available regulatory interactions observed in an organism of interest, such as Escherichia coli.

organism_to_effector('Escherichia coli')
5305 row(s) returned
Duration: 25.828 sec / 0.359 sec

KinMod Interface

The resulting table lists organism name, enzyme EC Number, effector IUPAC name, effector IID, effector SMILES, Effector InCHIKEY, KEGG ID, Inhibitor tag, Activator Tag, KI parameter(s), and additional comments.
Effector IID is the internal KinMod identifier assigned to distinct chemical molecules based on their first layer of InCHI string. The inhibitor tag is one for inhibitor molecules, and for activator molecules, the activator tag is one. Additional comments can include a range of reported kinetic parameters, the experimental conditions or reveal if the enzyme was induced in the organism.


  • The effector_to_organism:

module outputs all observed regulatory interactions across species of a molecule of interest.

effector_to_organism('citrate');
401 row(s) returned
1.609 sec / 0.000 sec

KinMod Interface

This module returns the organism names and EC numbers for which citrate act as an effector. Also, the structure of this molecule, inhibitor tag and activator tag can be obtained.


  • The organism_to_reactant module:

organism_to_reactant('Escherichia coli') 3112 row(s) returned
2.500 sec / 4.375 sec

KinMod Interface

This module returns the KM values reported for EC numbers in Escherichia coli. The output table includes information on the reaction string, the EC number, reactants IUPAC name, and KM values.


  • The organism_to_reactant module:

organism_to_kcat('Escherichia coli', '', '') 1484 row(s) returned
1.500 sec / 3.265 sec

KinMod Interface

This module returns the organism name, enzyme EC Number, substrate IUPAC name, substrate INCHIKey, substrate SMILES, substrate InCHIKEY, molecular weight, KEGG ID, charge, and kcat parameter(s).


Additional Filters and CSV File Downloads

There are also additional filters that can be applied on the above functions. The output of these functions include the query results as well as the required data in a downloadable csv file.

KinMod Interface

You can add filters to the tables or use the functions without filters by entering none in the additional filters if you'd like the output data in csv format.

3. Database Access

If you'd like to directly access the database, you can find the MySQLdump file here: LMSE KinMod Database File.

If you'd like to access the database through Python, you can do so by loading the dump file into MySQL locally and access it through a MySQL-python connector. This process has been tested with Python3 and mysql-connector-python (8.0.28). If you'd like more detail on accessing the database through python, please contact Rana Ahmed Barghout (rana.barghout@mail.utoronto.ca).

4. Sequence Data

Three additional files are accessible in a NoSQL format (JSON files) on KinMod Sequence Data. to provide a linkage between BRENDA, PDB, and Swiss-port.

  • ec_sequence_organism.json provides the sequence data for each EC number of specific species extracted from the PDB database.
  • Pdb_to_uniprot.json links PDB and UniProt Identifiers and lists enzymes' binding sites, descriptions, and particular protein residues.
  • >Annotation.json is a dataset that provides GO annotations for each PDB sequence.

kianahaddadi@gmail.com Kiana Haddadi

About

The hierarchical ontology of the KinMod database allows flexible exploration of data attributes and investigation of metabolic relationships within- and cross-species. Representing missing experimental values supports the rational experimental design to abridge kinetic parameter measurements. Linking multi-omics data and providing data on the me…

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published