# Radhika Mardikar, Xinxin Mo

## Step 1:
Looking at the KEGG pathway, we will select 4 enzymes from the glycolysis, TCA cycle and pentose phosphate cycle. 
Glycolysis: hexokinase 1, phosphoglucose isomerase, phosphofructokinase, fructose-bisphosphate aldolase (https://www.ebi.ac.uk/interpro/potm/2004_2/Page2.htm)
TCA: citrate synthase, aconitase, isocitrase dehydrogenase, alpha-ketoglurate (https://www.news-medical.net/life-sciences/Krebs-Cycle-Enzymes.aspx)
pentose phosphate: transketolase, transaldolase, lactonase, phosphopentose isomerase (https://mcb.berkeley.edu/labs/krantz/mcb102/lect_S2008/MCB102-SPRING2008-LECTURE5-PENTOSE.pdf)

In [None]:
from Bio import Entrez
Entrez.email = "rmardikar@berkeley.edu"
enzymelist = ["hexokinase 1", "phosphoglucose isomerase", "phosphofructokinase", "fructose-bisphosphate aldolase",
             "citrate synthase", "aconitase", "isocitrase dehydrogenase", "alpha-ketoglurate",
             "transketolase", "transaldolase", "lactonase", "phosphopentose isomerase"]
organismlist = ['Homo sapiens', 'Drosophila melanogaster', "Escherichia coli"]
idlist = []
for org in organismlist:
    for enzyme in enzymelist:
        handle = Entrez.esearch(db="nucleotide", term= org + "[Orgn] AND " + enzyme)
        record = Entrez.read(handle)
        idlist.append(record["IdList"])
for i in idlist:
    handle = Entrez.efetch(db="nucleotide", id = i, rettype = 'gb', retmode = 'text')
    print(handle.read())

LOCUS       NR_029671                 88 bp    RNA     linear   PRI 02-SEP-2018
DEFINITION  Homo sapiens microRNA 125b-1 (MIR125B1), microRNA.
ACCESSION   NR_029671
VERSION     NR_029671.1
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Matteucci E, Maroni P, Nicassio F, Ghini F, Bendinelli P and
            Desiderio MA.
  TITLE     Microenvironment Stimuli HGF and Hypoxia Differently Affected
            miR-125b and Ets-1 Function with Opposite Effects on the
            Invasiveness of Bone Metastatic Cells: A Comparison with Breast
            Carcinoma Cells
  JOURNAL   Int J Mol Sci 19 (1), E258 (2018)
   PUBMED   29337876
  REMARK    GeneRIF: depending on the microenvironment conditions and
            endogenous miR-

LOCUS       NM_000175               4125 bp    mRNA    linear   PRI 01-JUL-2018
DEFINITION  Homo sapiens glucose-6-phosphate isomerase (GPI), transcript
            variant 2, mRNA.
ACCESSION   NM_000175
VERSION     NM_000175.5
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4125)
  AUTHORS   Yang M, Haase C, Viljanen J, Xu B, Ge C, Kihlberg J and Holmdahl R.
  TITLE     Cutting Edge: Processing of Oxidized Peptides in Macrophages
            Regulates T Cell Activation and Development of Autoimmune Arthritis
  JOURNAL   J. Immunol. 199 (12), 3937-3942 (2017)
   PUBMED   29127146
  REMARK    GeneRIF: The 2 cysteines in hGPIc-c are prone to form disulfides
            upon oxidation. hGPIc-c could induce arthritis in both B10.Q and
          

LOCUS       NM_031284               2605 bp    mRNA    linear   PRI 09-AUG-2018
DEFINITION  Homo sapiens ADP dependent glucokinase (ADPGK), transcript variant
            1, mRNA.
ACCESSION   NM_031284
VERSION     NM_031284.5
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2605)
  AUTHORS   Richter JP, Goroncy AK, Ronimus RS and Sutherland-Smith AJ.
  TITLE     The Structural and Functional Characterization of Mammalian
            ADP-dependent Glucokinase
  JOURNAL   J. Biol. Chem. 291 (8), 3694-3704 (2016)
   PUBMED   26555263
  REMARK    GeneRIF: ADPGK is substrate inhibited by high glucose concentration
            and shows high specificity for glucose, with no activity for other
            sugars, as determined by NMR spectroscopy, i

IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



LOCUS       NM_001362840            3629 bp    mRNA    linear   PRI 12-AUG-2018
DEFINITION  Homo sapiens aconitase 1 (ACO1), transcript variant 3, mRNA.
ACCESSION   NM_001362840
VERSION     NM_001362840.1
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3629)
  AUTHORS   Connell GJ, Danial JS and Haastruthers CX.
  TITLE     Evaluation of the iron regulatory protein-1 interactome
  JOURNAL   Biometals 31 (1), 139-146 (2018)
   PUBMED   29330752
  REMARK    GeneRIF: Evaluation of the iron regulatory protein-1 interactome
            has been presented.
REFERENCE   2  (bases 1 to 3629)
  AUTHORS   Johnson NB, Deck KM, Nizzi CP and Eisenstein RS.
  TITLE     A synergistic role of IRP1 and FBXL5 proteins in coordinating iron
            metabolis

Supplied id parameter is empty.

LOCUS       NM_001365524            8182 bp    mRNA    linear   PRI 02-SEP-2018
DEFINITION  Homo sapiens fibronectin 1 (FN1), transcript variant 19, mRNA.
ACCESSION   NM_001365524 XM_005246414
VERSION     NM_001365524.1
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 8182)
  AUTHORS   Nguyen H, Huynh K and Stoldt VR.
  TITLE     Shear-dependent fibrillogenesis of fibronectin: Impact of platelet
            integrins and actin cytoskeleton
  JOURNAL   Biochem. Biophys. Res. Commun. 497 (2), 797-803 (2018)
   PUBMED   29470988
  REMARK    GeneRIF: Fn with its inactive compact structure requires unfolding
            to assemble into active fibrils. Shear stress could induce
            conformational changes of

LOCUS       NM_001145934            2579 bp    mRNA    linear   PRI 02-SEP-2018
DEFINITION  Homo sapiens transketolase like 1 (TKTL1), transcript variant 3,
            mRNA.
ACCESSION   NM_001145934
VERSION     NM_001145934.1
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2579)
  AUTHORS   Dong Y and Wang M.
  TITLE     Knockdown of TKTL1 additively complements cisplatin-induced
            cytotoxicity in nasopharyngeal carcinoma cells by regulating the
            levels of NADPH and ribose-5-phosphate
  JOURNAL   Biomed. Pharmacother. 85, 672-678 (2017)
   PUBMED   27916418
  REMARK    GeneRIF: Knockdown of TKTL1 additively complements
            cisplatin-induced cytotoxicity in the nasopharyngeal carcinoma
            cells by inhibi

LOCUS       NM_032680               2307 bp    mRNA    linear   PRI 24-JUN-2018
DEFINITION  Homo sapiens calcium release activated channel regulator 2A
            (CRACR2A), transcript variant 3, mRNA.
ACCESSION   NM_032680 XM_937238
VERSION     NM_032680.3
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2307)
  AUTHORS   Srikanth S, Woo JS and Gwack Y.
  TITLE     A large Rab GTPase family in a small GTPase world
  JOURNAL   Small GTPases 8 (1), 43-48 (2017)
   PUBMED   27221160
  REMARK    GeneRIF: Results show the characterization of CRACR2A protein which
            encodes a large Rab GTPase containing multiple functional domains
            contrary to small Rab GTPases. It was found to play an unexpected
            role in regulatin

IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



Supplied id parameter is empty.



IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



LOCUS       NT_033778           25286936 bp    DNA     linear   CON 16-AUG-2018
DEFINITION  Drosophila melanogaster chromosome 2R.
ACCESSION   NT_033778 NW_001844732 NW_001844738 NW_001848856
VERSION     NT_033778.4
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
            Assembly: GCF_000001215.4
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 25286936)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            H

IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



Supplied id parameter is empty.

LOCUS       NM_134950                531 bp    mRNA    linear   INV 16-AUG-2018
DEFINITION  Drosophila melanogaster uncharacterized protein, transcript variant
            B (CG15418), mRNA.
ACCESSION   NM_134950
VERSION     NM_134950.4
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 531)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
  

LOCUS       NT_033777           32079331 bp    DNA     linear   CON 16-AUG-2018
DEFINITION  Drosophila melanogaster chromosome 3R.
ACCESSION   NT_033777 NW_001844733 NW_001844734 NW_001844736 NW_001844852
            NW_001844855 NW_001844895 NW_001848858
VERSION     NT_033777.3
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
            Assembly: GCF_000001215.4
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 32079331)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model 

IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



Supplied id parameter is empty.

LOCUS       KQ961090               26431 bp    DNA     linear   CON 17-FEB-2016
DEFINITION  Escherichia coli strain JPH264 genomic scaffold Scaffold62, whole
            genome shotgun sequence.
ACCESSION   KQ961090 LSNY01000000
VERSION     KQ961090.1
DBLINK      BioProject: PRJNA268327
            BioSample: SAMN03838475
KEYWORDS    WGS; HIGH_QUALITY_DRAFT.
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 26431)
  AUTHORS   Mitreva,M., Pepin,K.H., Mihindukulasuriya,K.A., Fulton,R.,
            Fronick,C., O'Laughlin,M., Miner,T., Herter,B., Rosa,B.A.,
            Cordes,M., Tomlinson,C., Wollam,A., Palsikar,V.B., Mardis,E.R. and
            Wilson,R.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-FEB-2016) McDonnell Genome Institute, Washington
            University School of Medicine,

IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



Supplied id parameter is empty.



IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)

