## Informations about the genes obtained in the optimizations

In [5]:
from Bio import ExPASy

In [6]:
def file (id_gene ,filename):
    '''
    A função file utiliza o package Biopython ExPASy para ir buscar ficheiros à Uniprot.
    Para a função é dado o id do gene a procurar na base de dados e o nome do ficheiro xml a criar 
    com os resultados obtidos. 
    '''
    for i in id_gene:
        handle = ExPASy.get_sprot_raw(i)
        gene = handle.read()
        handle.close ()
        with open(filename, "a") as out_handle:
            out_handle.write(gene)
    return gene

In [7]:
def informations (filename):
    '''
    Para a função informations é dado o nome do ficheiro xml obtido com os resultados da Uniprot e 
    imprime o ID com o status reviwed e o comprimento da sequência, o nome do gene, source, mutações, 
    funções, entre outras informções.
    '''
    with open(filename) as file: 
        file_list = file.readlines()
    for i in range(len(file_list)):
        if "ID" in file_list[i][:5]: 
            print('---------------------------')
            print()
            print("Nome de acesso, Status e comprimento da sequência:",file_list[i][5:].replace("    ",""),end="")
        if "AC   " in file_list[i][:5]: 
            print("ID:",file_list[i][5:],end="")
        if "DE   " in file_list[i][:5]: 
            print(file_list[i][5:],end="")
        if "GN" in file_list[i][:5]: 
            print(file_list[i][5:],end="")
        if "OS" in file_list[i][:5]: 
            print("Source:",file_list[i][5:],end="")
        if "SUBCELLULAR LOCATION:" in file_list[i][:30]: 
            print("Localização subcelular:",file_list[i][30:],end="")
        if "DR" in file_list[i][:5] and "PROSITE" in file_list[i][:15]:
            print("Domínios PROSITE:",file_list[i][5:],end="")
        if "RP" in file_list [i][:5] and 'FUNCTION' in file_list [i]:
            print(file_list[i][5:], end="")
        if "CC   "  in file_list [i][:5] and not "CC   --" in file_list [i] and not  "CC   Co" in file_list [i] and not  "CC   Di" in file_list [i]: 
            print (file_list [i][5:], end="")
    f = open(filename, "r")
    content = f.read()
    return None

### Informations about xanthotoxin genes 

In [8]:
id_gene = ['P12995','P32173','P30011','P32125','Q46877','P37596','P60546','P0AEX9','P02916','P68183','P68187','P69451','P38135','P0ACD4','Q47146']
filename = "xanthotoxin.xml"
file (id_gene,filename)
informations ("xanthotoxin.xml")

---------------------------

Nome de acesso, Status e comprimento da sequência: BIOA_ECOLI  Reviewed; 429 AA.
ID: P12995; Q2MBJ5;
RecName: Full=Adenosylmethionine-8-amino-7-oxononanoate aminotransferase;
         EC=2.6.1.62 {ECO:0000269|PubMed:1092681, ECO:0000269|PubMed:1092682};
AltName: Full=7,8-diamino-pelargonic acid aminotransferase {ECO:0000303|PubMed:1092681};
         Short=DAPA AT;
         Short=DAPA aminotransferase {ECO:0000303|PubMed:1092682};
AltName: Full=7,8-diaminononanoate synthase;
         Short=DANS;
AltName: Full=Diaminopelargonic acid synthase;
Name=bioA; OrderedLocusNames=b0774, JW0757;
Source: Escherichia coli (strain K12).
FUNCTION, SUBUNIT, SUBSTRATE SPECIFICITY, COFACTOR, AND CATALYTIC ACTIVITY.
BIOPHYSICOCHEMICAL PROPERTIES, ACTIVITY REGULATION, FUNCTION, AND CATALYTIC
FUNCTION, AND DISRUPTION PHENOTYPE.
-!- FUNCTION: Catalyzes the transfer of the alpha-amino group from S-
    adenosyl-L-methionine (SAM) to 7-keto-8-aminopelargonic acid (KAPA) to
    form

### Informations about xanthohumol genes 

In [9]:
id_gene = ['P12281','P0ADG4','P02943','P06149','P32173','P32125','P0AC84','P77747','P02932','P02931','P06996','P11458','P25437','P40191','P77433','P0AFU8','P00954']
filename = "xanthohumol.xml"
file (id_gene,filename)
informations ("xanthohumol.xml")

---------------------------

Nome de acesso, Status e comprimento da sequência: MOEA_ECOLI  Reviewed; 411 AA.
ID: P12281;
RecName: Full=Molybdopterin molybdenumtransferase;
         Short=MPT Mo-transferase;
         EC=2.10.1.1 {ECO:0000269|PubMed:15632135};
Name=moeA; Synonyms=bisB, chlE, narE; OrderedLocusNames=b0827, JW0811;
Source: Escherichia coli (strain K12).
FUNCTION, AND CATALYTIC ACTIVITY.
-!- FUNCTION: Catalyzes the insertion of molybdate into adenylated
    molybdopterin with the concomitant release of AMP.
    {ECO:0000269|PubMed:15632135}.
-!- CATALYTIC ACTIVITY:
    Reaction=adenylyl-molybdopterin + H(+) + molybdate = AMP + H2O + Mo-
      molybdopterin; Xref=Rhea:RHEA:35047, ChEBI:CHEBI:15377,
      ChEBI:CHEBI:15378, ChEBI:CHEBI:36264, ChEBI:CHEBI:62727,
      ChEBI:CHEBI:71302, ChEBI:CHEBI:456215; EC=2.10.1.1;
      Evidence={ECO:0000269|PubMed:15632135};
-!- COFACTOR:
    Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
      Evidence={ECO:0000269|PubMed:11428898};
    Note=Bin

### Informations about prenylnaringenin genes 

In [10]:
id_gene = ['P0ACD4','P0AC16','P0AFU8','P0A6V8','P75823','P23847','P0ABM1']
filename = "prenylnaringenin.xml"
file (id_gene,filename)
informations ("prenylnaringenin.xml")

---------------------------

Nome de acesso, Status e comprimento da sequência: ISCU_ECOLI  Reviewed; 128 AA.
ID: P0ACD4; P77310;
RecName: Full=Iron-sulfur cluster assembly scaffold protein IscU;
AltName: Full=Sulfur acceptor protein IscU;
Name=iscU; Synonyms=nifU, yfhN; OrderedLocusNames=b2529, JW2513;
Source: Escherichia coli (strain K12).
FUNCTION AS A SULFUR ACCEPTOR, INTERACTION WITH ISCS, ACTIVITY REGULATION,
FUNCTION, SUBUNIT, AND MUTAGENESIS OF ASP-39; LYS-89; ASN-90; SER-107 AND
-!- FUNCTION: A scaffold on which IscS assembles Fe-S clusters. Exists as 2
    interconverting forms, a structured (S) and disordered (D) form. The D-
    state is the preferred substrate for IscS. Converts to the S-state when
    an Fe-S cluster is assembled, which helps it dissociate from IscS to
    transfer the Fe-S to an acceptor. It is likely that Fe-S cluster
    coordination is flexible as the role of this complex is to build and
    then hand off Fe-S clusters. {ECO:0000269|PubMed:11577100,
 