# Problem 8: Translating RNA into Protein 
The 20 commonly occurring amino acids are abbreviated by using 20 letters from the English alphabet (all letters except for B, J, O, U, X, and Z). Protein strings are constructed from these 20 symbols. Henceforth, the term genetic string will incorporate protein strings along with DNA strings and RNA strings.

The RNA codon table dictates the details regarding the encoding of specific codons into the amino acid alphabet.

Given: An RNA string s corresponding to a strand of mRNA (of length at most 10 kbp).

Return: The protein string encoded by s.

Sample Dataset 
> AUGGCCAUGGCGCCCAGAACUGAGAUCAAUAGUACCCGUAUUAACGGGUGA 

Sample Output 
> MAMAPRTEINSTRING

In [62]:
codonTable = """UUU:F\nCUU:L\nAUU:I\nGUU:V
UUC:F\nCUC:L\nAUC:I\nGUC:V
UUA:L\nCUA:L\nAUA:I\nGUA:V
UUG:L\nCUG:L\nAUG:M\nGUG:V
UCU:S\nCCU:P\nACU:T\nGCU:A
UCC:S\nCCC:P\nACC:T\nGCC:A
UCA:S\nCCA:P\nACA:T\nGCA:A
UCG:S\nCCG:P\nACG:T\nGCG:A
UAU:Y\nCAU:H\nAAU:N\nGAU:D
UAC:Y\nCAC:H\nAAC:N\nGAC:D
UAA:Stop\nCAA:Q\nAAA:K\nGAA:E
UAG:Stop\nCAG:Q\nAAG:K\nGAG:E
UGU:C\nCGU:R\nAGU:S\nGGU:G
UGC:C\nCGC:R\nAGC:S\nGGC:G
UGA:Stop\nCGA:R\nAGA:R\nGGA:G
UGG:W\nCGG:R\nAGG:R\nGGG:G"""

# create a codon:protein dictionary
codon_dict = {}
for x in codonTable.split("\n"):
    temp = x.split(":")
    codon_dict[temp[0]] = temp[1]

In [63]:
from textwrap import wrap
def rnaToProtien(seq):
    protien = ""
    for x in wrap(seq, 3): # split the sequence into codons
        protien += codon_dict[x]
    return protien.replace("Stop","")

In [64]:
testSeq = 'AUGGCCAUGGCGCCCAGAACUGAGAUCAAUAGUACCCGUAUUAACGGGUGA'
rnaToProtien(testSeq)

'MAMAPRTEINSTRING'

In [65]:
actualSeq = 'AUGGGUCGGUUAUUGGCUGGGCAUCCGAGAGGGGAUGGAAUUACCGCUUUAUUUUCGGUCGUGUCGAACUUCCAACACACACGUAAACCCUCUAUGCGCUUGCCAUUGGUCUUUAGUACGCAUUGCCCAACGAAGCGCUUUUUGUCGGUACAUCGGACGAUUCCGGACGAAAUCCCAUAUGAACCUAUUGAUAAGGCUUGCGCCAUGGAAGAUCGUAAAAUGAAUGCGCGCAUAGCAUGUGAUGGACUACCAGACACUAUGUGCUGUUCCGUAAAUAUCAAGCGGUUGCUUCUUCCUCCGACACAUUGGCAUACGUUUAUUCAGGGAUGCUUCACUCGCUCAAUGGCAGUGAGACUUGCAGUAGCUGGUCAAUUCUUCUCCGAGAAUUGCAAAAUUGUUUCGUCUGCUAGUCUUGACGACUGUAGUAAAUACCUGGUCGUGUCUCCAUUUCGUAGACCAGUCGUAAUAGCGCAGUCAGAUGCCCCCAAGACAACACACAUGCGAUGCAGAAUGCGGAAGAGGCUCAUCAAGAAUAUAUGGACACCUUCGCCACAAGCCCAGACUUAUACUUCCGAUCUAGCGACAGGAUCUCCAUACCGUUUCGGUUAUUACGUUUAUACGCGAUACAGACUAUCGGAUAAUCGAUUACAUCUACAAUCAGUGAAGGGGCGCAAAAAUUCUCGUGCCAAGUCGUGCAAUCACGGAAGGUCCUGCGAUCAAGGCAUUCUUAGGGCUGUUCGCGCUGAUCCCGCCGGUUCAGCCUACUGGAUCAGUUCGGCGGCGUCCGGAAACGUCCUGAAUACUACAUCUUUAUGUGUCGCCGGAGGUCUCCCAGCACUAGGGCGCACUUCUAAUGGUAGCAGGUGCGUGCACAAAACCCUAGAGCGGCUUCUUUGUGAUUGGAGGUCGGGUAGCUUGAUGGAUCCACUCACAAAACAUUUGUCCGGGACUAACCAGAUUGAAGACCAAAGUAUGCGUUCAAGAAGGGCUUCUGACUCAGUCAUUAGAACACUAAGACAGCUCGACAUCGUUGAGUAUCCGCCCAAUACCUUAGUAAAGGGCAAUAGUCGGGGACCCGGCUCCGUUAAAACGACUGGCCAGCAAGUCAUUCGUGUAUGCCUCCCACUAAUUCGUCAUGCAGGGAAUAGAAGCGUGGUUGAACCGAAAAAUCCUGCAGCGGGUGACUUGAUGCCCAGAUGGAAGUACCGUAUUCUCCGUCAUGCUUACAGGCUACUAAUUACUAGCGCACCGCAGGAGCUCAGUGCUAUCGUCUUCUCUGAAAUACCGGUCGUACAGUUAUCUCACGACUCGCCUCUGAUCAAGGGUGCUACAAGACCUUACCAACCGAUCAGGUCAGCUCGUACACUGCGCCACAAUGCAGGAUUUGCUUAUGACGAUGAACGGGUCAAGGCAGAAAUGCAACUCGACGUUAACCUCUGUGCUAAAAGCGUGGUCCUUGGUAAUUUUGUGCUGCGGCACCAGAUGCGUCCCCCCUUCCACGAUAGAACUAAAGUGAGAGUCAGACGAGCGCGGCCUUGUGCCAGGUUUGCCCAAGGUGCUCAAGAAUUGGGACAUUGCUUGAAUGAAACGGACCGCAUGUUCGUAUUUACACAUGUACGUCUGAGUUGUAGAGGUGUGUGCCACACCGGUUUAGUGGAGGUGAUCUUGUCACGUACUCGUGGACGAUUGACAACGUAUACGACGGGGUGCUUGGAGGGGAAUCCAAAGGGCCAGAAUUUAACCAUAGUUGGCCGCUCCCAUCGCAAGGGGGAGUUCUUUUGUGAUACCCUCAAUAGCAAUCAGCCCACUCGACACUACGCUUUAGCUAGACCCCUGAAGCGAUCCCGUACCCUCGCUCCUGGCGAGCUCAGCCCCGUUGGUCUUCGGAAUGGUCUACCCACAUGCCGGAAACCUGGGGCAACUUAUUUAUUGGAAUCCAAAUCUGGAAGCGAUGGACGUUGCGGACAUUAUAAGUCCUGCGUGGAAGGGGGGGGCCAUCUCCGUCAGUCGUACACAACGAGUACAUUCCUGGCGAACAUGUCGGAGCGGUUUAAGUCAAUCGGUGCCUGUAGCUCAGAGGCGUUCCUACGAUGUGUAAGGGCGCGGUUAGUAGGGCCCAAACGAUUAUUAUCCCAACCAUUAAAGAUCGGGGCCCACGGCAUGUGUGUUAUUUUAUACCUGGGAGACCCAUGCUUCCCUUUAAUAUGGCCGGAGCUCUGUUUUACACGUCCAAUUAUUCCUGGAAGCCCAGACCGACCUUCACGUAGUCUUAAACUGAUCCGGUCCCUAUUAUGCAUAUCCAUGUACGAAGGCUCAUUGGUCGUCAUGCGUAGGAAGGACUUUGGCAGAGGAGCCCACAGUUUGGAUCUGAUGCCCAAGACCUUGGGACGGCGCAGGGAGUUUAUGAUGUUGGACCAGAAGUACGCAAUGAUCAUCACCAGUGCGCUAUUGAUAACAACUCCGGCAGGCCAUCUGCGUUUAAUAAAAGGGCACCGUGUGCAAGAUUGGGUCCUCUCGUCCCCUCUGACUGAGGCUUUCUCAAGGGGAACUCGACGCGUGAUUUCGCUGGUUAUAUAUUACUUCGCUGAUAAACACCCUCGCGGCUAUAAUAGGGUUCGGCCCGAUUUAGGUUUUCACUUUGGACUCCAACUAACAUUAUGGGUUCGCCCGAAAGGCUUGAUGUUGUCCGAAGGUAAUGCUUGGAGACACAAAUCGACCUCCCUUGAGUGCUCACCAUGCACUCCAAAACAUUGGGGGUAUCGUUCCCUUUGCGCAUGUACAGGAUUCGAUGAAGAAUGCUGGCCUAGUUCCGCGCCACACCUCCCGCUGUCCAAUCUUUACAAGAAGUACGCGGCCAGAUCAUCUCUAAGACCCCUUUUACCCGCUUACAAAGCCAUCUCUAUUGACCUUCCUCGCAAGUCACCGGUAGACGCACUCAUGCAGCGACUGGGGUUCAAGCAUUUCGAGCCCACGGCGCCCUGCACAUCUAGCUGCCGACACGCACAAGAUAUGUCCGUGUUAAUCCCGUUGGUCCAAUUGUUCAGCUGGGGCUUGGCGCUGCAGGUGGUCAACGGAUUAAUAUCAUGUCUCUUUAAGACGUCCAGCAGUGCAGGGUCGAGUGCUCUUGGUCAGCGAAUUAUCGACAACGUCCCUAUAGCCGCCGCAAGCAAACUAAUCGUGGCACCCACUGUUUGGAACAAGGAGAUUGAUUACAGGCAAAGCGAUACCGUGUUGAGUCAAUCGCUAGUUACAGGUGGGAACUUUUUGAGAUUGACCUGCGUCCCAAUGAAGGAUUGCCGGAAAGACGGUUUCCCGCCGCUCAUGAGAAUCUUAAAUAGAAUGACCUGUUGUGACCGAACCCCGGGCGUCAGAUCAGACGCGUCACCACUCUCUUGCCAUCCAAGUGUCGCUUACCGAACCACACACGACUCCCACGCAUCAGGGCGGAUGGGACCAGCGUCACGAAAAUCCUUCAGCGCUGUUGUUUGCUCGUUCUCCGCUAGUUCGUUCCAGUUUUCUGUAUUGGAUGCGAUCGACGAGGGGCGAUCCUGCAUCCUAAGCAGACACACCCUGCGUAUCCAGACGAGGCAGAGAUCAUGCAACUUACCUCUUUCUGCCCGUAGAUACAUUAUUUUUGGUCCCCUAUAUCCCAGCCGGAGAUUCUGGAUCGGUCAGCUUAUGAUCCUCAGGGGCACUUCAUGCAAGAAGGGUAGUCUCCACCACAGUAGUCUCUACAUCAGAUCCCUUAGCUCUGCUCGUGACGAUUGGUACGUUGCAUUGGGCCGCGGGUUCUUCCUAACCUUGGAUGCGGUGCUGCUUUGUCGAAAGCCGAUGCUCAGAAUGCGCUGUUCGGUCUCUACCGGUCUCUUCGAUCCCUUGCACGCCAGACAAUCUCGCAAUUACGUUCCUACAACAAUUGUAGUGGGAUUCGAGACCUGUCUUCCAUUGUGCUCCGGCAGCUCAGAGUCUUUAAGGCCGCAACCCGCUUGCGAUGGAGUCCGAUUGCUGCAAGUCAGGCGAGUAUUGUUGUCUUCAAUCGUUGGGGUCUGUGAAGAAGGACGAGACGUAUUAUUGCGCGGAUUCGCCGAAUUCCAGUGCGAAACUUACAUGCGGAAGGGCUCACGAACAAGGGCUAAUUUUGUUCUCUUAACACUUUGCUCGGGGUUCAAAACAUCUCUAAGCUCCCGCAGGGGCAGUCAAGGUCUUAAUAUUUGUACGCGCGUCGGAGGUACACGAGAACUCCGGUGUGGUCGCUUGGAAUUCAUGUGCUUCUUCGGUCCCUCCAAGUUAUCUACGGAAGUUUGCCUCUGGUUAGAGACCUUAGGGGACACGGUUAAGAGAUCCUGGUAUAUGCGAAUCUUUAGGGGGAGCUCUAUUAGCUGGUCCGCAUUGGAUUUGAGACGCAUAGUUUCUAGGCGGAUGCGGAAUUCUCCAAGUGAAUGCGGGAGUUCACAGGGGUUGCAGAUACGGGAGAUUAAGUGGAAGAUUCGGCAAAAAUUACUAGGUAGAAGUUUCAUCCCGGAUUCUCGUGCUCUGCGACCGAUGCUUGGGAAACCUGUCGGGCAUUUCGUGCAUCGAGUCUCCCCAAAGUACCGCCCCCGGUACGCAUCCGUUCCUAACAGUUGGGCUCCUGUGGAUCGGAGUAGCCGAGAACCCUUUAAGUGUCCGUUAGCCCAGGUCCCAUCUGGGUCCGAUAGUUGUGCUAGUGUAACCCCAAGGGACCGAACUGAGACACGAGGUUUCACCGAUUUCAUCGGUUUGGUUACUCAGGAGGUCAGAGCGUGGGGCGUUUGGUCAAGCGGCUGGUUCACAUUCAUACAUGCUGAAACAAUAACGUGUUACUUCCAACAUAUUAGACGACUACUAUCCCCAUCUCGCGCACCGCAGUCGUUCAAUCGGGUCGUUUCCACGGAUAAGUCUAGCUUAACCUGGAUCGUCGACGGACCGAGUCUGGAAGGAAGGCAGGAUUUGAGGAUCUCGUCAAUCACUCACCCGUCGCGCACCACCAGAAAUCCCUUUGGCUCUACACCCAAAAAUGGGUACGCCGUUGCUGGUGUAACCUUUAUUAUUGCCGCUUGGUAUGUUACAAGAAGGCCAUUGGUCUGGCAGCGUAAAAUACGGUGGUCUCAUAGUAUCCACAAUUCAACCGUGCGAGCUGAUCACUUACACACAUACACUCGAAUUUUUAACAGUAAGGAUUUAGGGCGCAUGAUUCCACAACACCCUAUGGAGACUGCCGAACACACUCAGACUUCAGCGAUGUUUCGUGUCAAUUCCGUGGCAUAUUACCCUAUUCCGGUCGGACCUGGUUAUACGAGUCAUCGAUGUAGAACGUUACUGUAUGGCCCUGAGCGUUGUGGAAUCGGGCGGGCUACCAAUAUGUCUAAAGUAACACUGCUGCUGUAUUACAAUCUGGCCGACGGGUGCCGUUGCACACAACCAAGGAGCCAUCACUUUAAUAUAACGUUCUUGCUUAUUGGUCCAGGCUCUGAAUUUGCAAGCGGGCAUGCUCCCGUGUACCUAGAUGCAUCGGAAUGGUCCAUCAGCGCUUCCCCAGCCCACACGAUCGACGAGGUCCGAGCCGGGGUCCCGGGUGCCCCGCGGGGAGCAAGGAAAAGUAAAGCCGACGUUGAGAUGAACGGCUGUAAAAAUACUGUGUUGAAGCCCGUUAGAAGCUUUACGGUCAUAGAAGGCGCCGACGGUAAGGAGGGUAAGUUCGCUCAUCCUGAAGUAGCCUUCGGUUCAUUUACUUAUUCUCGGGCUAACGUGCGAGUUCUCCACUUCCACGGGGGUACCCCCGGGUCAAGAUCUCCGGUGCUAUCCCCAAUUUUAGUUGCGCUAAAUCCUAUCAUAUUACCUACAUUGAUGGGGGCUAUACGGAAGUACUUUGUGACAACUAGCCUCCGAUCGACAAUAGAGAGCUCAGAGACAUGCAGCCGUCCUUUUUGUGACCGAUGUCCUUAUGCGACUACCGCAGGCCUGCGGUUCGGGCUUUCCAGUCUUCGCGCGGGACUAGAUUGCGUAGGAGCCAAUCAUGCAGUAGCGUUAGCUUGCAAACUCAAUGCGUUCGCGGCGGGAUACUAUAUCCGUCUCACGGGAAUCGCCCUCUAUUGUUCCGAACUAGCUCUAACUGCCUAUGGCUGGAGCCGGACUUCUACUUUCAAAUUAUUUGUUAUAGUCCUAGUCGUCCCAGAUUUGGUUUCCGGAACACAGAUGCUUCCGUGUAGCCCUAACACCUUAGUUUGCAUCGGCCAUAUUCCUUGCGUGUUGGGCGGCUUCGUGACUGGGCCCGCUGGAGUCUUAGGGUGUAGUCGUGGAGUGAAACCUCACGAUGAGGUCUCUAUUGGCCAUUCCAGGCAUCGCUGGGUCUUCUAUUUACAGGCUAGGGGAGGCGGAUAUCAUUCAUGCAUUGACAUGCUGCUGAAUUACCUUAGCUCCGCCGGCGGUUAUCGAGUAGUAGUCAGCAGGGAAGAGUGUAAGUAUCAUGGAACUUUAUCGAGAACGAACGUAACUAAAGCGUACUGGACACAUCGCCACAGAGACUUGUCGCGUCUGAGGCUCCGCCCGUUUUCCGAUAAGAUUAGUAUGGGGUGCCUGAAAAAUGUCACGAGAUUGACCAGAGCUCUCGACACCUAUCAUGAAUGUGUCAAGAGCGCCGAAGAAAGUGAGGCUCUCCGAGGAAUGGGGAAUAAGACGCAUACGAUUAAGCGCUGCAAAAGGCAUCUCGGCGGUUACCCGCUUUUAGAUCCUUUCGCAGGGAAGAGUCAAAGGUCUAAGACAUACUUCAGUAUUACACUUGUGGAAGGUCCGCACCUAUAUAGCUUUUACCGACUAGGCAUUCAGCCCGUUCAUGUUCAACCCGAUACACUGAUUGUGACCGUCGGUUACCUUGAUGACGCACGCCCGAUUGCAGUAGAAAAAAUCCUGACCCGUGUUGGCGGGAAGAGCGCACUAACGGGAUCGGCUUCUAACCGCAUAGUCUGCAUUGUGCCAGCUAUAACCGACCAUGGAUACACAGGCGAUCUGGAAGUCUUACUGGCCGAGGUCCUCUUAGAGUCGAACGUUCCCAGCGAGGCCCGUAUCGAUCUGGGGCCACACUAUGGGCGACAGAGUUCUCAUACCGACUCGUCCAGAGCUACAGUCUGGCUGCCGUGCAAAUCCACCUACUGCGGUAAAUCGACUCACGGAGGUGCGGUAUAUGCGCCCUUUCCCCCGGGCUGUUUCGCAGCGCUCUUAGUGACGCGGCGUCUCGAGCUCCGAGGUGUUUUCUUAACAUGGAAAAUGCCUGCGCACCACUGUUCAAUCGGCGGGGAAGAACAGACUGGGGUAAUCUUAGAGGCAUAUCAGGAGCAACCUCGUAGGCUGGGUCCCCCGUACAGCGACGGCCUACGUGAGGCAUGGAAAGACGAGGAAAGUUAUCUCUUGUCACAAGCACGGUCUGCAUCACUGGAGUGCAGGUCCGAUAAAUGCCGGAUCGGAUUGCUAGCGGUGGAUACUUUUGCCCUGUCUAAGUCCAGGCGAGGUGUGAAUCGCGCCAUGGUGACUGAUCACACAACGGGACGGAUUUCCAGGCGACGAGUGCAUGCACCCCCACCAGCAGCGGGCUGUCAUCGGUUGCGAAUAACAAGAAUGCUAGUGUACAGUUUCCCUAAGGUUUGGAGUCAAGAGUCCCCCGCCACGCCCUUUUACCACCUGAAAAUGAUGUUGCAAUUCGCCAAAUUACCCUGGCAUAUCCGGCGAGAUGCGAGCUCGAGACCCGCAAAAUUGGAUGGGUACGGCCGAUCCCAUCUAUCAAGUUAUCCAGCGGGGAAUCUGGAAAUCGCUUCCUCAAAACUUCAUAUGUUCAGGAGUAGAACCCCUGCAACCUUUGAGCGUCCAACACUGCUAAGCUACUUAGGAUACGAAACAGCUAUUGUACCAAGAAGCAGCCCAAGAAGGUCCCCAAGAAACACGACACAGCCAUCUCUGAUUGGCGACUACAGAACAGGGGUGGCUUUUGGGUCGUACGGCGACCCGUCGUGCGCCCGGGGGACAACCCCCGACAUCCUCGAACGGGGUGAUUGUCACGCUCGUAAUGUCCAUGCGGGGAUGCGCAAUAAGCUAGACCUGACAGUUCCGGGUGAGUUCAUGUCGCGGACGGCAGCCGUGUCACGUGACUCACCAGUUUACAGUUUACUCUGCUCGGUGAAUUACUCGGUAGUCAGGCGGGGCAUGUCUACUGAGCACAUCUCCCUGAGCAGAAAUCGUCUUUCGAACACGGCGACGCCCAUGGCGCCGAGCCUCCUCUCGCGACAGAAAGUCAUUUUCCCCCAAGUUCAGAGUCGUGCGAAGUCAUCUUCUGAACCGCAUCGGUGCAGCCUUCGACCUAUCCUAUCGAAGUCACGGAUAGAAUCGCACCGAGAUUUGAGGACUUCGUCCUUCCCCUUGUCAAAGUGUUUGUGGACUGAAGCGCGAUGGUCAUUUAGUUCGAUUACUCGAUACAUGGAUCCCAGUUGCCUCGCUCGUGGUCUGCCUCGGGUAGCCCAACGGAGGGCCCAAAAACAUGGCUGCACGCCGAUUUGGCGAAGACGACACGGCAACUCCUUGCAGUCUUAUUGCUCCCUGAGAAUUGUACCGGUGCCAAUAACGGAUCCAGUAAGAACAGAUGGAAGGCAUCUGAUAUCAGGCCUUGAGCGAAUAGCAGUGGGACACGACGUACACGGAGGUGUUGUAACGUCCGAGUGGGCGUUUUCCCCCGUCAAUGGAUUGCCGGCACAGGUAAGAUGUGGCUGCUGUUAUGGAGUAUCAUUGAUUGCUUCAUUAGUACAAGCAACCUCUCCUGAGUUCUACCGAAGGAAUUUCAACCAUCUUGUUCUCUUUACAUAUCGGCUGAGCGUACUCGUUUCCGCGGUCCCCGCACUUGUCUGGACAGGAAAUCAAAGCUCCGCGCAAGUAACCACAACACGUAAUACUACAGUAAAGAACUUUGCCAAUAAUGUCAGAAUCGUCCUGAGCACGCAGGGGACCUUACGUCGGGCUGGUGCAACAUAUCCGCAUGUGUGGGCUGCAACCAAACAUUCUGGUAGGCGGGUGACGUUUACUACCUACUGUAUCUACGUCUCCAUAAGACCACACUAUCACGAAAGACCCAGCUACAGCCGUCUAUUGUAA'
rnaToProtien(actualSeq)

'MGRLLAGHPRGDGITALFSVVSNFQHTRKPSMRLPLVFSTHCPTKRFLSVHRTIPDEIPYEPIDKACAMEDRKMNARIACDGLPDTMCCSVNIKRLLLPPTHWHTFIQGCFTRSMAVRLAVAGQFFSENCKIVSSASLDDCSKYLVVSPFRRPVVIAQSDAPKTTHMRCRMRKRLIKNIWTPSPQAQTYTSDLATGSPYRFGYYVYTRYRLSDNRLHLQSVKGRKNSRAKSCNHGRSCDQGILRAVRADPAGSAYWISSAASGNVLNTTSLCVAGGLPALGRTSNGSRCVHKTLERLLCDWRSGSLMDPLTKHLSGTNQIEDQSMRSRRASDSVIRTLRQLDIVEYPPNTLVKGNSRGPGSVKTTGQQVIRVCLPLIRHAGNRSVVEPKNPAAGDLMPRWKYRILRHAYRLLITSAPQELSAIVFSEIPVVQLSHDSPLIKGATRPYQPIRSARTLRHNAGFAYDDERVKAEMQLDVNLCAKSVVLGNFVLRHQMRPPFHDRTKVRVRRARPCARFAQGAQELGHCLNETDRMFVFTHVRLSCRGVCHTGLVEVILSRTRGRLTTYTTGCLEGNPKGQNLTIVGRSHRKGEFFCDTLNSNQPTRHYALARPLKRSRTLAPGELSPVGLRNGLPTCRKPGATYLLESKSGSDGRCGHYKSCVEGGGHLRQSYTTSTFLANMSERFKSIGACSSEAFLRCVRARLVGPKRLLSQPLKIGAHGMCVILYLGDPCFPLIWPELCFTRPIIPGSPDRPSRSLKLIRSLLCISMYEGSLVVMRRKDFGRGAHSLDLMPKTLGRRREFMMLDQKYAMIITSALLITTPAGHLRLIKGHRVQDWVLSSPLTEAFSRGTRRVISLVIYYFADKHPRGYNRVRPDLGFHFGLQLTLWVRPKGLMLSEGNAWRHKSTSLECSPCTPKHWGYRSLCACTGFDEECWPSSAPHLPLSNLYKKYAARSSLRPLLPAYKAISIDLPRKSPVDALMQRLGFKHFEPTAPCTSSCR