# Mass spectrometry data

The objective of this exercise is to read in raw peptide MSMS spectrum information and output a dataframe.
The .msp file can be downloaded [here](ftp://chemdata.nist.gov/download/peptide_library/libraries/cptaclib/2015/cptac2_mouse_hcd_selected.msp.tar.gz).

The information in this ASCII based text file is organized spectrum by spectrum.
The first line per spectrum provides formatted like this:

&emsp;<code>Name: sequence/charge_nmods_collisionenergy</code>

followed by a comment section wich can be disregarded and the actual spectrum data which is tab-separated:

&emsp;<code>m/z&emsp;intensity&emsp;additional_info</code>

Spectra are separated by an empty line.

Code a function that returns two DataFrames or arrays containing the processed and filtered data. The first one should contain the spectrum information (n_spectra, n_m/z_features) and the second one the sequences per row (n_spectra).

Here are some general guidelines:

* The m/z values need to be binned to integer values (mathematically rounded), otherwise the dataframe size would get out of hand. This will allow for multiple values mapped to a single bin (e.g. if there are peaks at 145.1 and 145.2). Here, only the maximum of those peaks should be kept in the final dataframe.

* Rows that are all-zero should be dropped.

Your function should allow for selecting a range on the x-axis (m/z-range). All peaks outside of this range can be disregarded. Furthermore, only spectra within a set collision energy range and a maximum sequence length should be contained in the output dataframe.

The faster your function runs, the better. I will time them all in the end.

In [5]:
import numpy as np
import pandas as pd
import timeit
import csv
from itertools import islice

In [4]:
filename = 'cars.msp'

In [27]:
def read_file(filename):
    lines = []
    with open(filename, 'r') as file:
        file_dict = csv.reader(file, delimiter='/')
        for line in file_dict:
            lines.append(line)
    return lines

In [29]:
a= read_file(filename)

In [37]:
a[6]

['167.1190\t2481.4\t"Int', 'PP-CO', '6.6ppm"']

In [54]:
def read_file(filename):
    i = 0
    names = []
    with open(filename, 'r') as file:
        file_dict = csv.reader(file)
        for line in file_dict:
            names.append(line)

In [125]:
def read_file_1(filename):
    names = []   
    locations = []
    with open(filename) as file:
        file_dict = csv.reader(file)
        i = 0
        j = 0
        for line in file_dict:
            if i > 0 and not any('Name' in sl for sl in line):
                names.append(line)
                j += 1
                
            if any('Name' in sl for sl in line):
#                 print(line)
#                 print(line[0])
                i = 0
                if len(line[0].split('/')[0][6:])<30:
                    print(line)
#                     print(line[0])
                    locations.append(j+1)
                    j = 0
                    names.append(line[0].split('/')[0][6:])
                    i +=1
                    protein_name = re.search('HUMAN(.+?)OS=', str(line))
                    self.protein_name = (protein_name.group(1))
                    
    return names, locations

In [55]:
%timeit -n 1 read_file(filename)

3.77 s ± 66.3 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)


In [52]:
%timeit -n 1 read_file_1(filename)

7.35 s ± 190 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)


In [None]:
names, locations = read_file_1(filename)

In [None]:
locations

In [None]:
names[]

In [None]:
print(names[1195+322+84])

In [None]:
read_file(filename)

In [None]:
a = 'AAFEWNEEGAGSSPSPGLQPVR'

In [None]:
a = ' 01'

In [117]:
len(a)

['Name: AAAAAAAAAAEEAAMQRDLLPPAGR/3_0_34.9eV']
['Name: AAADDGEEPK/2_0_24.4eV']
['Name: AAAEVNQEYGLDPK/2_0_31eV']
['Name: AAAFEDQENETVVVK/2_0_38.7eV']
['Name: AAAFEDQENETVVVK/2_0_40.1eV']
['Name: AAAGELQEDSGLHVLAR/3_0_19eV']
['Name: AAAITSDLLESLGR/2_0_29eV']
['Name: AAAITSDLLESLGR/2_0_33.2eV']
['Name: AAAITSDLLESLGR/2_0_44eV']
['Name: AAALERMPR/2_0_21.2eV']
['Name: AAATLMTER/2_0_22.6eV']
['Name: AAATQPDGKDTPDEPWAFPAR/3_0_34.3eV']
['Name: AACFQLK/2_1(2', 'C', 'CAM)_19.6eV']
['Name: AACLESAQEPAGAWSNK/2_1(2', 'C', 'CAM)_41.9eV']
['Name: AACLESAQEPAGAWSNK/2_1(2', 'C', 'CAM)_43.5eV']
['Name: AADAVEDLR/2_0_30eV']
['Name: AADIPGLK/2_0_18.4eV']
['Name: AADKDTCFSTEGPNLVTR/2_1(6', 'C', 'CAM)_41eV']
['Name: AADKDTCFSTEGPNLVTR/2_1(6', 'C', 'CAM)_46.4eV']
['Name: AADKDTCFSTEGPNLVTR/2_1(6', 'C', 'CAM)_48.2eV']
['Name: AADKDTCFSTEGPNLVTR/3_1(6', 'C', 'CAM)_22eV']
['Name: AADKDTCFSTEGPNLVTR/3_1(6', 'C', 'CAM)_29.2eV']
['Name: AADKDTCFSTEGPNLVTR/3_1(6', 'C', 'CAM)_30.4eV']
['Name: AADVHEVR/2_0_21eV']
['

['Name: AFGENAVVQLISLQK/2_0_37.9eV']
['Name: AFGENAVVQLISLQK/2_0_39.3eV']
['Name: AFGENAVVQLISLQK/2_0_50eV']
['Name: AFGENAVVQLISLQK/3_0_23.9eV']
['Name: AFGPGIEGK/2_0_20.5eV']
['Name: AFGPGLEPTGCIVDRPAEFTIDAR/3_1(10', 'C', 'CAM)_38.2eV']
['Name: AFGPGLQGGNAGSPAR/2_0_35.4eV']
['Name: AFGPGLQGGNAGSPAR/2_0_45eV']
['Name: AFIDCCNHITK/2_2(4', 'C', 'CAM)(5', 'C', 'CAM)_29eV']
['Name: AFIDCCNHITK/2_2(4', 'C', 'CAM)(5', 'C', 'CAM)_32.3eV']
['Name: AFIDCCNHITK/2_2(4', 'C', 'CAM)(5', 'C', 'CAM)_33.5eV']
['Name: AFIDCCNHITK/3_2(4', 'C', 'CAM)(5', 'C', 'CAM)_15eV']
['Name: AFIDCCNHITK/3_2(4', 'C', 'CAM)(5', 'C', 'CAM)_20.3eV']
['Name: AFIDCCNHITK/3_2(4', 'C', 'CAM)(5', 'C', 'CAM)_21.1eV']
['Name: AFIGDIK/2_0_15.9eV']
['Name: AFIGDIK/2_0_18.6eV']
['Name: AFIGDIK/2_0_24eV']
['Name: AFIHWVSQPLVCEIR/3_1(11', 'C', 'CAM)_28.4eV']
['Name: AFLEVLAK/2_0_20.9eV']
['Name: AFLEVLAK/2_0_28eV']
['Name: AFLGHNVTNAVITVPAYFNDSQR/3_0_37.4eV']
['Name: AFLGMK/2_0_16.2eV']
['Name: AFLHIPAK/2_0_21.8eV']
['Name: AFLHIP

['Name: ALANYILFK/2_0_33eV']
['Name: ALASVLLQDHIR/2_0_32.5eV']
['Name: ALAVIEK/2_0_18.1eV']
['Name: ALCSEFQGSVATPR/2_1(2', 'C', 'CAM)_37eV']
['Name: ALCSFQIYSVPWK/2_1(2', 'C', 'CAM)_38.9eV']
['Name: ALDETLENQK/2_0_28.2eV']
['Name: ALDFAVSEYNK/2_0_30.6eV']
['Name: ALDFTQWLMVLK/2_0_35.6eV']
['Name: ALDGDFTEENR/2_0_26eV']
['Name: ALDGDFTEENR/2_0_29.7eV']
['Name: ALDGDFTEENR/2_0_30.8eV']
['Name: ALDGDFTEENR/2_0_39eV']
['Name: ALDGLWHFR/3_0_16.5eV']
['Name: ALDLFSDNAPPPELLEIINEDIAKK/3_0_40.8eV']
['Name: ALDLFSDNAPPPELLEIINEDIAKK/3_0_42.4eV']
['Name: ALDLFSDNAPPPELLEIINEDIAKK/3_0_54eV']
['Name: ALDLTRPVTFVSNAK/3_0_32eV']
['Name: ALDPALWTIQK/2_0_30.5eV']
['Name: ALDPFTTPSPPTSLEITSVTK/2_0_51.6eV']
['Name: ALEAANADLEVK/2_0_29.1eV']
['Name: ALEAANYQDTIGR/2_0_34.6eV']
['Name: ALEEERPSLR/2_0_37eV']
['Name: ALEEERPSLR/3_0_13eV']
['Name: ALEEERPSLR/3_0_18.4eV']
['Name: ALEEEVEK/2_0_23eV']
['Name: ALEEFQSEVSSCR/2_1(11', 'C', 'CAM)_37.5eV']
['Name: ALEELQR/2_0_20.9eV']
['Name: ALEHAFKLEHIMDLTR/4_0_20.

['Name: AQIGGPEAGK/2_0_22.6eV']
['Name: AQIHFPR/2_0_20.3eV']
['Name: AQIHFPR/2_0_21.1eV']
['Name: AQIHFPR/2_0_27eV']
['Name: AQIHNGILR/2_0_24.8eV']
['Name: AQIIHDSFNLASAK/2_0_36.8eV']
['Name: AQIIHDSFNLASAK/2_0_47eV']
['Name: AQITNPSGASTECFVK/2_1(12', 'C', 'CAM)_40.1eV']
['Name: AQLEEQLQEVR/2_0_32.7eV']
['Name: AQLIDMK/1_0_34.1eV']
['Name: AQLIDMK/2_0_17eV']
['Name: AQLIDMK/2_0_19.2eV']
['Name: AQLIDMK/2_0_25eV']
['Name: AQLYIDCDK/2_1(6', 'C', 'CAM)_23eV']
['Name: AQLYIDCDK/2_1(6', 'C', 'CAM)_27.4eV']
['Name: AQLYIDCDKMESAELDVPIQSIFTR/3_1(6', 'C', 'CAM)_45.1eV']
['Name: AQMQEAMTQEVSDVFSDTTTPIK/2_0_59.9eV']
['Name: AQMQEAMTQEVSDVFSDTTTPIK/2_0_62.2eV']
['Name: AQMQEAMTQEVSDVFSDTTTPIK/3_0_37.7eV']
['Name: AQNGEFMTHK/2_0_28.3eV']
['Name: AQNTWGCGSSLR/2_1(6', 'C', 'CAM)_32.5eV']
['Name: AQNVPLPVSTLVEFVIAATDCTAK/2_1(20', 'C', 'CAM)_53eV']
['Name: AQNVPLPVSTLVEFVIAATDCTAK/2_1(20', 'C', 'CAM)_59.6eV']
['Name: AQNVPLPVSTLVEFVIAATDCTAK/2_1(20', 'C', 'CAM)_61.9eV']
['Name: AQNVPLPVSTLVEFVIAATDCTA

['Name: AVLDVAETGTEAAAATGVK/3_0_19eV']
['Name: AVLDVAETGTEAAAATGVK/3_0_26.2eV']
['Name: AVLDVAETGTEAAAATGVK/3_0_27.2eV']
['Name: AVLEDIFKK/3_0_15.7eV']
['Name: AVLFNYR/2_0_18eV']
['Name: AVLFNYR/2_0_21.5eV']
['Name: AVLPLLDAQEPCYLLFR/2_1(11', 'C', 'CAM)_42eV']
['Name: AVLPLLDAQEPCYLLFR/2_1(11', 'C', 'CAM)_47.2eV']
['Name: AVLSAEK/2_0_15eV']
['Name: AVLSAEK/2_0_17.5eV']
['Name: AVLSLLR/2_0_18.1eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/2_2(14', 'C', 'CAM)(19', 'C', 'CAM)_53.1eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/2_2(14', 'C', 'CAM)(19', 'C', 'CAM)_59.8eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/2_2(14', 'C', 'CAM)(19', 'C', 'CAM)_62.1eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/3_2(14', 'C', 'CAM)(19', 'C', 'CAM)_28eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/3_2(14', 'C', 'CAM)(19', 'C', 'CAM)_35.4eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/3_2(14', 'C', 'CAM)(19', 'C', 'CAM)_37.6eV']
['Name: AVLTSQETLFGGSDCTGNFCLFK/3_2(14', 'C', 'CAM)(19', 'C', 'CAM)_39.1eV']
['Name: AVLVPHHK/2_0_21.9eV']
['Name: AVLVPHHK/3_0_13.8eV']
['

['Name: DAPIVNR/2_0_18.4eV']
['Name: DAQDLDSGREHER/3_0_23.4eV']
['Name: DAQEKLEQAEK/2_0_31.3eV']
['Name: DAQILYNAGENK/2_0_32.5eV']
['Name: DAQLAGSPELLEFLGTR/2_0_57eV']
['Name: DAQMVHSNALNEDTQDELGDPR/3_1(3', 'M', 'Oxidation)_37.9eV']
['Name: DAQPQLEEADDDLDSK/2_0_41.9eV']
['Name: DAQPQLEEADDDLDSK/2_0_43.5eV']
['Name: DAQPQLEEADDDLDSK/3_0_27.4eV']
['Name: DAQQESTDPK/2_0_27.2eV']
['Name: DASESSELSR/2_0_26.3eV']
['Name: DASEVVTPR/2_0_22.8eV']
['Name: DASKRQAELEAAIQR/3_0_24.9eV']
['Name: DASVVGFFR/2_0_24.3eV']
['Name: DATNVGDEGGFAPNILENKEALELLK/3_0_42.2eV']
['Name: DATSHVLTDLEPGQEYTVLLIAEK/3_0_39eV']
['Name: DATSHVLTDLEPGQEYTVLLIAEK/3_0_40.5eV']
['Name: DATSHVLTDLEPGQEYTVLLIAEK/3_0_52eV']
['Name: DATSLNQAALYR/2_0_32.2eV']
['Name: DAVDTGDIFESFSSHPPLILPLGSSR/3_0_40.7eV']
['Name: DAVENCCGISR/2_2(5', 'C', 'CAM)(6', 'C', 'CAM)_31.1eV']
['Name: DAVIYPILVEFTR/2_0_32eV']
['Name: DAVIYPILVEFTR/2_0_36eV']
['Name: DAVIYPILVEFTR/2_0_37.3eV']
['Name: DAVKEEDSLHWQRPGDVQK/3_0_44eV']
['Name: DAVKEEDSLHWQRPG

['Name: DICEGQVNSLPGSINK/2_1(2', 'C', 'CAM)_42.1eV']
['Name: DIDITSPEFMIK/2_0_33eV']
['Name: DIDITSPEFMIK/2_0_34.3eV']
['Name: DIDLPETFDAR/2_0_31.4eV']
['Name: DIDLSPVTIGFGSIPR/2_0_39.5eV']
['Name: DIDSLAQR/2_0_22.3eV']
['Name: DIEDESTGLK/2_0_26.9eV']
['Name: DIEIKPSVELPFNTFNVK/3_0_32eV']
['Name: DIELSPEAQSK/2_0_29.6eV']
['Name: DIEVTKEEFAQSAIR/3_0_26.6eV']
['Name: DIFLVMPTGGGK/2_0_28.9eV']
['Name: DIFLVMPTGGGK/2_0_30eV']
['Name: DIFPIAFPR/2_0_25.2eV']
['Name: DIFQEIYDKK/2_0_31.6eV']
['Name: DIGLPNGLTFDPFSK/2_0_33.8eV']
['Name: DIGLPNGLTFDPFSK/2_0_38eV']
['Name: DIGLPNGLTFDPFSK/2_0_39.4eV']
['Name: DIGLPNGLTFDPFSK/2_0_51eV']
['Name: DIGNIISDAMK/2_0_27.6eV']
['Name: DIGNIISDAMK/2_0_37eV']
['Name: DIGNIISDAMKK/3_0_20eV']
['Name: DIHNLEDIKK/3_0_18.8eV']
['Name: DIIFAVK/2_0_19.6eV']
['Name: DIIFLLDGSDNVGK/2_0_35.3eV']
['Name: DIIFLLDGSDNVGK/2_0_36.6eV']
['Name: DIKHDPSLQPWSASYDPGSAK/2_0_55.9eV']
['Name: DIKHDPSLQPWSASYDPGSAK/3_0_35.2eV']
['Name: DIKTENGLK/2_0_24.8eV']
['Name: DILATNGVIHFID

['Name: DQSPASHEIATNLGDFAISLYR/3_0_36.8eV']
['Name: DQSPASHEIATNLGDFAISLYR/3_0_47eV']
['Name: DQSPASHEIATNLGDFAISLYR/4_0_25eV']
['Name: DQSPASHEIATNLGDFALR/2_0_42.5eV']
['Name: DQSPASHEIATNLGDFALR/2_0_47.8eV']
['Name: DQSPASHEIATNLGDFALR/2_0_49.7eV']
['Name: DQSPASHEIATNLGDFALR/2_0_64eV']
['Name: DQSPASHEIATNLGDFALR/3_0_28.3eV']
['Name: DQSPASHEIATNLGDFALR/3_0_30.1eV']
['Name: DQSPASHEIATNLGDFALR/3_0_40eV']
['Name: DQTNDQVTIDSALATQK/2_0_44.9eV']
['Name: DQVASLSSGVIQEALANNMK/2_1(18', 'M', 'Oxidation)_50.9eV']
['Name: DQVDSAVQELLQLK/2_0_37.1eV']
['Name: DRCCELPGLRDEESCPDLPR/4_3(2', 'C', 'CAM)(3', 'C', 'CAM)(14', 'C', 'CAM)_25.8eV']
['Name: DRDVTFSPATIEEELIK/3_0_29eV']
['Name: DRDVTFSPATIEEELIK/3_0_30.1eV']
['Name: DREEALHQFR/2_0_30.5eV']
['Name: DREEEEEPLTEQTEGK/3_0_29.4eV']
['Name: DRIKTIRNQPR/2_0_32.7eV']
['Name: DRLEEVR/3_0_14.1eV']
['Name: DRLEEVREHMEEVR/3_0_36eV']
['Name: DRLEEVREHMEEVR/4_0_19eV']
['Name: DRLEEVREHMEEVR/4_0_25eV']
['Name: DRPHEGTRPVHAVFVSEGK/3_0_42eV']
['Name: DRPHE

['Name: EAPFTHFDPSCLFPACR/2_3(0', 'E', 'Pyro_glu)(10', 'C', 'CAM)(15', 'C', 'CAM)_47.7eV']
['Name: EAPFTHFDPSCLFPACR/3_2(10', 'C', 'CAM)(15', 'C', 'CAM)_22eV']
['Name: EAPFTHFDPSCLFPACR/3_2(10', 'C', 'CAM)(15', 'C', 'CAM)_28.5eV']
['Name: EAPFTHFDPSCLFPACR/3_2(10', 'C', 'CAM)(15', 'C', 'CAM)_30.3eV']
['Name: EAPQDFHPDR/2_0_29.5eV']
['Name: EAPQDFHPDR/3_0_18.6eV']
['Name: EAPQELR/2_0_20.5eV']
['Name: EAQDSSSYR/2_0_25.4eV']
['Name: EAQELGSPEDR/2_0_29.9eV']
['Name: EAQIFDYNEIPNFPQSTVQGHAGR/3_0_41.6eV']
['Name: EAQIFDYNEIPNFPQSTVQGHAGR/3_0_53eV']
['Name: EAQLALGNAAADATEAK/2_0_38.5eV']
['Name: EAQLALGNAAADATEAK/2_0_40eV']
['Name: EAQLALGNAAADATEAK/2_0_51eV']
['Name: EAREEAEEKSEEKQ/4_0_18.3eV']
['Name: EASCSSNCAGCGR/2_3(3', 'C', 'CAM)(7', 'C', 'CAM)(10', 'C', 'CAM)_34.4eV']
['Name: EASDGTDKAPTDSR/2_0_35.3eV']
['Name: EASDGTDKAPTDSR/3_0_22.2eV']
['Name: EASDPQPEDVDGGLK/2_0_37.9eV']
['Name: EASDPQPEDVDGGLK/2_0_49eV']
['Name: EASIYDLTSYFTGSK/2_0_40.9eV']
['Name: EASTLIDRPAPQFER/3_0_26.5eV']
['N

['Name: EGFYDLSAEDPYGCK/2_1(13', 'C', 'CAM)_42.6eV']
['Name: EGGFLLVHTVLK/3_0_18.2eV']
['Name: EGGLGPLNIPLLADVTK/2_0_35eV']
['Name: EGGLGPLNIPLLADVTK/2_0_40eV']
['Name: EGGLGPLNIPLLADVTK/2_0_53eV']
['Name: EGGLGPLNIPLLADVTK/3_0_25.2eV']
['Name: EGGSDGDHPER/2_0_28.1eV']
['Name: EGGVESAFHK/3_0_16.3eV']
['Name: EGHPDTLSK/2_0_23eV']
['Name: EGHPDTLSK/3_0_14.5eV']
['Name: EGIDPAPYYWYTDQR/2_0_45.6eV']
['Name: EGIECEVINLR/2_1(4', 'C', 'CAM)_32.4eV']
['Name: EGIPEDGTIYR/2_0_30.4eV']
['Name: EGIREETVPLRKD/3_0_30eV']
['Name: EGIRGPR/3_0_12eV']
['Name: EGIRGPR/3_1(0', 'E', 'Pyro_glu)_11.8eV']
['Name: EGITVYSTQFGGYAK/2_0_38eV']
['Name: EGITVYSTQFGGYAK/2_0_39.4eV']
['Name: EGITVYSTQFGGYAK/2_0_51eV']
['Name: EGIVGVTEEQVHR/3_0_22.3eV']
['Name: EGIVTTAEQDIKEDIAK/3_0_28.5eV']
['Name: EGKEDEASTDVDEKPK/3_0_27.2eV']
['Name: EGLPVALEK/2_0_23.2eV']
['Name: EGLVALQR/2_0_20.7eV']
['Name: EGNPEEDITADQTNAQAAALYK/2_0_55eV']
['Name: EGNPEEDITADQTNAQAAALYK/2_0_57.1eV']
['Name: EGNPEEDITADQTNAQAAALYK/3_0_36eV']
['N

['Name: ELVSDLDKR/2_0_26.1eV']
['Name: ELVSDLDKR/2_0_33eV']
['Name: ELVSELK/2_0_19.2eV']
['Name: ELVSELK/2_1(0', 'E', 'Pyro_glu)_19.5eV']
['Name: ELVSSTVSGAQEMVSSSVSSAK/2_0_50.8eV']
['Name: ELVVVDTPGIFDTEVPDADTQR/2_0_58.8eV']
['Name: ELWDTLYQLETDKFEFGEK/3_0_35.3eV']
['Name: ELWDTLYQLETDKFEFGEK/3_0_36.6eV']
['Name: ELWLGDCDVTNSGCSSLANVLLANR/2_2(6', 'C', 'CAM)(13', 'C', 'CAM)_67.2eV']
['Name: ELYDAGVK/2_0_21eV']
['Name: ELYDAGVK/2_0_21.8eV']
['Name: ELYDAGVK/2_0_28eV']
['Name: ELYDAGVKR/2_0_25.6eV']
['Name: ELYDAGVKR/3_0_16.1eV']
['Name: ELYIEEALQNER/2_0_35.3eV']
['Name: ELYLFDVLR/2_0_24eV']
['Name: ELYLFDVLR/2_0_36eV']
['Name: ELYLSNNGIEVIEGLENNNK/3_0_34.7eV']
['Name: ELYPDFDLNLND/2_0_35.7eV']
['Name: EMASPSSESNESK/2_0_33.6eV']
['Name: EMDRETLIDVAR/2_1(1', 'M', 'Oxidation)_34.3eV']
['Name: EMDRETLIDVAR/2_1(1', 'M', 'Oxidation)_46eV']
['Name: EMDRETLIDVAR/3_1(1', 'M', 'Oxidation)_16eV']
['Name: EMDRETLIDVAR/3_1(1', 'M', 'Oxidation)_21.6eV']
['Name: EMEENFALEAANYQDTIGR/2_0_46eV']
['Name: 

['Name: ETIPLQESTLYTEDR/2_0_56eV']
['Name: ETLAQTVLAEVPTQMVSYFR/2_0_47.6eV']
['Name: ETLAQTVLAEVPTQMVSYFR/2_0_53.5eV']
['Name: ETLAQTVLAEVPTQMVSYFR/3_0_33.7eV']
['Name: ETLDQQK/2_0_21eV']
['Name: ETLEEGTSEENK/2_0_33.2eV']
['Name: ETLEEVFEK/2_0_26.3eV']
['Name: ETLEEVFEK/2_0_27.4eV']
['Name: ETLEEVFEK/2_0_35eV']
['Name: ETLEQAK/2_0_19.9eV']
['Name: ETLKNEALSTQLR/2_0_36.5eV']
['Name: ETLLDAIR/2_0_19eV']
['Name: ETLLDAIR/2_0_21.8eV']
['Name: ETLLEMFK/2_0_23.7eV']
['Name: ETLQNVWIHLDGPGVMRPK/3_0_32.3eV']
['Name: ETLQNVWIHLDGPGVMRPK/3_0_33.6eV']
['Name: ETLQNVWIHLDGPGVMRPK/4_0_22.8eV']
['Name: ETMCSSMNPIMAQCFDK/2_2(3', 'C', 'CAM)(13', 'C', 'CAM)_48eV']
['Name: ETNLESLPLVDTH/2_0_34.4eV']
['Name: ETNLESLPLVDTHSK/2_0_35eV']
['Name: ETNLESLPLVDTHSK/2_0_40.9eV']
['Name: ETNLESLPLVDTHSK/2_0_53eV']
['Name: ETNLESLPLVDTHSK/2_1(0', 'E', 'Pyro_glu)_39eV']
['Name: ETNLESLPLVDTHSK/2_1(0', 'E', 'Pyro_glu)_40.5eV']
['Name: ETNLESLPLVDTHSK/3_0_18eV']
['Name: ETNLESLPLVDTHSK/3_0_24.8eV']
['Name: ETNLESLPLV

['Name: FEDTSPADER/2_0_28.4eV']
['Name: FEECCQENTPMNIFMCTY/2_3(3', 'C', 'CAM)(4', 'C', 'CAM)(15', 'C', 'CAM)_55.6eV']
['Name: FEHSLGVGYLAGCLVR/2_1(12', 'C', 'CAM)_43.3eV']
['Name: FEHSLGVGYLAGCLVR/3_1(12', 'C', 'CAM)_26.2eV']
['Name: FEHSLGVGYLAGCLVR/3_1(12', 'C', 'CAM)_27.2eV']
['Name: FEISDSNR/2_0_22.7eV']
['Name: FELTCYSLAPQIK/2_1(4', 'C', 'CAM)_33eV']
['Name: FELTCYSLAPQIK/2_1(4', 'C', 'CAM)_36.8eV']
['Name: FENAFLSHVISQHQSLLGNIR/3_0_35.6eV']
['Name: FENAFLSHVISQHQSLLGNIR/3_0_36.9eV']
['Name: FENAFLSHVISQHQSLLGNIR/3_0_47eV']
['Name: FENAFLSHVISQHQSLLGNIR/4_0_25.1eV']
['Name: FENAFLSHVISQHQSLLGNIR/4_0_33eV']
['Name: FEPFSNK/2_0_21.1eV']
['Name: FEPGQSHAGVVQYSHNQMQEHVDMR/3_0_42.9eV']
['Name: FEPGQSHAGVVQYSHNQMQEHVDMR/4_0_30.3eV']
['Name: FEPGQSHAGVVQYSHNQMQEHVDMR/5_0_22.7eV']
['Name: FEPGQSHAGVVQYSHNQMQEHVDMR/5_0_24.3eV']
['Name: FESFCLDPSLVTK/2_1(4', 'C', 'CAM)_37.5eV']
['Name: FESLAEEK/2_0_22.3eV']
['Name: FESLPAGSTLIFYK/2_0_36.8eV']
['Name: FEVSVNVAPGSK/2_0_38eV']
['Name: FEWDLPLD

['Name: FSPGAPSGPGPQPNQK/2_0_49eV']
['Name: FSPLTANLMNLLAENGR/2_0_39eV']
['Name: FSPLTANLMNLLAENGR/2_0_43.6eV']
['Name: FSPLTANLMNLLAENGR/2_0_45.3eV']
['Name: FSQGCAPGYEK/2_1(4', 'C', 'CAM)_29.1eV']
['Name: FSQMLHPIFEEASDVIKEEYPDK/4_0_29.8eV']
['Name: FSQVTPTSFTAQWIAPSVQLTGYR/2_0_62.9eV']
['Name: FSQVTPTSFTAQWIAPSVQLTGYR/2_0_65.3eV']
['Name: FSQVTPTSFTAQWIAPSVQLTGYR/3_0_30eV']
['Name: FSQVTPTSFTAQWIAPSVQLTGYR/3_0_37.3eV']
['Name: FSQVTPTSFTAQWIAPSVQLTGYR/3_0_39.6eV']
['Name: FSSEELDKLWR/3_0_21.6eV']
['Name: FSSSTFEQVNQLVK/2_0_34eV']
['Name: FSSSTFEQVNQLVK/2_0_37.8eV']
['Name: FSSSTFEQVNQLVK/2_0_39.3eV']
['Name: FSSSTFEQVNQLVK/2_0_50eV']
['Name: FSTSQSLPASQTR/2_0_29eV']
['Name: FSTSQSLPASQTR/2_0_34.3eV']
['Name: FSTSQSLPASQTR/2_0_44eV']
['Name: FSTVDLR/2_0_20.4eV']
['Name: FSTVDLR/2_0_26eV']
['Name: FSVEEIIQK/2_0_26.6eV']
['Name: FSVGDAK/2_0_22eV']
['Name: FSVLQYVVPEVK/2_0_33eV']
['Name: FSVLQYVVPEVK/2_0_34.3eV']
['Name: FSVLVPLLAR/2_0_26.1eV']
['Name: FSWGAEGQKPGFGYGGR/3_0_27.6eV']
['N

['Name: GEFLSEGGGVR/2_0_25.9eV']
['Name: GEFQILLDALDK/2_0_28eV']
['Name: GEFQILLDALDK/2_0_31.9eV']
['Name: GEFQILLDALDK/2_0_33.1eV']
['Name: GEFQILLDALDK/2_0_42eV']
['Name: GEFVWR/2_0_16.5eV']
['Name: GEFVWR/2_0_18.6eV']
['Name: GEFVWR/2_0_25eV']
['Name: GEGAEVDVNLQK/2_0_30.6eV']
['Name: GEGPEVDVSLPK/2_0_25eV']
['Name: GEGPEVDVSLPK/2_0_28.7eV']
['Name: GEGPEVDVSLPNADLDVSGPK/2_0_49.1eV']
['Name: GEGQLSAAER/2_0_23.8eV']
['Name: GEGQLSAAER/2_0_24.7eV']
['Name: GEKEEEDKEDEEKPK/4_0_19.7eV']
['Name: GELDPVLEDNSVETR/2_0_39.2eV']
['Name: GELGEIGLDGLDGEEGDK/2_0_43.8eV']
['Name: GELGPVGNPGPAGPAGPR/2_0_37.5eV']
['Name: GELGQFYR/2_0_22.7eV']
['Name: GELGQFYR/2_0_30eV']
['Name: GELSGHFEDLLLAIVHCAR/3_1(16', 'C', 'CAM)_31.5eV']
['Name: GELVPLDTVLDMLR/2_0_32.7eV']
['Name: GELVPLDTVLDMLR/2_0_36.8eV']
['Name: GELVPLDTVLDMLR/2_1(11', 'M', 'Oxidation)_37.2eV']
['Name: GELVPLDTVLDMLR/3_0_23.2eV']
['Name: GEMEADIRAGR/1_0_50.2eV']
['Name: GEMMDLQHGSLFLK/2_0_37.6eV']
['Name: GEMMDLQHGSLFLK/2_0_39.1eV']
['Name

['Name: GLENNVNVELLNALHSHMVNK/4_0_16eV']
['Name: GLENNVNVELLNALHSHMVNK/4_0_24.4eV']
['Name: GLENNVNVELLNALHSHMVNK/4_1(17', 'M', 'Oxidation)_16eV']
['Name: GLENNVNVELLNALHSHMVNK/4_1(17', 'M', 'Oxidation)_24.6eV']
['Name: GLENNVNVELLNALHSHMVNKR/4_0_27.1eV']
['Name: GLENNVNVELLNALHSHMVNKR/5_0_19.5eV']
['Name: GLETIASDVVSLASK/2_0_34.9eV']
['Name: GLETIASDVVSLASK/2_0_36.3eV']
['Name: GLETQAK/2_0_17.5eV']
['Name: GLEVDVK/2_0_16eV']
['Name: GLEVDVK/2_0_18.5eV']
['Name: GLFDVHSVLR/3_0_16.9eV']
['Name: GLFIIDAK/2_0_20.5eV']
['Name: GLFIIDAK/2_0_27eV']
['Name: GLFIIDPNGVVK/2_0_29.8eV']
['Name: GLFIIDPNGVVK/2_0_30.9eV']
['Name: GLFPFHHQQIGYVYR/3_0_28.5eV']
['Name: GLFQVLAGGTVLQLR/2_0_36.8eV']
['Name: GLFQVLAGGTVLQLR/3_0_23.2eV']
['Name: GLGEHEMDEDEEDYESSAK/3_0_33.2eV']
['Name: GLGEISAATEFK/2_0_29.7eV']
['Name: GLGHEADESADVGTVETTMCK/3_1(19', 'C', 'CAM)_33.8eV']
['Name: GLGTDEDAIIGILAYR/2_0_34.9eV']
['Name: GLGTDEDAIIGILAYR/2_0_39.3eV']
['Name: GLGTDEDAIIGILAYR/2_0_52eV']
['Name: GLGTDEDSILNLLTSR/2

['Name: GSPGADGPAGSPGTPGPQGIAGQR/3_0_30.8eV']
['Name: GSPQNLVTK/2_0_22.1eV']
['Name: GSSWHETCFTCQR/2_2(7', 'C', 'CAM)(10', 'C', 'CAM)_40.3eV']
['Name: GSTAPVGGGSFPTITPR/2_0_33eV']
['Name: GSTAPVGGGSFPTITPR/2_0_37.5eV']
['Name: GSTAPVGGGSFPTITPR/2_0_50eV']
['Name: GSTASDPQGDLLFLLDSSASVSHYEFSR/3_0_44eV']
['Name: GSTASQVLQR/2_0_24.5eV']
['Name: GSTDDKGPVAGWMNALEAYQK/3_0_34.3eV']
['Name: GSTFRPHDSFPK/3_0_27eV']
['Name: GSTGPAGIR/2_0_17eV']
['Name: GSTVFEELPNK/2_0_28.6eV']
['Name: GSVFSAPSASGTPNK/2_0_34.2eV']
['Name: GSVFSAPSASGTPNKETAGLK/3_0_30.7eV']
['Name: GSVHDFPEFDANQDAEALYTAMK/3_0_37.7eV']
['Name: GSVHDFPEFDANQDAEALYTAMK/3_0_39.2eV']
['Name: GSVLRYGSR/2_0_23.3eV']
['Name: GSVVMGPWVEGSVVTR/2_0_40.4eV']
['Name: GSWACCQLPHAVCCEDR/3_4(4', 'C', 'CAM)(5', 'C', 'CAM)(12', 'C', 'CAM)(13', 'C', 'CAM)_32.2eV']
['Name: GSYNIK/2_0_16eV']
['Name: GSYTYFAPSNEAWENLDSDIR/2_0_59.2eV']
['Name: GSYTYFAPSNEAWENLDSDIRR/3_0_38.2eV']
['Name: GSYTYFAPSNEAWENLDSDIRR/3_0_39.7eV']
['Name: GTAGVVPVVPGEVEVVK/2_0_

['Name: HPDLSGFFDNHFGLISPNFK/4_0_24.8eV']
['Name: HPDLSTPELLR/2_0_26eV']
['Name: HPDYSVSLLLR/2_0_27eV']
['Name: HPDYSVSLLLR/2_0_30.4eV']
['Name: HPDYSVSLLLR/2_0_40eV']
['Name: HPDYSVSLLLR/3_0_14eV']
['Name: HPDYSVSLLLR/3_0_19.2eV']
['Name: HPDYSVSLLLR/3_0_25eV']
['Name: HPECYVCTDCGINLK/2_3(3', 'C', 'CAM)(6', 'C', 'CAM)(9', 'C', 'CAM)_45.4eV']
['Name: HPMDTEITK/2_0_25.1eV']
['Name: HPQLEAVLMGTR/3_0_26eV']
['Name: HPTDLDASK/2_0_23.9eV']
['Name: HPTPLALGQFHTVTLLR/3_0_28eV']
['Name: HPYFYAPELLYYAEQYNEILTQCCAEADK/3_2(22', 'C', 'CAM)(23', 'C', 'CAM)_53.1eV']
['Name: HQEMREDIK/2_1(3', 'M', 'Oxidation)_28.1eV']
['Name: HQGQNLLTMTTAPR/2_0_38.1eV']
['Name: HQGQNLLTMTTAPR/3_0_24eV']
['Name: HQLKDVEK/2_0_24.2eV']
['Name: HQSEQGNQGQESDSEAEGEDK/3_0_35.1eV']
['Name: HQSLGGQYGVQGFPTIK/2_0_42.6eV']
['Name: HQSLGGQYGVQGFPTIK/3_0_26.8eV']
['Name: HQSQCK/2_1(4', 'C', 'CAM)_18.4eV']
['Name: HQSVFTVTR/2_0_25.2eV']
['Name: HQTVLDNTEGK/2_0_29.1eV']
['Name: HQTVLDNTEGK/2_0_30.2eV']
['Name: HQTVLDNTEGK/3_0_19eV

['Name: IFSENICGLSDSPGVSK/2_1(6', 'C', 'CAM)_44eV']
['Name: IFSFDGR/2_0_20.5eV']
['Name: IFSNIR/2_0_17.6eV']
['Name: IFSQHLQNK/2_0_27.1eV']
['Name: IFSSEHDIFR/2_0_30.4eV']
['Name: IFSSEHDIFR/3_0_18.5eV']
['Name: IFTPLLHK/3_0_10eV']
['Name: IFTPLLHK/3_0_14.3eV']
['Name: IFTVDNNLLPVGK/2_0_29.8eV']
['Name: IFTVDNNLLPVGK/2_0_33.5eV']
['Name: IFTVDNNLLPVGK/2_0_45eV']
['Name: IFVGAR/2_0_15.5eV']
['Name: IFVGSSQVPVVFENTDLHSYVVMNHGR/3_0_46.4eV']
['Name: IFVGSSQVPVVFENTDLHSYVVMNHGR/4_0_32.8eV']
['Name: IFVPNK/2_0_17.5eV']
['Name: IFVPNKGSR/3_0_15.6eV']
['Name: IFYLQNK/2_0_22.5eV']
['Name: IFYTTTPVKK/2_0_29.2eV']
['Name: IGADFLGR/2_0_19.9eV']
['Name: IGADGTQVAMVQFTDDPR/2_0_45eV']
['Name: IGADGTQVAMVQFTDDPR/2_0_46.7eV']
['Name: IGCDQHTSCPVGQTCCPSLK/3_4(2', 'C', 'CAM)(8', 'C', 'CAM)(14', 'C', 'CAM)(15', 'C', 'CAM)_35.3eV']
['Name: IGCVHAISTDSPDLEPVLK/3_1(2', 'C', 'CAM)_31.4eV']
['Name: IGDLQAFVGR/2_0_26.2eV']
['Name: IGDLQSQIVSLLK/2_0_29eV']
['Name: IGDLQSQIVSLLK/2_0_33.1eV']
['Name: IGDLQSQIVSLLK

['Name: INLPIQTFSALNFR/2_0_39.7eV']
['Name: INLPIQTFSALNFR/3_0_24.1eV']
['Name: INLYMSSPCHIEMILTEK/2_1(8', 'C', 'CAM)_53eV']
['Name: INMCGLTTK/2_1(3', 'C', 'CAM)_25.2eV']
['Name: INMELK/2_0_18.2eV']
['Name: INNIQCPMEAVVFQTK/2_1(5', 'C', 'CAM)_39eV']
['Name: INPASMFDVHVK/3_0_20.8eV']
['Name: INQTYQQQYGR/2_0_34eV']
['Name: INQTYQQQYGR/2_0_44eV']
['Name: INSCFSANTVEQIIENLR/2_1(3', 'C', 'CAM)_49.4eV']
['Name: INSVEVYDGTLYR/2_0_37.2eV']
['Name: IPAGTTLTLDMLTVK/2_0_36.9eV']
['Name: IPAHQVLYSTSGGNASGK/2_0_43.4eV']
['Name: IPAHQVLYSTSGGNASGK/3_0_27.4eV']
['Name: IPASMTAEELTLEILDRR/3_0_30.4eV']
['Name: IPCFLAGDMR/2_2(2', 'C', 'CAM)(8', 'M', 'Oxidation)_28eV']
['Name: IPCSQPPTIEHGSINLPR/3_1(2', 'C', 'CAM)_29.7eV']
['Name: IPCSQPPTIEHGSINLPR/3_1(2', 'C', 'CAM)_30.9eV']
['Name: IPDEEDQEDPYLNDR/2_0_44.9eV']
['Name: IPDGIVPK/2_0_19.6eV']
['Name: IPDLLPK/2_0_19.4eV']
['Name: IPDWFLNR/2_0_24.8eV']
['Name: IPDWFLNR/2_0_33eV']
['Name: IPEALAGPPNDFGLFLSDDDPKK/3_0_36.2eV']
['Name: IPEGSAVPATDAAPK/2_0_34.6

['Name: IVNHLEK/3_0_13.1eV']
['Name: IVPNILLEQGK/2_0_29.8eV']
['Name: IVPNILLEQGK/2_0_38eV']
['Name: IVQEIPQLLDAAASPLIAEQEVLEALPK/2_0_72.9eV']
['Name: IVQEIPQLLDAAASPLIAEQEVLEALPK/3_0_44.2eV']
['Name: IVQIGGSNAQR/2_0_27.8eV']
['Name: IVQIGGSNAQR/2_0_36eV']
['Name: IVQSVIQTAVDQFAR/2_0_40.7eV']
['Name: IVRGDQPGASGDNDDDEPPPLPR/3_0_47eV']
['Name: IVSFSK/2_0_15.9eV']
['Name: IVSGAAETDQEYYFGQVVR/2_0_44eV']
['Name: IVSGAAETDQEYYFGQVVR/2_0_49.9eV']
['Name: IVSGAAETDQEYYFGQVVR/2_0_51.8eV']
['Name: IVSGAAETDQEYYFGQVVR/3_0_23eV']
['Name: IVSNASCTTNCLAPLAK/2_2(6', 'C', 'CAM)(10', 'C', 'CAM)_38eV']
['Name: IVSNASCTTNCLAPLAK/2_2(6', 'C', 'CAM)(10', 'C', 'CAM)_42.6eV']
['Name: IVSNASCTTNCLAPLAK/3_2(6', 'C', 'CAM)(10', 'C', 'CAM)_26.9eV']
['Name: IVSNASCTTNCLAPLAK/3_2(6', 'C', 'CAM)(10', 'C', 'CAM)_27.9eV']
['Name: IVSPSGAAVPCK/2_1(10', 'C', 'CAM)_24eV']
['Name: IVSPSGAAVPCKVEPGLGADNSVVR/3_1(10', 'C', 'CAM)_38eV']
['Name: IVSSNDVGHDEYSTQSLVK/3_0_41eV']
['Name: IVWHVIR/3_0_13.6eV']
['Name: IVYGHLDDPANQ

['Name: KLDQDTVFALANYILFK/2_0_63eV']
['Name: KLDQDTVFALANYILFK/3_0_22eV']
['Name: KLDQDTVFALANYILFK/3_0_29.5eV']
['Name: KLDQDTVFALANYILFK/3_0_30.6eV']
['Name: KLDQDTVFALANYILFK/3_0_39eV']
['Name: KLDTYQEYK/2_0_25eV']
['Name: KLEAAATALATK/2_0_25eV']
['Name: KLEAAATALATK/2_0_28.9eV']
['Name: KLEAAATALATK/2_0_37eV']
['Name: KLEDDILVMDDQNSK/2_0_42.9eV']
['Name: KLEDDILVMDDQNSK/3_0_27eV']
['Name: KLEEDQIIMEDQNCK/2_1(13', 'C', 'CAM)_44.3eV']
['Name: KLEEDQIIMEDQNCK/2_1(13', 'C', 'CAM)_46eV']
['Name: KLEEDQIIMEDQNCK/2_2(8', 'M', 'Oxidation)(13', 'C', 'CAM)_46.4eV']
['Name: KLEEDQIIMEDQNCK/3_1(13', 'C', 'CAM)_29eV']
['Name: KLEEDQIIMEDQNCK/3_2(8', 'M', 'Oxidation)(13', 'C', 'CAM)_29.2eV']
['Name: KLEEEEDSETK/2_0_32.5eV']
['Name: KLEEGMDNYK/2_0_38eV']
['Name: KLEIIQSQK/2_0_25.5eV']
['Name: KLEIIQSQK/2_0_34eV']
['Name: KLENCNYAVDLGK/2_1(4', 'C', 'CAM)_32eV']
['Name: KLENCNYAVDLGK/2_1(4', 'C', 'CAM)_35.7eV']
['Name: KLENCNYAVDLGK/2_1(4', 'C', 'CAM)_37.1eV']
['Name: KLENCNYAVDLGK/3_1(4', 'C', 'CA

['Name: LAMQEFMILPVGASSFR/2_0_59eV']
['Name: LAMQEFMILPVGASSFR/2_1(2', 'M', 'Oxidation)_46.5eV']
['Name: LAMQEFMILPVGASSFR/2_1(6', 'M', 'Oxidation)_40eV']
['Name: LAMQEFMILPVGASSFR/2_1(6', 'M', 'Oxidation)_46.5eV']
['Name: LAMQEFMILPVGASSFR/2_2(2', 'M', 'Oxidation)(6', 'M', 'Oxidation)_46.9eV']
['Name: LAMQEFMILPVGASSFR/3_0_21eV']
['Name: LAMQEFMILPVGASSFR/3_0_28eV']
['Name: LANIGR/2_0_15.1eV']
['Name: LANMEAEIR/2_0_24.5eV']
['Name: LAPALSVK/2_0_18.7eV']
['Name: LAPITDLVR/2_0_24.3eV']
['Name: LAPITSDPTEAAAVGAVEASFK/2_0_45eV']
['Name: LAPITSDPTEAAAVGAVEASFK/2_0_50.2eV']
['Name: LAPITSDPTEAAAVGAVEASFK/2_0_52.2eV']
['Name: LAPITSDPTEAAAVGAVEASFK/2_0_67eV']
['Name: LAPITSDPTEAAAVGAVEASFK/3_0_23eV']
['Name: LAPITSDPTEAAAVGAVEASFK/3_0_29.8eV']
['Name: LAPITSDPTEAAAVGAVEASFK/3_0_32.9eV']
['Name: LAPITSDPTEAAAVGAVEASFK/3_0_42eV']
['Name: LAPPLVIK/2_0_19.9eV']
['Name: LAQAPNHVVVSR/3_0_14eV']
['Name: LAQAPNHVVVSR/3_0_19eV']
['Name: LAQAWFNSHR/2_0_29.9eV']
['Name: LAQAWFNSHR/3_0_18.8eV']
['Name: 

['Name: LFQEEFPGIPYPPDAAVECHR/3_1(18', 'C', 'CAM)_36.5eV']
['Name: LFQFFK/2_0_19.4eV']
['Name: LFQLTVER/2_0_23.6eV']
['Name: LFQQIYSDGSDEVK/2_0_51eV']
['Name: LFQTALQEEIK/2_0_32.1eV']
['Name: LFQTMVK/2_0_21.1eV']
['Name: LFQTMVK/2_1(4', 'M', 'Oxidation)_21.5eV']
['Name: LFSFVPSWNLTCK/2_1(11', 'C', 'CAM)_37.4eV']
['Name: LFSGATMASSR/2_0_23eV']
['Name: LFSGATMASSR/2_0_35eV']
['Name: LFSGLAHVK/2_0_23.6eV']
['Name: LFSKPEGK/3_0_13.9eV']
['Name: LFSSHDYEAPQEITLGANK/2_0_49.7eV']
['Name: LFSSHDYEAPQEITLGANK/3_0_32.5eV']
['Name: LFSSQSGK/2_0_20.8eV']
['Name: LFSTSADLSELSAMAR/2_0_39.8eV']
['Name: LFSVVSR/2_0_19.7eV']
['Name: LFVDPSQGLEVTGK/2_0_31eV']
['Name: LFVDPSQGLEVTGK/2_0_34.9eV']
['Name: LFVDPSQGLEVTGK/2_0_36.2eV']
['Name: LFVDPSQGLEVTGK/2_0_46eV']
['Name: LFVSEGSPGSLPVLAAAAR/2_0_43.2eV']
['Name: LFVTNHEK/2_0_24eV']
['Name: LFVVPTDESQAR/2_0_33.1eV']
['Name: LFYAPTSGGPEELVPIPGNTNYAILR/2_0_65.3eV']
['Name: LFYAPTSGGPEELVPIPGNTNYAILR/2_0_67.8eV']
['Name: LFYAPTSGGPEELVPIPGNTNYAILR/2_0_87eV']

['Name: LLPGAPQQPPK/2_0_36eV']
['Name: LLPLYVLALLK/2_0_30.6eV']
['Name: LLPPTQNNR/2_0_25.6eV']
['Name: LLPPTQNNR/2_0_33eV']
['Name: LLPQTWFQGGAPCLR/2_1(12', 'C', 'CAM)_42.4eV']
['Name: LLPQVMYLDGYDR/2_0_37.1eV']
['Name: LLPQVMYLDGYDRDNK/3_0_29.7eV']
['Name: LLPSFVSSENAFYLPPDLR/2_0_50.7eV']
['Name: LLPSFVSSENAFYLPPDLRK/3_0_33.8eV']
['Name: LLQAGEKYGR/2_0_26.6eV']
['Name: LLQDLHEAQAER/3_0_28eV']
['Name: LLQHLYK/2_0_22.2eV']
['Name: LLQNIQENIR/2_0_30.2eV']
['Name: LLQVTPTDSGEYVCR/2_1(13', 'C', 'CAM)_36eV']
['Name: LLQVTPTDSGEYVCR/2_1(13', 'C', 'CAM)_40.7eV']
['Name: LLQVTPTDSGEYVCR/2_1(13', 'C', 'CAM)_42.3eV']
['Name: LLQYADALEHLLSTGQGVVLER/3_0_35.8eV']
['Name: LLSAAAHQR/2_0_23.5eV']
['Name: LLSATPPSSQNPATNIPLQFR/2_0_52.7eV']
['Name: LLSATPPSSQNPATNIPLQFR/3_0_33.2eV']
['Name: LLSDEDVGLMVK/2_0_30.9eV']
['Name: LLSGFGVDR/2_0_23.4eV']
['Name: LLSHCLLVTLASHHPA/3_1(4', 'C', 'CAM)_26.1eV']
['Name: LLSHCLLVTLASHHPA/4_1(4', 'C', 'CAM)_12eV']
['Name: LLSHCLLVTLASHHPAD/3_1(4', 'C', 'CAM)_27.8eV']
[

['Name: LPSLAFLYMEK/2_0_27.3eV']
['Name: LPSLAFLYMEK/2_0_30.7eV']
['Name: LPSLAFLYMEK/2_0_41eV']
['Name: LPSLAFLYMEK/2_1(8', 'M', 'Oxidation)_32.3eV']
['Name: LPSQPQDVQTLWSTEDMTR/2_0_52.3eV']
['Name: LPSVFKK/3_0_12.6eV']
['Name: LPTGYYFGASAGTGDLSDNHDIISIK/3_0_41.5eV']
['Name: LPVALDPGSK/2_0_24.2eV']
['Name: LPVDVAYQR/2_0_25.8eV']
['Name: LPVEEAVGIPNIPVHPIGYNDAER/3_0_39.8eV']
['Name: LPVNFFK/2_0_21eV']
['Name: LPVYEGNFIVK/2_0_31.1eV']
['Name: LPYDVTPEQALSHEEVK/3_0_28.8eV']
['Name: LPYDVTPEQALSHEEVK/3_0_38eV']
['Name: LPYQCPR/2_1(4', 'C', 'CAM)_22.7eV']
['Name: LPYTTPGPPSTPWVSNVTR/2_0_48.5eV']
['Name: LQAAQLPDK/2_0_23.9eV']
['Name: LQAEIFQAR/2_0_26.2eV']
['Name: LQAEIFQAR/2_0_33eV']
['Name: LQALSPELLAPVPR/2_0_31eV']
['Name: LQALSPELLAPVPR/2_0_35.2eV']
['Name: LQALSPELLAPVPR/2_0_36.6eV']
['Name: LQALSPELLAPVPR/2_0_47eV']
['Name: LQALSPELLAPVPR/3_0_16eV']
['Name: LQALSPELLAPVPR/3_0_22.2eV']
['Name: LQCYECYGVPIETSCPAVTCR/2_4(2', 'C', 'CAM)(5', 'C', 'CAM)(14', 'C', 'CAM)(19', 'C', 'CAM)_60eV

['Name: LSSGDDFVLLSLEVPLEDVR/2_0_51.6eV']
['Name: LSSGQISGPEIK/2_0_38eV']
['Name: LSSGYYDFSVR/2_0_31.5eV']
['Name: LSSPATLNSR/2_0_25.4eV']
['Name: LSSPATLNSR/2_0_32eV']
['Name: LSSSDEEDFLYVDIK/2_0_41.2eV']
['Name: LSSSDEEDFLYVDIK/2_0_42.8eV']
['Name: LSTDGSPTR/2_0_22.7eV']
['Name: LSTEPSPEFSNYSEIAK/2_0_46.2eV']
['Name: LSTHLR/2_0_17.7eV']
['Name: LSTHLRK/2_0_20.8eV']
['Name: LSTLPSDFCGLTHLVK/2_1(8', 'C', 'CAM)_43.5eV']
['Name: LSTLPSDFCGLTHLVK/3_1(8', 'C', 'CAM)_19eV']
['Name: LSTLPSDFCGLTHLVK/3_1(8', 'C', 'CAM)_26.4eV']
['Name: LSTLPSDFCGLTHLVK/3_1(8', 'C', 'CAM)_27.4eV']
['Name: LSTPSPLDGTFCVDSIAALK/2_1(11', 'C', 'CAM)_49eV']
['Name: LSVDLRPLK/3_0_15.4eV']
['Name: LSVLCQDNYLTQDSEEMVCK/2_2(4', 'C', 'CAM)(18', 'C', 'CAM)_57eV']
['Name: LSVLCQDNYLTQDSEEMVCK/2_2(4', 'C', 'CAM)(18', 'C', 'CAM)_59.1eV']
['Name: LSVLCQDNYLTQDSEEMVCK/2_3(4', 'C', 'CAM)(16', 'M', 'Oxidation)(18', 'C', 'CAM)_57.3eV']
['Name: LSVLCQDNYLTQDSEEMVCK/3_2(4', 'C', 'CAM)(18', 'C', 'CAM)_35.9eV']
['Name: LSVLCQDNYLTQD

['Name: LYLGHNYVTAIR/2_0_29.6eV']
['Name: LYLGHNYVTAIR/2_0_33.2eV']
['Name: LYLGHNYVTAIR/2_0_44eV']
['Name: LYLGHNYVTAIR/3_0_15eV']
['Name: LYLGHNYVTAIR/3_0_20.9eV']
['Name: LYLGHNYVTAIR/3_0_21.8eV']
['Name: LYLGHNYVTAIR/3_0_28eV']
['Name: LYNLGFGNNLNYNFLETLALENHGLAR/3_0_45.4eV']
['Name: LYNNQISNTHTFCAGPAYLK/3_1(12', 'C', 'CAM)_35.4eV']
['Name: LYPIANGNNQSPIDIK/2_0_41.1eV']
['Name: LYPIANGNNQSPIDIK/2_0_42.7eV']
['Name: LYQGNLAVTTLGSPCLPWNSLPAK/2_1(14', 'C', 'CAM)_60.9eV']
['Name: LYQGNLAVTTLGSPCLPWNSLPAK/2_1(14', 'C', 'CAM)_63.2eV']
['Name: LYQGNLAVTTLGSPCLPWNSLPAK/3_1(14', 'C', 'CAM)_29eV']
['Name: LYQGNLAVTTLGSPCLPWNSLPAK/3_1(14', 'C', 'CAM)_38.4eV']
['Name: LYQGNLAVTTLGSPCLPWNSLPAK/3_1(14', 'C', 'CAM)_39.8eV']
['Name: LYQQVLTQAELR/2_0_35.5eV']
['Name: LYQSCADPTGCGTGSDAR/2_2(4', 'C', 'CAM)(10', 'C', 'CAM)_46.6eV']
['Name: LYQSMNSQYLK/2_0_43eV']
['Name: LYQVQYEMPLCDQDSTSK/2_1(10', 'C', 'CAM)_51.6eV']
['Name: LYSEFLGK/2_0_30eV']
['Name: LYSIHKPCK/2_1(7', 'C', 'CAM)_27.9eV']
['Name: LYS

['Name: MQQVEASLQPETLR/2_0_39.6eV']
['Name: MQQVEASLQPETLR/2_0_51eV']
['Name: MQQVEASLQPETLR/2_1(0', 'M', 'Oxidation)_34eV']
['Name: MQQVEASLQPETLR/2_1(0', 'M', 'Oxidation)_40eV']
['Name: MQQVEASLQPETLR/2_1(0', 'M', 'Oxidation)_51eV']
['Name: MQQVEASLQPETLR/3_0_18eV']
['Name: MQQVEASLQPETLR/3_0_22.7eV']
['Name: MQQVEASLQPETLR/3_0_25eV']
['Name: MQQVEASLQPETLR/3_0_32eV']
['Name: MQQVEASLQPETLR/3_1(0', 'M', 'Oxidation)_18eV']
['Name: MQQVEASLQPETLRK/2_0_37eV']
['Name: MQQVEASLQPETLRK/2_0_41.2eV']
['Name: MQQVEASLQPETLRK/2_0_42.7eV']
['Name: MQQVEASLQPETLRK/2_0_55eV']
['Name: MQQVEASLQPETLRK/2_1(0', 'M', 'Oxidation)_41.6eV']
['Name: MQQVEASLQPETLRK/2_1(0', 'M', 'Oxidation)_43.1eV']
['Name: MQQVEASLQPETLRK/2_1(0', 'M', 'Oxidation)_55eV']
['Name: MQQVEASLQPETLRK/3_0_19eV']
['Name: MQQVEASLQPETLRK/3_0_25.9eV']
['Name: MQQVEASLQPETLRK/3_1(0', 'M', 'Oxidation)_19eV']
['Name: MQQVEASLQPETLRK/3_1(0', 'M', 'Oxidation)_27.2eV']
['Name: MQQVEASLQPETLRK/3_1(0', 'M', 'Oxidation)_35eV']
['Name: MQTYEV

['Name: NKESVVFVQTDKPVYKPGQSVK/3_0_37.9eV']
['Name: NKESVVFVQTDKPVYKPGQSVK/4_0_26.8eV']
['Name: NKLDHYAIIK/2_0_29.5eV']
['Name: NKLDHYAIIK/3_0_17.9eV']
['Name: NKLDHYAIIK/3_0_18.6eV']
['Name: NKLDHYAIIKFPLTTESAMK/3_0_35.5eV']
['Name: NKLDHYAIIKFPLTTESAMK/3_1(18', 'M', 'Oxidation)_35.8eV']
['Name: NKLDHYAIIKFPLTTESAMK/4_0_25.1eV']
['Name: NKLDQVVIHVGALNVK/2_0_42.5eV']
['Name: NKPCITYGLR/2_1(3', 'C', 'CAM)_28.6eV']
['Name: NKPGVYTDVANYLAWIQK/3_0_31.9eV']
['Name: NKRDEEEEEEKLEEK/3_0_29.6eV']
['Name: NKRDEEEEEEKLEEK/4_0_20.9eV']
['Name: NKRDEEEEEEKLEEK/4_0_27eV']
['Name: NLADLCR/2_1(5', 'C', 'CAM)_20.2eV']
['Name: NLAPLVEDVQSK/2_0_27eV']
['Name: NLAPLVEDVQSK/2_0_30.7eV']
['Name: NLAPTWEELSK/2_0_26.8eV']
['Name: NLAPTWEELSK/2_0_30.2eV']
['Name: NLAPTWEELSKK/2_0_34.5eV']
['Name: NLEENYCR/2_1(6', 'C', 'CAM)_25.7eV']
['Name: NLEENYCRNPDGETAPWCYTTDSQLR/3_2(6', 'C', 'CAM)(17', 'C', 'CAM)_48.8eV']
['Name: NLELHYQAFLR/2_0_34.1eV']
['Name: NLELHYQAFLR/3_0_20.7eV']
['Name: NLEPEWAAAATEVK/2_0_35.8eV'

['Name: NSYQDAEDK/2_0_25eV']
['Name: NSYQDAEDKKK/3_0_20.3eV']
['Name: NSYQDAEDKKKEEKEEEEQEK/4_0_28.3eV']
['Name: NSYQDAEDKKKEEKEEEEQEKLK/4_0_30.9eV']
['Name: NSYQDAEDKKKEEKEEEEQEKLK/5_0_23.2eV']
['Name: NSYVAGQYDDAASYK/2_0_40.2eV']
['Name: NSYVAGQYDDAASYK/2_0_52eV']
['Name: NTAMLVCSTPQFPHGVMDPVPEVAK/3_1(6', 'C', 'CAM)_40.2eV']
['Name: NTDGVNFYNILTK/2_0_35.1eV']
['Name: NTDGVNFYNILTK/2_0_36.4eV']
['Name: NTDIDLDK/2_0_21.9eV']
['Name: NTDPGYNCLPCPPR/2_2(7', 'C', 'CAM)(10', 'C', 'CAM)_38.9eV']
['Name: NTDPGYNCLPCPPR/2_2(7', 'C', 'CAM)(10', 'C', 'CAM)_40.4eV']
['Name: NTEEEGPK/2_0_22eV']
['Name: NTELCETPATSDTK/2_1(4', 'C', 'CAM)_38.1eV']
['Name: NTETNVFPQDK/2_0_40eV']
['Name: NTEYTLFESISGK/2_0_36.2eV']
['Name: NTFAEITGLSPGVTYLFK/2_0_40.8eV']
['Name: NTFAEITGLSPGVTYLFK/2_0_45.9eV']
['Name: NTFAEITGLSPGVTYLFK/2_0_61eV']
['Name: NTFAEITGLSPGVTYLFK/3_0_21eV']
['Name: NTFAEITGLSPGVTYLFK/3_0_38eV']
['Name: NTFTESAGSR/2_0_26eV']
['Name: NTIEETGILAER/2_0_32.7eV']
['Name: NTILSAIHNSTK/2_0_31.6eV']


['Name: QADGPDMQSLFTQYFQSMTDYGK/2_2(0', 'Q', 'Pyro-glu)(17', 'M', 'Oxidation)_64.6eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/2_2(0', 'Q', 'Pyro-glu)(17', 'M', 'Oxidation)_83eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/2_2(0', 'Q', 'Pyro-glu)(6', 'M', 'Oxidation)_64.6eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/3_1(0', 'Q', 'Pyro-glu)_39eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/3_1(0', 'Q', 'Pyro-glu)_40.5eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/3_2(0', 'Q', 'Pyro-glu)(17', 'M', 'Oxidation)_40.7eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/3_2(0', 'Q', 'Pyro-glu)(6', 'M', 'Oxidation)_40.7eV']
['Name: QADGPDMQSLFTQYFQSMTDYGK/3_2(0', 'Q', 'Pyro-glu)(6', 'M', 'Oxidation)_52eV']
['Name: QADLSFSSPVEMK/2_0_35eV']
['Name: QAEAMALLAEAER/2_0_34.1eV']
['Name: QAEDTYTDANGDK/2_0_34.7eV']
['Name: QAEEAEEQSNTNLSK/2_0_40.8eV']
['Name: QAEEQLEACGK/2_1(8', 'C', 'CAM)_30.7eV']
['Name: QAELEAAIQR/2_0_26.4eV']
['Name: QAESSQVK/2_0_20.5eV']
['Name: QAESSQVKQ/2_0_24.4eV']
['Name: QAEVTFLGHPGK/2_0_31.2eV']
['Name: QAEVTFLGHPGK/2_1(0', 'Q', 'Pyro-gl

['Name: QKPDGVFQEDGPVIHQEMIGGFR/3_1(0', 'Q', 'Pyro-glu)_37.9eV']
['Name: QKPDGVFQEDGPVIHQEMIGGFR/3_1(0', 'Q', 'Pyro-glu)_50eV']
['Name: QKPDGVFQEDGPVIHQEMIGGFR/4_0_26.9eV']
['Name: QKPDGVFQEDGPVIHQEMIGGFR/4_0_28eV']
['Name: QKPDGVFQEDGPVIHQEMIGGFR/4_0_36eV']
['Name: QKPDGVFQEDGPVIHQEMIGGFR/4_1(17', 'M', 'Oxidation)_18eV']
['Name: QKPDGVFQEDGPVIHQEMIGGFR/4_1(17', 'M', 'Oxidation)_36eV']
['Name: QKRLEDLR/3_1(0', 'Q', 'Pyro-glu)_16eV']
['Name: QKTESELLEISGK/2_0_34.3eV']
['Name: QKTESELLEISGK/2_0_46eV']
['Name: QKTESELLEISGK/2_1(0', 'Q', 'Pyro-glu)_33.8eV']
['Name: QKTESELLEISGK/2_1(0', 'Q', 'Pyro-glu)_45eV']
['Name: QKTESELLEISGK/3_0_21.6eV']
['Name: QKTESELLEISGK/3_0_29eV']
['Name: QKTGLDSPTGFDSSDITANSFTVHWVAPR/4_0_33.9eV']
['Name: QKVAPLGAELQESAR/2_0_38.8eV']
['Name: QKVDSLLESLEK/2_0_33.8eV']
['Name: QKVEGSEPTTPFNLFIGNLNPNK/3_0_39eV']
['Name: QKVTSLTACLVNQNLR/2_2(0', 'Q', 'Pyro-glu)(8', 'C', 'CAM)_44.4eV']
['Name: QKYEETHAELEASQK/2_0_43.5eV']
['Name: QKYEETHAELEASQK/3_1(0', 'Q', 'Pyro-g

['Name: QVSEHIAVYR/2_1(0', 'Q', 'Pyro-glu)_24eV']
['Name: QVSNAIVR/2_0_20.8eV']
['Name: QVSTPTLVEAAR/2_0_26eV']
['Name: QVSTPTLVEAAR/2_0_29.8eV']
['Name: QVSTPTLVEAAR/2_0_30.9eV']
['Name: QVSTPTLVEAAR/2_0_40eV']
['Name: QVSTPTLVEAAR/2_1(0', 'Q', 'Pyro-glu)_26eV']
['Name: QVSTPTLVEAAR/2_1(0', 'Q', 'Pyro-glu)_29.4eV']
['Name: QVSTPTLVEAAR/2_1(0', 'Q', 'Pyro-glu)_30.5eV']
['Name: QVSTPTLVEAAR/3_0_18.8eV']
['Name: QVTQTYWEDKPSR/2_1(0', 'Q', 'Pyro-glu)_38eV']
['Name: QVTQTYWEDKPSR/3_0_25.1eV']
['Name: QVVDSAYEVIK/2_0_29.3eV']
['Name: QVVDSAYEVIK/2_0_30.4eV']
['Name: QVVDSAYEVIK/2_0_39eV']
['Name: QVVDSAYEVIK/2_1(0', 'Q', 'Pyro-glu)_28.9eV']
['Name: QVVDSAYEVIK/2_1(0', 'Q', 'Pyro-glu)_30eV']
['Name: QVVLVGQEPVLFSGSVK/2_0_41.8eV']
['Name: QVVTLLNELK/2_1(0', 'Q', 'Pyro-glu)_26.7eV']
['Name: QVYEEEYGSNLEDDVVGDTSGYYQR/2_0_68.3eV']
['Name: QVYEEEYGSNLEDDVVGDTSGYYQR/2_0_70.9eV']
['Name: QVYEEEYGSNLEDDVVGDTSGYYQR/3_0_43eV']
['Name: QVYEEEYGSNLEDDVVGDTSGYYQR/3_1(0', 'Q', 'Pyro-glu)_42.7eV']
['Name: 

['Name: RVSEMEMASR/2_0_29.1eV']
['Name: RVSEMEMASR/2_0_37eV']
['Name: RVVVIKK/2_0_20.5eV']
['Name: RVYGSFLVNPEPGYNVSLLYDLENLPASK/3_0_48eV']
['Name: RVYLSECK/2_1(6', 'C', 'CAM)_25.6eV']
['Name: RWEVAALR/2_0_24.3eV']
['Name: RWEYCDIPR/2_1(4', 'C', 'CAM)_30.3eV']
['Name: RWEYCDIPR/3_1(4', 'C', 'CAM)_19.1eV']
['Name: RYDDPEVQKDTK/3_0_22.9eV']
['Name: RYEVPLETPR/2_0_30.6eV']
['Name: RYRPIATR/1_0_50.2eV']
['Name: RYTLNILEDIGGGQK/2_0_40.8eV']
['Name: RYTLNILEDIGGGQK/3_0_24.8eV']
['Name: SAAAFKPVGSTSVK/2_0_32.8eV']
['Name: SAAEEVDGLGVVRPHYGSVLDNER/2_0_60.2eV']
['Name: SAAEQLAISIPNDTAGR/2_0_40.1eV']
['Name: SAAESISESVPVGPK/2_0_34.1eV']
['Name: SAAESISESVPVGPK/2_0_45eV']
['Name: SAAEVIAQAR/2_0_23.8eV']
['Name: SAAEVIAQAR/2_0_24.7eV']
['Name: SAAVSFIPFMTFFK/2_0_38.7eV']
['Name: SADESGQALLAASHYASDEVR/3_0_32.1eV']
['Name: SADESGQALLAASHYASDEVR/3_0_33.3eV']
['Name: SADEVLFSGVK/2_0_28eV']
['Name: SADHPTLDK/2_0_23.9eV']
['Name: SADHPTLDK/3_0_15.1eV']
['Name: SADLNVDSIISYWK/2_0_37.7eV']
['Name: SAEEGEV

['Name: SHHPADFTPAVHASLDK/4_0_19.8eV']
['Name: SHKPLDMSK/2_0_25.4eV']
['Name: SHKPLDMSK/3_0_16eV']
['Name: SHLAIHQR/2_0_23.4eV']
['Name: SHLDRIDPPTVTITSR/3_0_27.7eV']
['Name: SHLDRIDPPTVTITSR/3_0_35eV']
['Name: SHSAGVTVGVAPVK/2_0_31.8eV']
['Name: SHVEDGDVAGSPAVPPAEQDPMK/3_0_35.7eV']
['Name: SHVSPIK/2_0_18.7eV']
['Name: SHYSNIEANESEEVR/2_0_42.9eV']
['Name: SHYSNIEANESEEVR/2_0_55eV']
['Name: SHYSNIEANESEEVR/3_0_19eV']
['Name: SHYSNIEANESEEVR/3_0_27eV']
['Name: SIANPEFDQR/2_0_27.6eV']
['Name: SIAQLNAENDHPFYYK/2_0_44.7eV']
['Name: SIAQLNAENDHPFYYK/3_0_28.2eV']
['Name: SIAQYWLGCPTSE/2_1(8', 'C', 'CAM)_36.8eV']
['Name: SIAQYWLGCPTSEK/2_1(8', 'C', 'CAM)_38.4eV']
['Name: SIAQYWLGCPTSEK/2_1(8', 'C', 'CAM)_39.9eV']
['Name: SIASLLTQVLLGAGDSTK/3_0_26.2eV']
['Name: SICINGK/2_1(2', 'C', 'CAM)_19.3eV']
['Name: SIEALLEAGQAQDTQASHAEANQQQTR/3_0_42.7eV']
['Name: SIEALLEAGQAQDTQASHAEANQQQTR/3_0_57eV']
['Name: SIEYSPQLEDASAK/2_0_37.4eV']
['Name: SIGEGQQHHMGGSK/3_0_22.3eV']
['Name: SIGVSNFNFR/2_0_26.7eV']
[

['Name: SPAPKPSDLRPGDVSGK/3_0_33eV']
['Name: SPATLNSR/2_0_20.6eV']
['Name: SPATLNSR/2_0_26eV']
['Name: SPDAGPHGQDR/2_0_27.6eV']
['Name: SPDGNKSPAPK/2_0_26.7eV']
['Name: SPDNEDPGDSK/2_0_28.2eV']
['Name: SPDVGLYGVIPECGETYQSDLAEAK/2_1(12', 'C', 'CAM)_63.2eV']
['Name: SPDVGLYGVIPECGETYQSDLAEAK/3_1(12', 'C', 'CAM)_30eV']
['Name: SPDVGLYGVIPECGETYQSDLAEAK/3_1(12', 'C', 'CAM)_41.3eV']
['Name: SPEAEEEEEQVMVR/2_0_40.4eV']
['Name: SPEAEEEEEQVMVR/2_1(11', 'M', 'Oxidation)_40.8eV']
['Name: SPEGTKPK/2_0_19.8eV']
['Name: SPFICR/2_1(4', 'C', 'CAM)_18.3eV']
['Name: SPFSVGVSPSLDLSK/2_0_32eV']
['Name: SPFSVGVSPSLDLSK/2_0_35.6eV']
['Name: SPFSVGVSPSLDLSK/2_0_37eV']
['Name: SPFSVGVSPSLDLSK/2_0_47eV']
['Name: SPGGPGPLTLK/2_0_24eV']
['Name: SPGGPGPLTLK/2_0_32eV']
['Name: SPLIESTANMENNQPQK/2_0_46.2eV']
['Name: SPLIIFSDCNMENAVK/2_1(8', 'C', 'CAM)_44.7eV']
['Name: SPLLEIVER/2_0_24.7eV']
['Name: SPLLPAVLEGLAK/2_0_27eV']
['Name: SPLVFGFCR/2_1(7', 'C', 'CAM)_22.5eV']
['Name: SPLVFGFCR/2_1(7', 'C', 'CAM)_26.3eV']


['Name: SVIDPIPAPVGDSNVDSGAK/2_0_45.4eV']
['Name: SVIFFER/2_0_18eV']
['Name: SVIFFER/2_0_21eV']
['Name: SVIGMGTGAGAYILTR/2_0_38.1eV']
['Name: SVILHLISQLK/2_0_29.3eV']
['Name: SVIVEPEGIEK/2_0_25eV']
['Name: SVIVEPEGIEK/2_0_29.2eV']
['Name: SVIVEPEGIEK/2_0_37eV']
['Name: SVKVPMMK/3_0_14.1eV']
['Name: SVMQEGAAPLPR/2_0_26eV']
['Name: SVMQEGAAPLPR/2_0_29.4eV']
['Name: SVMQEGAAPLPR/2_0_30.5eV']
['Name: SVMQEGAAPLPR/2_1(2', 'M', 'Oxidation)_26eV']
['Name: SVMQEGAAPLPR/2_1(2', 'M', 'Oxidation)_30.9eV']
['Name: SVMQEGAAPLPR/2_1(2', 'M', 'Oxidation)_40eV']
['Name: SVNCGTMVAQPK/2_1(3', 'C', 'CAM)_31.4eV']
['Name: SVNFAESEEAK/2_0_25eV']
['Name: SVNFAESEEAK/2_0_29.4eV']
['Name: SVNFAESEEAKK/2_0_32.6eV']
['Name: SVNFAESEEAKK/3_0_20.5eV']
['Name: SVNGLAFYDWENTELIR/2_0_49.3eV']
['Name: SVNGLYIQK/2_0_23.9eV']
['Name: SVNQSLLELHK/2_0_29.7eV']
['Name: SVNQSLLELHK/3_0_18.7eV']
['Name: SVPLATTPMAEQRPESTATAAVK/3_0_36.1eV']
['Name: SVPMSTVFYPSDGVATEK/2_0_40eV']
['Name: SVPMSTVFYPSDGVATEK/2_0_44.9eV']
['Name:

['Name: TETTGATGQASSLLSGR/2_0_38.3eV']
['Name: TETTGATGQASSLLSGR/2_0_39.8eV']
['Name: TETVCTFQDGALVQHQQWDGK/2_1(4', 'C', 'CAM)_59.5eV']
['Name: TETVCTFQDGALVQHQQWDGK/3_1(4', 'C', 'CAM)_37.5eV']
['Name: TETVEEPLEEDEAAK/2_0_35eV']
['Name: TETVEEPLEEDEAAK/2_0_39.6eV']
['Name: TETVEEPLEEDEAAKEEK/3_0_30.6eV']
['Name: TEVNTNHVLIYIEK/2_0_40.7eV']
['Name: TEVNTNHVLIYIEK/2_0_52eV']
['Name: TEVNTNHVLIYIEK/3_0_18eV']
['Name: TEVNTNHVLIYIEK/3_0_24.7eV']
['Name: TEVNTNHVLIYIEK/3_0_33eV']
['Name: TEVVCAPPTAYIDFAR/2_1(4', 'C', 'CAM)_42.4eV']
['Name: TFASLSELHCDK/2_1(9', 'C', 'CAM)_29eV']
['Name: TFASLSELHCDK/2_1(9', 'C', 'CAM)_33eV']
['Name: TFASLSELHCDK/3_1(9', 'C', 'CAM)_20.8eV']
['Name: TFCSEPEKVDKDNEDFQESNR/3_1(2', 'C', 'CAM)_39.4eV']
['Name: TFCSEPEKVDKDNEDFQESNR/4_1(2', 'C', 'CAM)_27.9eV']
['Name: TFDAIVMDPK/2_0_26.6eV']
['Name: TFDAIVMDPK/2_0_27.6eV']
['Name: TFDKTDFANWASSLANAPALISQR/3_0_40.2eV']
['Name: TFDQLSPDESKER/3_0_23.8eV']
['Name: TFEGVDPK/2_0_20.9eV']
['Name: TFEGVDPQNTSMR/2_0_46eV']


['Name: TLNFNEEGDAEEAMVDNWRPAQPLK/3_0_57eV']
['Name: TLNFNEEGDAEEAMVDNWRPAQPLK/3_1(13', 'M', 'Oxidation)_44.3eV']
['Name: TLNLAQNLLTQLPK/2_0_33eV']
['Name: TLNLAQNLLTQLPK/2_0_36.7eV']
['Name: TLNLAQNLLTQLPK/2_0_49eV']
['Name: TLNLTCTVFGNPDPEVVWFK/2_1(5', 'C', 'CAM)_54.7eV']
['Name: TLNTEGVVK/2_0_22.5eV']
['Name: TLPLNFDK/2_0_22.2eV']
['Name: TLPQAEALDK/2_0_26.4eV']
['Name: TLQEEHVTVAQLR/2_0_35.7eV']
['Name: TLQISPLDNGDLVR/2_0_36.1eV']
['Name: TLQNILEVEK/2_0_37eV']
['Name: TLQPLLFINSAK/2_0_32.7eV']
['Name: TLQQLQQQSQELQEVLGETDR/3_0_37.9eV']
['Name: TLQQTLLNQERPIK/3_0_24.8eV']
['Name: TLSTVSTQQK/2_0_26.6eV']
['Name: TLSYNLEVSNNSRRVT/3_0_20eV']
['Name: TLTGTTEESKR/2_0_29.7eV']
['Name: TLTGTTEVHVNK/2_0_30.5eV']
['Name: TLTSGGHAEHEGKPYCNHPCYSAMFGPK/4_2(15', 'C', 'CAM)(19', 'C', 'CAM)_32.6eV']
['Name: TLTSGGHAEHEGKPYCNHPCYSAMFGPK/4_2(15', 'C', 'CAM)(19', 'C', 'CAM)_33.9eV']
['Name: TLTSGGHAEHEGKPYCNHPCYSAMFGPK/5_2(15', 'C', 'CAM)(19', 'C', 'CAM)_25.4eV']
['Name: TLTTAAVSTAQPILSK/2_0_39eV']
[

['Name: TVDETYVPK/2_0_33eV']
['Name: TVDFLEAYPGILGQK/2_0_40.1eV']
['Name: TVEDLDGLIQQIYR/2_0_39eV']
['Name: TVEDLDGLIQQIYR/2_0_40.4eV']
['Name: TVEDLDGLIQQIYR/2_0_52eV']
['Name: TVEDLDGLIQQIYR/3_0_24.5eV']
['Name: TVEEAENIVVTTGVVR/2_0_41.7eV']
['Name: TVEEGAETPVYLALLPPDATEPHGQLVR/3_0_44.3eV']
['Name: TVEGAGNIAAATGFVK/2_0_36.6eV']
['Name: TVEMSPAYTEEELK/2_1(3', 'M', 'Oxidation)_38.5eV']
['Name: TVEPMGEMFNK/2_0_30eV']
['Name: TVEPMGEMFNK/2_0_40eV']
['Name: TVEPMGEMFNK/2_1(4', 'M', 'Oxidation)_30.4eV']
['Name: TVEPMGEMFNK/2_1(7', 'M', 'Oxidation)_31.6eV']
['Name: TVEPMGEMFNK/2_1(7', 'M', 'Oxidation)_40eV']
['Name: TVEQIRK/2_0_21.3eV']
['Name: TVEQIRK/3_0_13.4eV']
['Name: TVFDIWVR/2_0_21.6eV']
['Name: TVFTEQPTWVIDPIDGTTNFVHR/3_0_40.9eV']
['Name: TVGATALPK/2_0_20.9eV']
['Name: TVGATALPK/2_0_27eV']
['Name: TVGLLPPQNIHIFDEWYTR/2_0_53.8eV']
['Name: TVGLLPPQNIHIFDEWYTR/2_0_55.9eV']
['Name: TVGLLPPQNIHIFDEWYTR/3_0_25eV']
['Name: TVGLLPPQNIHIFDEWYTR/3_0_33.9eV']
['Name: TVGLLPPQNIHIFDEWYTR/3_0_45

['Name: VDCTADSDVCSAQGVR/2_2(2', 'C', 'CAM)(9', 'C', 'CAM)_42.3eV']
['Name: VDDPAGMLLAAFR/2_0_28eV']
['Name: VDDPAGMLLAAFR/2_0_32.2eV']
['Name: VDDQVTHIR/3_0_21eV']
['Name: VDEETEHTMR/2_0_29.2eV']
['Name: VDEETEHTMR/2_0_30.3eV']
['Name: VDEETEHTMR/2_1(8', 'M', 'Oxidation)_30.7eV']
['Name: VDEETEHTMR/3_0_18.4eV']
['Name: VDEETEHTMR/3_1(8', 'M', 'Oxidation)_18.6eV']
['Name: VDEFVGMLDMIR/3_2(6', 'M', 'Oxidation)(9', 'M', 'Oxidation)_21.5eV']
['Name: VDEFVTGNLSFDQINQAFDLMHSGDSIR/3_0_46.5eV']
['Name: VDETYVPK/2_0_22.3eV']
['Name: VDEYDYSKPIQGQQK/3_0_27.5eV']
['Name: VDFPQDQLATLTGR/2_0_32eV']
['Name: VDFPQDQLATLTGR/2_0_36.5eV']
['Name: VDGALCLDK/2_1(5', 'C', 'CAM)_20eV']
['Name: VDGALCLDK/2_1(5', 'C', 'CAM)_23.2eV']
['Name: VDGALCLDK/2_1(5', 'C', 'CAM)_24.1eV']
['Name: VDGEDTGEK/2_0_23.1eV']
['Name: VDGQSCPVPMMYQEGK/2_1(5', 'C', 'CAM)_38eV']
['Name: VDGQSCPVPMMYQEGK/2_1(5', 'C', 'CAM)_42.7eV']
['Name: VDGQSCPVPMMYQEGK/2_2(5', 'C', 'CAM)(10', 'M', 'Oxidation)_43.1eV']
['Name: VDIDTPQVDVHGPDLK

['Name: VGLTASLAGPHAILGR/2_0_37.3eV']
['Name: VGLTASLAGPHAILGR/3_0_17eV']
['Name: VGLTASLAGPHAILGR/3_0_23.5eV']
['Name: VGLVAVDK/2_0_18.8eV']
['Name: VGLVAVDK/2_0_25eV']
['Name: VGLVAVDKGVFVLNKK/3_0_25.8eV']
['Name: VGLVQFSDTPVTEFSLDTYQTK/2_0_52eV']
['Name: VGLVQFSDTPVTEFSLDTYQTK/2_0_58eV']
['Name: VGLVQFSDTPVTEFSLDTYQTK/2_0_60.2eV']
['Name: VGLVQFSDTPVTEFSLDTYQTK/3_0_36.5eV']
['Name: VGLVQYNSDPTDEFFLR/2_0_42eV']
['Name: VGLVQYNSDPTDEFFLR/2_0_46.8eV']
['Name: VGLVQYNSDPTDEFFLR/2_0_48.6eV']
['Name: VGLVQYNSDPTDEFFLR/2_0_63eV']
['Name: VGNGGCHGLATCK/2_2(5', 'C', 'CAM)(11', 'C', 'CAM)_32.4eV']
['Name: VGPANPSLQK/2_0_23.7eV']
['Name: VGPANPSLQK/2_0_31eV']
['Name: VGPDSVQCYHFGWSPGFPTCK/3_2(7', 'C', 'CAM)(19', 'C', 'CAM)_35.8eV']
['Name: VGPDSVQCYHFGWSPGFPTCK/3_2(7', 'C', 'CAM)(19', 'C', 'CAM)_37.2eV']
['Name: VGPESDK/2_0_17.1eV']
['Name: VGPESDKYR/2_0_24.6eV']
['Name: VGPESDKYR/3_0_16.1eV']
['Name: VGQALLQGNTER/2_0_31.3eV']
['Name: VGQLLACLIGTQFR/2_1(6', 'C', 'CAM)_36.9eV']
['Name: VGQLLACL

['Name: VLRPQVTAVAQQNQGEAPEPQDMK/3_1(22', 'M', 'Oxidation)_40.6eV']
['Name: VLSDDEEDVDFDIIHNANDTFTVK/3_0_40.6eV']
['Name: VLSEEEIDDNFK/2_0_35eV']
['Name: VLSGEDK/2_0_15.6eV']
['Name: VLSGEDK/2_0_18.2eV']
['Name: VLSGEDK/2_0_23eV']
['Name: VLSGEDKSNIK/2_0_28.9eV']
['Name: VLSGEDKSNIK/3_0_17.6eV']
['Name: VLSIQSHVVR/2_0_27.7eV']
['Name: VLSIQSHVVR/3_0_17.4eV']
['Name: VLSMTETCR/2_2(3', 'M', 'Oxidation)(7', 'C', 'CAM)_27.1eV']
['Name: VLSPLDR/2_0_18.7eV']
['Name: VLSPLEYFR/2_0_27.3eV']
['Name: VLSYAPGPLDNDMQQLAR/2_0_46.6eV']
['Name: VLSYAPGPLDNDMQQLAR/2_0_48.4eV']
['Name: VLTADTLK/2_0_20.2eV']
['Name: VLTAMNK/2_0_18.9eV']
['Name: VLTPDLYNK/2_0_22.1eV']
['Name: VLTPDLYNK/2_0_24.9eV']
['Name: VLTPTQVMNRPSSISWDGLDPGK/3_0_36.9eV']
['Name: VLTPTQVMNRPSSISWDGLDPGK/3_0_49eV']
['Name: VLTQLVTGPK/2_0_25.7eV']
['Name: VLTSEEEYSLLSDK/2_0_37.8eV']
['Name: VLTSEEEYSLLSDK/2_0_39.2eV']
['Name: VLTSVDQYLELVGNSLPGTTSK/2_0_56.5eV']
['Name: VLVAQHDAYK/2_0_26.8eV']
['Name: VLVDINNPEPLIQTAK/2_0_41.3eV']
['Nam

['Name: VSWDISDSNVEQFR/2_0_40.9eV']
['Name: VSYGIGEEEHDQEGR/2_0_39.9eV']
['Name: VSYGIGEEEHDQEGR/3_0_19eV']
['Name: VSYGIGEEEHDQEGR/3_0_26.1eV']
['Name: VTAEVVLVHPGGGSTSR/2_0_40.5eV']
['Name: VTAEVVLVHPGGGSTSR/2_0_52eV']
['Name: VTAEVVLVHPGGGSTSR/3_0_25.5eV']
['Name: VTAEVVLVHPGGGSTSR/3_0_33eV']
['Name: VTAHAEGYTSSAK/2_0_31eV']
['Name: VTAHAEGYTSSAK/3_0_20.3eV']
['Name: VTAHAEGYTSSAK/3_0_26eV']
['Name: VTAIYIDPATHR/2_0_31.8eV']
['Name: VTALVDMYPRVFRKK/3_0_26.9eV']
['Name: VTALVPSESAIR/2_0_39eV']
['Name: VTAPLEAEYSGLVR/2_0_36.6eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/2_0_53eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/2_0_59.2eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/2_0_61.5eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/2_0_79eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/2_1(9', 'M', 'Oxidation)_59.6eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/3_0_28eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/3_0_37.3eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/3_0_50eV']
['Name: VTASEPLETMGSEGALSPGGVASLLR/3_1(9', 'M', 'Oxidation)_28eV']

['Name: WVTFMLK/2_0_22.5eV']
['Name: WVTFMLK/2_1(4', 'M', 'Oxidation)_22.9eV']
['Name: WVTHAGNK/2_0_22.2eV']
['Name: WWGNKR/2_0_20.6eV']
['Name: WWGNKR/3_0_13eV']
['Name: WYAICISDVGDYEGIK/2_1(4', 'C', 'CAM)_45.9eV']
['Name: WYASLQK/2_0_21eV']
['Name: WYASLQKPSWHPPR/3_0_26.9eV']
['Name: WYDVQSVVPHPGSRPDSLEDDLILFK/4_0_21eV']
['Name: WYDVQSVVPHPGSRPDSLEDDLILFK/4_0_31.4eV']
['Name: WYNLAVGSTCPWLSR/2_1(9', 'C', 'CAM)_44eV']
['Name: WYQMGIVSWGEGCDRK/3_1(12', 'C', 'CAM)_30.2eV']
['Name: WYVVGLAGNAVQK/2_0_32.9eV']
['Name: YAAGHTVHLSSCR/2_1(11', 'C', 'CAM)_35.5eV']
['Name: YADESGNMDFDNFISCLVR/2_1(15', 'C', 'CAM)_52.8eV']
['Name: YADESGNMDFDNFISCLVR/2_1(15', 'C', 'CAM)_54.8eV']
['Name: YAEAVGR/2_0_17.9eV']
['Name: YAEQYNEILTQCCAEADKESCLTPK/3_3(11', 'C', 'CAM)(12', 'C', 'CAM)(20', 'C', 'CAM)_46.3eV']
['Name: YAFTGSHYWR/2_0_31.3eV']
['Name: YAGLKPEELPTCESLK/2_1(11', 'C', 'CAM)_43eV']
['Name: YAGLKPEELPTCESLK/3_1(11', 'C', 'CAM)_27.1eV']
['Name: YAGVFHVEK/3_0_16.1eV']
['Name: YALQACR/2_1(5', 'C', '

['Name: YSESVKDAQEK/2_0_31.2eV']
['Name: YSFCTDHAVLVQTR/2_1(3', 'C', 'CAM)_39.7eV']
['Name: YSFCTDHAVLVQTR/3_1(3', 'C', 'CAM)_18eV']
['Name: YSFCTDHAVLVQTR/3_1(3', 'C', 'CAM)_25eV']
['Name: YSGCLTESNLIK/2_1(3', 'C', 'CAM)_33.7eV']
['Name: YSGELHLVHWNSAK/3_0_24.2eV']
['Name: YSGELHLVHWNSAK/3_0_32eV']
['Name: YSGKEGDKHTLSK/2_0_35.3eV']
['Name: YSGKEGDKHTLSK/3_0_22.2eV']
['Name: YSLEPVAAELK/2_0_25eV']
['Name: YSLEPVAAELK/2_0_28.6eV']
['Name: YSLEPVAAELK/2_0_38eV']
['Name: YSLVTHQR/2_0_24.4eV']
['Name: YSMQSSPK/2_0_22.6eV']
['Name: YSMWFK/2_0_21eV']
['Name: YSNKDCPDNAEEYER/3_1(5', 'C', 'CAM)_29eV']
['Name: YSNSALGHVNSTIK/2_0_34.9eV']
['Name: YSNSALGHVNSTIK/2_0_36.2eV']
['Name: YSNSALGHVNSTIK/2_0_46eV']
['Name: YSNSALGHVNSTIK/3_0_22.8eV']
['Name: YSNSALGHVNSTIK/3_0_29eV']
['Name: YSPHCK/2_1(4', 'C', 'CAM)_18.5eV']
['Name: YSPNTQVEILPQGR/2_0_37.5eV']
['Name: YSPNTQVEILPQGR/2_0_50eV']
['Name: YSPSEAGLHEMDIR/2_0_37.6eV']
['Name: YSPSEAGLHEMDIR/2_0_39eV']
['Name: YSPSEAGLHEMDIR/2_0_50eV']
['Nam

In [119]:
locations

[0,
 1193,
 90,
 81,
 182,
 380,
 118,
 217,
 67,
 105,
 395,
 85,
 78,
 401,
 470,
 103,
 123,
 40,
 149,
 492,
 436,
 195,
 62,
 157,
 214,
 111,
 427,
 321,
 162,
 26,
 580,
 86,
 102,
 79,
 535,
 93,
 58,
 1028,
 473,
 47,
 55,
 94,
 58,
 121,
 30,
 425,
 325,
 120,
 51,
 93,
 531,
 107,
 41,
 499,
 60,
 305,
 660,
 342,
 107,
 149,
 237,
 555,
 316,
 88,
 568,
 114,
 59,
 54,
 26,
 196,
 266,
 134,
 93,
 237,
 342,
 42,
 36,
 720,
 429,
 330,
 91,
 88,
 39,
 256,
 44,
 279,
 165,
 406,
 165,
 238,
 188,
 30,
 57,
 32,
 60,
 196,
 462,
 848,
 215,
 143,
 727,
 167,
 480,
 105,
 253,
 168,
 112,
 31,
 316,
 91,
 67,
 108,
 77,
 662,
 255,
 348,
 528,
 53,
 307,
 75,
 253,
 636,
 75,
 298,
 31,
 137,
 41,
 773,
 194,
 619,
 687,
 260,
 285,
 132,
 214,
 50,
 228,
 246,
 386,
 40,
 461,
 50,
 74,
 44,
 87,
 79,
 235,
 56,
 92,
 45,
 60,
 339,
 84,
 85,
 56,
 65,
 99,
 75,
 74,
 456,
 39,
 140,
 40,
 168,
 723,
 39,
 143,
 228,
 122,
 356,
 47,
 72,
 148,
 91,
 70,
 74,
 46,
 56,
 602,

In [124]:
names[]

'AAAEVNQEYGLDPK'

In [90]:
print(names[1195+322+84])

['MW: 1650.8090']


In [24]:
read_file(filename)

 AAAAAAAAAAEEAAMQRDLLPPAGR
 AAADDGEEPK
 AAAEVNQEYGLDPK
 AAAFEDQENETVVVK
 AAAFEDQENETVVVK
 AAAGELQEDSGLHVLAR
 AAAITSDLLESLGR
 AAAITSDLLESLGR
 AAAITSDLLESLGR
 AAALERMPR
 AAATLMTER
 AAATQPDGKDTPDEPWAFPAR
 AACFQLK
 AACLESAQEPAGAWSNK
 AACLESAQEPAGAWSNK
 AADAVEDLR
 AADIPGLK
 AADKDTCFSTEGPNLVTR
 AADKDTCFSTEGPNLVTR
 AADKDTCFSTEGPNLVTR
 AADKDTCFSTEGPNLVTR
 AADKDTCFSTEGPNLVTR
 AADKDTCFSTEGPNLVTR
 AADVHEVR
 AADVHEVR
 AADVHEVRK
 AADVHEVRK
 AAEGVPK
 AAEIDCQDIEER
 AAEQMSTLPIDAPSPLENLEQK
 AAESSAMAATEK
 AAEVESVK
 AAFEWNEEGAGSSPSPGLQPVR
 AAFEWNEEGAGSSPSPGLQPVR
 AAFEWNEEGAGSSPSPGLQPVR
 AAFEWNEEGAGSSPSPGLQPVR
 AAFGLSEAGFNTACLTK
 AAFGLSEAGFNTACLTK
 AAFLNEVLR
 AAFLNEVLR
 AAFLNEVLR
 AAFTMFSR
 AAFVLPEFTR
 AAFVLPEFTR
 AAGAQIQGMK
 AAGGAGAQVGGSISSGSSASSVTVTR
 AAGGAGAQVGGSISSGSSASSVTVTR
 AAGHLDDLPGALSALSDLHAHK
 AAGHLDDLPGALSALSDLHAHK
 AAGHLDDLPGALSALSDLHAHK
 AAGLLSTFR
 AAGLLSTFR
 AAGPLESSGKEEITQLK
 AAGPLESSGKEEITQLK
 AAGSGELGVTVK
 AAGSIASAQKPPAGK
 AAGSIASAQKPPAGK
 AAGTIQTSVQEVNSK
 AAGVPSASITWR
 AAGVSVEPFWPGLFAK


 ALYQTEAFTADFQQPTEAK
 ALYQTEAFTADFQQPTEAK
 ALYSNILGEENTYLWR
 ALYYDLITNPDIHSTYK
 ALYYDLITNPDIHSTYK
 ALYYDLITNPDIHSTYK
 ALYYDLITNPDIHSTYK
 ALYYDLITNPDIHSTYK
 AMADEVTEK
 AMALPSSGEGLAFFTFPNIDSPTK
 AMAMELGPHK
 AMAVLEPLQVVITNFPAPK
 AMAVLEPLQVVITNFPAPKPLDIR
 AMDECADEGGRPQR
 AMDFDRDVLSALAEVEQLSK
 AMEALALAER
 AMEELEDR
 AMEIAEALGR
 AMFHINKPR
 AMFHINKPR
 AMFHINKPR
 AMGDTEIKDGER
 AMGFTPLDMGSLASAR
 AMGNLQIDFADPQR
 AMGSTLESCWFPR
 AMGVPMMGLDYSDEINQVVEVR
 AMGVPMMGLDYSDEINQVVEVR
 AMGVPMMGLDYSDEINQVVEVR
 AMLQVHGGSGPR
 AMLQVHGGSGPR
 AMLQVHGGSGPR
 AMLSLGSK
 AMLSLGSK
 AMLSLGSK
 AMNQWVSGPAYYVEYLIK
 AMSLEEAK
 AMSTTSVTSSQPGK
 ANAGEESVMNLDK
 ANEDVCQDCMK
 ANEEDEQEEGGDGASGDPKK
 ANEEDEQEEGGDGASGDPKK
 ANEELAGVVAEVQK
 ANFLEKPVLGFVR
 ANFLEKPVLGFVR
 ANFLEKPVLGFVR
 ANFLEKPVLGFVR
 ANGSWELDEDLTK
 ANHMEVLDAGK
 ANIDVSGPK
 ANIIFNTALGTIFGVK
 ANLGGTEILTPLCNIYK
 ANLGGTEILTPLCNIYK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSVACK
 ANLMHNLGGEEVSV

 DAEDLQKR
 DAEDLQKR
 DAEDQEAQLK
 DAEDQEAQLK
 DAEEAATGECTATVGK
 DAEEAATGECTATVGK
 DAEEAATGECTATVGK
 DAEEAATGECTATVGKR
 DAEEAATGECTATVGKR
 DAEEAATGECTATVGKR
 DAEEAATGECTATVGKR
 DAEEAATGECTATVGKR
 DAEEVISQTIDTIVDMIK
 DAEEVISQTIDTIVDMIK
 DAEEVISQTIDTIVDMIK
 DAEEVISQTIDTIVDMIK
 DAEEVISQTIDTIVDMIK
 DAEEVISQTIDTIVDMIK
 DAEEVISQTIDTIVDMIK
 DAETEQGPTHGHGWLHEK
 DAFERNPELQNLLLDDFFK
 DAFQALEK
 DAFQQIK
 DAFVAIVQSVK
 DAFVAIVQSVK
 DAGEGLLAVQITDQEGKPQR
 DAGVSTYMYEFR
 DAGVSTYMYEFR
 DAGVSTYMYEFR
 DAGVSTYMYEFR
 DAGVSTYMYEFR
 DAIDETK
 DAILDALENLSGDELKK
 DAILDALENLSGDELKK
 DAILFPSFIHSQK
 DAILYYVNLK
 DAILYYVNLK
 DAITAVR
 DAITAVR
 DAIVQAVK
 DAIVQAVK
 DALDMLQDFYDR
 DALDMLQDFYDR
 DALEEEDNVLVLK
 DALEEEDNVLVLK
 DALEEEDNVLVLK
 DALEEEDNVLVLKK
 DALEEEDNVLVLKK
 DALEEEDNVLVLKK
 DALISFLR
 DALISFLR
 DALISFLR
 DALNIETAVK
 DALNIETAVK
 DALVSQPTK
 DAMLAK
 DANEELNCQDPDVGDMEEEER
 DAPDGPSVEAEPEYTFEGLR
 DAPIVNR
 DAQDLDSGREHER
 DAQEKLEQAEK
 DAQILYNAGENK
 DAQLAGSPELLEFLGTR
 DAQMVHSNALNEDTQDELGDPR
 DAQPQLEEADDDLDSK
 DAQPQLEEADDDL

 DTHSLFFR
 DTHVMDYR
 DTLFTSDSGTR
 DTLFTSDSGTRR
 DTLGSTVSQLQER
 DTLLTASQLK
 DTLPESR
 DTLTSRPAQGVITTLENVSPPR
 DTLTSRPAQGVITTLENVSPPR
 DTLTSRPAQGVITTLENVSPPR
 DTLTSRPAQGVITTLENVSPPR
 DTLTSRPAQGVITTLENVSPPR
 DTLTSRPAQGVITTLENVSPPRR
 DTLTSRPAQGVITTLENVSPPRR
 DTLTSRPAQGVITTLENVSPPRR
 DTLTSRPAQGVITTLENVSPPRR
 DTLTSRPAQGVITTLENVSPPRR
 DTLWGLFNNLQR
 DTMASIGQTR
 DTMPADLPAIAADFVEDQEVCK
 DTMPADLPAIAADFVEDQEVCK
 DTMPADLPAIAADFVEDQEVCK
 DTNIVFSPLSISAALAIVSLGAK
 DTNIVFSPLSISAALAIVSLGAK
 DTNLLILFK
 DTNLLILFK
 DTPFPNAWQGQGEETQVEAK
 DTPLTLTVLHK
 DTPLTLTVLHK
 DTPLTLTVLHK
 DTPLTLTVLHK
 DTPLTLTVLHK
 DTPLTLTVLHK
 DTPPKREEMVLDDSAK
 DTQLITVDEK
 DTRGEVQTVTFDTDEVK
 DTSFDLFSISNINRK
 DTTEKELLDSYIDGR
 DTTPLSVLCGADIQVVSVGIK
 DTTPLSVLCGADIQVVSVGIK
 DTTPLSVLCGADIQVVSVGIK
 DTTVSSDSVAK
 DTTVTGLGR
 DTVAHIQEELATLR
 DTVEEHR
 DTVLSALSR
 DTVQLETLELPQGCVR
 DTVQLETLELPQGCVR
 DTVTTGLTGAVNVAK
 DTWVEHWPEAEECQDQK
 DTWVEHWPEAEECQDQK
 DTWVEHWPEAEECQDQK
 DTYDYDIAVLR
 DVAGDIFHQQCK
 DVAPPMEEEIVPGNDTTSPK
 DVAPPMEEEIVPGNDTTSPK
 DVASLGSQ

 ELHDPHYFSPIGFPHK
 ELHDVDLAEVKPLVEK
 ELHDVDLAEVKPLVEK
 ELHDVDLAEVKPLVEK
 ELHINLIPSKQDR
 ELHLNDNPMGDAGLK
 ELHLNDNPMGDAGLK
 ELHLNDNPMGDAGLK
 ELHLSSNR
 ELHLSSNR
 ELHSILQHK
 ELIEQLQNKPSDLGTK
 ELIEQLQNKPSDLGTK
 ELIIHQEYK
 ELILSSEPSPAVTPVTPTTLIAPR
 ELILSSEPSPAVTPVTPTTLIAPR
 ELILSSEPSPAVTPVTPTTLIAPR
 ELILSSEPSPAVTPVTPTTLIAPR
 ELILSSEPSPAVTPVTPTTLIAPR
 ELIPAEEALR
 ELISELDER
 ELISELDER
 ELISELDER
 ELISELDER
 ELKEEIRK
 ELKEENEEK
 ELKEENEEK
 ELLASVTAPEK
 ELLASVTAPEK
 ELLDSYIDGR
 ELLDSYIDGR
 ELLDSYIDGR
 ELLIGDK
 ELLIGDK
 ELLIGDKNALQNIILYHLTPGVYIGK
 ELLPTKDR
 ELLQSFQSK
 ELLSNVDQDVHELEK
 ELLSVGLGFLR
 ELLWGYK
 ELLYYAEQYNEILTQCCAEADK
 ELLYYAEQYNEILTQCCAEADK
 ELLYYAEQYNEILTQCCAEADK
 ELNDFISYLQR
 ELNDFISYLQR
 ELNDFISYLQR
 ELNDFISYLQR
 ELPAEQAEYCIR
 ELPEEQQQR
 ELPSALK
 ELPSGVEELLNLGHDPLADR
 ELPTPELMEAWGDAVK
 ELQEFYK
 ELQEFYKDTYQK
 ELQEMDKDDESLTK
 ELQEMDKDDESLTK
 ELQEMDKDDESLTK
 ELQEMDKDDESLTK
 ELQEQEEVVSCTK
 ELQLYGNNLEYIPEGVFDHLVGLTK
 ELQNTAANLHVR
 ELQQVMEAQER
 ELQSMADQEKVSPAALKK
 ELQVGIPVTDEAGQR
 ELSFAA

 FHVMDVQGSTEASAIK
 FHYGFNSSYLK
 FHYGFNSSYLK
 FICPPNRPFR
 FIDEHATK
 FIDEHATKR
 FIEDYLLPDTTFGADVK
 FIEWTR
 FIGGDAGDAFDGYDFGDDPSDK
 FIGGDAGDAFDGYDFGDDPSDK
 FIGGDAGDAFDGYDFGDDPSDK
 FIGGSGQVSER
 FIHKPSLK
 FIHVSHLNASMK
 FIILVSGFCPR
 FIIPNIVK
 FIIPNIVK
 FIIPNIVK
 FIITALPSIYHCK
 FIITALPSIYHCK
 FIITALPSIYHCK
 FILPDQGR
 FILYGLIGK
 FIMESQK
 FIMVPSGNMGVFDPTEIHNR
 FINEVVK
 FINFFR
 FINFFR
 FINLVAR
 FINTGFIK
 FINYVIK
 FIPFPASAK
 FIPLSEPAPVPPIPNEQQLAR
 FIPLSEPAPVPPIPNEQQLAR
 FIPLSEPAPVPPIPNEQQLAR
 FIPLSEPAPVPPIPNEQQLAR
 FIPLSEPAPVPPIPNEQQLAR
 FIPQMTAGK
 FIPQMTAGK
 FIQDSIFGLCPHMTEDNK
 FIQDSIFGLCPHMTEDNK
 FIQDSIFGLCPHMTEDNKDLIQGK
 FIQDSIFGLCPHMTEDNKDLIQGK
 FIQDSIFGLCPHMTEDNKDLIQGK
 FIQDSIFGLCPHMTEDNKDLIQGK
 FIQLNK
 FISDKDASVVGFFR
 FISDKDASVVGFFR
 FISDKDASVVGFFR
 FISHIK
 FISHIK
 FISHIK
 FISTAGLTVMGTQDTR
 FIVDSSK
 FIVDSSKK
 FIVDSSKK
 FIVFNSK
 FIWMDGSK
 FKAEHDQLLLNYAK
 FKDIFQEIYDKK
 FKGFDPNQISVATLLFEGDREK
 FKGGSPLNTGR
 FKGGSPLNTGR
 FKLEAPDADELPR
 FKLEENYNMNDALYK
 FKLEENYNMNDALYK
 FKLPGQPPASMGR
 FKLPGQPPASM

 GIETGAEDLEILPNGLTFFSTGLK
 GIFNGFSITLK
 GIFVQSVMPYLVATK
 GIGLPCSTTQGK
 GIGTDEATIIDIVTHR
 GIGTDEATIIDIVTHR
 GIGTDEATIIDIVTHR
 GIGTDEATIIDIVTHR
 GIGTDEATIIDIVTHR
 GIGTDEATIIDIVTHR
 GIGWVPIGSLEVVK
 GIHETTFNSIMK
 GIHETTFNSIMK
 GIHETTFNSIMK
 GIHETTFNSIMK
 GIHETTFNSIMK
 GIHETTFNSIMK
 GIHETTFNSIMK
 GIHVEIPGAQAESLGPLQVAR
 GIHVEIPGAQAESLGPLQVAR
 GIIDLIEER
 GIIDLIEER
 GIIDLIEER
 GIILDALEQQEDNINK
 GILDALLQTAR
 GILDALLQTAR
 GILDGNSAPVFPQPFGVK
 GILDGNSAPVFPQPFGVK
 GILLCHVDSDPPAQLR
 GILVQTK
 GILVQTK
 GIMEEDSYPYIGK
 GIMEEDSYPYIGK
 GINSGSDLPEEQLR
 GIPDGHR
 GIQMLPGYR
 GIQTLADPGSFDSNAFALLLR
 GISCMNTTVSESPFK
 GISGEDGYR
 GISHAPNAVK
 GISLNMEQWSQLK
 GITWGEDTLMEYLENPK
 GITWGEDTLMEYLENPKK
 GIVATFYSHPR
 GIVFVAAK
 GIVLLEELLPK
 GIVLLEELLPK
 GIYTGLSAGLLR
 GKADAGKDANNPAENGDAK
 GKAEQQTADQLLAR
 GKDFTPAAQAAFQK
 GKDFTPAAQAAFQK
 GKEAPFTHFDPSCLFPACR
 GKEIVLSAGSTPK
 GKGPQLFHMDPSGTFVQCDAR
 GKKVITAFNDGLNHLDSLK
 GKLDVQFSGLAK
 GKLEEQKPER
 GKLEEQKPER
 GKPDVVVKEDEEYK
 GKPDVVVKEDEEYKR
 GKPIPNK
 GKVNADEVGGEALGR
 GLAGVENVSELKK
 G

 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLMPMLETLK
 HSLSPEQDIK
 HSQAVEELADQLEQTK
 HSQAVEELADQLEQTK
 HSQAVEELADQLEQTK
 HSQAVEELADQLEQTK
 HSQAVEELADQLEQTK
 HSQAVEELADQLEQTKR
 HSQAVEELADQLEQTKR
 HSQTTDDPLCPPGTK
 HSSASGIGHIQVAR
 HSSQTDAAR
 HSVGMAVLGQIAR
 HSVSLPK
 HTDQEELVK
 HTEAAAAQR
 HTEDVGVVCSEK
 HTESLESLLSK
 HTESLESLLSK
 HTESLESLLSK
 HTIILLTDGK
 HTLAANFNPVSEER
 HTLGHGDES
 HTLIIEGATK
 HTMNEGEPAIYAER
 HTNATAR
 HTNLGPLETK
 HTNLGPLETK
 HTNLVPHGTEK
 HTNLVPHGTEK
 HTNYNMEHIR
 HTQAVEELTEQLEQFK
 HTQAVEELTEQLEQFK
 HTQAVEELTEQLEQFKR
 HTQLAEEK
 HTSSWLVTPK
 HTSSWLVTPK
 HTSSWLVTPK
 HTTIFEVLPEK
 HTTIFEVLPEK
 HTTIFEVLPEK
 HTTIFEVLPEK
 HTTIFEVLPEK
 HTTIFEVLPEK
 HTTIFEVLPEKADR
 HTTIFEVLPEKADR
 HTTIFEVLPEKADRDQYELLCLDNTR
 HTTIFEVLPEKADRDQYELLCLDNTR
 HTTIFEVLPEKADRDQYELLCLDNTR
 HVAGLWAAVR
 HVAYAVYSLSK
 HVAYAVYSLSK
 HVAYAVYSLSK
 HVAYAVYSLSK
 HVDQEDAIEAYHGVCQTNR
 HVDWVR
 HVEEVRK
 HVEPGNAAIQEK
 HVGDLGNVTAG

 ITDVTSGLLGGEDGR
 ITEEPMGITLK
 ITEEYYVHLIADNLPVATR
 ITEGDLSQLTASIR
 ITEQVMK
 ITESIGCVMTGMTADSR
 ITFELIYQELLQR
 ITFQTK
 ITGDPFK
 ITGLVGPR
 ITGPDANVHLK
 ITGTNAEVMPAQWEFQIGPCEGIR
 ITGVEDIISR
 ITGYIIR
 ITLEDVISHR
 ITLNLPASTPVRK
 ITLPDFSGDFK
 ITLWLPR
 ITMYLMK
 ITNFSTDIK
 ITNFSTDIK
 ITPFEEK
 ITPFEEK
 ITPWKPYILMVPSNSDEIQLDIQAR
 ITQSSLTICFPEYTGANK
 ITQVLHFR
 ITQVLHFR
 ITQVLHFR
 ITQVLHFR
 ITSAVWGPLGECVIAGHESGELNQYSAK
 ITSEELHYFVQNHFTSAR
 ITSGIPQTER
 ITSLEVENQNLR
 ITSLEVENQNLR
 ITSVDAAFR
 ITVAALDAANLLASVPASQR
 ITVIAIYEDGDGGHLTGNGR
 ITVVDALHEIPLK
 ITVVDALHEIPLK
 ITVVDALHEIPLK
 ITVVDALHEIPLK
 ITVVDALHEIPLKK
 ITWAPVGNPDK
 ITWDPPSGPVK
 ITWLHTK
 ITYEDLPAIITIQDAIK
 ITYEDLPAIITIQDAIK
 ITYGMQGSSGYSLR
 ITYQPSTGEGNEQTITVGGR
 ITYQPSTGEGNEQTITVGGR
 ITYQPSTGEGNEQTITVGGR
 ITYQPSTGEGNEQTITVGGR
 ITYQPSTGEGNEQTITVGGR
 ITYVPMTGGAPSMVTVDGTDTETR
 ITYVPMTGGAPSMVTVDGTDTETR
 ITYVPMTGGAPSMVTVDGTDTETR
 ITYVPMTGGAPSMVTVDGTDTETR
 ITYVPMTGGAPSMVTVDGTDTETR
 ITYVPMTGGAPSMVTVDGTDTETR
 ITYVPMTGGAPSMVTVDGTDTETR
 IVAATLSDPELFK
 

 LCNECSDGSFHLSK
 LCPTVQLEDICNHQGLTPLK
 LCQLCPGCGCSSTQPFFGYVGAFK
 LCQLCPGCGCSSTQPFFGYVGAFK
 LCQLCPGCGCSSTQPFFGYVGAFK
 LCQLCPGCGCSSTQPFFGYVGAFK
 LCQLCPGCGCSSTQPFFGYVGAFK
 LCQLCPGCGCSSTQPFFGYVGAFK
 LCQPKPK
 LCRPCQCNDNIDPNAVGNCNR
 LCRPCQCNDNIDPNAVGNCNR
 LCSIPIHGIR
 LCTSATESEVTR
 LDANIPEVAVEGPEGK
 LDAPSHIEVK
 LDAPSHIEVK
 LDAPSHIEVK
 LDAQASFLSEELAAQTIK
 LDAQASFLSEELAAQTIK
 LDAQASFLSEELAAQTIK
 LDAQASFLSEELAAQTIK
 LDAQASFLSEELAAQTIKK
 LDAQASFLSEELAAQTIKK
 LDAVNTLLVMAER
 LDDCGLTEVR
 LDDLPGALSALSDLHAHK
 LDDMSSIVQK
 LDEEIAQK
 LDFLRPFSVPNK
 LDFNLVR
 LDFNLVR
 LDFPAMK
 LDGLVDTPTGYIESLPK
 LDGLVDTPTGYIESLPK
 LDGNFLKPPIPLDLMMCFR
 LDGNFLKPPIPLDLMMCFR
 LDGNPLTQSSLPPDMYECLR
 LDGNPLTQSSLPPDMYECLR
 LDGNPLTQSSLPPDMYECLR
 LDGNPLTQSSLPPDMYECLR
 LDGNPLTQSSLPPDMYECLR
 LDGPEEAECTK
 LDGPEEAECTK
 LDIACWVHHK
 LDIDSAPITAR
 LDIDSAPITAR
 LDIDSAPITAR
 LDIDTPDIDIHGPEGK
 LDIDVPNVDVQGPELHMK
 LDILDVLSEIK
 LDISAPDLNLEGPEGK
 LDKMDEEEVEVFLPK
 LDLDLTSDSQPPVFK
 LDLDTSGVGDGRR
 LDNGDCDQFCR
 LDNLVAIFDINR
 LDNLVAIFDINR
 LDNLVAIFDINR

 LPGALSALSDLHAHK
 LPGMIGSLPDPFGEEMR
 LPGMIGSLPDPFGEEMR
 LPGMLVASYTK
 LPGPWLPA
 LPGQPPASMGR
 LPGQPPASMGR
 LPHLGK
 LPIICFDYGMVPISAPR
 LPIICFDYGMVPISAPR
 LPIIDVAPLDIGAPDQEFGLDIGPACFV
 LPIIDVAPLDIGAPDQEFGLDIGPACFV
 LPIIDVAPLDIGAPDQEFGLDIGPACFV
 LPILNQPTSEIVASAR
 LPIPDSQVLTINPALPVEDAAEDYAR
 LPIPDSQVLTINPALPVEDAAEDYAR
 LPIPDSQVLTINPALPVEDAAEDYAR
 LPIPDSQVLTINPALPVEDAAEDYARK
 LPIPPILLAELGSDPEK
 LPITLNK
 LPLAAQAHPFRPPVR
 LPLAAQAHPFRPPVR
 LPLAAQAHPFRPPVR
 LPLAAQAHPFRPPVR
 LPLAAQAHPFRPPVR
 LPLPALFK
 LPLPALFK
 LPLPALFK
 LPLQLDDAIRPEVEGEEDGR
 LPMGSQCSVDLESASGEK
 LPMGSQCSVDLESASGEK
 LPMGSQCSVDLESASGEK
 LPMGSQCSVDLESASGEK
 LPNGLVIASLENYAPLSR
 LPPGEYVLVPSTFEPHK
 LPPIDLVR
 LPPIDLVR
 LPPLNIGEVLTLPEANFPSFSLPNCNR
 LPPLNIGEVLTLPEANFPSFSLPNCNR
 LPPLNIGEVLTLPEANFPSFSLPNCNR
 LPPLNIGEVLTLPEANFPSFSLPNCNR
 LPPLPVTPGMEGAGVVVAVGEGVGDRK
 LPPLPVTPGMEGAGVVVAVGEGVGDRK
 LPPLPVTPGMEGAGVVVAVGEGVGDRK
 LPQENFNNLR
 LPQENFNNLR
 LPQEVAK
 LPQFGISTPGSDLDINIK
 LPQFGISTPGSDLDINIK
 LPQTATCK
 LPQVSPADSGDYVCR
 LPQVSPADSGDYVCR
 LPSD

 MEVPFAVR
 MFAEYLASENQR
 MFASFPTTK
 MFASFPTTK
 MFASFPTTK
 MFASFPTTK
 MFASFPTTK
 MFASFPTTK
 MFASFPTTK
 MFASFPTTK
 MFASFPTTKTYFPHFDVSHGSAQVK
 MFASFPTTKTYFPHFDVSHGSAQVK
 MFASFPTTKTYFPHFDVSHGSAQVKG
 MFASFPTTKTYFPHFDVSHGSAQVKG
 MFESFIESVPLFK
 MFFDGRYI
 MFGCDVGSDWR
 MFGCDVGSDWR
 MFGGSGTSSR
 MFGGSGTSSR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGGSGTSSRPSSNR
 MFGIDK
 MFGIDK
 MFGIDKDAIVQAVK
 MFGIDKDAIVQAVK
 MFGLMNSHAMLK
 MFGLMNSHAMLK
 MFGNLQGLTMHVK
 MFGNLQGLTMHVK
 MFGNLQGLTMHVK
 MFGNLQGLTMHVK
 MFGNLQGLTMHVK
 MFGNLQGLTMHVK
 MFGVPVVVAVNVFK
 MFIQTQDTPNPNSLK
 MFIQTQDTPNPNSLK
 MFNELLTHQAPR
 MFNQLHNNMLSGAGSR
 MFPSLCQLCAGK
 MFQTLIQK
 MFQTLIQK
 MFSAENGK
 MFVHLR
 MFYESVYGQCK
 MFYESVYGQCK
 MFYGSFHK
 MFYGSFHK
 MGAPEAGMAEYLFDK
 MGAPEAGMAEYLFDK
 MGAPEAGMAEYLFDK
 MGASLVSIETAAESSFLSYR
 MGASLVSIETAAESSFLSYR
 MGDHLWIAR
 MGDSSQGDNNVQK
 MGGSSGALYGLFLTAAAQPLK
 MGHYLHEVAR
 MGHYLHEVAR
 MGHYLHEVAR
 MGHYLHEVAR
 MGHYLHEVAR


 NSFTIPSQPGIPEEVGAGK
 NSGIENGAFQGLK
 NSIAYLDEETGSLNK
 NSIAYLDEETGSLNK
 NSIMNIR
 NSITLTNLNPGTEYVVSIIAVNGR
 NSITLTNLNPGTEYVVSIIAVNGR
 NSITLTNLNPGTEYVVSIIAVNGR
 NSITLTNLNPGTEYVVSIIAVNGR
 NSLCPSGGNILTPLLQQDCHQK
 NSLFDFQR
 NSLFDFQR
 NSLFGSVETWPWQVLSTGGK
 NSLFGSVETWPWQVLSTGGKEDVSYEER
 NSPGSFICECSPESTLDPTK
 NSPGSFICECSPESTLDPTK
 NSQEDSEDSEEKDVK
 NSQLTGGFTVEPVHDGAR
 NSQLTGGFTVEPVHDGAR
 NSQLTGGFTVEPVHDGAR
 NSSASNTQDGVGSLCSR
 NSSVGLIQLNRPK
 NSSYAHGGLDSNGKPADAVYGQK
 NSTFSELFKK
 NSTIVFPLPVDMLQGIMGSNH
 NSTIVFPLPVDMLQGIMGSNH
 NSTIVFPLPVDMLQGIMGSNH
 NSVELLVEDR
 NSWGEPWGEK
 NSWGLNFGDQGYIR
 NSWKDPALCCDLSPEDK
 NSYPHFYDGSEIVVAGR
 NSYPHFYDGSEIVVAGR
 NSYPHFYDGSEIVVAGR
 NSYPHFYDGSEIVVAGR
 NSYPHFYDGSEIVVAGR
 NSYPHFYDGSEIVVAGR
 NSYPHFYDGSEIVVAGR
 NSYQDAEDK
 NSYQDAEDKKK
 NSYQDAEDKKKEEKEEEEQEK
 NSYQDAEDKKKEEKEEEEQEKLK
 NSYQDAEDKKKEEKEEEEQEKLK
 NSYVAGQYDDAASYK
 NSYVAGQYDDAASYK
 NTAMLVCSTPQFPHGVMDPVPEVAK
 NTDGVNFYNILTK
 NTDGVNFYNILTK
 NTDIDLDK
 NTDPGYNCLPCPPR
 NTDPGYNCLPCPPR
 NTEEEGPK
 NTELCETPATSDTK
 NTETNVFPQDK

 QDGTHVVEAVDATHIGK
 QDGTHVVEAVDATHIGK
 QDGTHVVEAVDATHIGK
 QDGTHVVEAVDATHIGK
 QDLATLDVTK
 QDLGSPEGIALDHLGR
 QDLGSPEGIALDHLGR
 QDLGSPEGIALDHLGR
 QDLGSPEGIALDHLGR
 QDMEQAMTPSEMANALGLPALK
 QDNNWGKEER
 QDNNWGKEER
 QDNNWGKEER
 QDPSETSDSGVTLGR
 QDPSETSDSGVTLGR
 QDRVPPSR
 QDSFHTLKK
 QDSTQNLIPAPSLLTVPLQPDFR
 QDTTSTIISIASNVAGHPLVWDFVR
 QEACVFKEEIYCICDIPVMKVYNPAR
 QEALELIK
 QEDEMMR
 QEDEMMR
 QEDFELLCPDGTR
 QEDFELLCPDGTR
 QEDFELLCPDGTR
 QEDFELLCPDGTR
 QEDFELLCPDGTR
 QEDFELLCPDGTRK
 QEDFELLCPDGTRKPVK
 QEDFELLCPDGTRKPVK
 QEDFELLCPDGTRKPVK
 QEDFELLCPDGTRKPVK
 QEDFELLCPDGTRKPVK
 QEDFELLCPDGTRKPVK
 QEDIPEVSCIHNGLR
 QEDIPEVSCIHNGLR
 QEDKEEDLK
 QEDKEEDLK
 QEEDLANINQWVK
 QEEDLANINQWVK
 QEEDLANINQWVK
 QEEETMDFR
 QEESLMPSQAVK
 QEEVYSELQAR
 QEFGWIIPMMLQNLLPEGK
 QEGVSNQVK
 QEHTFSSLFCASDAEISEK
 QEHTFSSLFCASDAEISEK
 QEHTFSSLFCASDAEISEK
 QEHTFSSLFCASDAEISEK
 QEIAQEFK
 QEIAQEFK
 QEIFQEQLAAVPEFQGLGPLFK
 QEIFQEQLAAVPEFQGLGPLFK
 QEITDKDHTVSRLEETNSVLTKDIEMLR
 QELDEQEK
 QELLEQIAEQQK
 QELQEAQER
 QELSCPLCLQLFDAPVTAECGHS

 RVAEQELLDASER
 RVDSVNPPYPR
 RVDSVNPPYPR
 RVDSVNPPYPR
 RVDSVNPPYPR
 RVEDKDGYR
 RVELAGAK
 RVFQNYGK
 RVFQNYGK
 RVHPISTMIK
 RVHPISTMIK
 RVHPISTMIK
 RVHPISTMIK
 RVHPISTMIK
 RVPAPTNLQFTEVTPESFR
 RVPAPTNLQFTEVTPESFR
 RVPAPTNLQFTEVTPESFR
 RVPAPTNLQFTEVTPESFR
 RVPAPTNLQFTEVTPESFR
 RVPAPTNLQFTEVTPESFR
 RVPVIPPR
 RVSEMEMASR
 RVSEMEMASR
 RVVVIKK
 RVYLSECK
 RWEVAALR
 RWEYCDIPR
 RWEYCDIPR
 RYDDPEVQKDTK
 RYEVPLETPR
 RYRPIATR
 RYTLNILEDIGGGQK
 RYTLNILEDIGGGQK
 SAAAFKPVGSTSVK
 SAAEEVDGLGVVRPHYGSVLDNER
 SAAEQLAISIPNDTAGR
 SAAESISESVPVGPK
 SAAESISESVPVGPK
 SAAEVIAQAR
 SAAEVIAQAR
 SAAVSFIPFMTFFK
 SADESGQALLAASHYASDEVR
 SADESGQALLAASHYASDEVR
 SADEVLFSGVK
 SADHPTLDK
 SADHPTLDK
 SADLNVDSIISYWK
 SAEEGEVTESK
 SAETEKEMANMKEDFEK
 SAEVDDGASEK
 SAEVDSDDTGGSAAQK
 SAEVELQSK
 SAGTSLVNFFSSLMNLEEK
 SAGTSLVNFFSSLMNLEEK
 SAGTSLVNFFSSLMNLEEK
 SAGTSLVNFFSSLMNLEEKPAPAA
 SAGTSLVNFFSSLMNLEEKPAPAA
 SAGTSLVNFFSSLMNLEEKPAPAA
 SAGTSLVNFFSSLMNLEEKPAPAA
 SAGTSLVNFFSSLMNLEEKPAPAA
 SAGTSLVNFFSSLMNLEEKPAPAA
 SAGTSLVNFFSSLMNLEEKPAPAA


 SSLQTMESDVYTEVR
 SSLQTMESDVYTEVR
 SSLQTMESDVYTEVR
 SSNLDEQQELVERDDEK
 SSPCSTDKQNV
 SSPFQLFGSPHGEDLLFTDAAHGLLR
 SSPFQLFGSPHGEDLLFTDAAHGLLR
 SSPFQLFGSPHGEDLLFTDAAHGLLR
 SSPFQLFGSPHGEDLLFTDAAHGLLR
 SSPFQLFGSPHGEDLLFTDAAHGLLR
 SSPIYVGR
 SSPLIQFNLLHSK
 SSPLIQFNLLHSK
 SSPLPLQEGPGPEGGR
 SSPLPLQEGPGPEGGR
 SSPLSVVVR
 SSPVEYECINEK
 SSPVIIDASTAIDAPSNLR
 SSPVIIDASTAIDAPSNLR
 SSPVIIDASTAIDAPSNLR
 SSPVIIDASTAIDAPSNLR
 SSPVIIDASTAIDAPSNLR
 SSPVIIDASTAIDAPSNLR
 SSQDMLSVMEK
 SSQDMLSVMEK
 SSQQAASHVAPK
 SSQQAASHVAPK
 SSQQAASHVAPK
 SSQSQPSFPWQVR
 SSQVRPVSTVTPGSSGK
 SSRPASLQILYAPR
 SSRPEFYK
 SSRPEFYK
 SSSDACGPCVPASCPALPR
 SSSEQELR
 SSSGLLEWDSK
 SSSIASFK
 SSTASGTQVLK
 SSTIATK
 SSTNVEEAFFTLAR
 SSTPLPTVSSSAENTR
 SSTQPFFGYVGAFK
 SSTTNGFDDGIIWATWK
 SSTTNGFDDGIIWATWK
 SSTTNGFDDGIIWATWK
 SSVAVPYVIVPLK
 SSVAVPYVIVPLK
 SSVAVPYVIVPLK
 SSVAVPYVIVPLK
 SSVDDLVGIDYSLLK
 SSVETQPAEEVR
 SSVFVVDAK
 SSVFVVDAK
 SSVFVVDAK
 SSVSQVESDLK
 SSVSQVESDLK
 SSYPGQITGN
 SSYSLMK
 STATINNIK
 STATINNIKPGADYTITLY
 STATINNIKPGADYTITLYAVTGR


 TMPADLPAIAADFVEDQEVCK
 TMQALEFHTVPVEVLAK
 TMQTLLSLVK
 TMQYVPNSHDVK
 TMQYVPNSHDVK
 TMQYVPNSHDVK
 TMSGLDCQAWDSQSPHAHGYIPAK
 TMSGLDCQAWDSQSPHAHGYIPAK
 TMSGLDCQAWDSQSPHAHGYIPAK
 TMTIHNGMFFSTYDRDNDGWVTTDPR
 TMTIHNGMFFSTYDRDNDGWVTTDPR
 TMTIHNGMFFSTYDRDNDGWVTTDPR
 TMVVHEK
 TMVVHEKQDDLGKGGNEESTK
 TMVVHEKQDDLGKGGNEESTKTGNAGSR
 TMVVHEKQDDLGKGGNEESTKTGNAGSR
 TNCDLYEK
 TNCDLYEK
 TNELGDGGVGLVLQGLQNPTCK
 TNELGDGGVGLVLQGLQNPTCK
 TNELGDGGVGLVLQGLQNPTCK
 TNFPICIFCCK
 TNFWIGMFR
 TNLIVNYLPQNMTQEELR
 TNMAFSPFSIASLLTQVLLGAGDSTK
 TNMAFSPFSIASLLTQVLLGAGDSTK
 TNPEAQALWQVVGSSVIMR
 TNQMNLQNTATK
 TNSDLVSLAK
 TNTNVNCPIECFMPLDVQADRDDSRE
 TNVDQTMCLDINECER
 TNVDQTMCLDINECER
 TNVDQTMCLDINECER
 TNVNEPASDTLQEVK
 TNVSGGAIALGHPLGGSGSR
 TNVSGGAIALGHPLGGSGSR
 TNYDIEHVIK
 TNYDIEHVIKK
 TNYDIEHVIKK
 TPAAQAAFQK
 TPAGLQVLNDYLADK
 TPAGLQVLNDYLADK
 TPAGLQVLNDYLADK
 TPDEEFK
 TPDEEFKEVEVDR
 TPDEEFKEVEVDR
 TPDLSVQLPSADLELK
 TPDLSVQLPSADLELK
 TPDQIMELFDSITYSK
 TPDQIMELFDSITYSK
 TPEDIGEEQR
 TPEEANAGEK
 TPEEESIDWTK
 TPEELSAIK
 TPEELSA

 VESDVGPK
 VESSDVSDLLYK
 VESTEQLIEIASR
 VESVFETLVEDSPEEESTLTK
 VESVFETLVEDSPEEESTLTK
 VESVFETLVEDSPEEESTLTK
 VETLSQVEVILQQSAADIAR
 VETLSQVEVILQQSAADIAR
 VETLSQVEVILQQSAADIAR
 VETLSQVEVILQQSAADIAR
 VEVGKDQEFAIDTNGAGGQGK
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVLPVSLPGEHGQR
 VEVPVGSFK
 VEVQIGIPGHLR
 VEVSAPDVSIEGSEGK
 VEVSAPDVSIEGSEGK
 VEYGFTVK
 VEYGFTVK
 VEYLNNR
 VFADYEEYIK
 VFAENAAGLSLPSETSPLVR
 VFAENMCGLSEDATMTK
 VFAILENKR
 VFAILENKR
 VFANAPDSACVIGLR
 VFANPEDCAGFGK
 VFAVHQGR
 VFAVHQGR
 VFCIGPVFR
 VFCIGPVFR
 VFDAIMNFR
 VFDAIMNFR
 VFDAIMNFR
 VFDAIMNFR
 VFDAIMNFR
 VFDPQSDKPSK
 VFDPQSDKPSK
 VFDPQSDKPSK
 VFEHIGKR
 VFEHSSVELK
 VFEHSSVELK
 VFEHSSVELK
 VFEHSSVELK
 VFFEQGATR
 VFFEQGATR
 VFFINHNIK
 VFFINHNIK
 VFFSNKPTR
 VFGFQPLAVR
 VFHGLK
 VFHGLK
 VFHILDK
 VFHILDK
 VFHILDK
 VFHILDKDK
 VFHILDKDK
 VFHLPTTTFIGGQEPALPLR
 VFIEDVSK
 VFIGNLNTAVVK
 VFIGNLNTAVVK
 VFIPHGLIMDR
 VFIPHGLIMDR
 VFIPHGLIMDR
 VFIPHGLIMDR
 VFIPHGLIM

 VTGLQEGNTYEFR
 VTGWGNLR
 VTGWGNLR
 VTHAFR
 VTIAQGYDALSSMANISGYK
 VTIGLLSLDDPQR
 VTIGLLSLDDPQR
 VTIGLLSLDDPQR
 VTILGR
 VTIMWTPPDSVVSGYR
 VTIMWTPPDSVVSGYR
 VTIMWTPPDSVVSGYR
 VTIMWTPPDSVVSGYR
 VTIMWTPPDSVVSGYR
 VTIMWTPPDSVVSGYR
 VTLASHHPADFTPAVHASLDK
 VTLASHHPADFTPAVHASLDK
 VTLEDEKK
 VTLNDVPCEVTK
 VTLSNQPYIK
 VTLTLPVLNAAQSIIFVATGEGK
 VTLTLPVLNAAQSIIFVATGEGK
 VTLTLPVLNAAQSIIFVATGEGK
 VTLTPEEEAR
 VTLTPEEEAR
 VTLTPEEEAR
 VTLYDPMSGILTSVQTK
 VTMTSEGGR
 VTNPVGEDVASIFLR
 VTNPVGEDVASIFLR
 VTNPVGEDVASIFLR
 VTNPVGEDVASIFLR
 VTPNLMGQLCGSGR
 VTPPEGYDVVTVFRE
 VTPPEGYDVVTVFRE
 VTQTFGENMQK
 VTQTFGENMQK
 VTQTFGENMQK
 VTQTFGENMQK
 VTSAALDLVK
 VTSAGPEESDGDLSCVCVK
 VTSGTK
 VTSLTACLVNQNLR
 VTSLTACLVNQNLR
 VTSLTACLVNQNLR
 VTSLVVDIVPR
 VTSSGTSTTHR
 VTSSGTSTTHR
 VTSTTNMAYNK
 VTTFLK
 VTVGLYGPK
 VTVNTNMTDLNDYLQHILK
 VTVTPVYTVGEGVSVSAPGK
 VTVTPVYTVGEGVSVSAPGK
 VTVVDVNEAR
 VTWAPPPSIELTNLLVR
 VTWAPPPSIELTNLLVR
 VTWAPPPSIELTNLLVR
 VTWAPPPSIELTNLLVR
 VTWAPPPSIELTNLLVR
 VTWAPPPSIELTNLLVR
 VTWAPPPSIELTNLLVR
 VTWCAVGSEE

 YHVPVVVVPEGSTSDTQEQAILR
 YHYVGTFLDGQK
 YIAPCLDSELTEFPLR
 YIATPIFSK
 YIATPIFSK
 YIDYEK
 YIESAGAR
 YIESAGAR
 YIESAGAR
 YIFTLR
 YIGYVIR
 YIINIFR
 YIINIFR
 YIISTPIVFR
 YIIWSPVCR
 YIMSYVSR
 YINENLIINTDELGR
 YIPQVK
 YIPVQQGPVGVNVTYGGDHIPK
 YIPVQQGPVGVNVTYGGDHIPK
 YIQLPFGDEDALK
 YIQLPFGDEDALK
 YIQLPFGDEDALK
 YIQLPFGDEDALK
 YIQLPFGDEDALKEAVATK
 YISLEGFEQPVAVFLGVPFAKPPLGSLR
 YISLEGFEQPVAVFLGVPFAKPPLGSLR
 YISVGYVDNK
 YISVGYVDNK
 YITPDQLADLYK
 YITPDQLADLYK
 YITPDQLADLYK
 YIVNVYQISEEGK
 YIVNVYQISEEGK
 YIVNVYQISEEGK
 YIVPMITVDGK
 YIVPMITVDGK
 YIVPMITVDGK
 YIVPMITVDGK
 YIVPMITVDGK
 YIVPMITVDGK
 YIYDNVAK
 YIYGKPVQGVAYTR
 YIYGKPVQGVAYTR
 YIYGKPVQGVAYTR
 YIYGKPVQGVAYTR
 YIYTGR
 YKAEDEVQR
 YKAEDEVQR
 YKPESDELTAEK
 YKPESDELTAEK
 YKPESDELTAEK
 YKPESDELTAEK
 YKPESDELTAEK
 YKPESDELTAEK
 YKSFVQNYPVVSIEDPFDQDDWGAWQK
 YKTDNGDFASFR
 YKTPDEEFKEVEVDR
 YKTPDEEFKEVEVDR
 YKTPDEEFKEVEVDR
 YLDFFK
 YLDFIFAVK
 YLDQTEQWEK
 YLDSIPPGQFMDSSLVK
 YLEGSVVR
 YLETLLHSQQQLAK
 YLGAEYMQSVGNMR
 YLGAEYMQSVGNMR
 YLGAEYMQSVGNMR
 YLGAE

In [112]:
a = 'AAFEWNEEGAGSSPSPGLQPVR'

In [43]:
a = ' 01'

In [114]:
len(a)

22

In [2]:
def msp_to_df(
    input_file,
    max_seq_len=30,
    min_ce=36,
    max_ce=40,
    mz_min=135,
    mz_max=1400,
):
    """
    Function to read spectrum data from .msp file and convert to dataframe.
    Args:
        input_file (str): path to .msp file
        max_seq_len (int): maximum acceptable sequence length
        min_ce (int): minimum collision energy of spectra to be included in df
        max_ce (int): maximum collision energy of spectra to be included in df
        mz_min (int): lower boundary for m/z to be included in df
        mz_max (int): lower boundary for m/z to be included in df

    Returns:
        df (pd.DataFrame or np.array): spectrum information within defined parameters [n_spectra, n_features]
        seqs (pd.DataFrame or np.array): sequences
    """
    
    
    
    df = None
    seqs = None
    
    return df, seqs