-
Notifications
You must be signed in to change notification settings - Fork 0
/
test.fasta
70 lines (70 loc) · 5.34 KB
/
test.fasta
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
>637077918 mll5855 mll5855 nitrogen fixation protein nifB [Mesorhizobium loti MAFF303099: NC_002678]
MSAPMISLQGLSGTTVFDQSPARAKSGCASSSCGSSAKPHEMDSAVWEKIKDHPCFSEEAHHYFARMHVAVAPACNIQC
NYCNRKYDCANESRPGIVSERLTPDQALRKVIGVANEVPQLSVLGIAGPGDACYDWKNTKATFQRVAKEIGDIKLCISTN
GLALPDRVAELAEMNVDHVTITINMVDPRIGAKIYPWIFYGNRRYTGVEAATILHERQMSGLEMLTARGILTKINSVMIP
GVNDEHLIEVNKWVKERGAFLHNVMPLISDPAHGTHYGLSGQRGPKAMELKALQDRLEGGAKLMRHCRQCRADAVGLLGE
DRGQEFTLDRLPDKVTYDASKRETYRAVVARERGDRLAAKSEALGMVKTAGSGKSLLVAVATKGGGRINEHFGHAKEFQV
YEASPKGISFVGHRKVEQYCLGGRAEDKSLDGVISTLEGVDIVLCARIGNCPEDRLKEGGIRATEAYGYDYIETAIGALY
AAEFGSEPLAATA
>GAO28256.1 thioredoxin family protein [Geofilum rubicundum JCM 15548]
MDDAEVKGSELSTLFASFNEEVPHMEQVEKMREEFFAAQSQGDQATMESIMADMETIIEEQQAYYRNFVKENSDNVVGAF
LALNMAQSLEFEELEEITTNLEANLSTHPYVVQLKEMMEPIKAQKEAEAALNVGNEAPLFTLPNMEGSEVSLDDFKGKYV
FVDFWAAWCRPCREENPILKRAYDRFGGENFEIVSVSLDQTAEAWQQAVAEDELNWTLLRDSAGTVAQTYGVQSIPNTWL
LDKDGKIMQKQIRGEELITVLEDLLQ
>GAO28255.1 N-acetyl-L,L-diaminopimelate deacetylase [Geofilum rubicundum JCM 15548]
MHNIDRETIQKLTQEIYKDLLTYRRQLHQNPELSFEENETARYISQILRNHDIPIREGIAGTGIIATIQGERGTGRTIAL
RADMDALPIREATGLPFASQNNGVMHACGHDAHSASLLGTGIILNRLKKQWGGTILLIFQPGEEKFPGGASLILKEGALN
HPRPDLIIGQHVLPEMASGNVGFKKGMYMASGDEVYITVNGKGGHAAMPHTLNDTILAASQVIVNLQQIVSRIVPANIPT
VLSFGHIEGKGATNIIPEEVAIAGTLRTMNEDWRSRIKDKIRHIAETTAATYGCRAEVDIKDGYPMVLNNEEVTARAQEL
ATEYLGKAHVETMETRMTAEDFGFYSQEFPATFYRFGVRQNDGETGALHTPRFNLNEKSLETSCGLMTWVALNMLQP
>GAO28254.1 ferredoxin [Geofilum rubicundum JCM 15548]
MAYVINDDCIACGSCIDECPVDAISEGDIYVIDAEACTDCGACADVCPTEAIHPA
>GAO28253.1 5-formyltetrahydrofolate cyclo-ligase [Geofilum rubicundum JCM 15548]
MRKMEMIQKKKALRKHITYLKSVVPLLQMQEESRHVVAAIEALAVFKQARTVLAYWPMFKELDLSSLLNKWQAEKVFLLP
VVQGDSLEIRRFQGETSLAPGPSFGIMEPVGPAFNAFGDIDLVLVPGVAFDEEGRRIGHGKAYYDRLLPQLTKAFKIGVG
FSFQLVEDVPSEAHDVRLDLVVVPPMKPKDIGK
>GAO28252.1 GldB protein [Geofilum rubicundum JCM 15548]
MKLVFVSSIVTLLLMTMACSSGSKAPDVSHVDVDFELIPFYEDLFAIHPDSFAGEAEALKAKYGQYLEAYSLGVIAAGST
EDEDFVENMQFFLSYEPNQEVLDTCRLVFGDTAPLEEELESAFKYYRYYFPEAEVPDVYLHISGFNQSVVVDSSWISVSV
EKYLGRGCLFYEWLSIPVYLRRRMSPEKVVPDVMMALAMTEFAYNDSVDDLLNQMIYEGQLRYFVKQLIPDIPDTTLLDF
STEQMSWVENNEERMWSTIVENKHLFSNDRMTLQRFVGKSPFTYYFGEESPGGTGIYLGYQIVKAYMERHPETPLASLMQ
MNDGHAFFRSARYQP
>GAO28251.1 hypothetical protein JCM15548_1317 [Geofilum rubicundum JCM 15548]
MRKTALLILLFISAQLLTAQSAGDYRFSFVLTPQISWVKSDHTDVDNKGSQFGYNFGIIMDRFFSHNYAFSTGLTINTTG
GKLAYPSVTTNGTETFSAMSQTYQLKYIEIPLGLKLRSEDMHRTNIYGRFGLSPQINIQAQNSGGKSINEEVRLFDLGYH
LGGGIEYSLGGRNALMIGVLFNNGFMDVTDHDYFDDKAILNRLVFEFGFIF
>643560964 Dhaf_1050 DSY4270 Mo-nitrogenase MoFe protein subunit NifD precursor (EC 1.18.6.1) [Desulfitobacterium hafniense DCB-2: NC_011830]
MSISEMVQARKELVNQVLEVYPEKAKKNRRQHLSVKESDCSSCAVKSNGKTVPGIMTARGCAYAGAKGVVWGPVKDIVHI
SHGPVGCGFYSWANRRNLAEGEVGIDNFVPFQFTSDFQESDIIYGGDKKLEKIIEEVVELFPNAKGVSVLSECPVGLIGD
DIESVARRMTEKTQRPVVPVRCEGFRGISQSLGHHIANDAIRDHIIGKGPEREIGPYDIGIIGDYNIGGDAWASKKILEE
IGLNVVNIWTGDSTLEMLQNGHLVKLNLIHCYRSMNYMANYMEETYGTPWLEFNFFGPTKIKESLLNIAAHFDDSIRENT
QRVIAKYEAQMQKVIDIYRPRLAGKKVMLYVGGLRPRHVVGAYEDLGMEIIGTGYEFAHKEDYERTYPQLKEGTLIYDDV
SALELEEFVKDLKPDLVGSGIKEKYVFEKMGLPFRQMHSWDYSGPYHGYDGFPIFARDMDMAVNSPTWKSIKAPWMK
>646614562 Alvin_0903 nitrogenase MoFe cofactor biosynthesis protein NifE [Allochromatium vinosum DSM 180 chromosome: NC_013851]
MKPKDLSALLDEPACAHNAKSKSGCAKPKPGATAGGCAFDGAQIAMLPIGDVAHIVHGSIACAGNSWDNRGTRSSGPRLY
KIGMTTDLTEQDIIMGRGEKRLFHSIKQAVDSYQPAAVFVYNTCVPALTGDDVEAVCRAAEQRWGTPVIPIDAAGFYGAK
NLGSRISGETMVKRVCGGREPDPIPEGIERPGFKVHDICLIGEYNIAGELWHVLPLFDELGLRVLCTLSGDARFREVQTM
HRSEVNMMVCSRALVNVARRLKETYGTPWFEGSFYGVRDVSQALRDFARIIDDPDLTARTEVVIAREEAKAEAALAPWRE
RLQGRKVLLYTGGVKSWSVISALQDLGMTVVASGTRKSTEEDKARIRELMGEDAVMIEEGNPRTLIDMVHDQGVDILIAG
GRNLYTALKARLPFLDINQEREFGYAGYAGMEELARQLCLTIESPIWEAVRRPPPWARSAATTRSPAVTTPMVSESTALE
VRHA
>640487283 PST_1326 nifH Mo-nitrogenase iron protein subunit NifH (EC 1.18.6.1) [Pseudomonas stutzeri A1501: NC_009434]
MAMRQCAIYGKGGIGKSTTTQNLVAALAELGKKVMIVGCDPKADSTRLILHSKAQNTIMEMAAEAGTVEDLELEDVLKTG
YGDIKCVESGGPEPGVGCAGRGVITAINFLEEEGAYEDDLDFVFYDVLGDVVCGGFAMPIRENKAQEIYVVCSGEMMAMY
AANNICKGIVKYANSGSVRLGGLICNSRNTDREDELIMALADKLGSQMIHFVPRDNVVQRAEIRRMTVIEYDPAAKQADE
YRTLAKKIVENKKLVIPTPISMDELEALLMEFGIMDEEDMTIVGKTAAEEVVA
>637127509 GSU2819 nifK Mo-nitrogenase MoFe protein subunit NifK (EC 1.18.6.1) [Geobacter sulfurreducens PCA: NC_002939]
MSNQLGLAVKPVTEYDDAEVKRVAEWINTEEYKEKNFARQALVINPAHACQPLGAELVAHAFEGTLPFVHGSQGCASYYR
STLNRHFREPAPAVSDAMTEDGAVFGGQNNLHEGLENAIALYKPKMVAVFTSCMPEIIGDDLTAFLKNARNKGIIPADMP
TPYANTPSFNGSHIHGYDAMLLSILQTLTAGKQVEGRCTGKLNLIPGFDANTGNFREYKRILEAFGIPYTILGDISDVFD
SPLDGTYRPYPGGTTLDDAADSINGKATLNLGPYSAAKTFSWVKDSYSGKHASLPMPMGVTKTDDFLKKLSELFGKPVPE
SLKEERGRAVDAMTDAHQYIHNKKFAVYGDPDQLLGYVSFLLEMGAKPYHILCSKGTKKLEKEIQALLDTSPYGAGCKIY
INKDLWHMRSLLMTDPVDAMIGDTHGKFAARDAGIPLFRFGFPIFDRVNKHRYPIIGYQGVVNMLTEICNKFLDITDETC
EDRFFEMMR
>641334397 GDI0440 nifN Nitrogenase iron-molybdenum cofactor biosynthesis protein nifN [Gluconacetobacter diazotrophicus PAl 5: NC_010125]
MATIVKPRKAASVNPLKSSTPLGAALAYLGIDGAVPLFHGSQGCTSFALVLTVRHYKEAIPLQTTAMDEVATILGAAGNL
EEALLNLQRRMKPRFIGIASTALVETRGEDYAGDLKLILQRQPELADTRIVFASTPDYAGALEDGWAAAVSAIIESVVAP
WSPTVTSFQQVNVLPGVHQTPADIEALRDLIESFGLYPVILPDLSGSLDGHVAENWCPTTQGGARMEEVAQMARAVHTIA
IGEHMRAPADLLGSVTGVPVTLFPTLTGLAANDRLMALLSRLSGRAVPGRYRRQRSQLLDAMLDGHFHFGGKRIAIAADP
DLLYGLSAFFAGMGARIVAAVASTSNAPNLDSIPADSVIVGDLTDLEDAVHAAGGADLLVTHSHGRQSADRLGIPLMRVG
FPIFDRLGTAHAQTIGYRGTRDLIFRVANLFLGQMHEHTPDDFGHVPSAHTIEEIVHDSASLAAH