forked from biopython/biopython
-
Notifications
You must be signed in to change notification settings - Fork 0
/
nucl2.fa
135 lines (135 loc) · 7.97 KB
/
nucl2.fa
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
>isotig35100 gene=isogroup05278 length=3411 numContigs=3
GAAAGGCAGGGTCGctGGTGCGTGCCCGGCGAGGCTGTGGAGGCCcgTGTGTCTAGAAGC
TGTGTGAGAGAGATGGCGGAACCCGGGAGGAGACGGGGTCCGAGGTCCCGCGGTGGCGGC
GCCGGCCGAGGCgCTCGAAGAGCCCGGGTCGCCCGTGGCCGGCGTCCTCGCGCCCCGCAG
TCTCTGTCCCGGCTCATTCCAGACACGGTGCTTGTGGACTTGGTCAGTGACAGCGACGAG
GAGATCCTGGAAGTCGTCGCGGACCCGGTAGAAGCGCCCGCCGCCCGGGCcCCcGCGCCG
GCCGCACATGGGCAGGACAGCGACAGCGACAGTGCAGGGGCGgACGAGGGGCCTGCAGGA
GCCCCTCAGACCTTGGTCCGGCGGCGGCGCCGGCGGCTGCTGGATCCCGGCGAGGCACCG
GTGGTTCCTGTGTACTCCGGGAAGGTACAAAGCAGCCTCAACCTCATCCCAGATAATTCA
TCCCTCTTGAAACTTTGCCCCTCAGAGCCTGAAGATGAGGCAGATGTGACAGATTGTGGC
AGTCCTCCTCCTGAGGATGCCCTAATTCCAGGTTCTCCCTGGAAGAAGAAGCTGAGGAAT
AAGCATGAAAAaGAAGAGATGAAGATGGAAGAGTTtCCGGACCAGGACATCTCTCCTTTG
CCCCGACCTTCATCAAGAAaCAAAAGCAGAAAGCATACCGAGGCACTCCAGAAGTTGAGG
GAAGTGAACAAGCGCCTCCAAGATCTCCGATCCTGCCTGAGCCCCAAGCAGCACCAGAGT
CCAGCCCTTCAGAACCCAGATGATGAGGTGGTCCTCGTGGACGGGCCTGtCTTGTCACAG
AGCCCGAGACTCTTCACCCTCAAGATCCGGTGCCGGGCTGACCTAGTCAGATTGCCCGTC
ATGACATCGGAACCCCTTCAGAATGTGGTGGATTACATGGCCAATCATCTTGGGGTGTCT
CCAAGCAGGATTCTTTTACTCTTTGGAGAGACAGAACTGTCCCCTACTGCCACCCCTAGG
ACCCTAAAGCTTGGTGTGGCTGACATCATTGATTGTGTGGTGCTGACAAGTTCTTCAGAG
GCCACAGAGACAACCCAGCAGCTCTGCCTCCGGGTGCAGGGGAAGGAGAAGCACCAGATG
TTGGAGATCTCACTCTCTCCTGACTCtCCTCTTGAGGTCCTCATGGCGCACTATGAGGAG
GCCATGGGACTCTCCGGACACAAGCTCTCCTTCTTCTTCGATGGGACAAAGCTGTCGGGC
AAGGAGCTGCCAGCTGATCTGGGCATGGAATCCGGGGATCTCATTGAAGTTTGGGGCAGC
TTCCTCCTCCTGTTTGGATGCAGAGCCAAGACTTGGGGACAACAGCTCCCACTTTtATTA
TTATTTTTtGCCCcAGGGCTAACAGAAACCGAATTAGAACTCGTTTATTTATTTCCGGCA
CTGGGGATTGAACCCAGGGCTGTGCATATGCTAAGGATGTGTGAAGTTGAGGTAAAaCCA
AGGCATGACCTTTGCCcTGTCTCGTTGACCGTAGTCTCAAGCGGTCTGATTGGTAATTGT
GTGACTGTGGCTGCCCTGGGTGgCCTGTGGCTGTGTGTGTTGGTGCTGTGTACAGCAGCT
CCTGGGGCATGGAGAAGGGGTATTGGCTTCCCTACCATTTCGTTCAGTAGTGCACAAATG
AGCCTTGCATTTGGGTGCCCAGGCTTGTTTATGCCACATTGGGGACCAACAGGGATTTTA
ATTCTCATTTGGGGGCTGAGATGCAGGTACCTTCCCTGGTGGGGATCGGTTGCAGGGAAA
ACAAGCAATGACATCTATGCTGAAGCTGAGGACGTAGACAGTGGGTTTTTAAAGGCTAAC
AGGAGTGGTGTCTTCAGCCCTGGGCATCCACTCTCCCAGGACTCGGCCAGCATCTGGTAC
ATACTTCCTGGCATGCACATCCCACATCTGAGCACATGCAGCTGTGTTTGTTTACTCTTC
cGTCCTCCCGCCCTCTCCTGTCAGCACCCACTTGGTATATTTGTATCTGAATGTCTTTAT
AAGATGATTTTCATGTGTGTGATTTTAGTGTATGTACATGGTACTGTGCAAACATTCTGT
CTTCACTCAGTGGTGACAGGCCCATGTCACCTTCTAGATCTGTTGTTGTTCCGATCTGCT
GTGGCTGCCCCATGGCATATGATATTACTCACACATTGCCTTAATTGGAGGGATACCGTC
GCCAGCTCCGAACCAAATTCCAGACCACAGCAAACATCTTTTTGTGTGTTCCCTTCTGGG
CCAGGGGTAATTGTCTGGCATACTTGCCCAGGATGGGACTgTGGGTTACGCAGGTTCATT
TcTCTGGTTTTTCTAGGgTTTTCTTCAGAATGTGGACTGCTTTTCTTGGAGCTTCTcTTT
CCTTcTCTTGACCAGTATTTAGTGTTGttgTACTTTCTGATTTGCCAGTCTACTGTGTAT
AAAGCACTGCATATCTTAATATCCTTTATCCCACCATTTCTTATCAATGTCCACCTTCTG
GTTTTTCTTTGTCTGCTTGTTTTCTTGGGCTTCCTTTTGTGTGTTTTTtCTGTGAGGAGT
TGCTTGTGTTCTaGgCAATTGCAAATAACCTTCAGAATAGCACTTTGTGTACCCAGGGGG
ATTTTCACCAAACAGACATTCTTAACTTTAATGCAGACAGCTGGCTTTTGTAAAGGGTTT
TTTGTTTTGTGTTTtTTTTTtttttGCTTTTGTTTTtGcTTTTTAAGGGAAAATGAGAAA
GAACTTCCAGAATGGCAGGGGTTCCGGGGAGTCTCAGGATTGCTACTTTTGGATCTCACT
AGTGAAACTTTCATTGTCTTCTGTCTTGCTAGTTTTTTGTTCTTTTGTTTTTATCTCTCA
TATGACCACATGAGAATAGAGTTCCTGATCTGCTGTGGTGTGGATaCTGACTGTTCTGAa
CTTTCCACTCTGCTTCTCACTGGTTACAGCACTGTACTGGTTCCTGACTATTGTATGTtA
CGTCTAAACCATCTGAAATGCACCTTTGTGTGGGAGAGCAAATTTTATTGTGTTTTAATT
ATAATGAGCCAGTTTTCtAAGCTTTGTCTaCTAAAAAATTATGTAGTCCTTTCACAAGAA
GTATGTCAACTTCTtAAAAATTTTTTTtGtGAGAATATGTTATACCTTATTTTAATATTT
AAAAATTCCATGACATATAGCTTCCAGGTTTCTCAGCTCTTGTATGACAATATCACACAT
TaCTATTATGTATTCAATCTCAGCCAAAGGGAGATGCCTTTAAC
>isotig35101 gene=isogroup05278 length=3391 numContigs=3
GAAAGGCAGGGTCGctGGTGCGTGCCCGGCGAGGCTGTGGAGGCCcgTGTGTCTAGAAGC
TGTGTGAGAGAGATGGCGGAACCCGGGAGGAGACGGGGTCCGAGGTCCCGCGGTGGCGGC
GCCGGCCGAGGCgCTCGAAGAGCCCGGGTCGCCCGTGGCCGGCGTCCTCGCGCCCCGCAG
TCTCTGTCCCGGCTCATTCCAGACACGGTGCTTGTGGACTTGGTCAGTGACAGCGACGAG
GAGATCCTGGAAGTCGTCGCGGACCCGGTAGAAGCGCCCGCCGCCCGGGCcCCcGCGCCG
GCCGCACATGGGCAGGACAGCGACAGCGACAGTGCAGGGGCGgACGAGGGGCCTGCAGGA
GCCCCTCAGACCTTGGTCCGGCGGCGGCGCCGGCGGCTGCTGGATCCCGGCGAGGCACCG
GTGGTTCCTGTGTACTCCGGGAAGGTACAAAGCAGCCTCAACCTCATCCCAGATAATTCA
TCCCTCTTGAAACTTTGCCCCTCAGAGCCTGAAGATGAGGCAGATGTGACAGATTGTGGC
AGTCCTCCTCCTGAGGATGCCCTAATTCCAGGTTCTCCCTGGAAGAAGAAGCTGAGGAAT
AAGCATGAAAAaGAAGAGATGAAGATGGAAGAGTTtCCGGACCAGGACATCTCTCCTTTG
CCCCGACCTTCATCAAGAAaCAAAAGCAGAAAGCATACCGAGGCACTCCAGAAGTTGAGG
GAAGTGAACAAGCGCCTCCAAGATCTCCGATCCTGCCTGAGCCCCAAGCAGCACCAGAGT
CCAGCCCTTCAGAACCCAGATGATGAGGTGGTCCTCGTGGACGGGCCTGtCTTGTCACAG
AGCCCGAGACTCTTCACCCTCAAGATCCGGTGCCGGGCTGACCTAGTCAGATTGCCCGTC
ATGACATCGGAACCCCTTCAGAATGTGGTGGATTACATGGCCAATCATCTTGGGGTGTCT
CCAAGCAGGATTCTTTTACTCTTTGGAGAGACAGAACTGTCCCCTACTGCCACCCCTAGG
ACCCTAAAGCTTGGTGTGGCTGACATCATTGATTGTGTGGTGCTGACAAGTTCTTCAGAG
GCCACAGAGACAACCCAGCAGCTCTGCCTCCGGGTGCAGGGGAAGGAGAAGCACCAGATG
TTGGAGATCTCACTCTCTCCTGACTCgCCTCTTGAGGTCCTCATGGCGCACTATGAGGAG
GCCATGGGACTCTCCGGACACAAGCTCTCCTTCTTCTTCGATGGGACAAAGCTGTCGGGC
AAGGAACTGCCAGCTGATCTGGGCATGGAATCCGGGGATCTCATTGAAGTTTGGGGCAGC
TTCCTCCTCCTGTTTGGATGCAGAGCCAAGACTTGGGGACAACAGCTCCCACTTTTATTA
TTATTTTTTGCCCCAGGGCTAACAGAAACCGAATTAGAACTCGTTTATTTATTTCCGGCA
CTGGGGATTGAACCCTGGGCTGTGCATATGCTAAGGATGTGTGAAGTTGAGGTAAAACCA
AGGCATGACCTTTGCCCTGTCTCGTTGACCGATTGGTTGTGTGACTGTGGCTGCCCTGGG
TGGCCTGTGGCTGTGTGTGTTGGTGCTGTGGACAGCAGCTCCTGGGGCATGGAGAAGGGG
TATTGGCTTCCCTACCATTTCGTTCAGTGCACAAATGAGCCTTGCATTTGGGTGCCCAGG
CTTGTGATTTATGCCACATTGGGGACCAACAGGGATTTTAATTCTCATTTGGGGGCTGAG
ATGCAGGTACCTTCCCTGGTGGGGATAGTCGGTTGCAGGGAAATAAACAAGCAACATCTA
TGCAGCGGACTAGGTAGACAGTGGGTAATTTTTAAAGGCCAGGAGTGGTGTCTTCAGCCC
TGGGCATCCACTCTCCCAGGACTCGGCCAGCATCTGGTACATACTTCCTGGCATGCACAT
CTAGCCACATCTGAGCACATGCAGCTGTTGAGTTTGATGTTTACTCTAGTTCcGTCCTCC
CGCCCTCTCCTGTCAGCACCCACTTGGTATATTTGTATCATGTCTTTAGATGATTTTCTG
AATGTGTGTGATTTTAGTGTATGTACATGGTACTGTGCAATAGACATTCTGTCTTCACTC
AGTGTGAGTGACAGGCTAGCCATGTTAGCACCTTCTAGATCTGTTGTTGTTCCGATCTGC
TGTGGCTGCCCCATAGTGGCATTAGATGATATTACTCACACATTGCCTTTGGAGGGATAC
CTGAGTCGCCAGCTCCTAGGAACCAAATTCCAGACCACAGCAAACATCTTTTTGTGTGTT
CCCTTCTGGGCCAGGGTAGGTAATTGTCTGGCATACTTGCCCAGGATGGGACTgTGGGTT
ACGCAGGTTCATTTcTCTGGTTTTTCTAGGgTTTTCTTCAGAATGTGGACTGCTTTTCTT
GGAGCTTCTcTTTCCTTcTCTCCAGTATTTAGTGTTGttgTACTTTCTGATTTGCCAGTC
TACTGTGTAAGCACTGCATATCTTAATATCCTTTATCTGACCACCATTTCTTATCAATGT
CCACCTTCTGGTTTTTCTTTGTTAGCTGCTTGTTTTCTTGTGAGGCTTCCTTTTGTGTGT
TTTTtCTGTGAGGAGTTGCTTGTGTTCgCAATTGCAAATAACCTTCAGAATAGCACTTTG
TGTACCCAGGGGGATTTTCACCAAACAGACATTCTTAACTTTAATGCAGACAGCTGGCTT
TTGTTAGAAAGGGTTTTTTGTTTTGTGTTTtTTTTTtttttGCTTTTGTTTTtGcTTTTT
AAGGGAAAAGAAAGAACTTCCAGAATGGCAGGGGTTCTAGCGGGGAGTCTCAGGATTGCT
ACTTTTGGATCTCTAAACTTAAGAAACTTTCATTGTCTTCTGTTAGCTTGCTTTTTTGTT
CTTTTGTTTTTATCTCTCATATTGAGACCACTAAATGAGAATAGAGTTCTAACTGATCTG
CTGTGGTGTGGATaCCTGTTCaCTTTCCACTCTGCTTCTCACTGGTTACAGCTAGACTGT
ACTGGTTCCCTATTGTATGTtACGTCTAAACCATCTGAAATGCACCTTTGTGTGGGAGAG
CAAATTTTATTGTGTTTTAATTATAATGAGCCAGTTTTCGCTTTGTCTaCTAAAAAATTA
ATATGTAGTCTGACTTTCACAAGAAGTATGTCAACTTCTAAATTTTTTTtGGAATATGTT
ATACCTTTGAATTTTAATATTAAATTCCATGACATAGTACTTCCAGGTTTCTCAGCTCTT
GTACAATTAATGAATCTGAACACATTaCTATTATGTATTCAATCTCAGCCAATAAAGGGA
GATGCCTTTAAC
>ENSG00000176953:ENST00000320805 cds:KNOWN_protein_coding
ATGGCGGAGCCTGTGGGGAAGCGGGGCCGCTGGTCCGGAGGTAGCGGTGCCGGCCGAGGG
GGTCGGGGCGGCTGGGGCGGTCGGGGCCGGCGTCCTCGGGCCCAGCGGTCTCCATCCCGG
GGCACGCTGGACGTAGTGTCTGTGGACTTGGTCACCGACAGCGATGAGGAAATTCTGGAG
GTCGCCACCGCTCGCGGTGCCGCGGACGAGGTTGAGGTGGAGCCCCCGGAGCCCCCGGGG
CCGGTCGCGTCCCGGGATAACAGCAACAGTGACAGCGAAGGGGAGGACAGGCGGCCCGCA
GGACCCCCGCGGGAGCCGGTCAGGCGGCGGCGGCGGCTGGTGCTGGATCCGGGGGAGGCG
CCGCTGGTTCCGGTGTACTCGGGGAAGGTTAAAAGCAGCCTTCGCCTTATCCCAGATGAT
CTATCCCTCCTGAAACTCTACCCTCCAGGGGATGAGGAAGAGGCAGAGCTGGCAGATTCG
AGTGGTCTCTACCATGAGGGCTCCCCATCACCAGGCTCTCCCTGGAAGACAAAGCTGAGG
ACTAAGGATAAAGAAGAGAAGAAAAAGACAGAGTTTCTGGATCTGGACAACTCTCCTCTG
TCCCCACCTTCACCAAGGACCAAAAGCAGAACGCATACTCGGGCACTCAAGAAGTTAAGT
GAGGTGAACAAGCGCCTCCAGGATCTCCGTTCCTGTCTGAGCCCCAAGCCACCTCAGGGT
CAAGAGCAACAGGGCCAAGAGGATGAAGTGGTCTTGGTGGAAGGGCCCACCCTCCCAGAG
ACCCCCCGACTCTTCCCACTCAAAATCCGTTGCCGGGCTGACCTGGTCAGATTGCCCCTC
AGGATGTCGGAGCCCCTGCAGAGTGTGGTGGACCACATGGCCACCCACCTTGGGGTGTCC
CCAAGCAGGATCCTTTTGCTTTTTGGAGAGACAGAGCTATCACCTACTGCCACTCCCAGG
ACCCTAAAGCTCGGAGTGGCTGACATCATTGACTGTGTGGTACTAACAAGTTCTCCAGAG
GCCACAGAGACGTCCCAACAGCTCCAGCTCCGGGTGCAGGGAAAGGAGAAACACCAGACA
CTGGAAGTCTCACTGTCTCGAGATTCCCCTCTAAAGACCCTCATGTCCCACTATGAGGAG
GCCATGGGACTGTCGGGACGGAAGCTCTCCTTCTTCTTTGATGGGACAAAGCTTTCAGGC
AGGGAGCTGCCAGCTGACCTGGGCATGGAATCTGGGGACCTCATTGAGGTCTGGGGC