-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathABCB11.txt
269 lines (255 loc) · 14.4 KB
/
ABCB11.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
Gene: ABCB11
Fonte do gene: Homo sapiens (human)
***INFORMACAO RELATIVA AO mRNA***
Tipo de molecula: mRNA
Accession number mRNA: NM_003742
Tamanho do mRNA: 6934 bp
Descricao do gene: Homo sapiens ATP binding cassette subfamily B member 11 (ABCB11), mRNA
Numero de features: 60
Notas:
- progressive familial intrahepatic cholestasis 2
- ABC member 16, MDR/TAP subfamily
- sister p-glycoprotein
- ATP-binding cassette sub-family B member 11
- ATP-binding cassette, sub-family B (MDR/TAP), member 11
Produto resultante: bile salt export pump
Comentarios:REVIEWED REFSEQ: This record has been curated by NCBI staff. The
reference sequence was derived from AC008177.3, AF091582.1,
AC069137.6 and CB161524.1.
This sequence is a reference standard in the RefSeqGene project.
On May 8, 2018 this sequence version replaced NM_003742.3.
Summary: The membrane-associated protein encoded by this gene is a
member of the superfamily of ATP-binding cassette (ABC)
transporters. ABC proteins transport various molecules across
extra- and intra-cellular membranes. ABC genes are divided into
seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20,
White). This protein is a member of the MDR/TAP subfamily. Members
of the MDR/TAP subfamily are involved in multidrug resistance. The
protein encoded by this gene is the major canalicular bile salt
export pump in man. Mutations in this gene cause a form of
progressive familial intrahepatic cholestases which are a group of
inherited disorders with severe cholestatic liver disease from
early infancy. [provided by RefSeq, Jul 2008].
Sequence Note: This RefSeq record was created from transcript and
genomic sequence data to make the sequence consistent with the
reference genome assembly. The genomic coordinates used for the
transcript record were based on transcript alignments.
Publication Note: This RefSeq record includes a subset of the
publications that are available for this gene. Please see the Gene
record to access additional publications.
COMPLETENESS: complete on the 3' end.
Numero de exoes: 28
Exao 1: [0:100](+)
Exao 2: [100:203](+)
Exao 3: [203:225](+)
Exao 4: [225:277](+)
Exao 5: [277:516](+)
Exao 6: [516:604](+)
Exao 7: [604:738](+)
Exao 8: [738:910](+)
Exao 9: [910:1035](+)
Exao 10: [1035:1210](+)
Exao 11: [1210:1324](+)
Exao 12: [1324:1435](+)
Exao 13: [1435:1561](+)
Exao 14: [1561:1765](+)
Exao 15: [1765:1936](+)
Exao 16: [1936:2138](+)
Exao 17: [2138:2202](+)
Exao 18: [2202:2305](+)
Exao 19: [2305:2470](+)
Exao 20: [2470:2575](+)
Exao 21: [2575:2737](+)
Exao 22: [2737:2941](+)
Exao 23: [2941:3183](+)
Exao 24: [3183:3340](+)
Exao 25: [3340:3538](+)
Exao 26: [3538:3745](+)
Exao 27: [3745:3892](+)
Exao 28: [3892:6934](+)
Sequencia mRNA:
AGAATGATGAAAACCGAGGTTGGAAAAGGTTGTGAAACCTTTTAACTCTCCACAGTGGAGTCCATTATTTCCTCTGGCTTCCTCAAATTCATATTCACAG
GGTCGTTGGCTGTGGGTTGCAATTACCATGTCTGACTCAGTAATTCTTCGAAGTATAAAGAAATTTGGAGAGGAGAATGATGGTTTTGAGTCAGATAAAT
CATATAATAATGATAAGAAATCAAGGTTACAAGATGAGAAGAAAGGTGATGGCGTTAGAGTTGGCTTCTTTCAATTGTTTCGGTTTTCTTCATCAACTGA
CATTTGGCTGATGTTTGTGGGAAGTTTGTGTGCATTTCTCCATGGAATAGCCCAGCCAGGCGTGCTACTCATTTTTGGCACAATGACAGATGTTTTTATT
GACTACGACGTTGAGTTACAAGAACTCCAGATTCCAGGAAAAGCATGTGTGAATAACACCATTGTATGGACTAACAGTTCCCTCAACCAGAACATGACAA
ATGGAACACGTTGTGGGTTGCTGAACATCGAGAGCGAAATGATCAAATTTGCCAGTTACTATGCTGGAATTGCTGTCGCAGTACTTATCACAGGATATAT
TCAAATATGCTTTTGGGTCATTGCCGCAGCTCGTCAGATACAGAAAATGAGAAAATTTTACTTTAGGAGAATAATGAGAATGGAAATAGGGTGGTTTGAC
TGCAATTCAGTGGGGGAGCTGAATACAAGATTCTCTGATGATATTAATAAAATCAATGATGCCATAGCTGACCAAATGGCCCTTTTCATTCAGCGCATGA
CCTCGACCATCTGTGGTTTCCTGTTGGGATTTTTCAGGGGTTGGAAACTGACCTTGGTTATTATTTCTGTCAGCCCTCTCATTGGGATTGGAGCAGCCAC
CATTGGTCTGAGTGTGTCCAAGTTTACGGACTATGAGCTGAAGGCCTATGCCAAAGCAGGGGTGGTGGCTGATGAAGTCATTTCATCAATGAGAACAGTG
GCTGCTTTTGGTGGTGAGAAAAGAGAGGTTGAAAGGTATGAGAAAAATCTTGTGTTCGCCCAGCGTTGGGGAATTAGAAAAGGAATAGTGATGGGATTCT
TTACTGGATTCGTGTGGTGTCTCATCTTTTTGTGTTATGCACTGGCCTTCTGGTACGGCTCCACACTTGTCCTGGATGAAGGAGAATATACACCAGGAAC
CCTTGTCCAGATTTTCCTCAGTGTCATAGTAGGAGCTTTAAATCTTGGCAATGCCTCTCCTTGTTTGGAAGCCTTTGCAACTGGACGTGCAGCAGCCACC
AGCATTTTTGAGACAATAGACAGGAAACCCATCATTGACTGCATGTCAGAAGATGGTTACAAGTTGGATCGAATCAAGGGTGAAATTGAATTCCATAATG
TGACCTTCCATTATCCTTCCAGACCAGAGGTGAAGATTCTAAATGACCTCAACATGGTCATTAAACCAGGGGAAATGACAGCTCTGGTAGGACCCAGTGG
AGCTGGAAAAAGTACAGCACTGCAACTCATTCAGCGATTCTATGACCCCTGTGAAGGAATGGTGACCGTGGATGGCCATGACATTCGCTCTCTTAACATT
CAGTGGCTTAGAGATCAGATTGGGATAGTGGAGCAAGAGCCAGTTCTGTTCTCTACCACCATTGCAGAAAATATTCGCTATGGCAGAGAAGATGCAACAA
TGGAAGACATAGTCCAAGCTGCCAAGGAGGCCAATGCCTACAACTTCATCATGGACCTGCCACAGCAATTTGACACCCTTGTTGGAGAAGGAGGAGGCCA
GATGAGTGGTGGCCAGAAACAAAGGGTAGCTATCGCCAGAGCCCTCATCCGAAATCCCAAGATTCTGCTTTTGGACATGGCCACCTCAGCTCTGGACAAT
GAGAGTGAAGCCATGGTGCAAGAAGTGCTGAGTAAGATTCAGCATGGGCACACAATCATTTCAGTTGCTCATCGCTTGTCTACGGTCAGAGCTGCAGATA
CCATCATTGGTTTTGAACATGGCACTGCAGTGGAAAGAGGGACCCATGAAGAATTACTGGAAAGGAAAGGTGTTTACTTCACTCTAGTGACTTTGCAAAG
CCAGGGAAATCAAGCTCTTAATGAAGAGGACATAAAGGATGCAACTGAAGATGACATGCTTGCGAGGACCTTTAGCAGAGGGAGCTACCAGGATAGTTTA
AGGGCTTCCATCCGGCAACGCTCCAAGTCTCAGCTTTCTTACCTGGTGCACGAACCTCCATTAGCTGTTGTAGATCATAAGTCTACCTATGAAGAAGATA
GAAAGGACAAGGACATTCCTGTGCAGGAAGAAGTTGAACCTGCCCCAGTTAGGAGGATTCTGAAATTCAGTGCTCCAGAATGGCCCTACATGCTGGTAGG
GTCTGTGGGTGCAGCTGTGAACGGGACAGTCACACCCTTGTATGCCTTTTTATTCAGCCAGATTCTTGGGACTTTTTCAATTCCTGATAAAGAGGAACAA
AGGTCACAGATCAATGGTGTGTGCCTACTTTTTGTAGCAATGGGCTGTGTATCTCTTTTCACCCAATTTCTACAGGGATATGCCTTTGCTAAATCTGGGG
AGCTCCTAACAAAAAGGCTACGTAAATTTGGTTTCAGGGCAATGCTGGGGCAAGATATTGCCTGGTTTGATGACCTCAGAAATAGCCCTGGAGCATTGAC
AACAAGACTTGCTACAGATGCTTCCCAAGTTCAAGGGGCTGCCGGCTCTCAGATCGGGATGATAGTCAATTCCTTCACTAACGTCACTGTGGCCATGATC
ATTGCCTTCTCCTTTAGCTGGAAGCTGAGCCTGGTCATCTTGTGCTTCTTCCCCTTCTTGGCTTTATCAGGAGCCACACAGACCAGGATGTTGACAGGAT
TTGCCTCTCGAGATAAGCAGGCCCTGGAGATGGTGGGACAGATTACAAATGAAGCCCTCAGTAACATCCGCACTGTTGCTGGAATTGGAAAGGAGAGGCG
GTTCATTGAAGCACTTGAGACTGAGCTGGAGAAGCCCTTCAAGACAGCCATTCAGAAAGCCAATATTTACGGATTCTGCTTTGCCTTTGCCCAGTGCATC
ATGTTTATTGCGAATTCTGCTTCCTACAGATATGGAGGTTACTTAATCTCCAATGAGGGGCTCCATTTCAGCTATGTGTTCAGGGTGATCTCTGCAGTTG
TACTGAGTGCAACAGCTCTTGGAAGAGCCTTCTCTTACACCCCAAGTTATGCAAAAGCTAAAATATCAGCTGCACGCTTTTTTCAACTGCTGGACCGACA
ACCCCCAATCAGTGTATACAATACTGCAGGTGAAAAATGGGACAACTTCCAGGGGAAGATTGATTTTGTTGATTGTAAATTTACATATCCTTCTCGACCT
GACTCGCAAGTTCTGAATGGTCTCTCAGTGTCGATTAGTCCAGGGCAGACACTGGCGTTTGTTGGGAGCAGTGGATGTGGCAAAAGCACTAGCATTCAGC
TGTTGGAACGTTTCTATGATCCTGATCAAGGGAAGGTGATGATAGATGGTCATGACAGCAAAAAAGTAAATGTCCAGTTCCTCCGCTCAAACATTGGAAT
TGTTTCCCAGGAACCAGTGTTGTTTGCCTGTAGCATAATGGACAATATCAAGTATGGAGACAACACCAAAGAAATTCCCATGGAAAGAGTCATAGCAGCT
GCAAAACAGGCTCAGCTGCATGATTTTGTCATGTCACTCCCAGAGAAATATGAAACTAACGTTGGGTCCCAGGGGTCTCAACTCTCTAGAGGGGAGAAAC
AACGCATTGCTATTGCTCGGGCCATTGTACGAGATCCTAAAATCTTGCTACTAGATGAAGCCACTTCTGCCTTAGACACAGAAAGTGAAAAGACGGTGCA
GGTTGCTCTAGACAAAGCCAGAGAGGGTCGGACCTGCATTGTCATTGCCCATCGCTTGTCCACCATCCAGAACGCGGATATCATTGCTGTCATGGCACAG
GGGGTGGTGATTGAAAAGGGGACCCATGAAGAACTGATGGCCCAAAAAGGAGCCTACTACAAACTAGTCACCACTGGATCCCCCATCAGTTGACCCAATG
CAAGAATCTCAGACACACATGACGCACCAGTTACAGGGGTTGTTTTTAAAGAAAAAAACAATCCCAGCAGGAGGGATTGCTGGGATTGTTTTTTCTTTAA
AGAAGAATGTTAATATTTTACTTTTACAGTCATTTTCCTACATCGGAATCCAAGCTAATTTCTAATGGCCTTCCATAATAATTCTGCTTTAGATGTGTAT
ACAGAAAATGAAAGAAACTAGGGTCCATATGAGGGAAAACCCAATGTCAAGTGGCAGCTCAGCCACCACTCAGTGCTTCTCTGTGCAGGAGCCAGTCCTG
ATTAATATGTGGGAATTAGTGAGACATCAGGGAGTAAGTGACACTTTGAACTCCTCAAGGGCAGAGAACTGTCTTTCATTTTTGAACCCTCGGTGTACAC
AGAGGCGGGTCTATAACAGGCAATCAACAAACGTTTCTTGAGCTAGACCAAGGTCAGATTTGAAAAGAACAGAAGGACTGAAGACCAGCTGTGTTTCTTA
ACTAAATTTGTCTTTCAAGTGAAACCAGCTTCCTTCATCTCTAAGGCTAAGGATAGGGAAAGGGTGGATGCTCTCAGGCTGAGGGAGGCAGAAAGGGAAA
GTATTAGCATGAGCTTTCCAGTTAGGGCTGTTGATTTATGCTTTAACTTCAGAGTGAGTGTAGGGGTGGTGATGCTACCATTACTGTGAGGACCTACCAG
TGTGGCTGGAGCAGGGACTCTCTCCCAGGCCTTTTACTCCTCAGCACCTCCCTGCATACTGATTGTTGTTTTTAGTTTCTGTGAAATTATATTCATGAAA
TGAAAATAGCGCATTTTACTTTGCTGTAGTTTCATAAGGTTTTATACAAAAAAGCAAGTAAATATGGCAGAAAAGCACTCATTTGCCCCTGCTCCCTCAA
AACACCACAGAATGACATAGAACTAAAGGCGGCAGGAATCTACAAGAATGAAGAAAACACAGTGATGCCACCTGCAAAATCTTGGGAGCCAGAAAGCAAA
TGGACAATTGATAATAGAGTTACAAGATGAGAGAAAACAAAAATGTAACCTGTTAGTTGGGGGAGCCTAGAAACATCCTGTTTTGTACCACAGACCCCTA
GAAAGTTTCAAGATGTAAAAACACTGGATCCTTCTGGAAGGAGGAGACAAGGGGACAGAGGGACTGAAGACAGAAGAATGGACCAAAAGCCTGTATGGAA
ACAGAATTGCAGAGCCCCGCCTGCAGCAGGATGGCTGCCTTTCCCCAGTTCCAGCGAGAAACTGCATACTCCCTCTCTGAGGAGGCTCACAGGGAGGTTT
GCACTTAGAGACACAAGATGAAGTTGAAGGGGCAGCTAACATACAAGAGACAGGGGGATTGAATGTAAAGCTGTGTATGAACAGAAGAACTCCCAGCCCC
TCTCTTCTGGCAGGAGATTGAAGAATGATTTTCTGGGGAAATTGGCCAATCTTAGAAAAGAGGTCGGGCCAGGCGCTGTGGCTCACTCCTGTAATCCCAG
CACTTTGGGAGGCCAAGGCGGGCGGATCATGAGGTCGAGAGATTGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAT
TAGCCGGGCGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGTGGAGCTTGCAGTGAGCCGAG
ATCGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAGAGGTCAAGGACTCCGTCCTTGGAATCCTAAG
AAAATTTTCCAGCCGTATTACCCTTCTATGAAGCCCACCTGTCAACCAACAAGCACCCACTCGATCAGAGCTTCCCCAGGCTTTTTGGTGTCTCCTCCTT
GCATGGGAATTGACTTCCAAGGACCACCAGACACTGAGGAAGTATTTTAACATATAAAGCAAAAGCAACAATAGGGCAGCTGGAGAAAGGAAATTAGAAG
TAACAGAGCCAATGCAGTGATTAGAAAAACACTCAAAAAATGGTAATAAATGTGTTCAATGGGTCAAGAGAACATATTTCCATCTATTTAAATAAAAACA
GGAATCAATAAAAGTGAACATACAAAAAAGAGCTCTTGTCACAGAAATGAAAACCTTGGGAGCAGGACTGCACCCACGGTTCCCACCAACCTATCCTGCC
ATCATGTTTTCTCCCTACTCCAGCTTCCTCCCAAGCTCACTCTACGTAGATGGTTTTTCTTCCCTTAGCCAACAAACGAAATTCTGCCTATAAATTAAGC
CAGGACACTAGGATATGGAGGTATATTATTTATGTCTCTTTTCCAGTGTTTCCTGGTATATAACAGCAAAGATGGTTTAGCAAGTTAACTGGTCAGGCCA
GAATTCTTGATGGTTGCCAAGTCAGGAGGCAGACGGTGAAGGAATGCATTCTGGTTATGAAAAATGGCAATGACGAGTAAGTTGCAGAGTAAAGATGCCT
TTTAGAGGAAATTAAGGTTTGCAATAAATGCATGCTAATCCTCTAATTTTGATGAAAGACTAAAGCTCTTGTTTTGGTTTGCATTTCAAAGAGCAAGAGC
ATAGGCTCTTGTGCTGGGAGGTGGCCAGTTTCCTGGAAAGGAATTTATGGAGTCTGAAAGAGGAATTAGTACAAATTATATTTTCAAATAAGATACTTGA
AAGTTATTGATGACAGGGAAAAGATTGTCTAAAGAGGGCTTGAAATTGGGGAAATAATTGAGGCTCAGTGACAAAGGAGAAGATTACAACTTCAAAAATG
AATAAATAAATAACATTGCATGTTTATTTTACAA
***INFORMACAO RELATIVA A PROTEINA***
Tipo de molecula: protein
Accession number proteina: NP_003733
Tamanho proteina: 1321 aa
Descricao da proteina: bile salt export pump [Homo sapiens]
Peso molecular: 146277 Dalton
Numero de features da proteina: 30
Locais de interesse da proteina:
Site 5:
-Localizacao: [62:83]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 6:
-Localizacao: [108:109]
-Notas: N-linked (GlcNAc...) asparagine. /evidence=ECO:0000269|PubMed:17082223; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: glycosylation
Site 7:
-Localizacao: [115:116]
-Notas: N-linked (GlcNAc...) asparagine. /evidence=ECO:0000269|PubMed:17082223; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: glycosylation
Site 8:
-Localizacao: [121:122]
-Notas: N-linked (GlcNAc...) asparagine. /evidence=ECO:0000269|PubMed:17082223; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: glycosylation
Site 9:
-Localizacao: [124:125]
-Notas: N-linked (GlcNAc...) asparagine. /evidence=ECO:0000269|PubMed:17082223; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: glycosylation
Site 10:
-Localizacao: [147:168]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 11:
-Localizacao: [215:236]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 12:
-Localizacao: [240:261]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 13:
-Localizacao: [319:340]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 14:
-Localizacao: [353:374]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 15:
-Localizacao: [585:586]
-Notas: Phosphothreonine. /evidence=ECO:0007744|PubMed:24275569; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Site 16:
-Localizacao: [586:587]
-Notas: Phosphoserine. /evidence=ECO:0007744|PubMed:24275569; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Site 18:
-Localizacao: [689:690]
-Notas: Phosphoserine. /evidence=ECO:0000250|UniProtKB:Q9QY30; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Site 19:
-Localizacao: [700:701]
-Notas: Phosphoserine. /evidence=ECO:0000250|UniProtKB:O70127; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Site 20:
-Localizacao: [703:704]
-Notas: Phosphoserine. /evidence=ECO:0007744|PubMed:24275569; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Site 21:
-Localizacao: [755:776]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 22:
-Localizacao: [794:815]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 23:
-Localizacao: [869:890]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 24:
-Localizacao: [890:911]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 25:
-Localizacao: [979:1000]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 26:
-Localizacao: [1011:1032]
-Notas: propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: transmembrane region
Site 27:
-Localizacao: [1213:1214]
-Notas: Phosphoserine. /evidence=ECO:0007744|PubMed:24275569; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Site 29:
-Localizacao: [1320:1321]
-Notas: Phosphoserine. /evidence=ECO:0000250|UniProtKB:O70127; propagated from UniProtKB/Swiss-Prot (O95342.2)
-Tipo: phosphorylation
Sequencia proteina:
MSDSVILRSIKKFGEENDGFESDKSYNNDKKSRLQDEKKGDGVRVGFFQLFRFSSSTDIWLMFVGSLCAFLHGIAQPGVLLIFGTMTDVFIDYDVELQEL
QIPGKACVNNTIVWTNSSLNQNMTNGTRCGLLNIESEMIKFASYYAGIAVAVLITGYIQICFWVIAAARQIQKMRKFYFRRIMRMEIGWFDCNSVGELNT
RFSDDINKINDAIADQMALFIQRMTSTICGFLLGFFRGWKLTLVIISVSPLIGIGAATIGLSVSKFTDYELKAYAKAGVVADEVISSMRTVAAFGGEKRE
VERYEKNLVFAQRWGIRKGIVMGFFTGFVWCLIFLCYALAFWYGSTLVLDEGEYTPGTLVQIFLSVIVGALNLGNASPCLEAFATGRAAATSIFETIDRK
PIIDCMSEDGYKLDRIKGEIEFHNVTFHYPSRPEVKILNDLNMVIKPGEMTALVGPSGAGKSTALQLIQRFYDPCEGMVTVDGHDIRSLNIQWLRDQIGI
VEQEPVLFSTTIAENIRYGREDATMEDIVQAAKEANAYNFIMDLPQQFDTLVGEGGGQMSGGQKQRVAIARALIRNPKILLLDMATSALDNESEAMVQEV
LSKIQHGHTIISVAHRLSTVRAADTIIGFEHGTAVERGTHEELLERKGVYFTLVTLQSQGNQALNEEDIKDATEDDMLARTFSRGSYQDSLRASIRQRSK
SQLSYLVHEPPLAVVDHKSTYEEDRKDKDIPVQEEVEPAPVRRILKFSAPEWPYMLVGSVGAAVNGTVTPLYAFLFSQILGTFSIPDKEEQRSQINGVCL
LFVAMGCVSLFTQFLQGYAFAKSGELLTKRLRKFGFRAMLGQDIAWFDDLRNSPGALTTRLATDASQVQGAAGSQIGMIVNSFTNVTVAMIIAFSFSWKL
SLVILCFFPFLALSGATQTRMLTGFASRDKQALEMVGQITNEALSNIRTVAGIGKERRFIEALETELEKPFKTAIQKANIYGFCFAFAQCIMFIANSASY
RYGGYLISNEGLHFSYVFRVISAVVLSATALGRAFSYTPSYAKAKISAARFFQLLDRQPPISVYNTAGEKWDNFQGKIDFVDCKFTYPSRPDSQVLNGLS
VSISPGQTLAFVGSSGCGKSTSIQLLERFYDPDQGKVMIDGHDSKKVNVQFLRSNIGIVSQEPVLFACSIMDNIKYGDNTKEIPMERVIAAAKQAQLHDF
VMSLPEKYETNVGSQGSQLSRGEKQRIAIARAIVRDPKILLLDEATSALDTESEKTVQVALDKAREGRTCIVIAHRLSTIQNADIIAVMAQGVVIEKGTH
EELMAQKGAYYKLVTTGSPIS