-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathEMCN.txt
110 lines (98 loc) · 6.23 KB
/
EMCN.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
Gene: EMCN
Fonte do gene: Homo sapiens (human)
***INFORMACAO RELATIVA AO mRNA***
Tipo de molecula: mRNA
Accession number mRNA: NM_001159694
Tamanho do mRNA: 3927 bp
Descricao do gene: Homo sapiens endomucin (EMCN), transcript variant 2, mRNA
Numero de features: 19
Notas:
- isoform 2 precursor is encoded by transcript variant 2
- endomucin-2
- MUC-14
- mucin-14
- gastric cancer antigen Ga34
Produto resultante: endomucin isoform 2 precursor
Comentarios:VALIDATED REFSEQ: This record has undergone validation or
preliminary review. The reference sequence was derived from
BG506923.1, AK304568.1, AK291831.1, AC097459.3 and AL133118.1.
On May 31, 2019 this sequence version replaced NM_001159694.1.
Summary: EMCN is a mucin-like sialoglycoprotein that interferes
with the assembly of focal adhesion complexes and inhibits
interaction between cells and the extracellular matrix (Kinoshita
et al., 2001 [PubMed 11418125]).[supplied by OMIM, Mar 2008].
Transcript Variant: This variant (2) lacks an in-frame exon in the
middle portion of the coding region compared to variant 1. This
results in a shorter protein (isoform 2) compared to isoform 1.
Sequence Note: This RefSeq record was created from transcript and
genomic sequence data to make the sequence consistent with the
reference genome assembly. The genomic coordinates used for the
transcript record were based on transcript alignments.
Publication Note: This RefSeq record includes a subset of the
publications that are available for this gene. Please see the Gene
record to access additional publications.
COMPLETENESS: complete on the 3' end.
Numero de exoes: 11
Exao 1: [0:172](+)
Exao 2: [172:295](+)
Exao 3: [295:367](+)
Exao 4: [367:484](+)
Exao 5: [484:577](+)
Exao 6: [577:637](+)
Exao 7: [637:733](+)
Exao 8: [733:758](+)
Exao 9: [758:820](+)
Exao 10: [820:894](+)
Exao 11: [894:3927](+)
Sequencia mRNA:
ACATTCACTACACCTTTTCCATTTGCTAATAAGGCCCTGCCAGGCTGGGAGGGAATTGTCCCTGCCTGCTTCTGGAGAAAGAAGATATTGACACCATCTA
CGGGCACCATGGAACTGCTTCAAGTGACCATTCTTTTTCTTCTGCCCAGTATTTGCAGCAGTAACAGCACAGGTGTTTTAGAGGCAGCTAATAATTCACT
TGTTGTTACTACAACAAAACCATCTATAACAACACCAAACACAGAATCATTACAGAAAAATGTTGTCACACCAACAACTGGAACAACTCCTAAAGGAACA
ATCACCAATGAATTACTTAAAATGTCTCTGATGTCAACAGCTACTTTTTTAACAAGTAAAGATGAAGGATTGAAAGCCACAACCACTGATGTCAGGAAGA
ATGACTCCATCATTTCAAACGTAACAGTAACAAGTGTTACACTTCCAAATGCTGTTTCAACATTACAAAGTTCCAAACCCAAGAGTAGTGTTCTACAACC
AGATGCATCACCTTCTAAAACTGGTACATTAACCTCAATACCAGTTACAATTCCAGAAAACACCTCACAGTCTCAAGTAATAGGCACTGAGGGTGGAAAA
AATGCAAGCACTTCAGCAACCAGCCGGTCTTATTCCAGTATTATTTTGCCGGTGGTTATTGCTTTGATTGTAATAACACTTTCAGTATTTGTTCTGGTGG
GTTTGTACCGAATGTGCTGGAAGGCAGATCCGGGCACACCAGAAAATGGAAATGATCAACCTCAGTCTGATAAAGAGAGCGTGAAGCTTCTTACCGTTAA
GACAATTTCTCATGAGTCTGGTGAGCACTCTGCACAAGGAAAAACCAAGAACTGACAGCTTGAGGAATTCTCTCCACACCTAGGCAATAATTACGCTTAA
TCTTCAGCTTCTATGCACCAAGCGTGGAAAAGGAGAAAGTCCTGCAGAATCAATCCCGACTTCCATACCTGCTGCTGGACTGTACCAGACGTCTGTCCCA
GTAAAGTGATGTCCAGCTGACATGCAATAATTTGATGGAATCAAAAAGAACCCCGGGGCTCTCCTGTTCTCTCACATTTAAAAATTCCATTACTCCATTT
ACAGGAGCGTTCCTAGGAAAAGGAATTTTAGGAGGAGAATTTGTGAGCAGTGAATCTGACAGCCCAGGAGGTGGGCTCGCTGATAGGCATGACTTTCCTT
AATGTTTAAAGTTTTCCGGGCCAAGAATTTTTATCCATGAAGACTTTCCTACTTTTCTCAGTGTTCTTATATTACCTACTGTTAGTATTTATTGTTTACC
ACTATGTTAATGCAGGGAAAAGTTGCACGTGTATTATTAAATATTAGGTAGAAATCATACCATGCTACTTTGTACATATAAGTATTTTATTCCTGCTTTC
GTGTTACTTTTAATAAATAACTACTGTACTCAATACTCTAAAAATACTATAACATGACTGTGAAAATGGCAATGTTATTGTCTTCCTATAATTATGAATA
TTTTTGGATGGATTATTAGAATACATGAACTCACTAATGAAAGGCATTTGTAATAAGTCAGAAAGGGACATACGATTCACATATCAGACTGTTAGGGGGA
GAGTAATTTATCAGTTCTTTGGTCTTTCTATTTGTCATTCATACTATGTGATGAAGATGTAAGTGCAAGGGCATTTATAACACTATACTGCATTCATTAA
GATAATAGGATCATGATTTTTCATTAACTCATTTGATTGATATTATCTCCATGCATTTTTTATTTCTTTTAGAAATGTAATTATTTGCTCTAGCAATCAT
TGCTAACCTCTAGTTTGTAGAAAATCAACACTTTATAAATACATAATTATGATATTATTTTTCATTGTATCACTGTTCTAAAAATACCATATGATTATAG
CTGCCACTCCATCAGGAGCAAATTCTTCTGTTAAAAGCTAACTGATCAACCTTGACCACTTTTTTGACATGTGAGATCAAAGTGTCAAGTTGGCTGAGGT
TTTTTGGAAAGCTTTAGAACTAATAAGCTGCTGGTGGCAGCTTTGTAACGTATGATTATCTAAGCTGATTTTGATGCTAAATTATCTTAGTGATCTAAGG
GGCAGTTTAGTGAAGATGGAATCTTGTATTTAAAATAGCCTTTTAAAATTTGTTTTGTGGTGATGTATTTTGACAACTTCCATCTTTAGGAGTTATATAA
TCACCTTGATTTTAGTTTCCTGATGTTTGGACTATTTATAATCAAGGACACCAAGCAAGCATAAGCATATCTATATTTCTGACTGGTGTCTCTTTGAGAA
GGATGGGAAGTAGAAAAAAAAAAAAGAAAGAAAGGAAAGGAAGAGAGGAGAGAAGAAGGCAGGGATCTCCACTATGTATGTTTTCACTTTAGAACTGTTG
AGCCCATGCTTAATTTTAATCTAGAAGTCTTTAAATGGTGAGACAGTGACTGGAGCATGCCAATCAGAGAGCATTTGTCTTCAGAAAAAAAAAAAATCTG
AGTTTGAGACTAGCCTGGCCAACATGTTGAAACCCCATATCTACTAAAAATACAAAAATTAGCCTGGTGTGGTGGCGCACGCCTGTAGTCCCAGCTACTC
TGGAGCCTGAGGAACGTGAATCGCTTGAACCCAGAAGACAGAGGTTGCAGTGAGCTGAGATGGCACTATTGCACTCCAGCCTGGGTGACACAGCAAGACT
CTGTCTCAAAAAAAAAAAAAAAAAAAAGGAAAAAAAAGAAAGAAAGAAAGTCCCAGCACACCTAGATAATTTACCGAGCTCTTCAGCAAAAACCATGTTA
CATACAGCATATTCCAAAGAAATGAACTCTTCTGCAATTTAAATTATAAGTAATATGTTATTTTGGATCCTAGAGAAACCATTTTCTCTACATTTCATGA
GCATGGTTAGAAAAGAGTTTACAAGAATTAGGAAGAGGGAACAATTTTAATGGTCAGAAAAGAATAAAATTTATTCTAGTTCAAGAAGTGCACACAAAGA
ATATGCATTAATCTAACAACTATGAGATTAAATCTTTCAAAAAGGTCAAAGGAGGATTGAGAAGTTTACAGAGATGTCCACGGCATTTTATATCAATCTC
AAAGGTAAGGTCTGCATTTTTATAAACCAACTTAAACTTCTGTTGAGATAGGATATTTTGTTTTCAAGCCAAAATTACCATTAATCAAATATGTTTTAAT
TATCTGATTTAGATGATCTACTTTTTATGCCTGGCTTACTGTAAGTTTTTTATTCTGATACACAGTTCAAACATCATTGCAACAAAGAAGTGCCTGTATT
TAGATCAAAGGCAAGACTTTCTATGTGTTTGTTTTGCATAATAATATGAATATAATTTAAGTCTATCAATAGTCAAAACATAAACAAAAGCTAATTAACT
GGCACTGTTGTCACCTGAGACTAAGTGGATGTTGTTGGCTGACATACAGGCTCAGCCAGCAGAGAAAGAATTCTGAATTCCCCTTGCTGAACTGAACTAT
TCTGTTACATATGGTTGACAAATCTGTGTGTTATTTCTTTTCTACCTACCATATTTAAATTTATGAGTATCAACCGAGGACATAGTCAAACCTTCGATGA
TGAACATTCCTGATTTTTTGCCTGATTATTCTCTGTTGAGCTCTACTTGTGGTCATTCAAGATTTTATGATGTTGAAAGGAAAAGTGAATATGACCTTTA
AAAATTGTATTTTGGGTGATGATAGTCTCACCACTATAAAACTGTCAATTATTGCCTAATGTTAAAGATATCCATCATTGTGATTAATTAAACCTATAAT
GAGTATTCTTAATGGAGAATTCTTAATGGATGGATTATCCCCTGATCTTTTCTTTAAAATTTCTCTGCACACACAGGACTTCTCATTTTCCAATAAATGG
GTGTACTCTGCCCCAATTTCTAGGGAA
***INFORMACAO RELATIVA A PROTEINA***
Tipo de molecula: protein
Accession number proteina: NP_001153166
Tamanho proteina: 248 aa
Descricao da proteina: endomucin isoform 2 precursor [Homo sapiens]
Peso molecular: 24076 Dalton
Numero de features da proteina: 5
Sequencia proteina:
MELLQVTILFLLPSICSSNSTGVLEAANNSLVVTTTKPSITTPNTESLQKNVVTPTTGTTPKGTITNELLKMSLMSTATFLTSKDEGLKATTTDVRKNDS
IISNVTVTSVTLPNAVSTLQSSKPKSSVLQPDASPSKTGTLTSIPVTIPENTSQSQVIGTEGGKNASTSATSRSYSSIILPVVIALIVITLSVFVLVGLY
RMCWKADPGTPENGNDQPQSDKESVKLLTVKTISHESGEHSAQGKTKN