Login
Help

TRANSCRIPT CARD

Submit your Data

  1. Transcript 'Harore.CG.MTP2014.S1...'

Transcript Model

Transcript Id

Harore.CG.MTP2014.S1132.g00063.01.t

Possible name(s)

MUC5AC; MUC5B; OGFR

Location

S1132 [0 / 7,567]

Sequences

Amino acid sequence

Length: 2,114

>Harore.CG.MTP2014.S1132.g00063.01.p
DRSCYSLEISVFCLCSPTTASSTTEQTEPTETVPTTTTEPEIDTTTGKTSTVKSTPVTTT
TGIGESTTGPTTPGCSEGWTDWINNGEPTNDMENGGDFEDISAACEMGVPADIQCVCAAS
GENSSTTGDVVTCDTDVGLSCENSEQPDGVPCLDYEIKVYCVCDNTTTTTGVDTEPTSAT
TATPSTEETEPTQTVPTTTTEPKIDTTTGKTSTVKSTPAVSSTTSETVESTTAPTTVSTT
TPCMDGWTDWEEVNPDDFNSEGPNEEIFDEFQNEYAMCVDEKIIRIECRITDLTQSAVSA
RVEYPFVCQNINDLICSGDESCFLLEIRLFCICGTTPAPTTKEPTTTEITETSSTVGITT
TPQKADTTTPKKTTPVVTSTTRTTEGTTTGMTSPVGITTTLPVGETTTSGITSPVGSTTT
LPVGDTTTKEPTTTEDTELTSTAGTTTPQKVDTTTPKKTTPKIETTSKYPENCTNSGWSG
WMNTNNPEDNIIQFDDDTFDMLRQKYMFCADDQITNVKCRHANTESNDIPNAHSANGAVC
SIDFGSTCSDVDQPDGKTCYDFEIKNRPPQKNPPQLQSALKVKYGTNVHTIAPHIVNMLS
CNSKYIIVECKDGWKLNDNGVCVKPEDCGCVNDDGTIIKPGETVISDNGDKCLCFGNELD
CTSSTTPTSSTTPPNVDTTTIKKSTPVTTKPLSTTSSTIDTSTPTEKTTSSISPTTTTAE
TTSGSTTATPPPTTAPCTQTGCLNSGWTHWMTVENPGTFQPIVDGSFTSPQGIRDEYEEF
CDDEFVTNVLCRVTDITDVPKKRTNYGYICTEVSNLNCGNSEDPYGCFKLEIKVECNCCI
TQTTVPQVYTTTSKSTPKSTPKTTPSSSTPVETTTTPPSSSPGSTPSSTPVETTTTPPSS
LPGSTHSTTPVETTTTPPSSSPGSTTSSTPVETTTTPPSSSPGSTPSSTPVETTTTSPSS
TPGSTPSSTPEKTTTTPSGSTPSSTPGESTTTIPVTPTTVPDCDVYDQEWTDWMEVKNPS
DFEPVDGEFISPEGLRDDYSFCADEYVTEVMCRVTDNGAFTNNRAKYGYVCSQIASLDCS
GYDSPCYDVEIKVYCDCTSTSTTTTPLKNPTTTPKKSTSTSSICIDGWTDWYNENHPNNS
LSSGDYESIEEITECSSGMITEVKCVCASSDEDYTETDDVMKCDTTIGLECNNADQIHDL
ACLDYKLSAKCQCSSTTPIVTTTTTPKPTTTPEITTPYDDCIEWFEWNSVSNPSSETGND
IEDINDLIAESECKDPVAVECRVVEFNISSSSSGQAGVTCDLENGLICLSSKLRGVIRKC
YDYELRLGCLKEECKSTTSSVTSIPTTPSSESTTEYTTTTPETTTFWSNPCPNMPPDNSC
LECNATTHCYKGQCILPEDCPCVIDGKIHEEGTLWNEDCSVCTCFNSAKKCQPKTCSLVD
SSDCPSGEVYQEPKEGECCGTCVACICTENERQCVDTCECYPTEFWCDGEPDCSGGEDEM
DCTSTTTTPPKTTPPKESTFCIVDGKEYPVGEVIYIQNCFSYICIEGGEIEKTPIPGCGS
TTTTHSTSTEKTTPNTTTAPLKGCIYDEHYYPIGAVVDKGECFELICEESGKTVNHTYLD
CGYTTTRQTPTTTPNGCIYEGQMYPPGVTISKDVCSVTVCTDNSEVMTTDTCTTYTTLSS
TTTTPDHSSTTTSSGCVYEGNMYPIGTVIEQGNCYEITCNTQGNVIRDDYMDCTTGTSTT
PQKEPSTPSIPYCIYEGEKYPLNTTIVDDECNGVYCDVNGNVHYRTCISTTTTTGPTTTG
PTTTPHGCEYNGQMFPPGSMIDEGLCYIVICDSNSEVIHGDKPCTTTAESPTTTPVSPTM
TPISCEYNGQMYPPGTTIEKGECHIVLCDDSGNVIVGDFICTTTGKSTTGPSTTTSVVFC
IYNNTMYEPGSLIEEGACYTIVCLDDGTVEHEEKECTTTGTSTTPQKEPSTPSIPYCIYE
GEKYPLNTTIVDDECNGVYCDVNGNVHYRTCISTTTTTGPTTTGPTTTPHGCEYNGQMFP
PGSMIDEGLCYIVICDSNSEVIHGDKPCTTTAESPTTTPVSPTMTPISCEYNGQMYPPGT
TIEKGECHIVLCDD

Nucleotide sequence

Length: 6,344

>Harore.CG.MTP2014.S1132.g00063.01.t
AAGACAGATCTTGCTATTCATTGGAAATCAGTGTATTCTGTTTATGCTCTCCGACAACTG
CTTCCTCGACTACTGAGCAAACAGAACCAACAGAAACTGTTCCCACAACAACCACAGAAC
CTGAGATCGACACCACAACAGGTAAAACATCCACTGTGAAATCGACACCAGTAACCACTA
CTACAGGAATTGGTGAATCGACTACTGGACCTACAACGCCCGGCTGCTCAGAGGGATGGA
CCGACTGGATCAACAACGGAGAACCAACTAACGATATGGAGAATGGAGGTGACTTTGAGG
ATATTTCTGCTGCATGTGAAATGGGTGTTCCAGCTGACATACAATGTGTATGTGCTGCAA
GTGGAGAGAATTCATCTACTACTGGGGATGTGGTAACGTGCGATACTGACGTTGGTCTTT
CCTGTGAAAATTCCGAGCAGCCTGACGGGGTTCCGTGTCTGGATTATGAAATTAAAGTTT
ATTGTGTATGTGATAATACGACTACGACAACTGGGGTAGATACTGAGCCGACTTCAGCAA
CCACAGCTACTCCAAGTACAGAAGAAACAGAACCTACGCAAACTGTTCCTACAACAACCA
CAGAACCTAAGATTGATACAACAACTGGGAAAACATCCACAGTGAAATCAACACCAGCAG
TTTCTAGCACAACCTCAGAAACTGTTGAATCGACTACTGCACCAACAACAGTATCTACTA
CAACACCATGCATGGACGGTTGGACAGACTGGGAAGAAGTGAACCCAGATGATTTCAATT
CGGAAGGACCAAATGAGGAAATCTTTGATGAGTTTCAAAACGAATATGCTATGTGTGTTG
ATGAGAAAATTATTAGAATTGAATGTCGTATTACTGACCTAACCCAAAGTGCAGTGAGTG
CCCGCGTTGAATACCCATTCGTTTGCCAGAATATCAATGACTTGATTTGTTCCGGGGATG
AATCTTGCTTCCTTTTAGAAATCAGGTTATTCTGCATCTGTGGAACAACACCTGCCCCAA
CTACTAAAGAGCCGACCACTACGGAAATCACAGAAACCTCTTCCACCGTTGGAATAACCA
CCACACCGCAAAAGGCTGATACAACAACACCCAAGAAAACTACTCCTGTAGTGACGTCCA
CCACTCGTACCACAGAAGGTACTACAACAGGAATGACAAGCCCAGTTGGAATTACCACAA
CACTACCAGTAGGTGAAACAACTACATCGGGAATAACAAGCCCAGTTGGAAGTACAACAA
CACTACCAGTAGGTGATACAACTACTAAAGAGCCTACAACTACAGAAGATACAGAACTTA
CTTCTACCGCTGGCACAACAACCCCGCAAAAGGTTGATACAACAACACCAAAGAAAACTA
CTCCTAAAATAGAAACGACATCTAAGTACCCTGAAAATTGTACAAATAGTGGTTGGTCTG
GATGGATGAATACAAACAACCCAGAAGACAATATTATACAGTTTGATGATGATACATTTG
ATATGTTGAGACAAAAGTACATGTTTTGTGCAGATGATCAAATTACTAACGTGAAGTGTC
GCCATGCAAATACAGAATCTAATGATATTCCAAATGCCCATTCAGCAAATGGTGCTGTTT
GTAGCATTGATTTCGGTTCGACATGTTCCGACGTAGACCAACCTGATGGGAAAACGTGTT
ATGATTTTGAAATCAAAAATCGACCACCTCAAAAAAACCCACCACAACTTCAATCTGCCC
TGAAGGTGAAATATGGAACCAATGTGCATACAATTGCACCTCATATTGTCAATATGTTGT
CATGCAACAGCAAATACATTATTGTGGAATGCAAGGATGGGTGGAAATTGAATGACAATG
GTGTCTGTGTGAAACCAGAGGACTGTGGATGTGTGAATGATGACGGTACAATAATTAAAC
CTGGAGAGACCGTAATATCAGATAATGGTGATAAATGTCTCTGCTTTGGTAATGAACTGG
ATTGTACATCATCTACAACACCAACAAGTTCTACAACTCCACCGAATGTTGATACGACTA
CCATAAAAAAATCAACCCCAGTTACTACGAAACCGCTCAGTACCACATCAAGTACAATTG
ATACAAGTACTCCAACAGAAAAAACAACATCTAGTATTTCACCAACAACGACAACTGCTG
AAACCACATCAGGCTCTACTACAGCCACCCCACCCCCTACAACTGCACCTTGTACACAGA
CAGGCTGCCTAAATAGTGGATGGACACACTGGATGACAGTAGAAAATCCAGGTACATTCC
AGCCTATCGTTGATGGGAGTTTTACGTCACCACAAGGTATTCGGGATGAATATGAAGAAT
TCTGTGATGATGAGTTTGTAACCAATGTGCTGTGTCGTGTTACTGATATAACAGATGTTC
CCAAAAAACGCACCAATTATGGTTATATTTGCACAGAAGTAAGTAATTTAAATTGTGGAA
ATAGTGAAGATCCATATGGATGCTTCAAACTTGAAATAAAGGTTGAATGCAACTGTTGCA
TTACACAAACAACGGTGCCACAAGTATATACAACTACATCAAAAAGTACACCAAAATCTA
CGCCTAAAACTACTCCAAGCTCTTCTACTCCTGTGGAAACGACTACTACTCCACCAAGCT
CCTCGCCAGGCTCAACTCCCTCTTCTACTCCTGTGGAAACGACTACAACTCCACCAAGCT
CCTTGCCAGGCTCAACTCACTCTACTACTCCTGTGGAAACGACTACAACTCCACCAAGCT
CCTCGCCAGGCTCAACTACCTCTTCTACTCCTGTGGAAACAACTACGACTCCACCAAGCT
CCTCGCCAGGCTCAACACCTTCTTCTACTCCTGTGGAAACGACTACAACTTCACCGAGCT
CCACACCAGGTTCAACGCCCTCATCTACTCCCGAGAAAACAACTACAACTCCATCGGGCT
CAACGCCCTCTTCTACTCCAGGAGAATCGACTACTACAATACCAGTTACACCAACTACTG
TTCCAGATTGTGATGTGTATGATCAAGAATGGACTGATTGGATGGAAGTGAAAAATCCAT
CAGACTTCGAACCAGTCGATGGTGAATTCATTTCCCCAGAAGGATTAAGAGATGACTATT
CTTTCTGTGCTGATGAATATGTCACTGAAGTTATGTGTCGCGTTACAGACAATGGTGCTT
TTACCAACAACAGGGCAAAATATGGATACGTTTGCTCTCAAATTGCTAGCCTTGATTGTT
CAGGATATGATAGTCCATGCTATGATGTAGAAATAAAAGTCTACTGTGACTGCACTTCTA
CATCAACAACAACAACACCTCTGAAAAATCCAACAACAACACCCAAGAAATCCACATCAA
CTTCATCGATCTGCATCGATGGGTGGACAGACTGGTATAATGAAAATCATCCAAACAACT
CATTAAGTTCTGGAGATTATGAAAGTATTGAAGAAATTACAGAATGTTCAAGTGGAATGA
TAACTGAAGTTAAATGCGTATGTGCAAGCAGTGACGAAGACTATACTGAAACTGATGACG
TCATGAAATGTGATACAACCATTGGTTTAGAATGCAACAATGCCGATCAAATCCATGACC
TTGCATGTCTTGACTATAAATTAAGCGCTAAATGCCAATGCTCGTCTACAACACCAATAG
TTACGACAACAACAACACCAAAACCAACAACAACTCCTGAAATTACGACGCCATATGATG
ACTGCATTGAATGGTTTGAATGGAACAGCGTCTCTAATCCCAGCTCTGAGACAGGCAATG
ATATTGAGGATATAAATGATCTCATAGCAGAGTCTGAATGTAAGGATCCCGTTGCAGTGG
AATGTCGAGTTGTAGAATTCAACATTTCCTCATCCAGTTCGGGACAAGCTGGCGTAACTT
GTGACTTGGAGAATGGGTTAATATGTTTATCGTCGAAACTAAGAGGAGTCATTCGAAAGT
GCTATGATTATGAACTACGACTAGGATGCCTCAAAGAAGAATGCAAATCAACCACTTCAT
CCGTTACATCAATTCCTACAACACCATCAAGCGAATCAACTACTGAATACACAACAACAA
CACCAGAAACAACAACATTCTGGTCAAACCCATGCCCGAACATGCCTCCTGACAATTCCT
GCCTTGAGTGCAACGCAACTACTCATTGCTATAAAGGTCAATGCATTCTACCAGAAGACT
GTCCTTGTGTGATAGATGGCAAAATACACGAGGAAGGAACACTTTGGAATGAAGACTGCA
GTGTCTGCACATGCTTCAATAGTGCTAAAAAATGCCAGCCTAAAACATGTAGCTTGGTAG
ACTCTAGTGATTGTCCTAGTGGGGAAGTATATCAAGAGCCAAAGGAAGGTGAATGCTGTG
GTACTTGTGTTGCTTGTATTTGCACTGAAAATGAACGACAATGCGTTGACACTTGCGAAT
GCTATCCAACTGAATTTTGGTGTGATGGAGAACCAGACTGTTCTGGAGGAGAAGATGAAA
TGGACTGTACATCAACAACAACAACACCACCAAAAACAACACCTCCAAAAGAATCAACTT
TTTGCATTGTTGACGGCAAGGAGTATCCTGTCGGTGAGGTAATATACATACAGAATTGCT
TTTCATATATATGCATAGAAGGTGGAGAGATAGAAAAAACACCAATACCTGGATGTGGAA
GCACTACCACAACACACTCAACTTCAACTGAAAAGACAACACCAAACACAACTACAGCAC
CTCTGAAAGGATGCATATATGATGAACACTATTATCCAATCGGTGCAGTAGTCGACAAAG
GCGAATGCTTTGAATTGATTTGCGAAGAATCTGGTAAAACAGTAAATCACACATATTTGG
ATTGTGGATATACAACAACAAGGCAAACACCAACAACAACTCCAAATGGCTGTATTTATG
AAGGTCAAATGTATCCACCTGGCGTGACTATTTCTAAAGATGTATGCAGTGTTACTGTTT
GTACTGATAATAGTGAAGTAATGACTACAGATACATGTACCACTTATACCACACTCTCTT
CCACAACCACAACTCCAGATCACTCAAGCACTACAACCTCAAGTGGTTGTGTTTACGAAG
GAAATATGTATCCTATCGGTACTGTTATTGAACAAGGTAATTGTTATGAAATAACATGCA
ATACACAAGGGAATGTTATCAGAGATGACTATATGGATTGTACCACAGGAACAAGTACAA
CACCACAAAAAGAACCATCCACTCCATCCATCCCATATTGTATCTATGAAGGTGAAAAAT
ACCCACTCAATACAACAATTGTTGATGATGAATGTAATGGGGTATATTGTGATGTGAATG
GAAATGTTCATTATAGAACATGTATTTCGACAACAACCACAACAGGCCCAACCACAACAG
GCCCAACAACAACTCCACACGGATGTGAATACAATGGTCAAATGTTTCCTCCAGGTTCCA
TGATTGATGAAGGATTATGCTACATCGTTATCTGCGATTCTAACAGTGAGGTCATACATG
GTGACAAACCATGCACAACTACAGCGGAATCACCAACTACGACACCGGTATCACCAACTA
TGACACCTATATCTTGCGAATACAATGGTCAAATGTATCCACCTGGTACTACCATAGAGA
AAGGAGAATGTCACATAGTTTTGTGTGATGATAGTGGTAATGTAATTGTTGGAGATTTCA
TATGTACAACTACTGGAAAATCGACAACAGGTCCTTCAACAACCACTTCTGTAGTTTTCT
GCATCTACAATAATACAATGTATGAACCTGGATCATTAATCGAAGAAGGTGCTTGCTACA
CTATTGTCTGTTTGGATGATGGCACAGTCGAGCATGAAGAGAAAGAGTGTACTACCACAG
GAACAAGTACAACACCACAAAAAGAACCATCCACTCCATCCATCCCATATTGTATCTATG
AAGGTGAAAAATACCCACTCAATACAACAATTGTTGATGATGAATGTAATGGGGTATATT
GTGATGTGAATGGAAATGTTCATTATAGAACATGTATTTCGACAACAACCACAACAGGCC
CAACCACAACAGGCCCAACAACAACTCCACACGGATGTGAATACAATGGTCAAATGTTTC
CTCCAGGTTCCATGATTGATGAAGGATTATGCTACATCGTTATCTGCGATTCTAACAGTG
AGGTCATACATGGTGACAAACCATGCACAACTACAGCGGAATCACCAACTACGACACCGG
TATCACCAACTATGACACCTATATCTTGCGAATACAATGGTCAAATGTATCCACCTGGTA
CTACCATAGAGAAAGGAGAATGTCACATAGTTTTGTGTGATGAT

InterProScan

Pfam
WxxW_domain (IPR025155) - T[78-161] 7.8E-16 - T[246-331] 1.4E-4 - T[477-565] 3.8E-10 - T[746-836] 1.3E-5 - T[1009-1095] 2.9E-7 - T[1127-1211] 2.1E-12 - T[1243-1329] 4.0E-15
SMART
VWF_dom (IPR001007) - T[1402-1462] 3.0E-9 - T[1402-1467] 0.0074
Pfam
VWF_dom (IPR001007) - T[1402-1462] 9.3E-8
ProSiteProfiles
VWF_dom (IPR001007) - T[1402-1463] 11.982
ProSitePatterns
VWF_dom (IPR001007) - T[1419-1462] .
ProSiteProfiles
LDrepeatLR_classA_rpt (IPR002172) - T[1466-1503] 11.375
SUPERFAMILY
LDL_receptor-like_sf (IPR036055) - T[1466-1507] 6.78E-6
SMART
LDrepeatLR_classA_rpt (IPR002172) - T[1466-1504] 4.7E-5
CDD
LDrepeatLR_classA_rpt (IPR002172) - T[1467-1502] 9.42858E-6

Best Blast Hits in UniProt
Protein Name Identity Bit Score e-value
MUC5B_HUMAN 25 % 85.5 3.88E-16