Login
Help

TRANSCRIPT CARD

Submit your Data

  1. Transcript 'KH2012:KH.C10.190.v1...'

Transcript Model

Transcript Id

KH2012:KH.C10.190.v1.A.SL1-1

Possible name(s)

HMCN1; PRSS33; TPSB2

Location

KhC10 [3,700,359 / 3,708,535]

Sequences

Amino acid sequence

Length: 1,153

>KH.C10.190.v1.A.SL1-1
YRAFYTDFVCSIAPKKALVLVFITAVPVDKMGKYGFLLITSAIVVLYNTVEVSSQCQSIG
CGDCYTWCAPVHCSYPTGQLYCRATCGSCPNEQCRAAGCSHTCNPNTNPVTCTCNAGYEL
QADGKTCLDINECERSSNLCTNPTSPRCKNTPGSYICTGCGADSNTVWQYYNRTECCKID
TASQCGTSSTTGGRIVGGELAAYIKIGSTLCTGALIKRNVVLTAAHCLANRVTNLDELRI
YLGVQNISDTGNIHSQHIAALRYVQHPNFDSSTLENDIAIIFLRTEATIGDYVNTICLPN
GEQIAEGTKCWATGYGAISEGGPLSQTLRQVALPIANSQTCVQNYARISRTVNPVKTMCA
GYEQGGRDSCQGDSGGPLVCQRCNSCNWFLAGLVSFGRGCARVGMFGIYSRMTYFEQWIA
SQTGMNHSPRSCVRPSWSNWSAWSQDCPSCGAGTRTKTRSCTNGVVGDPGCDGSATMTGS
CPNNPCSSGSWGDYQQWSTCTVSCGGGTRTRTRSCNGGSVGSIGCPGEESQTEPCNTYNC
PGWNNWGNWGECTASCGGGTREATRTCNTFGQAGATCSGDATKSEACNTTPCPTWGEWQW
DSCSNTCGGGTRTGTRTCNKHGGTLECTGSATTSESCGNAACVGASWGDFGAWSACTASC
NGGTRTRTRSCNQGSIGSNGCPSGGESELEPCNTFGCPTWNDMVWGDCSVTCGGGTRTGT
RTCNRNGGTADCVGSNTVTGVCGAAQCATSSCVDTLGDCSDYSSLCSSLAHQSLLQSICP
QTCGFCGSSTGSWDEWTNSGGCSLTCGGGTQQQTRTCTGGTAGAGGCPGSSTQTIACNQQ
ACPPQGSWSGWSNSGTCTVTCGGGTQQQIRTCNGGTAGAGGCPGSSTQTIACNQQACPPQ
GSWGGWSNSGTCSLTCGGGTQQQIRTCNGGTAGAGGCPGSSTQTIACNQQACPPQGSWGG
WSNSGTCSLTCGGGTQQQIRTCNGGTAGAGGCPGSTTQTIACNGQACPSSGSWGGWINVG
TCSTSCMQAQTRQCNGGTAGQGGCSGLSSRIQSCTGGACPTQTGSCSNLRDLQAPATCRN
WASFNYCTNYAGYMLANCAKSCCERNAGASSTACSTIFDSFGTWCTATTLDCSNAYILYV
CARTCNPLCSAAG

Nucleotide sequence

Length: 3,592

>KH2012:KH.C10.190.v1.A.SL1-1
ATATAGGGCGTTTTATACTGACTTTGTCTGCTCCATTGCACCAAAAAAAGCGTTAGTCTT
AGTATTTATAACTGCTGTCCCAGTTGACAAAATGGGAAAATATGGATTTTTATTGATCAC
CTCAGCTATTGTTGTGTTATATAACACGGTTGAAGTTTCTTCGCAATGCCAATCAATTGG
ATGTGGCGATTGTTATACGTGGTGTGCGCCTGTACACTGCAGTTACCCAACCGGTCAGCT
ATATTGCAGAGCTACGTGTGGAAGCTGTCCAAATGAACAATGCAGGGCAGCTGGATGTAG
CCATACTTGTAACCCCAACACCAATCCAGTAACCTGTACATGTAACGCGGGGTACGAGCT
GCAAGCGGACGGAAAAACTTGTCTTGATATAAACGAATGCGAGAGAAGTAGTAACCTGTG
TACCAATCCAACGTCGCCACGATGCAAGAACACACCGGGGTCCTACATCTGTACAGGTTG
CGGGGCAGACTCGAACACTGTTTGGCAATATTATAACAGAACAGAATGCTGTAAGATAGA
CACAGCTTCACAATGCGGTACAAGTTCAACAACAGGTGGTCGAATCGTAGGAGGCGAGTT
AGCAGCGTACATTAAAATCGGTTCAACATTGTGCACGGGCGCTCTAATTAAACGGAACGT
CGTATTGACAGCAGCTCACTGTCTCGCAAACCGCGTTACAAATCTAGACGAGCTTAGAAT
ATATCTTGGGGTCCAGAATATTAGCGATACGGGAAACATTCATTCTCAACACATTGCTGC
TCTGAGGTATGTGCAACACCCTAACTTCGACTCGAGTACTTTGGAAAACGATATCGCAAT
AATTTTCTTGCGGACGGAAGCGACCATTGGCGATTACGTCAATACTATTTGTTTGCCCAA
CGGAGAGCAAATCGCGGAAGGAACAAAATGTTGGGCAACTGGTTACGGAGCTATAAGCGA
AGGTGGCCCTCTTTCTCAAACGCTTCGACAGGTTGCTCTGCCCATAGCGAACTCCCAAAC
ATGCGTTCAAAATTACGCCAGGATTTCGCGAACCGTTAATCCAGTCAAAACTATGTGCGC
TGGATACGAACAGGGTGGAAGAGACTCGTGCCAAGGTGATTCGGGCGGCCCATTAGTATG
TCAGCGGTGCAACAGTTGTAACTGGTTTCTGGCTGGTTTGGTATCATTCGGCAGAGGATG
TGCCCGAGTAGGGATGTTTGGTATATACAGTAGGATGACATATTTTGAACAGTGGATCGC
ATCTCAAACAGGAATGAACCACAGCCCTCGATCGTGCGTTAGACCATCATGGTCTAATTG
GAGCGCATGGTCCCAGGATTGCCCTAGTTGTGGGGCAGGAACGAGAACTAAAACCAGGAG
TTGTACAAATGGTGTGGTTGGTGACCCAGGGTGCGATGGGTCCGCTACTATGACCGGAAG
TTGCCCAAACAACCCGTGTTCAAGCGGCAGCTGGGGCGATTACCAACAGTGGTCCACATG
CACGGTTAGCTGTGGAGGAGGTACAAGAACCAGAACTCGAAGCTGTAATGGTGGTTCGGT
TGGTAGCATCGGATGTCCGGGAGAAGAATCTCAAACCGAGCCATGCAACACTTACAATTG
CCCGGGTTGGAACAACTGGGGCAATTGGGGTGAATGTACTGCATCTTGCGGAGGTGGGAC
AAGAGAAGCGACCAGAACATGCAACACCTTTGGACAGGCGGGGGCTACCTGTTCTGGCGA
TGCCACAAAATCTGAAGCATGTAACACAACACCTTGTCCAACTTGGGGTGAATGGCAATG
GGATTCATGCAGTAATACATGCGGTGGGGGCACAAGAACTGGCACACGGACGTGTAATAA
ACACGGTGGGACATTGGAATGTACTGGATCAGCAACTACATCGGAGAGTTGTGGAAATGC
TGCATGTGTAGGTGCTTCGTGGGGCGATTTTGGTGCGTGGTCCGCTTGTACAGCCAGTTG
CAACGGTGGTACACGAACGCGCACAAGGTCTTGCAACCAAGGGTCAATCGGGAGCAATGG
ATGTCCTTCTGGCGGCGAATCTGAGTTGGAACCATGCAATACGTTTGGTTGTCCAACTTG
GAACGACATGGTATGGGGCGATTGTTCCGTAACTTGTGGGGGTGGAACACGGACAGGTAC
AAGGACTTGCAACAGGAACGGCGGGACAGCTGATTGCGTGGGATCGAATACAGTAACAGG
AGTTTGCGGCGCTGCACAATGTGCAACCTCGTCATGTGTGGACACATTGGGTGATTGTAG
TGATTACTCCTCTCTTTGTTCTTCTCTAGCTCATCAGTCATTATTGCAAAGTATCTGCCC
ACAAACATGTGGATTTTGTGGATCTTCAACTGGTAGTTGGGATGAGTGGACAAACAGTGG
AGGTTGTTCACTGACTTGTGGAGGCGGTACACAGCAACAGACAAGAACCTGCACGGGTGG
AACTGCAGGCGCTGGTGGTTGTCCTGGATCTTCAACACAAACGATTGCTTGTAACCAACA
AGCTTGCCCCCCACAGGGCAGTTGGAGTGGCTGGTCAAATAGTGGGACTTGTACAGTAAC
TTGTGGTGGTGGCACTCAACAGCAGATAAGAACATGCAATGGTGGAACTGCAGGCGCTGG
TGGTTGTCCTGGATCTTCAACACAAACGATTGCTTGTAACCAACAAGCTTGCCCCCCACA
GGGCAGTTGGGGTGGCTGGTCAAATAGTGGGACTTGTTCATTGACTTGTGGTGGTGGCAC
TCAACAGCAGATAAGAACATGCAATGGTGGAACTGCAGGCGCTGGTGGTTGTCCTGGATC
TTCAACACAAACGATTGCTTGTAACCAACAAGCTTGCCCCCCACAGGGCAGTTGGGGTGG
CTGGTCAAATAGTGGGACTTGTTCATTGACTTGTGGTGGTGGCACTCAACAGCAGATAAG
AACATGCAATGGTGGAACTGCAGGCGCTGGTGGTTGTCCTGGATCTACAACACAAACGAT
TGCCTGCAATGGGCAGGCTTGCCCATCAAGTGGGTCCTGGGGTGGCTGGATAAATGTTGG
GACTTGTTCTACAAGCTGCATGCAAGCACAGACGCGTCAGTGCAATGGTGGAACAGCAGG
TCAAGGTGGATGTTCTGGCTTGTCCTCACGAATACAATCATGTACGGGTGGAGCATGTCC
AACACAGACCGGATCATGCAGCAATTTAAGGGACTTACAAGCACCTGCAACCTGCCGTAA
CTGGGCTAGCTTCAACTACTGTACGAATTATGCGGGTTACATGTTAGCTAACTGCGCTAA
ATCCTGCTGTGAACGAAATGCAGGTGCATCTTCTACTGCTTGTAGTACAATCTTTGATTC
ATTTGGTACTTGGTGCACGGCAACAACTTTAGACTGTAGTAATGCATATATATTGTATGT
GTGTGCAAGAACATGCAATCCACTCTGCAGTGCTGCTGGTTGAAGTGTCTTCTATTCAGA
TTAATAACTGTTTTGAACTGCTGACCACATCCTCTGAAGCACAAACACAAACATATTGCT
TTGGAAGGAATAAGAGCGCTAAACTATATCCTCCCAATTTTAACATTGCCAT

InterProScan

SMART
ShKT_dom (IPR003582) - T[60-90] 11.0 - T[751-787] 8.1E-5 - T[1065-1104] 2.9
ProSiteProfiles
ShKT_dom (IPR003582) - T[61-89] 6.729 - T[752-786] 8.144
Pfam
cEGF (IPR026823) - T[111-132] 2.6E-8
ProSitePatterns
EGF_Ca-bd_CS (IPR018097) - T[129-157] .
SUPERFAMILY
Peptidase_S1_PA (IPR009003) - T[184-428] 2.85E-67
SMART
Trypsin_dom (IPR001254) - T[194-419] 1.1E-72
ProSiteProfiles
Trypsin_dom (IPR001254) - T[195-424] 30.44
CDD
Trypsin_dom (IPR001254) - T[195-422] 7.44089E-65
Pfam
Trypsin_dom (IPR001254) - T[204-419] 1.4E-52
PRINTS
Peptidase_S1A (IPR001314) - T[212-227] 1.6E-11 - T[273-287] 1.6E-11 - T[367-379] 1.6E-11
ProSitePatterns
TRYPSIN_HIS (IPR018114) - T[222-227] .
ProSitePatterns
TRYPSIN_SER (IPR033116) - T[368-379] .
Gene3D
TSP1_rpt_sf (IPR036383) - T[433-486] 1.3E-6 - T[487-540] 1.3E-11 - T[542-593] 4.2E-10 - T[594-643] 9.8E-6 - T[644-698] 1.8E-9 - T[699-748] 2.0E-6 - T[789-842] 1.9E-8 - T[843-897] 2.1E-9 - T[898-952] 3.9E-9 - T[953-1007] 1.6E-8
ProSiteProfiles
TSP1_rpt (IPR000884) - T[434-487] 10.273 - T[488-539] 11.393 - T[540-593] 13.021 - T[595-643] 9.162 - T[644-698] 11.727 - T[700-748] 8.527 - T[790-843] 11.511 - T[845-898] 11.873 - T[900-953] 11.605 - T[955-1008] 11.597 - T[1010-1060] 8.733
SMART
TSP1_rpt (IPR000884) - T[437-487] 6.0E-5 - T[491-541] 2.6E-11 - T[543-593] 4.1E-8 - T[595-643] 2.7E-4 - T[647-698] 2.7E-10 - T[700-748] 0.0059 - T[793-843] 1.9E-5 - T[848-898] 8.2E-7 - T[903-953] 2.9E-6 - T[958-1008] 3.0E-6 - T[1013-1060] 0.023
Pfam
TSP1_rpt (IPR000884) - T[438-486] 0.012 - T[494-540] 1.8E-8 - T[545-592] 4.6E-8 - T[599-642] 4.8E-4 - T[649-697] 1.6E-5 - T[705-747] 1.6E-5 - T[793-842] 6.5E-6 - T[849-897] 1.9E-7 - T[905-952] 6.9E-7 - T[960-1007] 1.2E-6
SUPERFAMILY
TSP1_rpt_sf (IPR036383) - T[489-537] 4.45E-10 - T[540-589] 6.54E-10 - T[595-642] 4.32E-5 - T[645-693] 2.75E-9 - T[696-742] 5.62E-5 - T[791-839] 5.1E-7 - T[842-897] 6.8E-8 - T[897-952] 5.36E-8 - T[952-1007] 5.23E-8
Pfam
ShKT_dom (IPR003582) - T[751-786] 7.4E-7 - T[1066-1102] 0.014

Best Blast Hits in UniProt
Protein Name Identity Bit Score e-value
HMCN1_HUMAN 35.389 % 195 2.61E-50
KLKB1_HUMAN 34.8 % 138 1.3E-33
PRS33_HUMAN 34.274 % 132 4.64E-34