Login
Help

TRANSCRIPT CARD

Submit your Data

  1. Transcript 'Harore.CG.MTP2014.S8...'
  2. Transcript 'NCBI:KH.XM_018816709...'
  3. Transcript 'KH2012:KH.L18.37.v1....'
  4. Transcript 'Boleac.CG.SB_v3.S110...'

Transcript Model

Transcript Id

Boleac.CG.SB_v3.S1104.g01075.01.t

Possible name(s)

GAA; MGAM; SI

Location

S1104 [1,452 / 4,107]

Sequences

Amino acid sequence

Length: 884

>Boleac.CG.SB_v3.S1104.g01075.01.p
MSAGELICGYFLSFLTIQLALCIQCDVSTEQKIDCYPESYPLKQADCEQRGCCYSPATEN
DDVPWCFFPRSYSYGYVLTELKEVSYGYEGVLKISDVKAPYPINALPSLKLSVFLEAADR
IRFKITDEKDTRYQVPLDVPKVLTKTIDRVKYSWQLSSVNETFNIKISRADTKTVLFDTS
IAPILFGDQFLQISTSLASSNLYGLGERHSSLRPDVNWNRVTFWAADRAPQVPEKENLYG
DHPFYMVVEPDGKSHGVFFLNSNAKEAILQPTPALTWRSIGGIMDFYVFLGPNPDQVIQQ
YSTVVGTTFMPPLWSLGYHLCRWGYKTANNTLNYVEKMRSAKIPQDVQWNDIDYMEKKFD
FTYDHANFDTLPNVVKNLHTHGQHYIMIIDPGISDQAPKGTYSPYDDAVDMGILVKDHKT
LQPVRGKVWPGEVVFPDFTDPKVYDYWTKQLSRYHDKISFDGVWIDMNEYSNFMDGSPES
CYQNNTYDYPPYVPNVSGGRLFSRTVCPSSVQHASINYNVHSLTGLFEMKATSNALKKIR
NKRPFVISRSTFPSAGKYGGHWTGDVRSSWSHLKVSVAGILSFNLFGIPMVGADICGFGG
FTTEELCIRWTQLGAFYPFSRNHNDIAGSPQAPVDFSEKTQDILRTALHIRYAFLPYLYT
LFYNSHVNGSTVARPLFFEFPLDKVTYSIDTQFMWGSGIMISPVLDKGSLSVNAYFPAGV
WYDMYLQVGNRIESTGGSLMTPAPIDRMPIHIRGGHIIPCSHAGTVTNVSNTFYVIAAAD
SEGKANGNLYWDDGDSLNTIESNNYAYLKLVMEKNSXVISIYEGSKQVASKMQMDLLIIF
GMQCKPKQMTANGXVVSFDVGSYGEVFVNMKPLSMNNINFTWTC

Nucleotide sequence

Length: 2,655

>Boleac.CG.SB_v3.S1104.g01075.01.t
ATGAGTGCTGGAGAGTTGATCTGCGGGTATTTCCTCTCTTTTCTCACCATTCAACTTGCA
TTGTGTATACAATGTGATGTTTCAACAGAACAGAAAATCGACTGCTACCCGGAATCATAT
CCATTGAAACAGGCAGACTGCGAGCAAAGAGGCTGTTGTTATTCACCAGCTACAGAAAAT
GATGATGTGCCTTGGTGCTTCTTTCCCCGTAGTTATTCTTATGGATATGTTTTGACGGAA
TTGAAAGAAGTTAGTTATGGATATGAAGGTGTGCTGAAGATATCTGATGTGAAAGCTCCA
TATCCAATAAATGCATTGCCTTCATTGAAATTGTCTGTATTCCTCGAAGCGGCAGATCGT
ATTCGTTTCAAAATTACTGATGAAAAAGATACAAGGTATCAAGTTCCTTTGGATGTTCCC
AAAGTCCTTACAAAAACTATTGACCGAGTGAAGTATTCATGGCAATTGTCATCGGTGAAT
GAGACTTTCAACATAAAAATAAGTCGAGCCGATACAAAAACTGTGCTTTTTGATACGTCT
ATTGCTCCAATACTCTTTGGTGATCAATTTCTACAAATTTCTACATCACTAGCTTCTTCC
AACTTATACGGTCTTGGAGAAAGACATTCGTCACTTCGTCCTGATGTCAACTGGAATAGA
GTCACATTTTGGGCTGCAGATCGTGCTCCACAGGTGCCAGAAAAAGAAAATCTCTACGGT
GATCACCCTTTTTACATGGTTGTTGAACCAGATGGAAAGTCACATGGTGTTTTCTTTCTC
AATAGTAATGCAAAAGAGGCAATTTTACAGCCAACGCCGGCTTTAACATGGAGAAGCATT
GGTGGCATCATGGATTTTTATGTGTTTCTTGGGCCTAATCCTGATCAAGTTATACAACAG
TATTCTACAGTTGTTGGGACAACTTTCATGCCCCCGCTTTGGTCTTTAGGTTATCATTTA
TGTCGATGGGGTTATAAAACAGCAAATAATACATTGAATTATGTAGAAAAAATGAGGAGT
GCAAAAATTCCTCAAGATGTACAATGGAACGATATTGATTATATGGAAAAGAAGTTTGAC
TTTACATATGATCATGCAAATTTTGACACCTTGCCTAATGTTGTTAAGAATCTACACACT
CATGGTCAACATTATATCATGATTATTGACCCTGGAATTTCTGATCAAGCACCAAAAGGA
ACATACTCTCCCTATGATGATGCTGTTGATATGGGTATCCTTGTGAAAGATCATAAAACA
TTGCAGCCTGTTCGTGGAAAAGTTTGGCCGGGAGAAGTTGTGTTCCCTGATTTCACTGAT
CCTAAAGTGTATGATTATTGGACAAAACAACTGAGTCGATACCATGATAAAATTTCTTTT
GATGGAGTATGGATTGATATGAATGAGTATTCTAATTTCATGGATGGCTCCCCTGAATCT
TGTTATCAGAATAACACCTATGACTATCCTCCATATGTGCCCAATGTCTCTGGAGGTCGC
CTGTTTTCAAGGACTGTATGTCCATCGTCTGTTCAACATGCTTCCATCAATTACAACGTT
CACAGTTTGACTGGGTTGTTTGAAATGAAAGCGACAAGTAATGCTTTGAAAAAAATTCGA
AACAAAAGACCGTTTGTGATATCACGATCAACATTTCCCAGTGCTGGAAAGTATGGAGGT
CATTGGACTGGAGATGTACGAAGCAGCTGGTCACATTTAAAAGTATCTGTCGCGGGTATT
TTGTCATTTAATCTGTTTGGAATCCCAATGGTCGGTGCGGACATTTGCGGTTTTGGAGGG
TTTACCACAGAGGAGTTGTGTATACGATGGACTCAATTAGGTGCCTTCTATCCTTTTAGT
CGGAATCATAATGACATTGCAGGAAGCCCTCAAGCTCCAGTTGATTTTTCTGAGAAAACA
CAGGATATCCTGAGGACTGCGTTACATATTCGTTACGCTTTTTTGCCGTATCTGTACACA
TTATTCTACAATTCACATGTCAATGGCAGCACAGTAGCTCGGCCTCTGTTCTTTGAGTTT
CCTTTAGATAAAGTCACATATTCTATCGACACTCAATTTATGTGGGGATCGGGTATCATG
ATTTCACCTGTCCTTGATAAAGGAAGTTTATCTGTGAATGCATATTTTCCTGCTGGTGTT
TGGTATGATATGTACTTGCAAGTAGGTAACCGTATTGAATCTACGGGTGGATCATTAATG
ACACCAGCTCCAATAGACAGAATGCCAATCCACATTCGAGGAGGCCATATTATCCCATGT
TCTCATGCAGGGACAGTCACGAATGTCAGTAATACTTTCTATGTCATAGCTGCTGCTGAT
AGTGAAGGCAAAGCTAATGGAAACTTGTATTGGGATGATGGGGATTCTCTCAATACAATA
GAATCCAATAATTATGCATATCTGAAGCTTGTAATGGAAAAGAATTCCNTTGTTATAAGC
ATCTATGAAGGGTCCAAACAGGTAGCATCTAAAATGCAGATGGACCTACTCATTATCTTT
GGCATGCAATGTAAACCAAAGCAAATGACAGCCAATGGAAANGTGGTGTCTTTTGATGTT
GGTTCCTATGGTGAAGTTTTTGTCAACATGAAGCCTCTTTCTATGAATAATATCAACTTC
ACATGGACGTGTTGA

InterProScan

ProSiteProfiles
P_trefoil_dom (IPR000519) - T[23-70] 15.78
SMART
P_trefoil_dom (IPR000519) - T[23-73] 5.1E-10
CDD
P_trefoil_dom (IPR000519) - T[24-70] 4.64462E-13
Pfam
P_trefoil_dom (IPR000519) - T[25-69] 3.6E-12
SUPERFAMILY
Gal_mutarotase_sf_dom (IPR011013) - T[47-294] 2.26E-40
Pfam
Gal_mutarotase_N (IPR031727) - T[98-197] 4.3E-23
Pfam
Glyco_hydro_31_N_dom (IPR025887) - T[201-263] 6.9E-10
Pfam
Glyco_hydro_31 (IPR000322) - T[287-758] 3.8E-154
SUPERFAMILY
Glycoside_hydrolase_SF (IPR017853) - T[296-470] 1.58E-77 - T[506-669] 1.58E-77
ProSitePatterns
Glyco_hydro_31_AS (IPR030458) - T[462-469] .
ProSitePatterns
Glyco_hydro_31_CS (IPR030459) - T[592-622] .
Gene3D
Glyco_hydro_b (IPR013780) - T[655-754] 6.7E-34 - T[755-882] 7.2E-18

Best Blast Hits in UniProt
Protein Name Identity Bit Score e-value
MGA_HUMAN 41.219 % 673 0
SUIS_HUMAN 40.777 % 626 0
LYAG_HUMAN 45.814 % 789 0