UHGP-MC 18678


Information


Number of sequences (UHGP-50):
100
Average sequence length:
63±7 aa
Average transmembrane regions:
0.01
Low complexity (%):
0.46
Coiled coils (%):
0
Disordered domains (%):
0.5

Pfam dominant architecture:
PF00232
Pfam % dominant architecture:
7200
Pfam overlap:
0.14
Pfam overlap type:
reduced

Downloads

Seeds:
MC18678.fasta
Seeds (0.60 cdhit):
MC18678_cdhit.fasta
MSA:
MC18678_msa.fasta
HMM model:
MC18678.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME267292_01357364-424WPIDPVGFRTALNRIYDRYDLPIFIAENGLGSTDHPDKNGYVRDDDRIEYMKGHVREMREA
GUT_GENOME242764_02315334-393WIIDPKGLRLAAKELFDRYKLPVFTVECGIGVKETLNAEGTVEDDYRIDYLRDHVEQLRL
GUT_GENOME104704_01989322-385KTAFHWTIDPLGFRIMLNNMYERYEIPIMVVENGLGTYDRVENNQIHDNYRIDYLQKHIEQMKL
GUT_GENOME243156_01450333-406YQGTGNAALEKSDWDWTIDPEGLRISLRMLYDRYAVPVMIVENGLGAYDQVEDGKVHDPYRIEYLKKHVEQLQL
GUT_GENOME011451_00776334-389PLSLYYAPFFRYERYHLPIRITENGGCYKDTLTDGKIHDGGRISFLSEYLKQVERT
GUT_GENOME008187_00228334-410ESARDSNVPRNKWGREILPSCMYSTIMDVHERYDAPRIFITENGHGAYEQPDADGMIHDDDRIELLGQFIEHMMRAK
GUT_GENOME244140_01104346-430DGIPFTIESLKMEDRTSFDWAIYPEGLTEILVQLRERYGSALPPIIVTEGGAAFTETVTTDDSGQARVHDPRRIDYLARHFRAVA
GUT_GENOME000812_0198268-146GKPSPLPKNTFLQRTEWDWTIDPVGLRGGLNKLSSRYKLPIFIMENGMGVVEQLNDEGTVTDSYRVDFFKSHVKQMMKA
GUT_GENOME066362_02220376-441ETSYGMGIDPVGLRITLRELWERYSLPLLITENGCGVPDTLTEDGKVHDDYRIDYLRKHIKACQQA
GUT_GENOME141655_02339364-422WPIDPLGFRYVMNELTDRYELPLFIVENGIGLDEAPDELNRIHDPKRCEYLKIHLQEIA
GUT_GENOME164461_00306348-423QHAVRNKYLEQTPWGWQIDPQGLRVALNTLWDRYRVPLFVVENGMGTYDKVAEDGKIHDQYRIDYLRAHIEQMKEA
GUT_GENOME070710_00504352-415TPWDWRMDPLGFYHSISEYWDRYQKPMIIGENGFGAIDQVVDGRVHDDYRIDYFRQYIAQMKRA
GUT_GENOME282619_00647384-443WTIDPVGLRYMLNSLWRRYNKPVIITENGIGAYDKLENDTVNDEYRIAYLREHFIEMKKA
GUT_GENOME053971_00022309-372LGWENNGKGLYWTIRFIHERYSKDKPIMVTENGLCLKDQLEDEKVHDLERIRYLKEYLLNLERA
GUT_GENOME096391_00716357-417WSFDPIGLRITLNLLYERYQKPMMIVENGSGSFDELTADHKIHDPYRIAYYKTHIEQIKEA
GUT_GENOME103803_02298359-438DGNAFETLKNPYLEASEWGWQIDPLGLRITMNALYDRYQKPLFIVENGLGAVDVPDANGYVKDDYRIDYLREHIKAMKEA
GUT_GENOME061359_01910367-431SAWDWAIDPIGMRIALNHMYDRYHLPIFISECGFGAYDQVEADGMVHDDYRIDFLQKQLTQVKEA
GUT_GENOME183526_00808345-423EAGYYKGYANPNLKKTEFGWEIDPDGFLATMREMYSRYHLPMIVTENGLGAYDTLEDGRVHDNYRIAYLRAHIEQLHKA
GUT_GENOME053566_01112291-371KLGEEGVYRAAQNNYLEQTEFGWMVDSIGMRVTLRRIYDRYHLPLLIVENGVGLDENIDEKGEIHDDFRVHYIEEHLKNVN
GUT_GENOME044922_00914375-433WQIDPIGLRIMLNKMYDRTQKPIFISENGLGARDQLNSDFSIHDPYRIDYLKQHFKQIE
GUT_GENOME188373_01474375-438NAWGWATDPDVLRIALNDLWDRYHKPLWVVENGLGSADTLEKDGKVHDDYRINYLRSQIKSMRD
GUT_GENOME260772_00283343-402WQIDSEGLRYSLVDFYHRYHKPLFIVENGIGIDEKLVNGKVYDDERISYYQKHITSIKRA
GUT_GENOME031890_00649686-761NDLCDNPYIQKTQWGWPIDAKGLRYTLNWLYDRYQLPMFIVENGFGAIDQKEVDGSVHDQYRIDYLKEHIQEMKKA
GUT_GENOME096533_03783344-411LPANPWGWTIDPIGLRTLLNLFYDRYQCPIYITENGIGYHDTLEDDDSIHDAYRVDYFRAHIEQMKEA
GUT_GENOME059484_00384378-438WGIDPLGLRTVLNYYYDRYQKPIIIAENGFGTFDEISSDGCIHDERRIAYLREHIRNMKEA
GUT_GENOME038944_01659353-417TPWEWRADPLGFYNCLNQLWDRYQVPLMIAENGLGAEDVVEEDGSIHDEYRIEYLKAHTKAMKQA
GUT_GENOME005699_00643358-417NWTIDPDALRLSLIQLHERYHLPIFIVENGIGIKEEMINGKIYDDNRIDYYQGHIQAMKT
GUT_GENOME231525_00150367-426WAVDPLGLRISLCQLSDRYNMPLFVVENGFGAVDTLQEDGSIVDDYRIDYFKQHIEAMRT
GUT_GENOME073297_01771331-404QALRNPKLKQSSFGWTIDPLGLRILMNDLYDRYNLPLMIVENGLGVKDDNLTKDKKVHDDYRIAYLKQHIQAII
GUT_GENOME212132_00119328-392TEWKWPIDPYGFKHYLHDYYHRYQLPILITENGMGARDTLLPDGTIDDEYRIKYMADHIASMKEA
GUT_GENOME265488_00016336-408QLCYNPFVPYTDWDWEIDPVSMRYVMRYLWDHYHLPMMVTENGYGAHEHKDANGEIHDQYRIDFLRDTIYQLG
GUT_GENOME284695_00717314-377TSMGWIIDGKCLYYSAKFFYDRYKKPIIITENGVAFNDVVDNGAAHDQNRIEYIEEYFSNLMKA
GUT_GENOME177456_01707344-426LGEKPGVGKACVNEFFEKNSVDGNTFDPYCIRVTARRLTDRYHLPLIITENGYGRPDKFEDDGTIHDTYRIEHLRRMIEEMKI
GUT_GENOME016827_01450337-422GDQAKNPEVPMFNGVENDYVNKTNWGWEIDPTGLRIALRQVYEKYQLPIMITENGLGAKDIVENGKINDQYRIDYLADHVLAMKEA
GUT_GENOME181479_01179347-415EFLPTTDWDWTIDPMGLRMCCRQITSRYDLPIVISENGLGAFDKFEDGEIHDPYRIDYLKHHIEELKKA