UHGP-MC 8217


Information


Number of sequences (UHGP-50):
75
Average sequence length:
150±14 aa
Average transmembrane regions:
0.14
Low complexity (%):
2.13
Coiled coils (%):
0
Disordered domains (%):
14.11

Pfam dominant architecture:
PF18810
Pfam % dominant architecture:
2267
Pfam overlap:
0.73
Pfam overlap type:
extended

Downloads

Seeds:
MC8217.fasta
Seeds (0.60 cdhit):
MC8217_cdhit.fasta
MSA:
MC8217_msa.fasta
HMM model:
MC8217.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME018893_01259349-485GKRHLLKKEVQEQWKSIFNLQKLTDTYIPQHSLAIKTALKNKEVKLTYNNLFKLVEYKRTAFIPHIKQTLDTPDVIYKIEKENKFIFAKSYKENTVLVSVGVDYHTYITVISNYPKEFREIKNQLRRKEAKLIYNKP
GUT_GENOME147788_003101715-1871DKGINQYDKTKETFTDSYGKTHEIPKDIADNWKNTFNLKSLDEAYIPSFTPEVKQALDSILQGEDIKLYAGSLVKLIKENRLKYLDRIKPTLEQPQRIILQNDGALIFARNFGEEKYFTSVARNDNGEWIIRSNAPKAENGLNNKISAGGKEIYNSQ
GUT_GENOME232734_010292168-2338DEEVVAITDAMKANATVAPTVEINDANWKESVDTPIGAVKMGENQKTKLFAKGREQQYGMLLETLSNPDVVLEEKDKEQNMFHERPSSYLFVKTFQKEDGSKYVHFESVTVSQEGMEVSISSHIIRENQLKNKLKSDRLLYKATALDAPANTSAEQPIVGGSLSSDGKDTE
GUT_GENOME217315_019311087-1241LTDEEARQVIGQMEAAAEPAPELELTPENWEAEFGEEGKVQTPVGEVKMGENQLAKLFLKGRSAQFGMVKPTLETPDCVIEIPTQSKDGNTERPSSLLFVKAFTGKDGQKHYFFTSVTVQKDDMEVSVSNHLENRKRIIDFLKKGKLLYRVYGGA
GUT_GENOME252025_001814395-4544MQDNAETLVPRQLTKEIWDNEIKGKVFSTPIGVVKLGENQYEKNIKKGRELEFGMLIPTIERPDAILEEDAPEAGAERQTKYVFVKSFVENGVKKINYASVSVRKEGMEVVASSHYLRDKQMLNKIEKERMLWNRFASGSELSAQGGSTA
GUT_GENOME233183_011151-178MEKSTQGFEMNNFDEYEIISIIFKDKNNKEHLLTKQTQELWKATFGLQSIDDNFIAKIPQELKEILGKDIQVKKGSLFKIVSQKRENFIPQIKEVLESPEIAIKDGNNAYLLAKHLKNDDYFINVSVDKGGDYLISISNGIKELNNLKNKIKDGAEILYQSPNANSNLQTLLQASRYS
GUT_GENOME143418_00774364-485DIIFTDTKGKEHTLSKEAREQWLKAFNLKSLDEEAIVEIPVDLKERLGKEIKLNKKDFEKLVKNKREKYIPQIKETFKEPEAVFIDENEDLIFAKSFNDKLFFVNVNRDYGEAFKGLSLAPK
GUT_GENOME235886_01356431-571LDEKEKTHLHNLLKINAVEFPVVEFNEENYKKYLGSPIETPMGKVKLGENQFEKLKSKNRQNLIGAIHDTLTNPCFIAEEKGGTTLYVKSFIQNDKQKNIMSVVIKRDGLNISISTHEEREPQILSKIKKAGVLRETASDD
GUT_GENOME283137_02360202-378DDFSSSNIGKSIDGEENINKFLNTFKHTADVEPAEIPFTENNYKRLFGNGVDTPIGHVKMSENQEKKMVDKSRTKYFSLARPTLENPDFILEKPSKANEGQATERPSSYIFIKTFRNADGSYVTYYNAVSVMQDGLEVIMSNYIPEEREVKRDIRGGTLAYIKKVTVPSASDTTVHG
GUT_GENOME000578_005251767-1942LSEQEATDLIAQMEANAEVASTIELTPENWIAQFGEDGTVETPVGVVKMGENQYFKIAQQGRNGKFGMIKPTLQNPDVVIEDYRPAEAGNSERDTSLLFVKTFIKEDGSRYYHFTSVTVKKEGREVVISNQERSSKRISKLLQQGKITWIKDDSLYPTAQVEKSVSLNDSNNLTKS
GUT_GENOME142988_00412865-1002LRFIGKNDKEYTINKDVRNEWMKTFNLKNIDDEYIPNIPKEAKIALKDREIKLTKGSLLKLIEKDRIKYIPHIKETLESPQAILKDKDDFIFIKNIDNQTYFTSIGKDYETHLTIISNSPKKQNNIRNKIKNAEVVYY
GUT_GENOME096041_0144433-186KNDGLSSIKETIIAHATPWRHLDFSKEAWKQEFPDGGVHTPIGFVKLGENQTEKLLDRKRKTYFGLIKPTLEKPLYIVSERETPEKIQERKDKGEGVERENVIHFIKPFIVADGNVQCFCCISIKRGEFEVAISSSPRKIKQVFNAIRKHALII
GUT_GENOME142596_024531936-2111LNDDEANELLSRMEDNTSEIPQIELNPTNWIEQFGENGMVSTPLGKVKMGENQIAKLFEKGRSEQFGMIKPTLEHPHVIVEVPSEASDGNTERASSLLFIKTFNGTNGKKVYYFKSVTVKKDGLEVSVSSHYDRAKRVKEALKKGKLLYRFDGGAQTEHHPAGVSVTTSPNITQGK
GUT_GENOME196787_000472981-3136MTKDEAVSLRQQMADNAEPERILEHTEDNWLQDFGKDGRVNTPIGSIKLGENQYKKAGREDRIKRFGLLKPTLERPDVILEKPAPKEGAERQTKYLFVKSFKKVDGTKILNFESITVKQGEDEVSISAHQIEPSKLLKELTESKMLWNRFRGDSNS
GUT_GENOME237872_0047468-199KEAEEAWIKAFNLKDINEPYIPQFSKEIQDALNPILKGEQIKLTRGSFEKLANKNRLDFIPKIKETIEKPNYILDDGQGILFIKEFVENDKEKHFLSVAKNYDGEWIFSSHTRREANNIKNKIENSKMLYKG
GUT_GENOME237441_0119181-234EPHVVSLSELSAEELREGVKYALNEVAQPLPKIDFTRENYNKLFPYSKIDTPIETVKIGAHQFEKLEEKDRKTLLQAVHDVLSNPDVIINEEKKSVFGDTENSHIYAKSYVINENTKAVQSVVVNIEDEYVSISTHKRDISNVVNKIKKPDQLL
GUT_GENOME100343_010392308-2478ENGMIGRSLSEQEATELITRMEANAEVAPEMELTIENWDAQFGEDGIVNTPVGDVKMGEHQFLKMMRQGRNSKLGMEKPTLEQPYVIVEVGSEAIEGDSADRDSSYIFTRAFIKSDGSRFYYFTSITVSKDGKEVVVSNQEKSRNRILRLLKEGSVVWRTPKDATTPSAEE
GUT_GENOME096364_01879427-560QKVDSSDIIFTDTKGKEHTLTKEVQQQWCETFNLKSLDESYIPQLSQELQEAIGKEIKLTKGSLYKIIEKGREQYIPQIKETLEKPQIVLQDESEFIFAKQIKDDLYLTSIGKDFDTHITIISNSPKTDKTIQI
GUT_GENOME129169_004761200-1364MTEEEAEAFLTAISNNHEVAPELELTPENWYAEFGEDGVVHTPIGDAHMGENQFLKMMRDGRKSKLGMIRPTLEAPHAIVEEPSMAKEGQETERDSSYIYIRVFEKEDGSRHYHFTSVSVQRDGGEVIVSNQEKSRNQVKRLLTEGVVLWMRADNAPDTSDVDQD
GUT_GENOME167108_006832046-2178AIPIPEMELTHENWVNEFGNGVLITPLGEIKLGENQFSKMIDKGREKEAGMIKPTLTDPDFVIEEASMATEGETERPASHLYVKSFIGIDGRKRYFFKSVTVKKDGMEVNVSNHFDRVKRLREALKDGKLLYR
GUT_GENOME270036_002691422-1573SLNEDESELHVAAMEYMAAQAPTEEFSRENYNNLFPNGRVDTPVGTIKLGEHQFEKLNSKGRQAQLGMMSETLRDPDVILYEEDLAAPEDAERKGVLLFIKTFTKEDGGKYTNFESVTIRREGEEIAISNHILGRNAYKGKLESDLIVYKKT
GUT_GENOME147138_04576181-318ADMLAANAHTIDPEITEEQLTELMGSTIDTVVGPVRFGANQHDKLGSKGRTPYAGWIRPTLENPLVVLKEHRGNQGDEREFSYLFVNAIAKPDGKVRGFVCVTVQKSGGEVVISSHHLKATQIGRKLQGGEVVYRRPG
GUT_GENOME214832_018141801-1969LSKEDATDIIAKMEMSAVNDPQISLSPESWQNSFGLSNSIDTPLGKVKMGEGQYQKLVDKKRSAEFGMVVQTLQDPDVVFIEPSEAKEGQTTERDFSYVFVKTFIRNGQKFKYYTSVSVLKDGMEVSVSSHIASKTAIMKKLQGMERAYTKQSLLPNSSEWHLAEHPTD
GUT_GENOME141093_007421410-1572AAVRKEAEPFIEKEYSLENFKAEFPNGKVDTPIGEVAVSNYQFEKLNFKGREKYLGLIKPTLERPAFVVDFEDTTFFFKPFRDKDGIVKFASVIKERDGGLDVTSNYPMANRKFELITREGKVRYVQGSVAKPVEHSLDNNVRSSATKSPIEHSSDSKGIIPQ
GUT_GENOME280978_021751332-1489LSGEEADLLLSRMESAAEISSEKELTPETWAETFDENNFIATPIGSVKMGGNQITKFFEKKRTKEFGMVGPTLSNPDVIIEEASEAKDGNAERGSSFLFIKTFNRNGEKVKFYASITVKQDEMEVSVSSHYMNKNKVKRALQESGVLYIREALLSNSS
GUT_GENOME022912_004742403-2538TIKESLAQNAEPLVEVDFSKANYDRLFPRAIVQTPVETVKLGENQFEKLDARDRKRFLLATFQTLATPDLVIDEEREGKHSHNYIKSFVFDEKTKTIQDVVVNINGENVSITAHPRDINNIVNKIKMPDQLVYAAA
GUT_GENOME139983_01000390-537GQLLIKDEKGKNHFINEKLVKAWQDEFKLANAQDDFLPEFSPQIKDILAKNGVDEIHLKIGSLVKINARDRGEFIKHIKPTLQEPDFILKDESGVLFVKDIGKDKAIFTSIAKNENGEWIISTNSYKRIKTLKDKIESGNTTILHQSK
GUT_GENOME218324_0086228-173VVSDDKINYAHLKNIAQELKEVKDETEFYALFGFKNGKTRGNVKTPIKEVEIDLQEAWQHLQDNSKRENRLKLAGGILPTLQTPNVVSVDKNGTYFFHKAFKDEKGVLNLVSVELPKSNRLQYKTSYIASKQRLLKIFNEYKVIYE
GUT_GENOME157190_018441437-1593RSLTSDEADALLAQMENSAEVAPEIELTPDNWLSQFGEDGLVDTPIGKVKMGENQYFKIILKKREKEFGMIYPTLSNPDIVIEEPSTSDKAERNTSLLFVKTFVVNGEKVKYYASVTVSKDNKEVVISNHFIEKAAFKKKLLADSTLYLKPSLSNSS
GUT_GENOME018349_02092384-515LEKNVSPLKELELNRENWNKMFPDGTVKTPVGTVKLGENQFDKLRRNDRNNLLAAMYETLSNPALILEKETLDEKSGEFKPVNVYGKSFIHEDSNHKRAVESVIIFKDGENISISTHNKNIKDFVKQIKTAD
GUT_GENOME096039_016551475-1615FTDKKGKEHTLTKEVQEQWCETFNLKSLDEAYTPKHSDEIREALGGKEIKLQLGSLKKLVAQGREKYIPQIKEVLDSPEAILKDSDNAFLFIKHLKDDDYFVNVGVDKGEYLVSISNGIKETNNIKNKLDFGAKVIYQSPN
GUT_GENOME143153_00904337-465QAEETWIREFGLSDINQSKKVEIHPLIREALGKDLEITPRDFEKIIEKGRDKYIKEIMPTLKNPDYAFIDSGGDLIVAKDLGDSLFFTSVSRDYGEAFRNLSLSPKKENTLKNRLASAREIILEPASEL