UHGP-MC 100314
Information
- Number of sequences (UHGP-50):
- 111
- Average sequence length:
- 82±8 aa
- Average transmembrane regions:
- 0
- Low complexity (%):
- 0.65
- Coiled coils (%):
- 0
- Disordered domains (%):
- 0.47
- Pfam dominant architecture:
- PF13274
- Pfam % dominant architecture:
- 180
- Pfam overlap:
- 0.1
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC100314.fasta
- Seeds (0.60 cdhit):
- MC100314_cdhit.fasta
- MSA:
- MC100314_msa.fasta
- HMM model:
- MC100314.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME134296_02131 | 2-72 | KKFDLEAALKGAPVITGGNHPARIICTDAKGNLPIIALISNNGVENPVKYTLDGRYYTKGTSGYDLYMDVV |
GUT_GENOME233197_01707 | 104-207 | IQTKYKRIPFNIELAKKITNKEVKGSIVTRNGLKVRIICFNRKERKDIFDPLRNIVALIQYKVGDENGDEGVLTFRNNGMYLLNEETDYDLLIEIPKYSEYSNF |
GUT_GENOME070654_02613 | 129-213 | MEKRMITKPFDLELAKKISNGEYDGEIVTVGHNHKVELVYYNKDRGMFNTLGVIYSDSGIISDWFSDNGLGARGCRLCINIPEYT |
GUT_GENOME115866_01571 | 126-203 | LKKFDLEAAKAGKPVCTRDGRKARIICFDRKLLFKGISYPIIALVEDTAKEETIYGYNEKGKVMIEDDATYKDDLMML |
GUT_GENOME153797_00297 | 4-88 | EFVKVPFDIEKAKKIQAGEMEGRIVTKGNYAVKILCMDANIANTPIVALLTVNFKQKAATYTKEGKYYIDGYDCDENLMLELPSS |
GUT_GENOME110985_02135 | 3-93 | TKYVRVPFEVEIAKEITNGAKDGKIVTRRGDNVRIVCWDYKSMSGNYPILALVDEGRQEEAIKYSEDGRYNVYGYFSESCLDIMLEIPEYM |
GUT_GENOME095119_02486 | 1-101 | MTDTKMIIVPFAIELAKKIINKEIEGRIYDTYYKCPARIVDFNFQTIQGKFNVVISEIGKGKERFALCNDDGIIYLDREGKFDEKPVFMLEIPEYVTFKDG |
GUT_GENOME108107_00459 | 1-71 | MKEFDLEKAKVGHPVCTRDGKEARILCFDRIGHHPIVALVKEAGDETIFSYNKKGRFSNDGRGCMCDLFMK |
GUT_GENOME068883_00507 | 7-88 | RISFDIEIAKKIINGKIKGKIITREGQDVRILCFDAKGDNPIVALVKKANNTEEVVTYPEDGCIFKQGVSTLDLKLEVPEYV |
GUT_GENOME241027_01293 | 5-87 | FIRVPFDLELAKKITEKSIEGRIVTRDGRNVRVLCWDAKADQCIVALLLGEFYEECGTYTKDGRIFMNGESPADLMLEIPEYM |
GUT_GENOME046187_02009 | 5-88 | MKQIPFTLDMAKKIHGGLIKGEIKTREGGNVRGLIFNVNNEKWPLCAVVTKNNGAEIVLSFNIDGSAPPYMDTSVFDLTLYVED |
GUT_GENOME093198_00822 | 1-72 | MKNFDLAAAKRGATVCTREGWPVRILCFDCRMKRKGGISCPIVAEVFFKDADFKNGNWAVAEYDLNGKYRGE |
GUT_GENOME155461_00303 | 2-88 | ENKMVRVPFDVEMAKKITNGEVNGNIVTRNGRNARVICFDAHSDDNIVALIEDEKGVEYPKSYVSDGMTLLTGECDCDLMLEIPEYM |
GUT_GENOME252025_01438 | 4-92 | MEQFNLAEYLKNPDRQVVTRDGRAVRILCTDRNYDNYPVVALVQVQLRGGQLRDDPYCYTKDGLYLDHSENSKDLFFAPEKKVGWINIY |
GUT_GENOME020087_02312 | 2-77 | KPFDIELAKAGHPVCTRDGRAIRILCYDFITQEDTPIIALVRLSEKQEGIICYKADGRNFAPGMAELDLVMVPEKR |
GUT_GENOME141464_01490 | 1-100 | MSTQLIKVDFNIELAKKISAGEISGKITTRDGQDVEIIKFNKKGDYPIVALVGKDETIRCYSTNGDWDMKYNHGAGNNYAPLDLVLQIPQRERFKQGDII |
GUT_GENOME039096_00172 | 5-88 | QLVIVPFDLELSKEITNGEVEGRIITRVGNSVRILCYDVIGNEYKICGLVHYGKSEEPAVFTEKGLFYENQTDDLDLMIEIPEY |
GUT_GENOME090574_00780 | 3-77 | PFNLEEAKAGKSVCNRDGNDVRIICFDSRNNSNNPIVALHTEGDIEEIFFHDINGKSQYFSRFDLFMKSEKKEGW |
GUT_GENOME018155_00090 | 2-74 | KQFSLKEYLKNPNKKVVTRNGEKIRIICTDRKSTDFPIVALCTSITGQEACRSYDMNGKYNVKEESILDLMFA |
GUT_GENOME110646_01867 | 120-199 | EKKLSLKPFDLEAAKAGKPVCTRDGRKARIICFDAKRKDGKNIIALIPSKEYSGFEDLVAYPNNGNYHGGHENDGDLMML |
GUT_GENOME268572_01606 | 4-90 | KLVKVPFEVELAKKITNGECDGRIVTRDGKHARVVCWDKIDENSPIVALLKYREKEVVEIFMIDGRWHCDDGEESNLDLLLEIPEYM |
GUT_GENOME017743_00029 | 2-80 | KQFDLQEYLKNPEKKIVTRKGKSVRIICTNRLDDRYPVVAFIRLGDDYEDIWAFTKDGESGEHGENDYDLFFEPEKKTG |
GUT_GENOME019331_00021 | 2-73 | KQFSLEEYFKTPNRKVITRDGKEARIVCTDKKGRYPIIGLCQVLDDDEEIHSYTKSGKLFMDRDSNADLFFV |
GUT_GENOME228018_01050 | 2-92 | MGKKVVRIPFNLELAKKIMNGEIEGRILRNDGANVRIVSWNYISMSEKYPLACFVENGISEQSALYTNDGKYKSWEDKDIRNREDLSIEIT |
GUT_GENOME239076_00451 | 4-78 | PFDIDEARRIHSGESDSVIVTYRGDRAIIVQIDDDGDFPVMCIIIYGNGSQRVQRYTLKGKIRPITRSADDIFIY |
GUT_GENOME194788_00139 | 1-96 | MMETNMVIVPFDLELAKKIANGEIEGRITTRVGTDVRILVFDVKREVFPVSAVVEIDNNKEAVYAYTDKGLLNNINGRTEDFDLMLQVPEYMTFKD |
GUT_GENOME049701_00341 | 1-93 | MIQKNTIIPFSVELSKKIQNHDLPGRIITASGNMARIIDYTYKATEMLVMVMDKEGIERNRVYLISGEPVYGNDNLLIQVPEWATYKNGDVVC |
GUT_GENOME023059_00539 | 5-90 | KFKRVPFDLELAKKITKKEVKGRIVNGDGNEARIICWDKKCDCRKYPIIALVDRGDGEHIYTFTEKGIESIGYKTFKDLHIEVPTY |
GUT_GENOME052502_00112 | 4-92 | KMTKIPFDIELAKQITSGEIKGRVVTLEHLSARIVCWDARGDLPIVALINGDDEEYSCKYTEKGLVVDGYPSSDDLMLEVPEYITFKDG |
GUT_GENOME259276_01635 | 2-74 | KPFDFEKAKAGAPVCTREGFKARIICFDANNNRFPIVALLKDSNSSKEYPASFTKEGRFSDGEVDSSNDLLME |
GUT_GENOME018155_00098 | 1-80 | MEQFNLEKYLEDPNRKIITRDGKSVRIICTNRKSEDSPIIVLIQDSTNNYEEAYYYTIDGRWVIGGNYSMDLFFATEKQE |
GUT_GENOME275409_00015 | 4-95 | TKFKKIPFNLELAKKIMSKEVKGRIVSEDGRKVRIIYIDNESFTETTFLALYKDKDFNIERDYRLNKDGRYFRGERSDLDLHIEVPKYRDYS |
GUT_GENOME239469_02012 | 10-91 | LKNPTRPIVTRDGNSVRIICTDRDDEDEPIVALVRNEEGEAVMKFSENGEYFWMDEGPCPCDLFFAPEPKVKKVGWMNVCKY |
GUT_GENOME068924_00160 | 4-81 | EFDFEIYKNGDYDKVYLRNGKEARVLCDNGKGNSPMVVMIEDDKADDYIILRYNETGRRNINGQSGLDLMLSVKEREP |
GUT_GENOME046353_00215 | 1-71 | MKPFNLEEAKKGKPVTTKDGNPVRILCFDRDNFTDTPILALVNIEGVEMARYYHDDGSHNDQPSLDLVMAG |
GUT_GENOME022369_02164 | 1-82 | MKQFDLQEYLKKPQRKIVTKGGERARIICTDRNPCYPIVALVGEATEVCCYTKKGQYFGGATPSLKDLCFEPMKREGWLNIY |
GUT_GENOME195541_01513 | 2-91 | ESNLIKVPFYLEMAKRIINGEVKGRVVTREGNNVRILCWDKKDKTYHIAALVDDGDEENFKTYTNEGVWNTDKTCSIIKYDLMLEIPEYI |
GUT_GENOME217030_01627 | 1-77 | MKNFDLEAAKRGAAVCTRGGLPVEFSHITNSAYLPVRVLVYGDPKKLYSEIGAYLENGQMYPDIASEDDLMMRDDDY |
GUT_GENOME009523_01624 | 89-173 | NNITQISFDLKTAKKISNGEINGKIRTKGGSNAKIICWDCKGDWPICALIEIIGKNEEIPMQYYIDGIYSHAISESDYDLILEVS |
GUT_GENOME251418_00347 | 130-204 | LRPFDLQKAREGKPVCTRDGRKARIICFDAKGEHPIIALVTDGVQESPYNYTKEGYYYIEGVETMADLMMLPENK |
GUT_GENOME035859_00016 | 2-85 | EHKMVTIPFDLETAKKIRKGERLGQIVAEKGRNRAEIVYEDDLCNAYPLLVVIHSIPVLADWFSSTGKAFNDANRLLLEVPEYV |
GUT_GENOME262934_01441 | 31-101 | LAAEFDLEAAKRGAAVCTRDGSCVRIVCTDCRGEDPIVALVNYKIGEDVYTYNSRGRYYRSADSDLDLFMR |
GUT_GENOME093310_01983 | 1-72 | MRKFDLEAAKRGAAVCTRDGRNARIIAFDCKGCGRKPILALIDMGDWEQSASWTERGEIIEDFKDASDLMMR |
GUT_GENOME102303_01350 | 4-93 | VTIPFDLELAKKIQSGEVEGKIVNEEGLEYEIIKYDAEGDFPLIASFFNEKSNITEVDTFTAQGIYNMEKKSCLDLRLEVPEYLTWKEGD |
GUT_GENOME259590_01273 | 1-79 | MKDFDLSAAKAGAAVCIRSGENARIICYDKKDITPCLSNRIIYLADYDRYERIYSCRSDGRYTSCGEDDADLMMRDDDY |
GUT_GENOME110568_00197 | 241-319 | DDKLTFAPFDINLAKAGKPVRTRDGHKVRIICFDKKGGTPIVGLIQEDGFETVATFNESGLCLINGAYNLMMLPEKKEG |
GUT_GENOME233411_00800 | 179-274 | KKRATMTRTTFKKIPFDIELAKKITNKEAKGRIVTQEGKKVRIICWDKKPVDEEAHEYPIVAIIQNDYNGEMLQTFTAEGAACYPNYKSRYDLIIE |
GUT_GENOME245991_01097 | 6-90 | QVPFDIELAKKTTNGEIEGEIKTRGGLDARIICFDAISGDFPIVSLVKEFGIEEPFNYAQSGSAIKSMHSELDLVLYVPVYLRWK |
GUT_GENOME206585_01774 | 7-89 | KVPFDLEMAKKITNGELEGRVVTRDGKKARIVCWDKKSDSTYYIVALIDVNIMEKINTYTINGLEVEGLERDNDLMLEIPEHL |
GUT_GENOME158651_01574 | 28-99 | TMKAFDLTAAKAGAPVCTRGGVSARIICTDRHGCSGRIIALLDHGGREGIEFYYEDGRFGGYGNNERDLMMR |
GUT_GENOME048588_01279 | 1-72 | MREFDLSAAKAGVAVCTRNGAEVRIICFDRISTKFQIVGLRRNVDNEEGIVTFTTDGRRFFTSKDGGDLMMR |
GUT_GENOME221716_02680 | 4-85 | NLKEYLKNPKRKVVTRDGRDVKIIRIDKKGNYPVMGLFHGNFNEWCTYTKDGKQVIGEDSDADLFFASEKKEGWINLLRSSC |
GUT_GENOME153294_01690 | 3-100 | MEQKKVVVPFDLYLAKQIHGNNKEGRIITRTGGYVRNIVFNVKDENYPISAIIEVENDRELTASYTCNGCYRLNEKIKDPFDLMLEIPESLQFKPFDN |
GUT_GENOME219830_00766 | 2-93 | VQTKYKRIPFNIELAKKIAKGEMKGRITTLDGREARIVCWDKKPIEEYPIVILATNDCGSEMLYTCTEEGLIISCPSYKSCCNLILEIPTYH |
GUT_GENOME155012_01090 | 10-82 | KPFNLEEAKAGRPVCTRNGQEVRIICFDAKSENYPIVALVKEGYSTQEYLRTYTNEGEVCRNGLMHTLDLVMP |
GUT_GENOME203552_00659 | 1-92 | MKVPFDLEMAKKITSGEMEGKIVTRKGYPSKILMFDMANIDYPICATVLIPGVSEDCYHFAANGAHFKKFNPVSDFDLMLEIPEVNTFTPGM |
GUT_GENOME244014_00164 | 1-64 | MKDFDLEKALQGEPVMLRNGDKAFIFKNILGTPILDFKPDYPLIGMIHNHPVTQTWSLDGRISS |
GUT_GENOME079298_02231 | 3-77 | PFDIELAKQGKPVCNGYGMDVRILCYDLKDDKYPIAGAITDPETNRETLQKYTLDGYVIGDGANNNGDLFMKPEK |
GUT_GENOME258647_00301 | 1-76 | MKEFDLEQARAGKPVCTRHGNKARIICFDKRDSKHPIVALIDNKFSGESTRCYSIDGKDSISDQYDLFMAEERREA |
GUT_GENOME213587_00447 | 2-87 | EHKLVKVPFGVELAKKITNKECEGKIVTRSGRSARIVCFDMKSDSCIVAIIQDEFDEHVYSYPKDGCIILNKQSGSDLILEIPEYM |
GUT_GENOME017743_00045 | 1-76 | MKQFNLQAFLKNPTRPIVTRDGHAARIICTNRVDKTHSILALLFEDEDRDREEVYQYTSKGEYYPNAISPHDLCFA |
GUT_GENOME054660_00359 | 156-235 | LKEFNLEAAKAGKPIYTRDGRKARILCYDLKGVEEYPIVAAIETHDYLAENISVYDRNGRFDHDKENNNDLMILLEKKEG |
GUT_GENOME071371_00242 | 1-101 | MENPIIVPFDLNTARKIKSGEIEGSVLIDNIEIEFVYESKDCAGPYNSLFVRKDGYGISSIYANTEGCTIGGTTLELKVEAGAYFKKGDVLTSTNGYQFIY |
GUT_GENOME259745_00808 | 17-94 | MKPFNLEEAKVGKPVCTRNGKRVEIISFENPSNNNYPILAKVFFGKDDYEEFTFTESGTFFVADKESEADLMMTEDET |
GUT_GENOME153215_00705 | 5-94 | FIKIPFELELAKAISNGKAEGNIVTCNGRKARVVCWNYKSLSGTYPILALVENGIFEEPTLYTIDGRFKSWKSKGEQQKEDLMLEVPEYR |
GUT_GENOME272780_00343 | 59-145 | IQFKRVPFDLELAKKITNHEIKGQIVTEAGFNARIICFDRKCYSLVQLLALIKEYDYDYVVAYKLDGKALGDNVYGKNLHIEVPTYY |
GUT_GENOME025004_01033 | 1-66 | MKPFNLEEAKAGKPVRTWMSRYKVEIISFDDHQIPNMPILAKVFINNQSPVLFHFKEDGTHLLHNG |
GUT_GENOME175737_02426 | 3-93 | TKYVRVPFDVEMAKKITNGEVEGRVVTRDGRSVRILCFDRKGKSPIITLVLVDGNEEGFTPYLLDGRLKPEKEDRFDLMLEIPEYMTYKDR |
GUT_GENOME279626_00206 | 1-71 | MKDFDLAAAKRGARVCTRGGLQARILCFDLYPNENLVVAVLRQQGFETVEICRNNGENIHVDQSMDNLMMA |
GUT_GENOME260651_01040 | 1-75 | MREFDLSAAKAGAPVCTRDGMEARIICFDRVDLKYPIIALYKTESGIEHMRSFSPNGLQFDGVISAEDLMMRDDD |
GUT_GENOME276621_01816 | 4-74 | KDFNLNEALSGAKLRTRDGFPVTIYTFSRRFPAYPIVGVINYPDHDFVTTWTPQGRATKLHRAHGNDLMIC |
GUT_GENOME055703_00178 | 121-203 | EKKLNLKPFDLQKAREGKPVCTRNGRRARIICFDRKFYHDGYNYPIVAMVNDNDNELVHPYTQDGLLVGNMEDELDLMMLPEK |
GUT_GENOME259846_01246 | 1-73 | MKPFDIELAKAGYPVCTRDGRPVRILCFDRKSVDGYSILALVDEGDHESFIVCTSRGKFYKYGKDNPYDLFMS |
GUT_GENOME245702_00516 | 1-85 | MKPFDLELAKAGHPVCTRDGRRVKILSFHRKNKINKPIISLVEDGEEEIIMFHSNNGAALEDGDELMCDLMMSPEKKEGRMNIYK |
GUT_GENOME261201_00083 | 7-94 | NKFKPFNLEEAKAGKPVCTRGGHKAKIICFDARTFGNYPIIALIEDANNPIYEAAYYFSDKGEYKRGDICNMDLVMPLEEHEGWINIY |
GUT_GENOME245262_01104 | 2-92 | EKKFVKVPFDLEMAKKITSGDIEGRIVTDDGRNARIVDWNYKTTSNEPILAIISDRQSSYENTYSFNDKGIDKINDNNGISPLMLEVPEYL |
GUT_GENOME025902_00336 | 4-87 | TRFKRVPFNLELAKKITNNEIKGQIVTMDGRKAQIICFDWQSTGYPIAALVITNGGEDLHSFSEGGCFFIHHGHALDLYIELPI |
GUT_GENOME220271_01296 | 3-91 | QTRFKKVPFDIELAKKITNNEMKGRIVTRNGRQARIICFDLKNPVWGIIALVLNNINSEDVLEYQNNGCYSTNIGWHELDLLLEVPTYY |
GUT_GENOME234298_00468 | 4-85 | FNLEEYLKNPNQKVVTRDGRPVRIICTDRKHERPIIALIKEKDGTEETIHTYNTQGEFWANNEFSNLDLMFAPTKKEGWVNI |
GUT_GENOME275023_00016 | 2-93 | ETQMITIPFEVEKAKRIQAGEEPGKIVTRGGRNVRIVCWDRKTEHKYKIVSLLDTGKFELVLYNDKTGESDSFTESNLVLQVPEWTQYKEGD |
GUT_GENOME133801_00012 | 6-84 | ITIPFDVERAKRISNGEEEGKIVTRDGKEVRIVCFNARHNNYPIIGLIDEGDYESAESFSKNGSYSIEDGELDCDLFLK |
GUT_GENOME153176_04084 | 1-91 | METKMITIPFDVELAKEIQAGAKPGKIITKDGRDARIVCWDMESEMNYPYIILVKGEKFESVYQCNDDGKCKPSLNMLEIVLEIPEYAHHM |
GUT_GENOME119077_01192 | 24-98 | LKPFDLEASKAGAPVMTRDGRPVRILAFDVEGARYPVVAAVKTLDGKREAIQMYTESGEYSYVVAKHDYDLVMAR |
GUT_GENOME240232_02408 | 8-88 | IPFDLELAKKINNGERNGMIVTDGDNYRVEFVYHREESFPILGVIHTDHGIISDWFSNNGFGGKNYRLKLKVPEYTTFKDG |
GUT_GENOME069753_02105 | 19-108 | VPFEEKTAKAITIGAIPGAIVTRYGYEVRIIAWDRLSKDDKSSIVALVKDEAERIEHVFYYDNSGMSHLNDNRKLGLMMIVPKEMTYGDG |
GUT_GENOME274817_01192 | 132-218 | LKEFDLEAAKSGKPVCTRDGRKARIICFDRISGDDYYKIVACVTAFDGDFEEVLFYGIDGYIVDSQNPKDEDLMMLPEKRSGWINVK |
GUT_GENOME282039_01757 | 3-74 | QFNLEEYKKNPSRKVITRDGKSVRIVCTDMMGAIYPVLAVCKEDPTHESWNSYTTDGKLYTEGDTQNDLFFA |
GUT_GENOME133439_00161 | 179-254 | LKPFDLEAAKAGKLVCTRDGRKARIISFDRHGEDCPIIALVVDSKNAECEEVIDYTLDGICNENIINHNKYDLMMF |