UHGP-MC 101547
Information
- Number of sequences (UHGP-50):
- 80
- Average sequence length:
- 80±7 aa
- Average transmembrane regions:
- 0.05
- Low complexity (%):
- 1.66
- Coiled coils (%):
- 0
- Disordered domains (%):
- 5.03
- Pfam dominant architecture:
- PF05869
- Pfam % dominant architecture:
- 125
- Pfam overlap:
- 0.17
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC101547.fasta
- Seeds (0.60 cdhit):
- MC101547_cdhit.fasta
- MSA:
- MC101547_msa.fasta
- HMM model:
- MC101547.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME000443_01387 | 8-94 | QELERHRNGITDSIRRWKHLSINGGSDPFWCDGCNMNLVRNHIIYHKQMMIEICDTENQDLPAEYYLPLPPVVPNGYMANAKQEERI |
GUT_GENOME157319_01262 | 10-83 | YAAALENDYARWYELFTKGGFDPSWADGSNLNLVRNHILYDKEQLAKQENSLLGLPEVYYRETPPEVDPGYMAP |
GUT_GENOME040562_00384 | 15-94 | DEIAEIRNEYKMWDWIAEHGCSDPFWTDGANMNIIRNHIIYYKKQLISKAEEENAPIPQEFYWALPPKVPNTFMVKHGQY |
GUT_GENOME012428_01157 | 500-578 | EPTVESLTAELERDYARWDDLLENGGSDPTWADGVNMNLVQGRIVANRRILTELCGDGMRPAILEREEPREMPNDYMAK |
GUT_GENOME220812_00397 | 35-106 | REYVRRETERWRDMFTHGCSDPAWPDGCNLNLTRNHIISALRGLQDLGEDISSKYIPPKVASGLMIPAGRWF |
GUT_GENOME152310_01853 | 9-96 | ENQIEFVASEIVKELERWEYLRKYGGCDPFWPDGVNMNLIRNHIISYKRQLEELCETDVLPDEYYLSIPPEVDDQYLAKNGQYFAVRK |
GUT_GENOME252309_02376 | 186-264 | MSKETLKERLEDSFCRWDKELLSGGSDPYYTDGQNMNLLRNHIISAKYDMKEAGEFPEIYHRKTPEELPEHFMVQAEKI |
GUT_GENOME193870_00366 | 6-92 | KTPQEQMQEAVAELVERYNRWQDLYKNGCFDPNYCDGVNLNFVRNHIYFAKRKIEKLVEEHKELSFPEEYEKIEIPQEVSNDYMANP |
GUT_GENOME092637_01158 | 28-111 | EDHRKYIQRELAHWHFLRDHGGQDPFWPDGVNMNLTRNHIIYARRQIVEICEENDWALPAEYFLPLPPEVPDGYMANLDQEKRV |
GUT_GENOME063505_00030 | 10-100 | ADALKELERWNAIRKNGSNDPFWPDGVNLNLVRNHIIYANRRLKELSSAPVQLSMFSDCERLIDDGSVDLVPLPPEAPNDFMARKGEILQG |
GUT_GENOME252156_00320 | 12-100 | KQSPEELLNEAVANHESSFERWQSYHDFGGQDPFWADGCNMNLIRNHIIYYKRQISELCEQNGLEIPKCMERELPPQMDVDYMAQPDKI |
GUT_GENOME194265_03444 | 11-81 | AEELKKDYECWFTRWKEGWSDPNYPDGTGINGARGRIVYDKKELEKAGGEMPEEYFRPLPPEVPESYMARP |
GUT_GENOME212940_02998 | 4-77 | DLREEYARWHKLYEQGGSDPFYSDGYGLTLVRNHIIYTKRQIEKELDEKDYPKEYYDDLPPEVDQEYIARADEI |
GUT_GENOME232003_03991 | 70-148 | SIAQSYEERLQELYDRWNLWRQTGALEAELSDGIYLNNLRKGIEAFLRQIEKALPEEHYPECYYSPLPPVMDESYMANE |
GUT_GENOME071302_01623 | 9-95 | TLEQQIKQLSKELVEELKHWQYLREHGCQDPFWADGVNMNLTRNHIIYYKMRLRELCPDGNLPEEYYLPTPPEVDDNYLARRNEYFE |
GUT_GENOME066743_02128 | 11-91 | AVECGRLLDAHARWQFINEHGCQDPFWPDGVNMNLVRNQMIIHRREMAALCEDNGLALPSVYYLPVPPEVPGNYMAKLDQQ |
GUT_GENOME018258_01006 | 6-99 | KTVTKEDLRQQIDDEFRRWDHIHLHGCSDPGWEDGINMNLVRNHIIYHYRQIAEIMDGIQMSLFAAAGFEPEQYGMRPIPPEFPMDWMCPNGDY |
GUT_GENOME112658_01406 | 112-186 | EWELIKFYSQLLAAVADWCIVFLKGAHGPKWTDGQELNYKRRRIEYQQEEMIAHGFFIPSEFADLPPEMDVNYMR |
GUT_GENOME269869_00535 | 7-90 | DKVKEYCQCIHREIEHWKDINQNGCNDPFWSDGCNMNLTRNHIIYYQSKIREACTENQLPLPEECYLSIPPEVDNNYMANLKQK |
GUT_GENOME153238_01355 | 4-89 | KEKSPEQQLKELCQEIKSEIDRWNNLKIQGGNDPFWEDGYNMNLTRNHVIYDKGKIKEICEENSLDLPEEYHLQTPPEVPKYYMAK |
GUT_GENOME079750_01725 | 6-85 | ARLIKKMMEADFERWNELYSKGGRDPLYADGANLNLVRNRIIYERMRCEAELQPDEYPKEYFEPIPPTVDNEYMARADEI |
GUT_GENOME119307_00488 | 9-97 | LAICAASLTREFNRWDEIYKNGTYDPFWPDGVNLNLVRNHILYYKRQIKELVDRDNEELSLFTSEYPDIYFRETPAEVPANYMARADEI |
GUT_GENOME256299_00736 | 5-93 | KKDTLQDAREGIQDSLNRWQHIRTYGCSDPAWEDGCNMNLCRNHMIYWRRKLIELCDSQEIGYPEEYYLPIPPEVPNQYMSDGADQRRV |
GUT_GENOME265465_00453 | 9-84 | WEEELRKSYQNWEYLSHFGGHDPGWPDGVNMNLVRNHIIYEKRMLIESKPDRLPDIYFRETPPEVDSNFMARKKEI |
GUT_GENOME172949_02125 | 17-102 | QIMDEVDREFRRWNLLADGGCQDPGWPDGVNMNLVRNHIIYWYGLLDERGLVEVQLSLFPGEDVARDRRPIPPEVPDRYMVRDGKY |
GUT_GENOME246224_00748 | 7-80 | IKQKMAEAFQHWEKLKTYGGNDWWWSDGWTMNEERKNIMDLRWRCEHELKPEEYPEEYNRPVPDLVDNDYVARP |
GUT_GENOME136422_00830 | 7-80 | YSKDFQDEYRRWEYLKDHGGSDPFWPDGCNMNLVRNHIIYLRKEVEENLSPGEYPEEYFKDLPHKVEEGYMVNP |
GUT_GENOME081382_01368 | 5-89 | KTPEELLMQYSAGILKSIEQYKSIIEHGCSDPSWPDGCNANLCRNHVLAYKRYILDICADNDLEIPQEYYLPTPPEQDNRFMADK |
GUT_GENOME081016_00511 | 7-86 | LEELVRELEKDYARWEQVYMAGSKDPFWPDGVNANLCRNHILCGKRRIRELYPDAEMPEIYYRPLPQELPAEYMARKEEL |
GUT_GENOME043764_00496 | 45-119 | MSSLESLCEDIRTNVTVWKRYRDKGGSDPSWSDGDNMNLCRNHIIDDRRRIEEIIARDGIEPPADLFSRLDNEFH |
GUT_GENOME100569_02149 | 17-104 | SAEERLKDAIASCEERHRHWHELYENGTSDPFWADGVNLNLVHNHIRYYNKIIKECCDELGQDYPEVYCREIPPKVDPSYIAQKDRIV |
GUT_GENOME106452_00336 | 9-89 | TERISEEAHEIRREIKHWRYLREHGGNDPQWPDGVNMNLTRNHVIYGKRRLEDLCAEAGVDLPGEYYLPVPPEVPNNYMAD |
GUT_GENOME017499_00855 | 9-90 | KIEKFAKQIVRELATWENYRVYGGQDPFYSDGVNMNLIRQHIISYKNDIRDLCAENNIDLPIEYYLPTPQKVNDDYMVKNNP |
GUT_GENOME023928_00061 | 9-86 | DLKECLEERFERWDRLKTEGGYDPFWADGSNMNMVRAQIKYYKGKIEKKFSDGNYPEIYYRETPPKTDTKYMAQPDKI |
GUT_GENOME096448_02399 | 4-87 | QTPESQIAEYALCIKKELEHWDHLYQIGGQDPFWPDGTGLNLTRNHIISYKMDIQKLCEEYALPVPPEADYPTPREVDNNYMAQ |
GUT_GENOME148378_01726 | 9-88 | IDEYRKMIVESVGRWNYIAEHGCSDPFWSDGCNMNLVRNHTIYAKRNIEKLCEENGLEFPIEYNIPLPPEMDDNYMARAD |
GUT_GENOME126377_00201 | 7-83 | LLEELKKAYAQWESLYKQGGSDPFYADGVNLNLVRNHILYFKRQIEETQPLYKNTELFQRELPPQVEDGYMARAEEI |
GUT_GENOME242977_03453 | 14-94 | ELESLIEQKFALWDDIFSNGHSDLSFIPDGMRLNTLRNDIKQLRARLERLFGTDLAGQLSLFGGEVKPERPLPDIMPANWN |
GUT_GENOME104593_01917 | 9-82 | LLKELSISFTRWDALYSEGGSDPFYSDGANLNLVRNHIIYYKRMIEETQPELMESDTYKRQTPKEVSNDYMAKA |
GUT_GENOME000103_01056 | 5-91 | QRSRKEQLAEVIRESHEQWKRLWENGGSDPFWTDGVNLNLVRNHIIYGRRLCEEELQEWDYPEEYYLPLPEKVPPNYMVNSDEIRRK |
GUT_GENOME257773_04307 | 70-154 | KRLADEFGKRLQILYDRWNMWKIRGCPEADVPDGEYLNRLRSGIEAMMRQIENTFVEADYPECYYAPLPPVMDVDYMANCQQIKE |
GUT_GENOME230542_02293 | 9-77 | NLLFHLSGDLLEWCRVFLKGGKSPTWTDGKELNYIRKKIIRQIAELEEKGFDVKSLVPLPPEMNESYMK |
GUT_GENOME251663_02362 | 6-85 | KDETVQELNTGFERWDILYNYGGNDPMWSDGVNLNLVRNHIIAYKKRIEETFSKEEYPDIYYRDTPLEVNDDYMANPNEI |
GUT_GENOME080630_02953 | 9-87 | EELEKSYRDWESTRETGCSDPFYDDSVNLNLLRNHIIYWKQELYKKYGEDRSKYPGSYFKELPPEVNKGFMVRAAEIRD |
GUT_GENOME007982_00750 | 12-90 | LDEYIEQCEKSYVRWQDVFDNGCFDPTWADGVNLNLVRNHILIAKKNISKLCEQEGFESPPILLREVPPKVDTDYMAKA |
GUT_GENOME098721_02126 | 14-99 | KEKLIKELPLEWARWQYIKDHGCRDPSWPDGMNMNLVRNHIIYDKQRLLELCEQLNESLPEEYYIQTPPEVDDNYMCKAGKYFKKR |
GUT_GENOME005485_00084 | 56-158 | VAADEQKIENRAIAETLGKELQSLHDIWNQVYACGGADFNCPDGVELNLLRSRMMSQRRRIRSCLLEADYPASYKLEIPDMITSQYMARREEIEAAAKKALQI |
GUT_GENOME014858_01128 | 5-79 | SELEKDFARWDHLYQHGGQDPCWADGVNLNLVRAHIIRRKQQIEEECPLFAGMGICTREVPPVVPEEYMARSDEI |
GUT_GENOME031648_00713 | 18-97 | AEELGKNIRDSFARWNYIYQNGAGDPLWEDGVNLELVRNHIIYDKRRCEEELLLEQYPPEYHMELPPVVDRLYMARADEI |
GUT_GENOME158420_00676 | 13-98 | EQIQANIDERFRAWDKIAQNGCSDPFWPDGVNLNLIRNHIIYYYGLLHERQAGQVQISLFDAPASVQERPIPPEVPDNYMVAGCEH |
GUT_GENOME192986_00247 | 60-157 | GATVITSKKQEQDKLLAELDHRMEVLFDRWLLWKEQGAPGFDATDGKYLNRLRSGLERLRQKMEACSSEEDYPENYYAPLPPKMEESYMANEEQMKQK |
GUT_GENOME263375_00144 | 8-91 | KDELNRLVYELMDDRDRWNEIYENGTSDPFYCDGTNLNLVRNHIIYGKRQIKAFVTEHPELTIPEEVNSVFDPDEVPDNYMANP |
GUT_GENOME090354_01893 | 8-89 | PDQKKRSKQLGKEIRDSRKRYEDLRVHGGSDPFWSDGANMNLCRNHVMYFRKQVETELDPENYPEEYFLEIPAEVSVHYMAD |
GUT_GENOME167380_01957 | 5-85 | EKVKTYLKETYQMWKKIYKEGSSDPHYTDGYLLNGCRTKILGYKEYLSTTHTVEDELLPEEFFMETPPEVPEEYMARKKEL |
GUT_GENOME251854_02025 | 10-87 | EAQLIREYEHWEYLKEHGGSDPNYDDGVNMNLTRNHIIYYKNELEDLYGEDMSKYPEVYFRELPPEVEGQYIARADEI |
GUT_GENOME168295_01990 | 69-149 | MAEEFEQELQVLHSRWEQIYKYGGQDFNGPDGFELNLLRERIISVKRQITRCLREQDYPSSYQLETPSMVDNRYMAQRQEI |
GUT_GENOME104736_00953 | 10-83 | YGAELRSEYARWRELRDRGGTDPFWPDGVNMNLVRNHILYWKRMIEENLQEAEYPEEYFLDIPPEMDNNFMAHP |
GUT_GENOME096797_01144 | 18-95 | KQYAAQIVHESKDWEYINKNGCNDPNWPDGENMNLVRNHILYFKNEIFEICVKNDIALPDAYFVPTPPEVDSYYMANL |
GUT_GENOME172934_00305 | 4-81 | QSLDEQLKQEYDRWNHLYTYGGQDPNWPDGCNMHLVRNHIIHIKKDMEEAGQLTETYYRELPPEVDREYMARADEIRE |
GUT_GENOME091480_02138 | 12-90 | RLKKLCKEIIFEREVWKHINEEGCNDPFWPDGTNMNLTRNHILSYRNEIAEICGECGFNLPEEYFLKVPPVVPDNYVAV |
GUT_GENOME119280_01074 | 10-91 | EQIKAETARLVHSFSEWEHMRTKGCSDPFWPDGTNMNLVRNHIINGKRRLEKLCVGIPLPAAYYIPTPEEVDENYMASHGEH |
GUT_GENOME237094_00168 | 14-91 | LAETVLLEAFARWNYIRTKGCRDPFYPDGENMNLVRQHIMSYKEELENLCKDRELPDSYYIPTPGVVNPDYMAPDGKH |
GUT_GENOME122176_01154 | 24-104 | NLLLEAIKSCEERYQHWQNLYEEGGSDPSYADGVSLSLVRNHIIWRKKQIETACQALGCGLPDIHSRELPPEVAPGYMAKA |
GUT_GENOME155633_00742 | 11-89 | QSLKEDFLRWEHMSQYGGSDPFWSDGVNMNLLRNHIIHNKRELEKIISSKEELPEIYYRETPPEADENYYVHSGRIRDE |
GUT_GENOME270232_01648 | 15-96 | LLGKELADSYSKWERRYRGEGTIDPFYADGYNLNMIRNQICIQKKRIEKELPRELYPDEYLLPIPDLMNNEYVALQDEWVEV |
GUT_GENOME108684_00917 | 11-82 | SQLITLYEHWEDLHDNGGHDPLYADGVNLNLVRNNILYYRDQLKELDYFQEIMERPVPPEMENTYMARAEEI |
GUT_GENOME064200_00083 | 27-106 | RQLLAENLEKAYERWDLLYKNGGSDPFWCDGTNLNLVRNHILYYREQCEEVLEPKDYPEAYHKELPPVVPDDYMAKKGEI |
GUT_GENOME153286_01051 | 8-82 | QIEKEMALDYARWDYLYTEGGSDPNWADGCNLNLIRNHIIYHKRKMEELQYFPDIYYRKLPPMVDNSYMAHADKI |