UHGP-MC 101547


Information


Number of sequences (UHGP-50):
80
Average sequence length:
80±7 aa
Average transmembrane regions:
0.05
Low complexity (%):
1.66
Coiled coils (%):
0
Disordered domains (%):
5.03

Pfam dominant architecture:
PF05869
Pfam % dominant architecture:
125
Pfam overlap:
0.17
Pfam overlap type:
shifted

Downloads

Seeds:
MC101547.fasta
Seeds (0.60 cdhit):
MC101547_cdhit.fasta
MSA:
MC101547_msa.fasta
HMM model:
MC101547.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME000443_013878-94QELERHRNGITDSIRRWKHLSINGGSDPFWCDGCNMNLVRNHIIYHKQMMIEICDTENQDLPAEYYLPLPPVVPNGYMANAKQEERI
GUT_GENOME157319_0126210-83YAAALENDYARWYELFTKGGFDPSWADGSNLNLVRNHILYDKEQLAKQENSLLGLPEVYYRETPPEVDPGYMAP
GUT_GENOME040562_0038415-94DEIAEIRNEYKMWDWIAEHGCSDPFWTDGANMNIIRNHIIYYKKQLISKAEEENAPIPQEFYWALPPKVPNTFMVKHGQY
GUT_GENOME012428_01157500-578EPTVESLTAELERDYARWDDLLENGGSDPTWADGVNMNLVQGRIVANRRILTELCGDGMRPAILEREEPREMPNDYMAK
GUT_GENOME220812_0039735-106REYVRRETERWRDMFTHGCSDPAWPDGCNLNLTRNHIISALRGLQDLGEDISSKYIPPKVASGLMIPAGRWF
GUT_GENOME152310_018539-96ENQIEFVASEIVKELERWEYLRKYGGCDPFWPDGVNMNLIRNHIISYKRQLEELCETDVLPDEYYLSIPPEVDDQYLAKNGQYFAVRK
GUT_GENOME252309_02376186-264MSKETLKERLEDSFCRWDKELLSGGSDPYYTDGQNMNLLRNHIISAKYDMKEAGEFPEIYHRKTPEELPEHFMVQAEKI
GUT_GENOME193870_003666-92KTPQEQMQEAVAELVERYNRWQDLYKNGCFDPNYCDGVNLNFVRNHIYFAKRKIEKLVEEHKELSFPEEYEKIEIPQEVSNDYMANP
GUT_GENOME092637_0115828-111EDHRKYIQRELAHWHFLRDHGGQDPFWPDGVNMNLTRNHIIYARRQIVEICEENDWALPAEYFLPLPPEVPDGYMANLDQEKRV
GUT_GENOME063505_0003010-100ADALKELERWNAIRKNGSNDPFWPDGVNLNLVRNHIIYANRRLKELSSAPVQLSMFSDCERLIDDGSVDLVPLPPEAPNDFMARKGEILQG
GUT_GENOME252156_0032012-100KQSPEELLNEAVANHESSFERWQSYHDFGGQDPFWADGCNMNLIRNHIIYYKRQISELCEQNGLEIPKCMERELPPQMDVDYMAQPDKI
GUT_GENOME194265_0344411-81AEELKKDYECWFTRWKEGWSDPNYPDGTGINGARGRIVYDKKELEKAGGEMPEEYFRPLPPEVPESYMARP
GUT_GENOME212940_029984-77DLREEYARWHKLYEQGGSDPFYSDGYGLTLVRNHIIYTKRQIEKELDEKDYPKEYYDDLPPEVDQEYIARADEI
GUT_GENOME232003_0399170-148SIAQSYEERLQELYDRWNLWRQTGALEAELSDGIYLNNLRKGIEAFLRQIEKALPEEHYPECYYSPLPPVMDESYMANE
GUT_GENOME071302_016239-95TLEQQIKQLSKELVEELKHWQYLREHGCQDPFWADGVNMNLTRNHIIYYKMRLRELCPDGNLPEEYYLPTPPEVDDNYLARRNEYFE
GUT_GENOME066743_0212811-91AVECGRLLDAHARWQFINEHGCQDPFWPDGVNMNLVRNQMIIHRREMAALCEDNGLALPSVYYLPVPPEVPGNYMAKLDQQ
GUT_GENOME018258_010066-99KTVTKEDLRQQIDDEFRRWDHIHLHGCSDPGWEDGINMNLVRNHIIYHYRQIAEIMDGIQMSLFAAAGFEPEQYGMRPIPPEFPMDWMCPNGDY
GUT_GENOME112658_01406112-186EWELIKFYSQLLAAVADWCIVFLKGAHGPKWTDGQELNYKRRRIEYQQEEMIAHGFFIPSEFADLPPEMDVNYMR
GUT_GENOME269869_005357-90DKVKEYCQCIHREIEHWKDINQNGCNDPFWSDGCNMNLTRNHIIYYQSKIREACTENQLPLPEECYLSIPPEVDNNYMANLKQK
GUT_GENOME153238_013554-89KEKSPEQQLKELCQEIKSEIDRWNNLKIQGGNDPFWEDGYNMNLTRNHVIYDKGKIKEICEENSLDLPEEYHLQTPPEVPKYYMAK
GUT_GENOME079750_017256-85ARLIKKMMEADFERWNELYSKGGRDPLYADGANLNLVRNRIIYERMRCEAELQPDEYPKEYFEPIPPTVDNEYMARADEI
GUT_GENOME119307_004889-97LAICAASLTREFNRWDEIYKNGTYDPFWPDGVNLNLVRNHILYYKRQIKELVDRDNEELSLFTSEYPDIYFRETPAEVPANYMARADEI
GUT_GENOME256299_007365-93KKDTLQDAREGIQDSLNRWQHIRTYGCSDPAWEDGCNMNLCRNHMIYWRRKLIELCDSQEIGYPEEYYLPIPPEVPNQYMSDGADQRRV
GUT_GENOME265465_004539-84WEEELRKSYQNWEYLSHFGGHDPGWPDGVNMNLVRNHIIYEKRMLIESKPDRLPDIYFRETPPEVDSNFMARKKEI
GUT_GENOME172949_0212517-102QIMDEVDREFRRWNLLADGGCQDPGWPDGVNMNLVRNHIIYWYGLLDERGLVEVQLSLFPGEDVARDRRPIPPEVPDRYMVRDGKY
GUT_GENOME246224_007487-80IKQKMAEAFQHWEKLKTYGGNDWWWSDGWTMNEERKNIMDLRWRCEHELKPEEYPEEYNRPVPDLVDNDYVARP
GUT_GENOME136422_008307-80YSKDFQDEYRRWEYLKDHGGSDPFWPDGCNMNLVRNHIIYLRKEVEENLSPGEYPEEYFKDLPHKVEEGYMVNP
GUT_GENOME081382_013685-89KTPEELLMQYSAGILKSIEQYKSIIEHGCSDPSWPDGCNANLCRNHVLAYKRYILDICADNDLEIPQEYYLPTPPEQDNRFMADK
GUT_GENOME081016_005117-86LEELVRELEKDYARWEQVYMAGSKDPFWPDGVNANLCRNHILCGKRRIRELYPDAEMPEIYYRPLPQELPAEYMARKEEL
GUT_GENOME043764_0049645-119MSSLESLCEDIRTNVTVWKRYRDKGGSDPSWSDGDNMNLCRNHIIDDRRRIEEIIARDGIEPPADLFSRLDNEFH
GUT_GENOME100569_0214917-104SAEERLKDAIASCEERHRHWHELYENGTSDPFWADGVNLNLVHNHIRYYNKIIKECCDELGQDYPEVYCREIPPKVDPSYIAQKDRIV
GUT_GENOME106452_003369-89TERISEEAHEIRREIKHWRYLREHGGNDPQWPDGVNMNLTRNHVIYGKRRLEDLCAEAGVDLPGEYYLPVPPEVPNNYMAD
GUT_GENOME017499_008559-90KIEKFAKQIVRELATWENYRVYGGQDPFYSDGVNMNLIRQHIISYKNDIRDLCAENNIDLPIEYYLPTPQKVNDDYMVKNNP
GUT_GENOME023928_000619-86DLKECLEERFERWDRLKTEGGYDPFWADGSNMNMVRAQIKYYKGKIEKKFSDGNYPEIYYRETPPKTDTKYMAQPDKI
GUT_GENOME096448_023994-87QTPESQIAEYALCIKKELEHWDHLYQIGGQDPFWPDGTGLNLTRNHIISYKMDIQKLCEEYALPVPPEADYPTPREVDNNYMAQ
GUT_GENOME148378_017269-88IDEYRKMIVESVGRWNYIAEHGCSDPFWSDGCNMNLVRNHTIYAKRNIEKLCEENGLEFPIEYNIPLPPEMDDNYMARAD
GUT_GENOME126377_002017-83LLEELKKAYAQWESLYKQGGSDPFYADGVNLNLVRNHILYFKRQIEETQPLYKNTELFQRELPPQVEDGYMARAEEI
GUT_GENOME242977_0345314-94ELESLIEQKFALWDDIFSNGHSDLSFIPDGMRLNTLRNDIKQLRARLERLFGTDLAGQLSLFGGEVKPERPLPDIMPANWN
GUT_GENOME104593_019179-82LLKELSISFTRWDALYSEGGSDPFYSDGANLNLVRNHIIYYKRMIEETQPELMESDTYKRQTPKEVSNDYMAKA
GUT_GENOME000103_010565-91QRSRKEQLAEVIRESHEQWKRLWENGGSDPFWTDGVNLNLVRNHIIYGRRLCEEELQEWDYPEEYYLPLPEKVPPNYMVNSDEIRRK
GUT_GENOME257773_0430770-154KRLADEFGKRLQILYDRWNMWKIRGCPEADVPDGEYLNRLRSGIEAMMRQIENTFVEADYPECYYAPLPPVMDVDYMANCQQIKE
GUT_GENOME230542_022939-77NLLFHLSGDLLEWCRVFLKGGKSPTWTDGKELNYIRKKIIRQIAELEEKGFDVKSLVPLPPEMNESYMK
GUT_GENOME251663_023626-85KDETVQELNTGFERWDILYNYGGNDPMWSDGVNLNLVRNHIIAYKKRIEETFSKEEYPDIYYRDTPLEVNDDYMANPNEI
GUT_GENOME080630_029539-87EELEKSYRDWESTRETGCSDPFYDDSVNLNLLRNHIIYWKQELYKKYGEDRSKYPGSYFKELPPEVNKGFMVRAAEIRD
GUT_GENOME007982_0075012-90LDEYIEQCEKSYVRWQDVFDNGCFDPTWADGVNLNLVRNHILIAKKNISKLCEQEGFESPPILLREVPPKVDTDYMAKA
GUT_GENOME098721_0212614-99KEKLIKELPLEWARWQYIKDHGCRDPSWPDGMNMNLVRNHIIYDKQRLLELCEQLNESLPEEYYIQTPPEVDDNYMCKAGKYFKKR
GUT_GENOME005485_0008456-158VAADEQKIENRAIAETLGKELQSLHDIWNQVYACGGADFNCPDGVELNLLRSRMMSQRRRIRSCLLEADYPASYKLEIPDMITSQYMARREEIEAAAKKALQI
GUT_GENOME014858_011285-79SELEKDFARWDHLYQHGGQDPCWADGVNLNLVRAHIIRRKQQIEEECPLFAGMGICTREVPPVVPEEYMARSDEI
GUT_GENOME031648_0071318-97AEELGKNIRDSFARWNYIYQNGAGDPLWEDGVNLELVRNHIIYDKRRCEEELLLEQYPPEYHMELPPVVDRLYMARADEI
GUT_GENOME158420_0067613-98EQIQANIDERFRAWDKIAQNGCSDPFWPDGVNLNLIRNHIIYYYGLLHERQAGQVQISLFDAPASVQERPIPPEVPDNYMVAGCEH
GUT_GENOME192986_0024760-157GATVITSKKQEQDKLLAELDHRMEVLFDRWLLWKEQGAPGFDATDGKYLNRLRSGLERLRQKMEACSSEEDYPENYYAPLPPKMEESYMANEEQMKQK
GUT_GENOME263375_001448-91KDELNRLVYELMDDRDRWNEIYENGTSDPFYCDGTNLNLVRNHIIYGKRQIKAFVTEHPELTIPEEVNSVFDPDEVPDNYMANP
GUT_GENOME090354_018938-89PDQKKRSKQLGKEIRDSRKRYEDLRVHGGSDPFWSDGANMNLCRNHVMYFRKQVETELDPENYPEEYFLEIPAEVSVHYMAD
GUT_GENOME167380_019575-85EKVKTYLKETYQMWKKIYKEGSSDPHYTDGYLLNGCRTKILGYKEYLSTTHTVEDELLPEEFFMETPPEVPEEYMARKKEL
GUT_GENOME251854_0202510-87EAQLIREYEHWEYLKEHGGSDPNYDDGVNMNLTRNHIIYYKNELEDLYGEDMSKYPEVYFRELPPEVEGQYIARADEI
GUT_GENOME168295_0199069-149MAEEFEQELQVLHSRWEQIYKYGGQDFNGPDGFELNLLRERIISVKRQITRCLREQDYPSSYQLETPSMVDNRYMAQRQEI
GUT_GENOME104736_0095310-83YGAELRSEYARWRELRDRGGTDPFWPDGVNMNLVRNHILYWKRMIEENLQEAEYPEEYFLDIPPEMDNNFMAHP
GUT_GENOME096797_0114418-95KQYAAQIVHESKDWEYINKNGCNDPNWPDGENMNLVRNHILYFKNEIFEICVKNDIALPDAYFVPTPPEVDSYYMANL
GUT_GENOME172934_003054-81QSLDEQLKQEYDRWNHLYTYGGQDPNWPDGCNMHLVRNHIIHIKKDMEEAGQLTETYYRELPPEVDREYMARADEIRE
GUT_GENOME091480_0213812-90RLKKLCKEIIFEREVWKHINEEGCNDPFWPDGTNMNLTRNHILSYRNEIAEICGECGFNLPEEYFLKVPPVVPDNYVAV
GUT_GENOME119280_0107410-91EQIKAETARLVHSFSEWEHMRTKGCSDPFWPDGTNMNLVRNHIINGKRRLEKLCVGIPLPAAYYIPTPEEVDENYMASHGEH
GUT_GENOME237094_0016814-91LAETVLLEAFARWNYIRTKGCRDPFYPDGENMNLVRQHIMSYKEELENLCKDRELPDSYYIPTPGVVNPDYMAPDGKH
GUT_GENOME122176_0115424-104NLLLEAIKSCEERYQHWQNLYEEGGSDPSYADGVSLSLVRNHIIWRKKQIETACQALGCGLPDIHSRELPPEVAPGYMAKA
GUT_GENOME155633_0074211-89QSLKEDFLRWEHMSQYGGSDPFWSDGVNMNLLRNHIIHNKRELEKIISSKEELPEIYYRETPPEADENYYVHSGRIRDE
GUT_GENOME270232_0164815-96LLGKELADSYSKWERRYRGEGTIDPFYADGYNLNMIRNQICIQKKRIEKELPRELYPDEYLLPIPDLMNNEYVALQDEWVEV
GUT_GENOME108684_0091711-82SQLITLYEHWEDLHDNGGHDPLYADGVNLNLVRNNILYYRDQLKELDYFQEIMERPVPPEMENTYMARAEEI
GUT_GENOME064200_0008327-106RQLLAENLEKAYERWDLLYKNGGSDPFWCDGTNLNLVRNHILYYREQCEEVLEPKDYPEAYHKELPPVVPDDYMAKKGEI
GUT_GENOME153286_010518-82QIEKEMALDYARWDYLYTEGGSDPNWADGCNLNLIRNHIIYHKRKMEELQYFPDIYYRKLPPMVDNSYMAHADKI