UHGP-MC 98233


Information


Number of sequences (UHGP-50):
72
Average sequence length:
104±11 aa
Average transmembrane regions:
0.04
Low complexity (%):
6.05
Coiled coils (%):
0
Disordered domains (%):
4.49

Pfam dominant architecture:
PF01546
Pfam % dominant architecture:
6944
Pfam overlap:
0.33
Pfam overlap type:
reduced

Downloads

Seeds:
MC98233.fasta
Seeds (0.60 cdhit):
MC98233_cdhit.fasta
MSA:
MC98233_msa.fasta
HMM model:
MC98233.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME252545_01783192-289IACRRWELTFRGPGGHSWQAWGLPSPLHAAGRAVSVIAALQPPADPKTTLNVGTIHGGTSVNSIANETTVHVDIRSVSESLCEEYAIKIREIAEQAVA
GUT_GENOME115357_00371208-309TGSRRFRVTFSGPGGHSWSNFGIPNANHALGRAIAKIADLQVSESPKTTFSVGTVQGGSSINAIPAKAQLELDTRSINNDELNKLVERILPIFEEACEAENK
GUT_GENOME047501_00077172-282GMAVGSLRYKVTVKTTGGHSFADFGNINAIERLSAIITELYQYKVPAGSAKTTFNVGTISGGTTVNSIAAQAEMLYEMRSDDYACLMDAKAYFEKILEEFKAEGVDVTAEL
GUT_GENOME088558_02532198-308GIQTYEVNFHGIGGHACGMFGKVANPLHAAARAISKIADFQVPKEPMTTFAVTNFHAGSFESVHAIVPTAQIRFNFRSNSQEELEKLRDRIFAAIEEACKEETDRWGQDTI
GUT_GENOME199135_00915196-297ACKRQEIIVTGPGGHSWSDFGLASPINALGRAIARIADLHPVASPKTTYNVGLVSGGTSVNAIAHEARMHVDIRSTDTAERNKLEALIMDQIRLAVDEENAA
GUT_GENOME006036_00434201-306SHRYRFTITGPGGHSWTNFSECPSAVHAMCLAGAKVAHVKVPEGPRTTFTIGTIKGGTSVNTIAASCQVDVDMRSLDDGNLAALEAMIFKCFEEGVAEENAIWGVT
GUT_GENOME031993_0055728-152QSVSVSIIGPGGHSNGDYGNVNAVHAAARSIMLIEKSVPDAVVTAVTGGNSVNSIAAYANFRVLLEGDDAALKAKADKVKAAVEAKADKVKAAVEEGCKAENAFRGVKTGEVRDGLATDIRWTIK
GUT_GENOME029960_01841187-287TGSKRYKITYTAEGGHSFGAFGKPNPIHAMGRAIAAISELHGLKEPKTTFNVGVVSGGTSINSIPCETHMLVDIRSNSAQELEKLESTILDMARKAAQEET
GUT_GENOME199376_01276195-300GGHSFSAFGNTNAIEVIARLVQDIYALPVPQHAGSTTTYNVGLIEGGTSVNAIAQQASCVCEFRSDDEACMAQMERSFETLFARARARCLSLEVELLGTRPAGKNV
GUT_GENOME064313_00543183-279IRIRTTGGHSYADFGATNAIAVAADMIHDFYKIDTNTMPGKTTYNVGLIEGGTSVNTIAQDVKILYEYRSDRMMGLELMEREMSVIVDKYTDREDAR
GUT_GENOME192738_00592196-299MEITYTGTGGHAMRMFGYPNCNNAMGRAISKIAELKVPDEPLTTFNIGTVRGGTVTTAIPTESTMSIDVRSLSDEILQKVKKEIEMLCIRACEEENTYWNHPTE
GUT_GENOME044673_01756183-279SHRYQIDVQAEGGHSWGNFGNDNAIAIASGIIWNLYNLDVPASPKTTYNVGTIQGGSTVNSIAEHASFTIDLRSESKEELQRVDQKFHELLDAARDP
GUT_GENOME016892_01327210-307WLISYDGPGGHSFHKFRIAPSAIHAMGRAIAKFADLPVPEDPRTTYNLGTIKGGTIVNAVAAHCEAQLDLRSGGNDELLKLEKTVLACFDEALEEENR
GUT_GENOME006400_01085202-311SERFRITVRTQGGHSFNDFGRDNAIEKLAEVISLLYRIPVPENGRTTFNVGQIQGGTSVNTIAQEAHMLYEFRSDRKENLDYMHKQFDAVIEEARAAGIEIQTELLGLRP
GUT_GENOME034170_00654235-336YRVSFDGPGGHSYLGFGLPSAIHAMFRTGETLCNLEVPKDPKTTYTVGVVKGGTTVNAIAAHASMDIDMRSTENSSLCALRDKVLPCFEKAAADENAHWGVT
GUT_GENOME195585_01099346-448DVTYTGPGGHAYLAFGTPSAVQAMGRAIAALSDIQVPETPKTTFTVSLANGGQAIHGIAQSANFKINMRSDSAEVLAQMEQQALAIFQQGADAENARWQKPGA
GUT_GENOME095248_02340240-342HEVTFKGPGGHSYAAFGEVPSAIHGTGRAIAKIAEIRTPKSPKTTFTVGTVGGGTSVNTIAPDARMAVDIRSDEMAPLLATEKQVLAAVDSAMTEENARWGVN
GUT_GENOME023794_0017927-136QTLAVTILGPGGHSNGDYGNTNALHAGARAVLLIEKAVPTAMISDMKGGVSVNAIAADCHFLVTVADDDQALKVKKAVDEGVAAENAFRGVKPGQTRNGLQIDVRAEVKA
GUT_GENOME286511_00940211-306VEYSGPGGHSYLNFGQYPSATQALCRAGALLANLQVPAEPRTTFTLATLNGGSSVNAIAEHAECEIDLRSEDAQMLDGLVQEVLPLFAFGAQLENQ
GUT_GENOME041792_00198196-285TTCGGHSFGAFGKPNAIHHMARLIGRLYQQPLPQRHESKTTYNVGTIQGGTSVNTIAQQVEMTYEYRSDNRDCLQQMRKQFLDMVDEANG
GUT_GENOME096283_00281200-300SYRYKVTYKGPGGHSFGAFGLPSAIHALGRAIAEIADIQTKADPKTTFTVGEISGGTSVNAIAAEAHMYVDLRSNDPQELVKLEQQFLTIVEKACNDENAR
GUT_GENOME138466_01480580-713ALVGSFAASVAMADVSSVLNVSIQGPGGHSNGAYGNTNAVHAAARAILEIEKAIPSQKQCVVANFNGGNSVNSIAADGHFRVSLRAKDDKEMADLKQKVEAAVKKGVDAENAFRNAKPGDLTGGVPAQVVYKIK
GUT_GENOME126022_00477180-269AVGSHRYAVHVVTPGGHSFANFGEKNAIAAASAIIEEFYGLRVPHTPKTTYNVGTIQGGRTVNAIAADVEFTVDMRSVSMTELQKLDAAF
GUT_GENOME092065_02666189-291AIASMVYHVTYRGVGGHALGMFGIPNANNAMGRAIAKISDLQVPKKPVTVFNVGIARGGTSTNGIPTKSELMIDMRSMQDAELDRLADRIAACCREACEEENA
GUT_GENOME008300_00883199-305LYGAIGSHRWRFTVKGPGGHSYGAFGVTPSAIHAICIAGAKVAYLKAPDEPKTSFTIGTIRGGTSVNTIAAECSVDVDIRSLAMEPMLALEKEIREAFAAGVEEENR
GUT_GENOME116081_01900213-317GGTGSHRYEVTFRGPGGHSFGAFGLVNPIFAMGRAISYISELRTPREPKTTFSVGVVNGGTSINAIAYECSMLVDIRSNGLKELEELDAKLQECIKRAVEDENNR
GUT_GENOME237348_01556192-298SSRYRMIFRGPGGHSFADFGRPSAIHAMGRAIAKIADFRVPETPKTTFNAGVVNGGTSVNTIAAESSFLLDIRSDSPEELARLEGEARAAVQAAVEEENRRGTQQGA
GUT_GENOME086180_01250184-272ALGSVRYKITITGLGGHSYNDFGRKNAIVAMSELIRLFYQINPEEYSGAVTYNVGRIQGGTTVNAIAEECQADFEIRSDDSESLGKLQE
GUT_GENOME248214_0105734-141FEGPGGHSNGNYGNVNAVHAGARAVMELTRTAPDAVIAGMKGGNSVNSIASDCTFTVTAAGDEKAIAAAKSAVDAAVQKAVKAENDFRGVKAGDTVRGAPAEIRATVK
GUT_GENOME239698_00378296-399FRIIFEGPGGHSLHKFGIVGSAIHGLSRAIVRVDELKVPTDPKCTFNFGVIKGGTSVNAIAARAEAELDIRSFNQPALEAFVKTVLETIEAACDEENMRWGLEG
GUT_GENOME044199_00758194-292SARYRVTVRTEGGHSFGAFGNRNAIHLLASMISTLYAVKPPAEGDSKTTYNVGTIEGGTSVNTIAQEASMLFEYRSDSRACLEKMKAMFESIVAAYRSM
GUT_GENOME269290_00661198-306ANATGMIDLEVTIEGPGGHAWTACERVSAIHTAGLAIAEIAKIVPPKDPKTTLTVSLIEGGQAIHAIAQKAVFKINARSNSQAALNEIEEQIYHAIRRGVEIEQKKEGG
GUT_GENOME103749_01491306-418SRRYRISVHSEGGHSYKDFGKENAIVRLAEVITELYQVALPQNARVTYNVGRIEGGTTVNTIAQEASMLYEFRSESEEAMQFMENFLQNTLQRYQKKGVRIEAEVLGIRPGNG
GUT_GENOME024043_00777171-255VTVTAPGGHSFHAFGQKGAIETAAEIIHEIYRIKPCEGTQTTYNVGMIEGGTSVNTIAQKCTFLAEVRSDDAQALQKIDGQLCNI
GUT_GENOME032138_02004233-343GSHSWRFIVEGPGGHSWGAFGAVPSATHAVARATAEIADFEVPSDPRTSFTVGMITGGTSVTTIAPRCEAEVDFRSLSNDELLKIEKRIFRAFEKAVDAENARWGMTAPEK
GUT_GENOME158633_00031209-315VSVRAEGGHSYSKFGNSNAIVTLSEIICAMYRLEVPARPGRTTYNVGVVEGGTSVNTIAQQAHMLCEIRSDNRQSLEEMKASFEQVIASFRQRGCDVSLELLGLRPC
GUT_GENOME015937_0124218-134AGGAMAQTYVVNITAPGGHSFGSYGNTNAVHAAANAIREIARTVPEAVVSDFNGGATVNAIATDATFRVTVPDSKDLRTQIEKAVETGVKRENDFRGVKAGDLAAGGTPAHVRYTIK
GUT_GENOME281508_0194914-133LTMTMAASAAVVTFEFTGPGGHSLRAYGNTSALHAAARAALLVQKAVPEAVISDLNGGNSVNSIAADASFRVTLTGDAEKKLDQVKAAVQKGVDQENAFRNVKKGDMVKGAPAEVQWKLK
GUT_GENOME229823_0104613-142AFSGLANASMTDVLTVNLVGPGGHSFRDYGYTNAIHAGARAVAEIQKTIVDPNKYEIYGFNGGVSVNAIAGDAQFKVLLKANDKSELNTLKQQVISAVQKGVKAENDFRGVKTGSLTPDGASAEIQCTIK
GUT_GENOME122014_00646217-304AVGSLRYEISIQTKGGHSYNDFGNDNAIEAMAYLIQKLYQLKVPAHGKNTYNVGTIQGGTSVNTIAQDCKCLVEYRSDKKEYLEQMKE
GUT_GENOME088945_01563173-298ICCIHHCVAVSGDAVKITVHGKDAHGSLPENGVDAIHIAAHIILALEELTARELPMEQDSIVLVGRIAGGTTCNSVAGEAVLEVSIRTNSPKEREFLLKRVEEIAVGVAATFRGCATVEHLYGVPP
GUT_GENOME007523_00733206-307GSRRYKIEFDGIGGHSYKMFGIAPSANHALCRAVALFADIVPPVNPFTTFTVGTVNGGTSINSIASHAEAQLDMRSNAEESLTELESRIMPCFKEGARLENE
GUT_GENOME189606_01576176-293VTARAVGSQRYKVTVKTEGGHSYGKFGNKNAIYYLSSMIKDLYTKEVPKTEKTTYNVGTIQGGTTVNSIAQEATMLYEFRSVDRDCLKEMEEFFYSVIEMYRHMGIAVEVEVKGIRPC
GUT_GENOME281463_01505180-261QKYKVHIHAQGGHAYLNFGNPSAVVCAASMINEIYALPLPEGCKSSYNVGLISGGTTVNAIPQTVDMAVELRSEIEANLNTL
GUT_GENOME207678_02441202-298YAVKYEGPGGHAYGAFGTPSPLHAAARSIAKIAAIKPPVVPKTTYTVSFVKGGHAIHAIAQEASFTINMRSDSADELETLTEQVRQCIEAGADEENG
GUT_GENOME077989_00595174-293LCCSSAVGSYRYKITCKTPGGHSYANFGDPSAIQLLCGLVNELYQIQPPVRSRTTYNVGRIEGGTTVNSIAQQASMLYEFRSTAQDCLEEMEEKFRRAVAHWNGRGGDFEVELLGIRPGN
GUT_GENOME015072_00917208-312FAFEGPGGHAWTAFGEPSAVHAACRAAAKLADLPLTVNPKTTLTVSLIEGGQAVHAIASKASFKVNARSNSQEELEKLQRVMLDCVKEAVADENRRWGKTDVVKL
GUT_GENOME264140_02031181-272VGSRRYRVSIEAQGGHSWMHFGHTSAIAAAASLISKLYALPVPSAPKTTYNVGTICGGTTINSIAASAEFTVDLRSESHDELERLDKAFLRL
GUT_GENOME056137_00058206-302YRVTFHGPGGHSYMHFGTPNAIHAMGRAIAKIDDIQVPDRPKTTFAVGVVEGGTAITAIASEATMFVDLRSEDSAEMKKLEQGFQQAVDAAVQEENR
GUT_GENOME096283_00273204-305VGSKRYSITFKGPGGHSYGAFGTPSPIHALTRAGAKIADLQVPNEPKTTFNIGLISGGTSVNTISETANMVIDLRSSSQEQLLSLESTVLAILNGAVEEENK
GUT_GENOME225912_02235186-294YKVTVKAQGGHSYADFGNENAIITMSSLIQELYKIQVPKEAKTTYNVGCIEGGTTINSIAQECSILYEYRSSSDKCLKTMENNFYKIINDFKEKGKMIEIEVLGVRPGM