UHGP-MC 1116


Information


Number of sequences (UHGP-50):
51
Average sequence length:
198±18 aa
Average transmembrane regions:
0.11
Low complexity (%):
2.31
Coiled coils (%):
0
Disordered domains (%):
11.31

Pfam dominant architecture:
PF03067
Pfam % dominant architecture:
9020
Pfam overlap:
0.88
Pfam overlap type:
equivalent

Downloads

Seeds:
MC1116.fasta
Seeds (0.60 cdhit):
MC1116_cdhit.fasta
MSA:
MC1116_msa.fasta
HMM model:
MC1116.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME096291_0132425-173ALPASAHGWVTDPPSRQANCATGATPFDCGSVKYEPQSVEAPKGSMQCSGGNAAFSILDDNSKAWPVKNVGSSVNIQWKLTANHATSLWEYFVDGKLHQTFDQGGAQPSPTISHTLTNLPAGKHTILARWNVSNTVNAFYNCIDVFVGP
GUT_GENOME096384_0427611-204AGLALMAASSLSYAEQAKPFHGYVSNPPSRAFLCTSQAGNLNKNCGAIQYEPQSLEGPKGFPANGPADGHIASAGHDNYAQLDEQSPTRWHKVELNTGKQSFTWTLTAAHKTERWRFFITKKDWDPSKKLTRAQFDLDKPICDQDGKGEVPANSITIKDCTIPSDYKGYHVILGVWDIADTGNAFYQVIDTDIK
GUT_GENOME151655_0055264-275VDAVPAPRHGTSEMPPSRQYYCYSQQDYGGGSGGIKNAACKAAYEAAGSENWQRERLFNNWNAYSQNAPKDQWQTKVPDEQLCSAGIENYKSINLPSADWYTTELEVKEGRVALDYKASQMHDPSEFTVYLSKPDFDPATSKLKWSDLQESPEVKFVGIPDGTKAVPGHYKLDVKLPEGYTTGKKAIIYIAWARHEDPARETFFSCSDVILK
GUT_GENOME096544_0139027-222LVAAPASAHGSVTDPPSRNWGCLDRWGDRHQAPEMAQQDPMCWAAFQDNPAAMWNWNGLFREGVAGRHEQVIPNGQLCSGGLTEGGRYRSMDTPGNWTAKNVPTNFTLTLTDGAQHGADYLKVYITKPGFDPTTQSLGWGDLTLLKETGRYASAGQYQTDLNLSGRSGRAVIYTIWQASHLDQSYYICSDVNIGGT
GUT_GENOME142419_0252211-212ATSWAAAGLLLAGDAFAHGTMTTPVSRVYACFQGNPENPTNPACAAAKAIGGSQAFYDWNGINQANANGNHQAVVPDGKLCSGNNPTFRGLDVNRSDWQTTPIQPDANGRFTFVFKATAPHATRDWRFFVTREGWQPGSPLRWADLQEFCTLGNTPLSADGTYKLQCTLPQRTGQHVIYNTWQRSDSTEAFYTCMDVRFQGG
GUT_GENOME096559_0232814-208LAAVAIFTLLSVALMILLSDKASAHGYVEGPASRAALCKSGQNTDCGAIVYEPQSLEAPKGFPAAGPADGKIASAGGAFPKLDEQSASRWTKVNISSGTNTFTWKFTAAHATSNWKYYITKANWNPNAALTRDSFDLTPFCSVNYGGKQPPTSYTDTCNVPARSGYHVILAVWDIADTANAFYNVIDVNFSGSTP
GUT_GENOME176628_0248221-210LLTIVLAFSFVLGGAALAPTVSEAHGYVASPGSRAFFGSSAGGNLNTNVGRAQWEPQSIEAPKNTFITGKLASAGVSGFEPLDEQTATRWHKTNITTGPLDITWNLTAQHRTASWDYYITKNGWNPNQPLDIKNFDKIASIDGKQEVPNKVVKQTINIPTDRKGYHVIYAVWGIGDTVNAFYQAIDVNIQ
GUT_GENOME096544_0148332-221ALAHGGLTYPATRTYACYVNGIENGVGGGLNPTNPACVNMLAENGNYPFYNWFGNLLSNAGGQHRTIIPDGNLCGPLASFSGARKGGTDWPTTTLQSGSTITMQYNAWARHPGTWSQYVTKDGWNPSQPLKWSDLEAAPFDEVTNPPLREGGPQGAEYYWQAKLPNKSGRHIIYSIWQRSDSPEAFYNCS
GUT_GENOME096513_0245220-209MVTLGISVLLFALSIMFADKASAHGFVENSRADLCAKGINTDCGYVKYNPADLEAPGGFPNGGPEDGKIASANGRYPEMDQQWTDRWEKINIASGLNTFTWVHTANHNTANWRYYITKPDWNPNQPLTRDSFDALSFCTVSHDGSLPEFKDNHLCNVPSDRSGYHVILAVWEVAYTGSTFYQVIDVNIDG
GUT_GENOME141727_0493418-216GIGACVLAAGVTAITMIPQSAYAHGFVEKPSSRAALCSQNYGALNLNCGNVMYEPQSLEAPKGFPDGGPIDGKIASAGGLFGGTLDQQTSNRWFKNTIKGGANTFTWKYTAAHSTSKWHYYITKKGWDPNKPLTRAELEPIGTVNHDGSAASNNLTHTINVPTDRNGYHVILAVWDVADTSNAFYNVIDVNLVNNETPD
GUT_GENOME147132_0066218-212SSFHGHVFSPASRVYFAWLAGQIDEGALNQREAGKFFPETAEGLSDPFAPDDQPNSLPPLDGKIASANQLTGQFLDEPGTHWQKHDVLAGDILEISWGYGQKHKTRRWSYFMTRSDWNPNLPLSRAQFEKEPFYMVQLNEQPHWSVNSKFALMPIEPTVHEVILPHRSGYHVLLAVWDVADTGNAFYNVIDLDFV
GUT_GENOME213384_0106516-221AHGAVDTPIARQVYCKTLPDFWSGNPSDAGCAALARKSGQYPGQQWNEVAHLIPKPGYEDLEIVKREVPDGKLCSAADSKKDGLNLVSNDWYRTDVTPTNGTMEVRIIGTAPHVPSFAKVFLTKPGFDPTRSPLTWNDLTLIHEEKLEVAQTNWGTRPPVISASGFFKFPVPIPPEQSGKATLFVQWQRDDPAGEGFYNCSDINIV
GUT_GENOME143718_0050810-211LCSSVLLGVGLSIGGQDASAHGYVQSPISRSYQGHLDRNVNHNAALAKYGPVIYEPQSLEALKGYPAAGPADGQIASASGAVGNNFNLDRQTSTMWTKQDLNTGPNTFTWRYTQSHSTTKWHYYITKADWNPNDQLDRSDFELLTTINHGGAQASTNPSHSVNIPNDRLGYHVVLAVWDIDDTRMAFYQVIDVNLKGDSAIP
GUT_GENOME096513_0523423-206AALMLIVCSLLFADRASAHGWIENSRSDLCYDGINTGCGAVMYEPWSIEGRGDFPEIGVPDGQIAGGSKYPELDVQTENRWNKINMKGGPFTFHWDMVANHSTNYWDYYITKKGWDPNSPLKRSDLELFCRYEDNGALPPMAVEHDCFIPNDRTGYYVIVGVWDIFDTVNAFYQVIDVNLTIDP
GUT_GENOME231993_014406-191IALMMATLAASSAAWSHGYIEVPESRAYKCKLGSNTDCGRAQWEPQSVEQVSGFPGGATPLDGQLASGGVNGFESLDRQGVNVWALNTMKPGPQTFTWYHTAKHKTNNWRYYITKQDWDVNKPLSREAFEKEPFCEIDGHAKPPKDREVHQCVVPERTGYQVIYGVWEINDTVNSFYQVVDVNFED
GUT_GENOME141422_0086041-281HGSPAIPISRQYQCYKEGGYYWPADGSGIPQSDCKAAYQLVYHKYLDKSGIVVPKEEGKKSAINSKKNAVDIIEQSNYQFRQWNEASKNVADYNNPDAVKAAIPDGQLCSAGNVGAEWDERSKVWNDKSGLDAVTAWRATDIQKTSEGKIDIVYDATATHDPSFFEVYISRPDYNAATAPLKWSDLVLLEKVENVKPENQQYKFAVNAKDYTGKHVIYIRWQRIDPVGEGFYSCSDVNIKG
GUT_GENOME143709_0555823-211IVLGLAASAIFADSASAHGYIESPASRAYQCKLGMNTNCGQVQYEPQSVEAKGNFPVSGPADGHIAGGGIFAPLDEQSADRWNKVKMQGGTNTFQWHLTAPHATSEWKYYITKKDWDPNKPLTRADLDPVPFCTIQDGGKKPPATVTHECSVPTDRSGYHLILGVWEIADTGNAFYQVIDVDLVNDGSE
GUT_GENOME171567_004506-224SALAVALIGMYSTGALAHGYVFEPKSRSVIHFPPMNDKGEHSNVWPADAVEAPKLPAGNDMSPTPIANEQLFQFPPDGKLASGGSTATPAFSKLDDEKQNTYNNPMSAGPHQFKWHLPAKHRTTYFTYYITKPDWQSVPGADSRLTRAMFEDKPFCHKVYTYAPGNPSADISLPTGFETHTCDVPEREGKQKIYAVWRVRDTDNAFYQMIDVDFGGEII
GUT_GENOME143421_015826-216LGLALMSTIILGGSGLLSAEEASAHGYVEKPAARGYQGSLDKNTIGWSAAMEKYGMVITNPQSLEFDKGFPQAGPADGQIASAQGGKGQITDSVMDSTGLNRWTKQDINTGVNTFTWHYTAPHSTTKWHYYMTKVGWNPDKPLSRADFDFLGEVKHDGSAASNNKSHQITVPENRSGYHVILAVWDVADTTNAFYNVIDVNVKGGGEITPP
GUT_GENOME098296_0104430-240LLAVPQFAAAHGYVEYPAARQEICDKDGGYWDSQDGSTIPNLACRQAYQQSSWYPFVQKPEFSRLVSNYRELTAVKAAIPDGTLCAAGDTKKAGINMPSAAWKKTLIDVTQGGKVTVRFLAATPHNPSFWQFFLSKPGFDAATQKLAWADLELIANFNDVAVTTLDGKKYYQMEIQLPTDRTGDAVLYTRWQRVDPAGEGFYNCSDITFTG
GUT_GENOME062684_000583-192KHIIITAVLASLFSASVLAHGYVTKPASRATNCKLNNNPPEMCGHAKWEPQSIETLSGFPGGKFPPDGSLASGGIERFIPLDLVKDKIWTKEKVKPGKMDFEWTLTVVHSTRNWRYYITKQDWNHEVPLNREAFESEPFCYFEDYGRVPPYTVTHQCEVPERKGYQIIYAVWEIADTPNSFYQVIDADFN
GUT_GENOME096381_011321-228MTVRRTAAGIVALGIVPLALAVAPAGPAAAHGSMADPVSRVSTCFAEGPESPRSAACKAAVAASGKQAFYDWNEVNIPDAAGKHRQIIPDGKLCSAGRDKYKGLDLARTDWPATAMKAGKHTFRYKATAPHKGTFELYLTKDGYDPSKPLKWSDLEAKPFAKATDPQLRDGSYVIDATVPQRSGRHLVYSIWQRSDSPEAFYTCSDVTFGGKGTAAGQQTPAAPDEQQ
GUT_GENOME143517_015146-199KIGMFFAVFTLAVVLFQTTASAHGYISKPASRVYLANKGINVGVGSAQYEPQSVEAPKGFPASGPADGSIAGGGKYSLLDAQTANRWAKVDIQSGPLTVEWTLTAPHRTSSWQYFITKKGWDPNKPLTRASLEPLTTIEADGSVPNALTKQEINIPDDRSGYYVILGVWNIADTGNAFYQVIDANIINSSVAPA
GUT_GENOME231375_0558422-214AAHAHGSMETPPSRVYGCFLEGPENPKSAACKAAVAAGGTQALYDWNGVNQGNANGNHQAVVPDGQLCGAGKALFKGLNLARSDWPSTAIAPDASGNFQFVYKASAPHATRYFDFYITKDGYNPEKPLAWSDLEPAPFCSITSVKLENGTYRMNCPLPQGKTGKHVIYNVWQRSDSPEAFYACIDVSFSGAVA
GUT_GENOME096435_0077110-204TLKITLGALIMLLCSVAFANSVSAHGYVNNPESRNLLCADGANYDCGAVIYEPQSLEAPGNYPEGGPPDGKIASANIFHELDAQSEDRWAKVPMSSGAFTFEWTLTAAHATDKWEYYITKEDWDPNDPLERSDFERFCSIDDNGDRPPFTVNHDCVIPDRIGYHVILAYWEVADTANAFYNVIDANFDGEYVEPG
GUT_GENOME143421_0112413-227SIILGGLGVFTSQDASAHGYISSPPSRAYYGALEKNTIGYQAAQQKYGAVINEPQSLETGKGFSKDFGINLYSWETGASGNGPADGKIVSANDALGSQISVQKDNYWKMNDINSGKLNITWHYTATHATSRWTYYITKNGWNPNSPIKRSDLQLISNVPFTGAQANTNLTHTVDIPADHKGYHVILGVWDVSNTGAAFYQAVDVNIKNGAETTPP
GUT_GENOME095980_0211412-209SILGTVLTLFSFYASAHGYIRMPESRAYLCSFNDYLKEPKTPLNNDCGLGPEYEPQSIETQKGFPESGPPDGLIASGGGIERFSPLDVQTPTRWHKIDMKTGLNTFTWYLTMAHATDNWRFFITKNGWDQSAPLTRSAFELTPFCQRDDHGAMPLIEVDINFICDVPADHEGYHVILATWDVDNTQNAFYQVIDVNLS
GUT_GENOME142467_0169915-200IIVGMMIAFLGCGVIQVSAHGFVTNPGGRAYLGSTWYPGGPLNTNIGSVMYEPQSIEAPQNTFIDGKIASAGIANFAPLDEQNAKRWYKTPVKAGNLSVTWQLTARHKTSTWDYYITKPGWNPNAPLKFSDFKKIASYNDNGAIPSEFVTHQVNISANEKGYQVLLSVWNIADTGNAFYQVSDIDV
GUT_GENOME143498_0333522-244LASQLLVMLPVDKAEAHGAVGFPIARQYQCQLEAGFWGDPANIPNSDCRQAIENPGDPSNPQLPFTQWNELSANPTNPSIQATVELAVPNGLLCAGGDPRKAGLDNVPATKWRKTLITPDENGHMQLRWENTTAHNPAYMKVYITKPSYDSTKALRWEDLELLYADKAPTPTAGTGLSPSTNSFYFLNVPLNGHTGDAIIYSYWQREDAGNEGFFNCSDVNIS
GUT_GENOME147335_002151-204MKKQPKMTAIALILSGISGLAYGHGYVSAVENGVAEGRVTLCKFAANGTGEKNTHCGAIQYEPQSVEGPDGFPVTGPRDGKIASAESALAAALDEQTADRWVKRPIQAGPQTFEWTFTANHVTKDWKYYITKPNWNPNQPLSRDAFDLNPFCVVEGNMVQPPKRVSHECIVPEREGYQVILAVWDVGDTAASFYNVIDVKFDGN
GUT_GENOME147131_023986-215LQLRHGRVTAPQTRGLVATDLGLIAEWENNEMEGGKNFPDLTGGSFPPPYEMDSWSNPPPPDGLILSGGHRGNREVVNFTDKEMQHKLRSIGHPNDNFTWPTIMVNPGSDLDIYWAYTVAHVTRGYRWFITKDDWNPAERITRAQLEATPFYEDIYPYVPYYSNNDKLVAKIEHNVHLPKGKKGHHVIILAWIVADSGNAFYQACDVDFG
GUT_GENOME176555_0235526-232MLFIIAVGATMISVFALGDAQQSYAHGYVDSPVSRVKNAEANGFGWGPGQDSSQPEIITTPQGIEAPTKLLDTGQLNGRLPSAGLASYSKLDEQTATRWVKSNITTGENNFKWVMTAKHKTNRFRYYMTKPGWNPNAPLTMDEMELIGIVGQPIGQDLPPGQGFMVNDTETHRIKIPADRKGYHVIYAVWDINDTINSFYQAIDVNV
GUT_GENOME143489_016917-201MLAMVVMSISGSALAHGYIENPPSRNLLCHVDGKNLNKDCGAVQYEPQSSGETADGFPEKGPADGQLASGDNWLSKDLNQQTAERWAKTKMKAGVQPFTWTFTQSHPIASFKYYMTKQDWNPNAPLTRDSFDLTPFCVLPPAAAIPQTEGKVRASHDCDVPERTGYQVIYGAWDVADTAGTFYQMIDADFGSATG
GUT_GENOME096287_0043331-179ASGASAHGYVSAPLSRQALCANGTVKDCGQIQWEPQSVEGPKGLRSCSGGNAQFAVLDDDSKGWPATSVGTTTTFTWVFTARHRTASFEYWIGGTRVASISGNDAQPPATLSHTVSLAGFTGRQKVLAVWNVADTTNAFYACIDVQIGG
GUT_GENOME141041_0109512-202VAGMAIAAVALFSFAGADAASAHGYIEEPKARGLLCQEGANQDCGGVVWEPQSLEGPKGFPGLSVADGEIASAGGVFPQLDQQSSNRWAKVDMNSGLNTINWHLTARHSTSKWHYYITKPDWNPNLPLTRDQFELVPFYEIYDGGARPEQKVSHEVTVPQRTGYHVILGLWDVADTENAFYNVIDVNFGGG
GUT_GENOME103882_0248342-269NKVLQCGALSALMLASTVSLPVAAHGWVEFPSARQNTCYQDGGFWDNTIPNAACQAAFDESGAFPFVQRNEISANVPNYQDMAHVQAIVRDGSLCSAADSAKSGLNVASPDWQKTAITLDANQQIELVFNATAPHSPSYWQFYLTTPEYDQAAPLTWADLELIDTAGNVAVGDDKKYRIAVTFPAGRSGDAVLYTRWQREDAAGEGFYNCSDISFDGGAAPTDPTDPT
GUT_GENOME095980_0331322-202SQFALAHGYTTEPPSRAYLCSVRNTTELKNTGCGNQVPYDPHSFEAPKGFPEAGPADGVIASAAIAAHSEMDEQSPTRWVKSELKTGKNTFAWYLTVVHSTDSWRYFITKPDWDPSQPLTRDSFDLTPFCVKYENGVRPPVVTEIECDVPKDRSGYHVILAVWDVYDTSNAFYQVSDVNIS
GUT_GENOME096246_0386420-269TLSLPSAGAASTEKEKAPRHGTPSEPISRQQLCFDGQDYHWPVDGSNIKNAACKAAYQVNYNKYKDTVKYPEFKDEQKLIEQSNYPFVQKNEFAHLIPAPDYNDQEKVKAAIPDGTLCSGNNVGQAKDSKKPYLYNDKSGMDIPAPWTASKVHLNAKGEIEITYHASATHDPSFFEVYLSNADYKASERALKWSDLTLLAKVDKPQLVGSDYKFVAPAKEAKGPRVLFIRWQRIDPAGEGFYSCSDIDIQ
GUT_GENOME176464_012048-219RRSVVATGVWMAIAGAGLQVQVAQAHGHLADPPSRAALCHANHKNLNSNCGGAQYEPWSVGEAIGRFPAAGPVDGKIASGGIRGDFGALDEQSANRWHLTPIVDRNIQFDWHYQAPHPVTTWEYFITKADWNPNTALSRASFDDAPFCVVDGNNQVPASGTGTNPKHSCTLPADRAGQHVILGVWKVGDTDKAFHSVADVDIQLDGGPAPEW
GUT_GENOME025899_0009618-204LTTLSSVNVSAHGYVISPISRAYQGQLDKENIGYDFAVQKYGPIINEPQSLEAPQGFPKGGPADGKIASANGARGFELDKQTSSLWTKQNLTGGATKFTWKYTQSHPTSKWHYYITKPGWNQNKPLSRNELQLIGEVQGKGAQASTSPTHTITIPNDRIGYHVILAVWDVSDTSNAFYNVIDANIQP
GUT_GENOME142583_0524126-220SQQAFAHGYIESPKSRAFMCSAKGGNLNANCGSVTYEPQSIEYAPGVNHHYPSSYCPGDFTQCGPADGTIAAGGIANFAPLNEQTATRWQKTTIRPGVNEFKWRYTAGHASAYYQFYITKKDWNPNQPLTRDSFELKPLLHQDAGGVRPQSGQTSSYNVNIPSDRSGYHVILATWKIADTAATFYQVVDVNIDNN
GUT_GENOME143633_031515-203KLMVFTATLLLSSAGFISTASADVTLKHGYIDVPPSRAFLCSSKGNNLNKNCGSIQYEPQSIEGPNKFPQEGPVDGKIASGGNAAFSELNEQSADRWHKVAMKSGENTFKWTLTAKHSTKSWRFFITKPGWDVNKPLTRDDFDLTPFCQQNDNGEIPKETVEINCNIPERSGYQVILGVWDIADTKNAFYQVIDAEFKN