UHGP-MC 102442


Information


Number of sequences (UHGP-50):
51
Average sequence length:
201±21 aa
Average transmembrane regions:
0.07
Low complexity (%):
1.47
Coiled coils (%):
0
Disordered domains (%):
7.26

Pfam dominant architecture:
PF00884
Pfam % dominant architecture:
196
Pfam overlap:
0.18
Pfam overlap type:
shifted

Downloads

Seeds:
MC102442.fasta
Seeds (0.60 cdhit):
MC102442_cdhit.fasta
MSA:
MC102442_msa.fasta
HMM model:
MC102442.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME121810_0089323-223VFQEELDGLGAIPFDYIKIPSGGGIAFEVPGDDPDSPDLEKKLKGIMLHHHPINSYWMSSELSNEAPDCQSNDGKLGIVKETGEVRDCATCPHNQFKEDGSGKECKNSHRIYFLREGEPMPVIITLPPTSLRPLKDYIAKRLLLKGRRPIEVVTEISLKKEQNKDGVTFAMAQFLNIGLLDPTQKEKALNMQGYIKEIAGL
GUT_GENOME259472_005193-163KNEMIPATNYTALKDFNLAGAMASEMNGMDVSFDRVTIPAAGGTTFELPGELPGETDAAKEFTGVILYHHPLFAYYRERFTGGNNAPDCGSYDGVTGAGNPGGVCAQCPLNQFGSGENGGKACKNKRRIYILREGELIPILLTLPTGSMKEFAVYVKRLLA
GUT_GENOME116633_0075518-229MRQDSSNSDLAKHMSAGFSSISIKGKVFSLVKTGERKVIPNPRDPESPATFIDIVLLKVSPNKSKSYYANGFNENAEDQRPTCYSNNGITPDPSVEHPQCKCCATCPHNAWGSARGVNGTVGKGKACADYVRVALAEPTNLDEPIMLRVPPASIRAIGDYGALLNKHKAPYQGVITRIAFEPKEATPRLVFTPRAFLDQETYLRVKDIAESE
GUT_GENOME200638_010292-229TNAITTFENASVPAFLQAQQQTAHEMNATLAALATGFPTISIKGKVFTIVRDGERTIMTRPDDPQAPVSSILVIIANASGIVKSYYESGYTEGSDDNKPTCFSNDGLHPDPQVEHKQCDNCRLCKRNVFGTARNENGQAGKGKACSDAIRLAVMSDPSGDAYLLRVPPASLRALGQYTQQLMKRNVPYNAVVTRISFDIQAATPKLVFQFHSFLNEAQYRAVADAAKS
GUT_GENOME020687_0014124-235EDLAAIREELSDMDRVPYGRIKIAAGGVNIFQVFEPGEEEATSAQTVEGVIMLSHKSNGLWSKPFGRGDSKVPDCSSIDGVYGTVTETDEIVECASCPCNAFGSAKGGEGRGKACKNMRRLYIMRRGDIFPMVLTLPPTALSAYDSYRTKVMLGRKKMANVMTRISLKSAQNKDGVAYSTPIFEAVGVLDGVEAAAMRAYSEALNSSAQRVG
GUT_GENOME254456_0004919-218NNEIPVDELDGFTLSFDKVKIPAGGTTAFEVPGDDPENPDIEKELKVIIVDQYASNAFYKNAYDGTENAPDCFSNDGHQGINKDGEIINCDNCPNNRYGSAIDGIGKACKNMRKVYILRSGDTFPMLLTLPATSLAPFGKYLQRIVSKGLRPCDVITKISLKKAESKGGITYAQATFTMEEVLPPEMREKVRKYATGMKK
GUT_GENOME181255_01163254-410GVNAYNTCDVREKYGTSIDYWWPLNSYYAQGFRDGAEGDEAKPTCFSNDGITPDASAREPQCAKCAACPHNVFGTARTQDGVGGKGKACSDFVRVVLCTPDDLDTLYLLRVPPASIRNLGSYGDLLNKRKVPYQGVITKISFDMKEATPRLLFEPVG
GUT_GENOME084427_0171527-242LSSEDIAEELDGLGTIPFDRVKIPSGGAKTFEIPTEDGDDTESVNEITGVIVYHHAANAYWENEYSGAIEDPLCSSMDGKTGINRKTGEVINCATCPLNQYGSTQNGGKACKNVHRCYIMRNGNPIPLLLNLPPTSLNSFRNYLGKKVLLKGYKASDIVTKITLKTDSNKGGIKYSKTVFENLGPLNKEEKEKIHAVKESVKSIARSESIVADDKM
GUT_GENOME056639_0185928-212EIMSSEDFSASDLDRIKVPGAGGLMWEIPTAEGSRDSREIRGIILLSKTGRAYWQSKYSGENSKPDCASLDGQCGTGNPGGQCAACPFNQYESAAEGSGKACKETRTLLVLQEDELIPTVLRVPPSSIKELKQFLLRLAKKGLRLSHVASVLTLERDKSAGGITYSKIKFAAADALTPEQRKAID
GUT_GENOME141754_024351-221MSNLQTFSQAALPAHLATTQSSLTSAMNAGFSGGFNRISIRNSKWNAIKNGQVIQQSSAEFIDVVIVDIQSAIGRIYYNTQYVRDSKIKPACFSNDGITPDPRAETKPVVFLSDGTSRPVKSCAECPNNIRGSGQNGGRACGYVRRIAVVLADDLAGDVYTLDVKALSLFGDGNPALHQYSLRGYANFLATIPGTKGVPPNALVTRLSFDPSESVPVLRFG
GUT_GENOME173036_0275722-226LTQEEIAEELGGDIPQYPRIKIPAGGGIAFEIPGEDPENPEIEKEIIGVVVWHHKSNAYWAVSSDDNTPPDCLSHDGVQGIGSPGGSCKECPLNAFGSGEGGKGKACKNMELLYILQPENLLPVVVSLPPTSLNNWRTYKTLLISRGKKVNAVVTKITLSKKNTGGNDYSVANFKIAGNLNPETVGAATIYRQSIKDMLAQQAAE
GUT_GENOME103729_0215318-230LTGDLAEAVAEEMDGLGSIPFDRVKIPSGGGLAFEVPGEDEEDAESATELIGVILDHNPVNSYWANKFTGGNEQPDCSSFDGKQGVVRETGEVRSCETCPYNKFRSDNAGKACKNIHRVFMLREGSPVPLILSLPPTSLRYMRNYIAKRVLLKGYRCWQVLTKITLKKEKSKDGITYSRAAFTFLDALTPEQAQQAEAMRDMVKSIYRTIDIN
GUT_GENOME193240_0081927-230EDLADDMEGLTPQYPRIKIVSGGANQFDVGTDPENPQLERYLDGVFLFQHPSNAYWQGGNEDEGNAPICQSFDGKIGFGEPGGACISCVNNQFGSAVKDGKPAKGKLCKNTVMLYIQFSGEPIPYVIYLPPTSLGAFRKFCNDTFLVRQRAMFGSLVRIGLTKRVDGPNIYSVATFKLLRHFCGDELAEIKQYANPFREKAKAA
GUT_GENOME152418_0036878-217VPKFTAKAYYSGTYNADNEEAQTPICFSSTGIIPDQGCAQPQSACCKTCPKNAWGSAVDAKGIAGKGKACRDAVRLAIAPRGSLKETMLLRVPPTSIKALGDYGSKLTKHNVPYMAAVTRIKFDPNETAQKLVFEPVGFL
GUT_GENOME213779_0150237-223GGTPHITAKVGTGGVNMFILSNGNKDVTVEKFSGVIIGNHKCNAWFPKGELNSSPICSSNDGIIGINRQTGEYLNCMTCPNNEFGSDGNGKLCKNMHKLYILAENCAVPITLSLPPTSLENWRNYVLSGVAVAEKEISEVVTEFSLSGETSQTGNKYSVLNFKMTGYVNDDVNKFCSGMSELIQTYP
GUT_GENOME268555_013939-205ENFNLVSFTEDAMQNLNGIQLQCPQLKIPSGGNVYFDIDEEPYKELIGVIVDHGPMNVYFAGDFDGSSQPPDCFSKDGINGMHRIDNGAGDDIAYEPVQCEGCPYAEFGSGKNGGKACKEKHQLFIQLSGEMLPYSLLLPVSSTGVLNAYATKLFTKGMFLNDVVTSFTLEKAQNKTNIVYSKIVMKAIRPLTPEEK
GUT_GENOME121425_0013423-223HNNERTAREMNGLPMTFDRIKPPSSGGTSFEVPAIEGDDTETVKTFSGVILYHHPIQSYYKKTYTGGKEKPDCASMDGEFGRGNPGGECAACPLNKFGTGVNGSKACKEKHRLYLLKKGEIFPMILDLPTTSVREFGGYVKRLLSRGIFVDTVITKFALRKTQSTTGMEFSQVVFSMEERLGSGDTLLLSDLAEQARVYAQ
GUT_GENOME257287_0072312-232FELSTNTDIMEALIEELGEDAKIPYDRVKIPSGGTTAFEVPSDDPDNPDIEKEIEGIIVYAQKINAYWKDAYSGDNTSPDCSSNDGKVGVEFETGEIHKCVDCPFNEFKSGPNGVGKACKNMRRLYIIRTGCTLPIVLTLPPTSLRAYEDYVGRQIVTKGLRTHFVITKISLKKAQSKDGITYSTAQFTKVGRVPDEMKPELTAYYNSMKTNVSSMAIDDD
GUT_GENOME024712_013978-147IDHHTVNAYWESKYDGQNTPPSCSSLDGKTGINAETGEIHNCKVCPFNEFGSDGEVKACKNMHRLYILPENSPLPYILTLPPTSIKAWKEYLGKRIVVKGLRPHHVITKVTLKKQENKGDISYSSAQFSIVGRVDDKFKA
GUT_GENOME127231_0085325-227DLAEFEDIAPEFPRVKIPSGGQLSFEVPNPERPDDPDPCKALVGVIVMQHTANSYWAESDTNGTPPDCSSDDGVTGYGTPGGKCASCPLNEFGSGEGGAGKACKNMKNLYLLRDGDIMPLLLSLPPTSLKAFRQYANNLRFTGRGLSAVVTRIGLKRQESGGNAYSVATFSMEAPLAPELAEAGRAYARSMRENIAAMSAART
GUT_GENOME010448_0023913-222IAKVEEDVAEMIREEMRGLGTVAFDNVKIPAGGGLAFEVPGDDPENPDVEKEITCVILAHVPVNVFFEKDYDGEAVTPDCVAYDGTTGYNANTGEITSCKECPFNQFGSGKNGGKACQNRHELYILREGQPLPFRLTLPATSLKNFKEYMFKRVLMKNKKLHNVVTKITLKKAQNSGGIAYSQACFAPGGDLSEEQVKALQGSVKLVESV
GUT_GENOME096866_0062619-228ALKDFDFASVISEEMEGLTASFERIKIPAGGTTIFEIPGDDPNEPEAVKEFSAVILFHHPLNSYYKDAYTGGSNPPDCGSFDGITGIGTPGGNCKTCSYNEFGSGENGNAKACKNRRRIYLLREGEIFPMILSLPTGSLKDFSRYLMRQLSKGNKSNAVVTRFSLKKATNNSGILYSQAQFAVDRKLTSEEYVLISNLSEQVKARSRNVG
GUT_GENOME041845_0053424-232DLIKEELDGLGQIPFDTVKIPSGGGLAFELSGDDPDTPETVQSLTGVILHHHAVNSYWPGEFDGSNNVPDCSSADGKQGLDIKTGEVRDCSTCLFNQFGSSSKGNGKACKNGHRIYLLRSGEVLPVLISLPPTSLRAFKDYVAKRLVVKGKRTSSVLTTIKLKREKSADGITYSSCVFSKAGDLTPAQIEQVKPTVAWIKSVASTVPVV
GUT_GENOME024062_0158727-245DEELVAELEDEMEDLDDVKGISCKHIKIPSGGGKAYEIEGDDPDDPEIAKEIEAVIVFTHRMNSYWEGEFGATAEDGSPNFPKCSSMDGKVGVEFESGEVKNCENCPLNQYAEDGSGKQCKNIRRLYLLISGKPGIYLLSVPPTSIKDVNKQLAKIMGIQKIPYSKMVIRLKLEVTKNRNDIKYSKVVIEKAGILPREVWPVTSAMRRELKEKYKDVAI
GUT_GENOME085240_0165222-225FSTSAAAKELAGFNMRFDRVKVPSGGGLTFMLQEPDANGNTDFREINAVILTHHPMQSYFQGKYTGGNARPECSSMDGSFGVGSPGGNCARCYLNQFGTAENNSKACKRKHRIYILKEGEIFPIILDLPTLSVEGFGKYLRRLLTVGKDPGAIVTRFALRKAANKGGMDYSQVTFTEGRDLLPEELTAVRRLAAQVQAYSSSVD
GUT_GENOME009669_0226265-240FDTVKAPSGGATVFTVPSISGDEAEKSITGIILDYTTPRAYWETSDPVEGTPPACYSKDSIISFDGKACCSCPYNTYGSKDDESNAKACKESVSLFMLRPGNIMPIIVRIPVSSKVIFQRYLTRLISKMIPISSVITKITLSKTTNKSGQPYSIYNFEAASVLPPNEASKANDFSK
GUT_GENOME141754_0046614-245YLANTQDTLAQTMAHGFSSGQYNRISLKGGRFHLIENGTVVTTSRAAFIDVVIVDAQPNNGRIYFDKQYSADEKIKPACWSSNGVTPDAPIATRPTIKIHDATPRAVNSCAECPKNIKGSGQNGGRACGFTRRIAVVPASDVSGTVYTMDIKAMSLFKDDDPQNNLYSFGGYARFLTTPRNGLPHGISPSAIVTRISFDDTESVPVVRFGVTPNDGTGVGGYLSAEDYATVL
GUT_GENOME046065_0030430-238EAIAEDCVGLEFQFDRIKIPAGGSTAFELPGEDEDDTQMAKEVVGVILYNHPAYAYYTQKYTGGSNPPDCGSFDGVTGIGTPGGACASCPHNQFGSGEGQSKACKNRRMLYILQEGELFPMVLSLPTGSLKEFTKYVKRQLSKGRKLNQVVTKISLKKSTSTTGIAFSQAVFSMTRVLEQAEKVAIAKMSEQMKDYAANLTTSALIEND
GUT_GENOME018310_0112313-214PAFLQNTAATNDDLDAHAVSSFSVMSIKGKVFTLVKDGERRRVPNPKDPSSPASNIDVIVLRVSPFTSKSYYATGFSENAEDQKPACFSYDGVRPDPSIEHPQCATCAACKWNAFGTARGDNGGLGRGKACADSIRMAIADPTNIDEPIFLRLPPLSMRSLGEYSNTLKRHKAPYQGVLTNIAFDQDLATPRLLFRPVGFLQ
GUT_GENOME222069_017097-235LTTLDSFVLPSMTGERAEQQIDDLNDFEGIMMTFPRIKIPGGGSTLFEIPGASPDKPDYVPAIEGILIYNHNTNAYWEEGSEYDMNTSPVCSSPDGKTGYGTPGGACVDCPYNQYESDPNGGKGKACKNMRSLYILESGKPMPINLLLSPTSLKAYSNFVQSAFLMRNRPIWGSLIHIELRKETNGPNNYAVAVFKLLGDFTGEKLAEIKRYAIEARAGIKEMLEQRAI
GUT_GENOME149600_0095730-234LQEALSGGECTGLTFRFDRIKISSGGSLAFEVPGEEEETEMAKTITAIIAYHHPAFGYYATKFQGGSNPPDCGSFDGIHGTGNPGGLCRACPYNKFGSGGNKSKACKNRHMLYLVREGEIFPVVISLPTGSLKSFTDYVKHQLTKGRKLSEIVTQISLEKATNEEGIVYSRARFKFVRVLDPGEKTFIGEMTGLIRDYANSLTVG
GUT_GENOME237714_0086819-211LSEMRAIYTEILAAQEEVGGTILYRAKIPSGGAKSFEIVTGNDDTDTTVQKLVGVVIHSQKCNARFDEDTRGLPPVCASSDGVIGLEGDVEHVCADCPFNRFKTAKKGGGKACKNMIRLYMMVEGSPIPIVLSLPPTSIEGWRNYRLGVLGPRQLKPYEVVTELSLTAETNRAGDRYSVVRPRLIGRLSDADK
GUT_GENOME139460_0002020-229ELSGMNFLAEAMSDECAGLEFSLDRVKIPAGGMTAFEVPTGDGETSELEKEIDCVILLSHPANAYYRDVYKGGSNPPDCGSFDGVTGSSGQLCKTCPYNQFGSGEGKAKACKNRRMLYILREHELFPMILNLPTGSLRPFTKYVQSLLTMRKRPHQVVTRISLRKANSSSSIEYSQAVFKCLRALTPEEQTGIDNMVAQVRGIAAGMTVT
GUT_GENOME091490_0010832-248ELQDEMDDLDPESGITCLKIKIPSGGGLAYEVQTDEEDDAEYMKQIDGVVIFTHRANGFWPGAYGSGEDQNQPPACASMDGKTAIWTDTGELRSCEGCPYNEYGSGADQTGKQGRGKACKNMRRLYLMMSGDPNLYLLTVPPTSIKDVNRQLAKILAGGVPYTGMILRFTLEKATNANGVAYSKVVIKKGGILPTAIAAQAIALRRQVKEQYQSVAI
GUT_GENOME221626_0073023-242PTMAGLDFGNDDLADDMEGLTLSFPKVKIPSGGALQFELPTGDPENPEYTRFLQGVILYHHASGAYWPEGCEYDDNTIPLCSTVDGKQGYGTPGGACAACELNRYGTATDGKGKACKNMRILYLLQDGDYIPLQLSLPPTSLRPFNDFMNAAFVARRRPAWSSVVQIGLKRVDNGNNTYSVATFRKVEDLQPEQVGEFRAFVESFRQQAKEMLKNRAELS
GUT_GENOME159031_0151216-226NMSNDVTSDIMEELNGLNVSLDKIKISAGGGLAFEVPGEDPDSPDSAKEIIGVIVDHYPLNSFWTEKYNGQNVAPNCYSTDNRIGIGTPGGECAKCPYNKFGSGEDGQSKACKNAHRLYILCSGELYPVVVTIPPTSLKSLSDYLAKRIVTKGLRSYGVVTKLTLKKATNNTGIAYSQVQFAVVEKLSPENAEILKKFGESIRPITRNVDF
GUT_GENOME128508_0008542-241GGALTFRVKTPSGGGKAFDILTGDEEQDTSVPSFKGVIVFQHKCNALFDEEMSGNTPPLCSSIDGIRGLDSGTGEVRSCRGCPHNEYGTAKKGRGKACKNMQKLYIMTEWAPVPLVLCLPPTSLKNYQTYCLSSLAMRGLKAEEVLTEFGLTVDQNSDGIKYSKVKFKLSGKLTEEAKAAAGFFAAGLKEAASVSAEDYD
GUT_GENOME000086_0379712-238QERFMLPTAMPEAEFTQEELAEDMDGLQISFPRVKIPAGGALQFEIPSDDPENPDYAKTLVGVILFHHPNNAYWPEGSEYDDNATPLCSSVDGKLGIGEPGGSCAVCALNQFGSAAEGNGKACKNMRVLYLLRSGEFMPLQVTLPPTSLKPFREFMNQSFMLRRRAAYGSVVQIGLKKMSNGKDDYSVATFRRLHDFSGEELAQIRAYADGFKEQARMMLQQRATVN
GUT_GENOME139826_0003421-218TVEIMRRTVEALHGVSELPMTRVTVPSAGGRYFMFSDDEAGDMPPVNAFEGVILSANFVNAYWEHGFGEGGEKTPNCMSTDGISGWDRDGLEHVCKTCPRNRMGSGDGGRGKACQNNVQLMVLLEGEPLPVALKVPTMSVPNYVRYVAGVLTPRGLQPYQVTTRFGLMKATNSNGVDYSQIMFNCTGRVNDEEIKALM