UHGP-MC 98938


Information


Number of sequences (UHGP-50):
132
Average sequence length:
327±50 aa
Average transmembrane regions:
0.06
Low complexity (%):
1.75
Coiled coils (%):
0
Disordered domains (%):
5.58

Pfam dominant architecture:
PF00438 - PF02772 - PF02773 (architecture)
Pfam % dominant architecture:
5303
Pfam overlap:
0.92
Pfam overlap type:
equivalent

Downloads

Seeds:
MC98938.fasta
Seeds (0.60 cdhit):
MC98938_cdhit.fasta
MSA:
MC98938_msa.fasta
HMM model:
MC98938.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME045516_003923-328LNNYYTVETVLSGHPDKVCDQICDAILDSFIAVDKYANVTVECMGTNDTLIIGGEVGSDAIVDIEQVAHKTYAEIGYSTPLNIINKLHRESIQTNAPLKNNLAGDQGIMYGYACQNKYNFLPYGVYLVNSIAKEIDNFRRESQLFLPDGKVQLTLLGQKIETLVVCVQHKKNTNIEVLKEQILRKLFLAFPELVTATNIFFNHNSNFINGGFSIDAGLSGRKIIADTYCGLIHHGGGSFSGKDPYRLDRSAAYMARFVAKNLVANNLCKQCLVSVAYVFGHEKPIMLTVESDNPSQDNALLQIIHKKFDFTPNGIVKMLNLYNTQF
GUT_GENOME058404_005454-346FFTSESVSEGHPDKVCDQISDAVVDLFLRADPRSRTAIETLATTNQVVIAGETKCNEYISAEMIEQTVRDTIRRIGYDQKGFSWQTVKIKNYLHHQSCDIALGVDRDGAGDQGIMFGYAKKEKGFDSDYMPLAIYWADRILQNLAQARHSGEIIGIEPDAKSQVTLEYDEDGTPKGIKTIVLSTQHNENLSQEEVRRIACRYIEQTLPAGWMPADKDILINPTGRFVIGGPDGDTGLTGRKIIVDTYGGYAPHGGGAFSGKDPTKVDRSAAYMLRYLAKNIVAAGLADECLLQISYAIGLAEPLSLYINTNKTNKTDESKILAFIRNNIDLTPQGIINYLRLR
GUT_GENOME224708_0091368-433MKRFYTAESVTEGHPDKLCDLIADSILDACLKEDENSRVACEVLATKRNIIVAGEITSRFEPQVFEIIKKVLESAGYEAEEIHMDALIHKQSPDIAGAVERSRERRAGTVSVQSGLASGAGDQGIMIGYACDETPQLMPMPVVLANRIVRELSASRRSGYITGILPDGKAQVTVEYEDDRPARLDAVIVSCQHEKEKSLRKLEHEIREKVLRPALRMLPPDEDTKILINPSGRFVCGGLDADTGLTGRKLMVDTYGGLVPHGGGAFSGKDCSKVDRSGAYMARYIAKNMVAAGLASRCQVSLAYAIGVAQPVMVQVDTFGTGKICADDCLAAAIPLVFGLTPSQICDTLHLKRPIYRQSAVFGHFG
GUT_GENOME221321_0428121-347LYSNEIVFRGHPDKICDQISGAILDECLKQDKYTRAGIECALKNNRIYIFGEITTNAKIDKAEIARRVLKDIGYKEEFQVVENISEQSRDIAIGVDRLGAGDQGMMFGYACNDTKELLPLAQVILIKFAKEYDKLVHINPNVFYPDGKAQITGYYDQNFKLKGIKDFTISYQNNEKNRFVTDAEIKAIAMNICREYGIHIQSFLINPTGKFLTGGPYADSGLTGRKIVVDAYQSFANVGGGCMNGKDPTKVDISGAHKARELAKRILKEKQLTWCELQLSYAIGLEKPLAIYIDSDKGNLPVSTNMIEECKPARIIRDLHLLEPRYE
GUT_GENOME078956_015926-365SAEAVCIGHPDKLCDLIADQILDEILYADPNARVAVEVMATGRRIIVTGEINANARVDLRDCVRTALTAAGYKPWRFLVYVWVRRQSNDINDGVSTSLEARHGDESAYCLQGAGDQGTVYGYACTDTPERLPLPLVLAHEICKRLDDARKQRTITGIFSDGKAQVSVRYNDAGKPQAVETVVVSVQHDKSKDFEVLRREITSLIVGPACQPYLPVDADTVVLINPSGRFVEGGPKADTGLTGRKLMVDTYGGLAGHGGGAFCGKDASKVDRSGAYMARLIAKTIVDADLASRCQVAISYAIGKADPVAFSVDTLGTGQYTDQILTAAAQDVFNLRPAAIIDQFGLRAPGYVRYSTYGHFG
GUT_GENOME000279_019944-343LQTARAFCRGQSDSLCGQMADRILDAALSQDPGARVRVDVALAGARVWIGGELSADAQIDYATLARQTLLQAGYPDLAQHCQVEVALRPFAPEAEAGLRRADAGDSGALCQGLYAGYACREGEARFPLAAALARGLCGRLDALRAKDALPGLLPDGGAFVALRDGRVTALVLGAQHGAQIALDQVRALLEQKVVAPCLPGRLLEEDAQVFINPAGRFTCGGPAAFAGNSSHLESDYGDALDQPSLAGRDATRTCGAYMARYIAKNLVAAGLARRCRVKLCYALGLADPVALEVETFGEGKGAQLERLVSETLDLRPAAIIRRFGLRRPLYTPLAVCGRFG
GUT_GENOME236333_014814-340TSEYVSLGHPDKTADYISEYLLDQYLKEDANVRYAVECQVKDNVVNLAGEVTSTYMPSNEQIKSFVVKAVKDIGYTEEYAKKWDEQCINPAKLQVNIYISRQSPEIGVGVDSEGWGDQGIFFGYAVNNPDIDYMPVDYTMAKDINTTLYQQVKSGNLDAGLDIKTQITIKEENGGSLLDHVIVAIPLLSNDISDVADEVFSCIKSYINAHFIDGYTNDFSLTINGTGCYKIHSSVGDCGTTGRKLAVDFYGGNGHIGGGSPWTKDGTKADLTLNLYARKLAVDYARQCDHGDVVETRLSCCIGRKTVNVCILKNNKVVQTYQAILSTSDLIKMFDLD
GUT_GENOME070661_012053-339TSEYVSLGHPDKIADFISEYILDRILEQDKNARYALEVQIKDKFVTLGGEISTKASISAEKWVKEAVEEIGYTKKYAEKWGKENTICSDDLQVRVLFSTQSPDIAQGVNRDGWGDQGIFFGYAENNPDTNYMPLDHFLAKELNNILYNKAKKDGIGGLDIKTQITLDDDDMIEEIIVAIPAKDKKEFTKIKRIVEAWAGKKNITPTTVTIINGTGKYIKHSSMGDCGTTGRKLAVDFYGGNSPIGGGSPWTKDGTKADLALNIAARSAARYELINSTHPFKFSAECRMSCCIGKSEINASLIIKDKYGQILEKKSNKADLYPTEVIEELGLQNPKFA
GUT_GENOME251523_011628-374LVTAESVRAGHPDKFCDQVADAILDAHLRQDPNARVAVEVFATAGKIVVGGEITSKAKVSYDKIVAGVINRIGYTMSDLCGDPHGLLELEVCLHEQSPDISAAVSKTGLDTGDAGDAGKALGAGDQGIMVGYAASETAEYMPLPVVLAHRICKRLDELKPITPWLGADGKAQVTVAYENGVPVAVTAVVVSLQHDAEIDHKSIRRFIGKEVLGKVLPRELLKEDTSILINPSGKFVLGGPAADAGLTGRKLAVDQYGPVAHIGGGALSGKDPTKADRSGAYAARWVAMNIVAAGMAKRCEVQIAYAIGKSEPVSVTVDTFGTGVIPDSRIEAAVKAVFDLTPAGIIRDLKLNRPIYSRLARYGHFGR
GUT_GENOME059913_001078-367IFTSESVGEGHPDKVADYISDSILDACLAQDKTSRVACETLVKSNMVIIAGELTTKAVIDPEKIARQAIREIGYCNRQDDDVFHADTVFFTNLLTEQSPDIAQGVDAREAEGKGHAEQGAGDQGIMFGFATNETPELLPAPIVFAHKLLIELARRRKRGHVDWLRPDCKSQVAVAYDEDGRPAHIENVVISTQHTEDVDHDTIYSYCVKLIKNVLPAELLDERTEYFINPTGKFVVGGPHGDSGLTGRKIIVDTYGGMGRHGGGAFSGKDPSKVDRSAAYMCRWVAKHIVAAGLADKCELQVAYAIGYPAPVSIRVDTFGTGKVEEISIENALENIFSFKPADMVKQLNLLQPIYRKTTH
GUT_GENOME257744_003143-327FTSEQVSCGHPDKICDQISDAIVTDCLAHDKRSRVAAECMIKDFDIIIAGEITSSHTPDYEALAREVLKRIGVFSAEDFRIQTFISKQSADIALGVDGNAGAGDQGMMFGYATNETPEMLPIPYAVATHALQLLREVGCPILLPDAKSQVSYDYESGRITTFLISTQHLEGTTVEDIRPIVEAVMKTAAQDYGLNTDFEKLVNPTGRFVVGSSFADSGLTGRKIIADTYGGMCRHGGGAFSGKDPTKVDRSGAYAARYIAKDIVRQQYADRCEVQLAYAIGVAEPVSVYVDCFGTNRIPEADIVDYILTEYDLTPRGIITALGLL
GUT_GENOME234131_007555-374LISESVTEGHPDKVADKISDAIVDAYLAKEPDARVAVETVVKEGLISLQGEISAPFTLNHGEIARKAAEEIGYTDADSGLDPRGAGIINNVWPRDFSSDLGDFGQEDASSEEEKEHIYDTFVGDQGLVVGYATDETAEYLPLPYVAATKLAQRLAYVRKNGIIPQLRPDGKTEVVVEYNEDLSRALRFAAVLISTQHSPDIALEDVRSQVREQVLEPVLDEFGLDYSRARIVINRTPFIQGGPHSDAGLTGRKVVVDTYGGIGSHGGGAFSGKDPHKADRSEAYAARWAAKNIVAAGLAHRAQVTLGFILGQGAPVSIDVETYGTEQVPREIIQKAVEQVFDFRLLSIIDQLDLERPIYYKTAAYGHFGH
GUT_GENOME092749_010719-260LTSEAVTEGHPDKICDQISDAIVDAYLTQDPASHVAVEAFVSGNVLTLAGEVSSSATVDVTALARKVIADIGYTDPALGFTADACLVFTNIHQQSNDIKQGVTQAEGDRIVGSGDQGIMYGYATNETVNYMPLTALGRFLRPDGKSQVTMRFNDEGHPVGIHSLILSAQHSEAIGQDDLRQLLQKTVIEPVCSSWIGGPVGDTGLTGRKIMVDTYGTLAKHGGGAFSGKDATKVDRSAAYMARYVAKNVVAE
GUT_GENOME195813_009835-361LFTSECVTNGHPDKVADSISDAILDACLAQDPHSRVACETMVTTDFCIICGEITTKATVDYAAVAREAIRKIGYVYPGDGFDADTVEIQCRIHTQSADIALGTNDEVGGAGDQGMMFGGACTQTPELMPLPAALSRALCSRLTQCVHETDLLRPDGKTQVTVEFDEQGNVVGIDTVVVSIMHSADFPIEALRKYVRENVIAPVLERYGFHIENVAHIHINPTGNFVIGGPNGDTGLTGRKIIVDTYGGYFSHGGGAFSGKDPTKVDRSAAYMARYMAKNLVAAGLATKVQVQLAYAIGVAQPVSLRVDSYGTGKISDEKMTELLRQTCDMTPAGIIRKLDLRRPIYADTAAHGHFGI
GUT_GENOME203596_007347-362FTSESVSEGHPDKVADQISDAVLDACLEQEPMSRVAAETLCTTGLVVLAGEITTGANVDYIGLTRDVLKRIGYDNTEYGIDHKGCSVLVGYDKQSQDIAQGVDSAADDELNLGAGDQGLMFGYACDETPTLMPAPIYYAHRLVERQSIVRKNGSFPTLRPDAKSQVTLRYRDGKPIGCDTIVLSTQHAPDMSEGKHMKAEFIDEVIEKIIRPVMPAEWLKDTRFLINTTGRFVDGGPQGDCGLTGRKIIVDTYGGYCPHGGGAFSGKDPTKVDRSAAYACRYVAKNIVAAGLASQCQVQVAYAIGVAEPMNITVFTNGTGVIPDEQIAKLVRAHFDLRPRGIIQMLDLRRPIYSKT
GUT_GENOME055600_00134194-505DKIADYISSALLDECIKQDSGIRYAVEVMVKNNHIILGGEISGNIKLHNLPDCVRQALRDIGYDEEYSQIWQQNAIDINRIEITNLISAQSCEIGRGVEADGWGDQGVFVGYAQKGDGLINRELYLARKLNCALYEKARVSDNLGLDIKTQITLNTEGKILTAIVAIPMQKPESLSAFIAETLEQVPENLIVNGTGAYTCHSSIADCGITGRKLACDFYSAACPIGGGSPWTKDPSKADLTLNLYARKLAIQYLEDNNECFVYLSSCIGKSRLLSAVVKTRRNGNEKIFDICPDCSPQILINELRLRRPIYK
GUT_GENOME148171_008008-381SAESVTEGHPDKVCDQISDAILDDMLAQDPQSHVAVETCATVGQFFVFGEVTSEGYSDIQSIVRSVVRNIGYTSSRVGLDADSCGVTVSLTEQSSEINQGVARLSGEAESKASREQRYEAQGAGDQGVMFGYACDETDVLMPLPIYLAHRLAYRLTEVRKNGEVPHLRPDGKTQVTIEYDEHDAPVRLDTVLVSTQHDPQVDQAWLKEQLTEHVIRPVLDDVLADRVAHDEYRVLVNPTGSFVLGGPAADAGLTGRKIIVDTYGGAAHHGGGAFSGKDPSKVDRSAAYAARWVAKNIVAAGLAHKVEVQVAYAIGVADPVSINVETYGTENGVTREQIQQAVRKVFDLRPAAIIDELDLKRPIYSKTAAYGHFG
GUT_GENOME175028_011536-339TAKSVARGHPDKLCDQIADAILDDCLSHDRHAHVACEVLATQGHIVVAGKITCTHPPNVEAIARAVLNEIGYDGDSFQVTCLLHSLERDEGRAATGQGIVVGYACDETWQYLPLPVVLAHRLTQQLDTARRELPFLGPDGNAQVTVEYDELGRPSRIDTVVLSAQHSQDVPQDELFFELTDKVVMPALSVLPPDENTKLLLDPAGGGPDAGTGITGRKVAADTYGVFAPQVGSLSGRDHTQVERSGAYMARYIAKNLVAARLCKRCQVTLAYAMGREKPIAVEVDTFGTGKYCTDDCLAGAVQLLFGLTPGEIIQELDLLTPRYLRTAAYGHFG
GUT_GENOME162805_01330305-669MNTFIFTSESVTEGHPDKVSDIISDSILDAYLEKDKNARTAVETMVKNNTVILAGEISSSATINIEQTVRNAIYKIGYNHSELGFDSHSVEIIVRLDKQSADIAQGVNNALETRNFDTKAETGAGDQGMVFGYASDETEEYMPLAISLAHSLAKRLSEVRKNNTLPYLRPDGKTQVSIRYENDAAVFADTVLVSTQHNPDVSQEQIWEDIIREVITPVIPEKLLSKETKILINPTGRFVIGGPVGDSGLTGRKIIVDTYGGSAHHGGGAFSGKDPTKVDRSAAYAARYAAKNIIAAHLAKKAEIQIAYAIGVAAPVSVYINTFGTGIIPDSDIQKIVTETIDFRPGAIIDKLNLRQPIYKETAAY
GUT_GENOME194793_011032-245RTLFTSESVTEGHPDKICDQISDGILDALLAIDPNSRCACETTAEPGAVHIMGEITTAATVNYIEIARRIIREIGYTKPEYGFDADTCEISCSLHTQSADIAQGVDMALEAKEAIDTDGLGAGDQGMMFGFACDETPERMPLPITLAHRITRRLAAVRKNGELSYLRPDGKAQVTVEYENGKPVRVNTVVLSTQHDEDADPLTLRRDMIERVIKPCIPTEMLDDETKYYINPTGRFVLGDNATL
GUT_GENOME241315_0292811-369SKGSADRVADRIADSIIDKYIFKDHDAMIKCQAILAPGYVTVYIEGYTSEYDVNIDQTVRAVLTSVNSPNFSDIDPDSYSINIIRKTRYPLRSTWDSTIDRGASCSCTVEGYASSETEEMVPLHEKMAYLILDTMDRIREEGKLMKYLGHERKCQVGVTYDSDNGHPVIDSIHVNTQARFEYSNYNTDEEDIKAKLKNDLIFIVMDRVLSEHPELKEYISPNIQYNIDRCSCYSINGVFQTIGVSGKNLGSGFRYCGLTHNDTRRSASLAARNVAKNMVAAGVADKVQVELTYMQGVAAPVGIEVNTMGTNRTNMTEESLAKMVDAIFDLRHQANIARLRMENPVYSEYAEHGLDGFNS
GUT_GENOME238261_008354-333TSEWVSLGHPDKMADYIACYLLDRYIEKDPMVRFAVEVQIKDWFVTLAGEVSSTVHFSDRQLREFVKEAVYEIGYTAQYQRAWYSCNTICSADLKVAIHISQQSPDIGQGVDFGGWGDQGIFWGMAVRKSPTMMPLDYQIAKTLGRRLFDHRRDYNVGLDIKTQVTLEGTKVRQVIVAAPYLGEEKMKNHWVLDPIRQWLGEAGKGAELIFNGTGRFTIHGPMGDAGVAGRKLAVDFYGGNCRIGGGSPWTKDPSKADLTLNLYARKLAMKKLGSCYADIVYCAISCCIGRPEIDVVFYNQNLVQIGAHREKRPPAEIIAELRLRQPTYT
GUT_GENOME142516_019527-360TAEAVTPGHPDKLCDLISDMVLDDMLARDPHAHVAAETCAAGDTVMVFGEINSHDGYKPDVESVVRRAFRTVGYTDPSYGATADGVRVLDGMHAQSAEIDAAVGDADGAGDQGVMVGHATDATETMLPPETELARQLAARLWAARANGIGWLRPDGKTQVTLQVDGGRTVLDSILISTQHDPGMTPAAIRGCVSMYIVDPVLAAHPELDATDWRLDVNPSGSFVLGGPEGDAGLTGRKIVVDQYGPSIPVGGGAFSGKDPSKVDRSAAYMARHIAVSLVHARLCHEATVRLAYGIGVAEPTDVSVDAHGTGIADDAILSAAVRDVFDLTPRGIIDALDLRRPIYADTALHGHFG
GUT_GENOME011980_00495108-443IITSESVNIGHPDKTCDTIADAFLDEALRQDKDSQMAVECAIKNDKIFIYGEATTKAKIDYERIAKEVLKDIGYKCDFKVIKEISEQSPDINQAVIKTKLCANDQGIIYGYATNETKEYMPLPILLAHKLMQRYDQFRRKTDNFYADAKSQVSVEYVSDKPKHITTILISVSHSDKLSLEEIKRTIKKEVIDVVLKDYTNMVNENTNYIINPSGKFTIWGSFGDSGCVGRKIVVDTYGGVGKVGGGCFSSKNATKVDRSGAYYARYVAKNIVANGYADKCEIQVSYGIGLAKPISLNIDTFGTEKVALSEIYNYVNNNYDFSPQNIIDELGLLKPI
GUT_GENOME273472_002675-356FTSESVTEGQSDKICDKVADAILDAYLEKDPTSHVDCEVIAYENHLKLKGNIISAATIDHEKTARDVIAEIGYMDPDLKFDANHCGIDVDINRKLSKMPETFQKRGFPDYGVVFGYACKETPSYMPAGTEFANQLAKRLTYVRKMCIIPGLLPDGKIQVTMEYENGNPKRVDAVFISTQHRKEIRNGTLKYALNDEVISKVIPEKYIDENTKCFVDPIGDCLCGGPAVYLGATGRKIVSDTYGGYARSSGNFMAGKDALKAERSGAYMARYLAKNIVASEYADRCEVQLCYAMGLTGPVSVHIETFGTGKMSQEEFIRMIYKKEDLSLQSVIKKFQLNQPIYSNISCYGYFG
GUT_GENOME011293_001612-328ILTAEFVSKGHPDRLCDAIVNEIVNYVVLKDKDALCGLECAVHTNKVFVDGRIAAGKDEKVIDEEKIKEIVREVYHNAGYGKGSYGDASWNPFPENLEIILDVCIELLSDDERKLRQYSDDQNVVNGYAIDSKETDYLPVEHYVALRLGQTLNHNLLTSWYDRKYFGPDFKILVQLNKKNGKYSWERLTLSVQHIRGREFRKIYENRIWEINLELERLFENTRFSSLAEIDEEHFFLNGAGDFVQGGPEGDNGLSGKKLCIDFYGPSIPIGGGAIYGKDPHKIDVCGALRARELAVKLVKEHGYHSVFTTLAWSPGEMAPHIIEAYE
GUT_GENOME171657_040538-397FTSESVSEGHPDKVCDRISDEIVDMIYKEAKKTGVDPWTVRIACETLATTNRVVIAGEVRVPDTLLKKDKNGAVVHDAKGHPVINPSRFRAAARRAIRDIGYEQEGFSWKTAKIDVLLHPQSADIAQGVDNASDRQGEEGAGDQGIMFGYACRETPDLMPAPIYYSHKILELLAAARHKGEGEAGRLGPDAKSQVTVRYVDGVAAEATQIVLSTQHLDASWDSKKVRKVVEPYIREALGDLKIADNCVWYINPTGKFVIGGPDGDAGLTGRKIIVDTYGGAAPHGGGAFSGKDTTKVDRSAAYAARYLAKNVVAAGFADRCTIQLSYAIGVAQPLSVYVDLHGTGKVNEDAVEAALRKVLDLSPPGIRKHLDLNKPIYAKTSAYGHFGRK
GUT_GENOME057799_002313-336LFSTEQVSKYHPDKVADQISDAILTECLKQDPTSHVAVECLIKSNLVVLAGEVKTKANINYKDVASKVLRKLHYLDEYEYRFEINISEQSPEIDGAVINQEEVDELGAGDQGIMIGYADRTTQSKLPWPFELANKIINLIETDVEKPNSILKGDAKTQVTADMDNSEQKAKLIIVSVCHKSTVAFDDVKKYIKKIIDDNSIQYESLIVNPGGPWTIGGPTADCGLTGRKIVCDSYGGFIPVGGGAFSGKDPTKVDRSASYMARHIALRTLEKFPQYKTCEIRLAYGIGIPQPVSVAINVTPQFADMIEENRVKRFVLAHFDLTPKGIIETLNLL
GUT_GENOME087598_019477-353TAESVTEGHPDKLCDLIADTILDAVLTADPDAHAAIEATCTGGECHVFGETTLIPPDVETLVRGVYRRVGHPQPDMVRVTLDRQSPEIRAGVGDGETQGAGDQGVMVGYACDETPQLMPLPVVLAHMLARRMDTLRHGGVLPMLGPDGKTQVTVLYGDDGPRAVTGVLVSTQHAADTHVDMVRGLVERMVVRPVLADAASLLDVDVSGARVTVNPAGAWTLGGPWADSGLTGRKLAVDQYGPSAPNGGGALSGKDPSKTDRSGAVMARLIARTVVASGLASRCTVTLAYMIGRPDPVQVDVDTHGTGDVDDMLIRDGVLRLFDLTPHGIIRTLGLKRPIYAPTAVYG
GUT_GENOME259484_012792-205GEITAKTEQSIPYQEIARETLRFIGYDNDEAGFNCDTCSYIINVVNQSPDIAAGVDRDGAGDQGMMFGYATNETENYMPIAQQLANNISVIMFSKYKNDLLPWALPDGKCQITVQYDDNGNFEGVDTIVVSAQHKDKYSIKEITPDITNICNIALQGYAIKKDCKFYINPTGKFVKGGPAADTGLTGRKIVVDTYGGYAPHGGG
GUT_GENOME170401_00231804-1156RFYTAESVMRGHPDKLCDLIADSVLDACLQHDPASRVACEVMATHGHIIVAGEITTSAKPDVFNIVRDTLRDVGYDPKDYQIDCYIHDQSPDIAGAVEPELAEGEDEDTLGAGDQGVMVGYACNETPEYLPMPMVAAQRLVTLLEISRMTGVIPDIGPDGKVQVTMEYNGDTPVRITTVVVSVQHKEDTDINKLADLLDEYVFPLAFDGMPADDAEIILNPSGKFVQGGPDADTGLTGRKLMVDSYGTFAPHGGGAFSGKDATKVDRSAAYMARFIAKNIVAAGFAQRCQVTLAYAIGEKEPVMVDVNTFGTGGPCEDDCLSAAVRKAYDLTPAGIIKQLDLLNPIYSRTAAG
GUT_GENOME277287_008245-345SSEAVFKGHPDKICDQISDAILDACLKQDKASRVAIETLIKNDLVVIAGELTTNAVIDYKEIVKEVLTSLGYENLANLKVQVEVSKQSNDIALGVNKDGAGDQGIMYGYATNETKELMPLPIVLARRIAIKMDELTRPIREMFGADGKCQVSVAYDDYGNPLKVTTIVVSQQTRYNLDREFYTKFIINECILKVIPSYLIDEETEVLINPTGEFVKGGAYADSGLTGRKIICDSYGGVGRHGGGAFSGKDCTKVDRIGAYYARYVAKNIVASGIASKCEVQVAYAIGVAKPVSIYVDTFGTSKYSNDQILEAINKFFNFKPKAIRNEIINDEVSFKALAEY
GUT_GENOME058018_004664-346YITCESVFRGHPDKLCDQISDAILDEYLLKDKDSRVAIECSIKDNLVIIFGEVTSKAHVNLEKVAKRVLRDIGYFDNFVVITKVSTQSWDIAKGVDKLGAGDQGIMYGYATNETKECLPLPYVIARDISKAVENIRKEKYMDVLMPDGKCQVTVRYEDNKPKDIKTIVVSAQTKKGVKLEEVKRIIKEEVLIPMLSSTLDAIEILVNPTGAFFRGGPYADSGLTGRKLMVDTYGGVAHHGGGAFSGKDYTKVDRSGAYYARYVAKSIVEAGLADKCEVAVSYSIGVESPVSVSIDTFGTGKLSDDELLKLINEHFDFSVGNIIKELKLKDISYQRLAEYGHFG
GUT_GENOME142011_0210226-393FTSEVVSPGHPDKCADIIADSIVDRLIIEDSNSRVASEVFVAGKHIVIGGEVKSNAKLSQNDYEKIVKNALAKIGYDGKSAFTKEQALHPDDVKVQVLLNQQSPDISQGVDQTTGEIGAGDQGIMFGFASNEAAEFMPAAIVYARRLCDTVYNYALKNNQKLGVDIKTQVTVDYGTKENFENCKPQKIHTIVVSAPSVEGMPIEEVRTLIQGLIDNSGLPDKLYDKNSTIIHINPTGRYVNHSSLHDSGLTGRKLIVDSFGGYAPIGGGAQSSKDYTKVDRSGLYAARWIAKHIVASGLAKKAIVQISYAIGVARPTSVAVDTMGTYTKHNDDVLSAFVMENFPLTPRWITQKFALDKPSANTFLYAD
GUT_GENOME031228_005366-331EYVSLGHPDKIADYISQYLLDRYIEQDPMTRYAVEVQIKGNKVVLGGEITSNAYFSENMITVFVRQAVCQIGYTKAYQEFWGRKNTICGENLSVTVNISQQSPDIAQGLTGWGDQGIFFGMATNDPENCGMPYDHTIAKRLCKVLFESGVGGLDIKTQVVTEDNAMKKVIVAIPLLHEANIDTVKGIVWATIDGDYELIVNGTGRYMQHSTIADCGTTGRKLAVDFYGGNCVIGGGSPWTKDGSKADLTLNLLARRLAKKYAVKHGCDTFVRLACCIGRQEVDMCVLDAVGNVLEEDVLVIDPAKLRREFKLDTPIYASMCRWGLF
GUT_GENOME252376_001576-390ILTAESVTEGHPDKVCDAIADGVLDACLAEDPGARVACEVLATAGKITVAGEITAKTMPDISAIVARTVREIGYPSSDYEIEVITHDQSLDIAAAVGGGRQTGAGDQGVVYGYACLETDALLPLPVVLAHRLTRMLANARRLGTIRGLGPDGKAQVSVEYLFGLPSRIASVVVSCQHDEDKDLEELRREVLERVITPALEELPPDRETEILINPGGRFVLGGFEADTGLTGRKVMADTYGGLVPHGGGALSGKDGKGGPNRSPAKRVRLGEEAQGSERSFRRQAETETSGLCADEDRSGAYMARYIAKNLVAAGSAEKCTVSLAYAIGRAQPVAVDVDSHGTGEYPDAVLEQAVRGVFDLTPEGMIRTLGLDKPIFARFCNYGHF
GUT_GENOME181684_015271-160MTEGHPDKLCDIIADSVLDECLAHDALSRVACEVLATKGQVIVAGEITSLFEPDIPSIVHSVFRKAGYDPARFSVQCLLHRQSPDIAAARKTGMVRGLRPDGKAQVTVEYGEDGKPLRLDTVVLSAQHSPEIPADELCFELTDKVIAPALQVLPPDEVLS
GUT_GENOME017832_017945-338SNEIVFRGHPDKVADQISDALLTEYLKDDPNSRCGIEVAGGKGIIFITGEVTSASYVNVEKIVKSTLFNIGYDPSKYTIINNTGRQSQDIALGTNDEVGGAGDQGMMFGYACDDTEFYVPVAMNILQRLSIWYNDIVHEDEDFLPDGKAQITGVYDDDFKLVKIKDFTISYQNREVNRERTDKIIRDKILELCDGYEIESFHINPTGKFLVGGFDGDAGLTGRKIVVDSYQSFSNVGGGCYSGKDCTKVDRSGAYKARQLAIRMLKEYNLKWCEVQVSYAIGIANPLAIYIDSNIGNIIVDDKVYDEFKPTNIIKEFGLKHFDFTKTAMYGHFG
GUT_GENOME177550_0061223-412VFTSESVTEGHPDKIADQISDAVLDAILAKEAELEAKGYIAPNGTPAKLENVRCACETFVTTGTVIVAGEIRTEAYIDVQNIAREVVRRIGYDRAKFGFDCDTCGVLNLIHEQSPDIAQGVDESFDAQAGRTTDPLDLIGAGDQGMMFGYASNETPTLMPMPHYLSSRLAERLAAVRHDGTLPYLRPDGKTQVTVRYEGGKPVAVETIVISTQHAPEIEDMTIIEADMREHVIAPVMEAAGIPWEGADLYVNPTGRFVVGGPMGDTGLTGRKIIVDTYGGMGRHGGGAFSGKDCTKVDRSAAYAARWVAKNVVAAGLAERCEIELAYAIGVSHPLSIMVDTFGTGVVDEAAITEAVKRVFDLRPGAIIRDLDLRRPIYEKTAAYGHFGRE
GUT_GENOME024243_013184-379LFTSESVTEGHPDKIADQISDAVLDAILAQDPKGRVACETIVTTGQVHIFGEISTQCYVDIAHIARETINNIGYNRAKFGFDGNTCGVLISIDEQSPDIAMGVDKALEAKEGQAAEEETGAGDQGMMFGYATNETESYMPMPAYLANKLALQLTKVRKEGVLPYLRPDGKTQVTVEYDDGKPVRVDTIVISSQHAPEVSNEQIRKDIIEKVIAPIVPAEMLDAETKYYINPTGRFVIGGPQGDAGLTGRKIIVDTYGGMARHGGGAFSGKDPSKVDRSAAYAARHVAKNIVAAGLADKCEIQLAYAIGVAHPVSVLVETFGTGKISEDKIAELVRKNFSLSPTGIIRELDLLRPIYKQTAAYGHFATGIIRELDLL
GUT_GENOME035917_0265417-336YSVEGVLRGHPDKICDQISDALLDEFLSQDEDSKVAIECLGSGNTIVVAGEINSKAKIDIEDSVKRAYRKIVWHANRDINVINLLSVQSQQLQTNVKNGNAADQGIMYGYACNNEYNFLPYGYYIVNKICKRLDEYGEKSKLYYSDGKVQVVVKNEKIKQLYINVQHEKNADLNYIETKIKENVLYDIDCQEIIINKDKWFIKGGIENDTGVTGRKIMIDSYGGLICHGGGAFSGKDPSKLDRSAAYMCRYVAKNLVANNLANDCLVTVAYMFGEKEQIMLKVFADNNYSEELTNFVNKKYNFESSAIVDFLNLKNVKYL
GUT_GENOME203824_01038195-531DRQRVLSGILRSERFPNDIYPRLQLMIKDVNSLINHADFSFQRLDYIQDAALGLINIEQNGIVKIFSVAAVIFMPATLILSHVILKELAVIRREGQVMTYLRPDAKSQVTIEYDEQTHRPLRVHTIVVSTQHDDFIPTSKGVTEKMAEKRMQEKIREDVRTILIPRVKARLERAGDQLAELIGDDYILHVNPTGKFVIGGPHGDTGLTGRKIIVDSYGGRGAHGGGAFSGKDSSKVDRSAAYAARYIAKNLVAAGLCRKCQIELAYAIGVAHPVSVLVDTYGTGVLDEEKLSAIVNECFDMRPAKIIEHLNLRRPIYEQTAAYGHFGRTDVDLPWEH
GUT_GENOME097151_030298-337YVTESVLRGHPDKICDQISDAILDEYLKCDKESHTAVECMGFQNNVVLAGEVNSKAEIDVVDVAKKVYKSIGYQEDINVQNYISKQSCQLSCVVKKGEAGDQGIMYGYACNSQYNYLPTGIYISNMLAKKIDELRKEKYICLPDGKVQVTIKNSNIEKIIINVQHEKNANMELLREIILKEAVSKVVDSNNQIIINKDLGFIKGGFTNDTGLTGRKIMVDTYGGLVPHGGGAFSGKDPSKMDRTGAYMARFVAKSIVANGIAKECLVMLAYEFGEEKPAAVKIICDGNSCKSILREKILDEFDLRPEAIIERLGLRKIKFINTSTYGHFG
GUT_GENOME234993_001096-331TEQVSKFHPDKYADQISDAIVSACLEQDPQSRVACEVMVKDKTIILGGEIRTNAKVNYKAITRRVAKKLHYRVDKIINLIGQQSNEIYNGTSNDLCSVGAGDQGIMFGYACNETDSYLPFAFDLANRIISLIEQDVENNKKSILKGDAKTQITTDLDMQQNIGAIHTILISVCHKKHKSLEYVQHYVKNLLQPILRGFNGKLIINPAGTWTIGGPTADCGVTGRKIVCDQYGGFCPVGGGAFSGKDPSKVDRSAAYMARNMAVDIVKTFRTVKTCQIQLAYAIGVVEPISVNILTDGVNAPDNIYKYIKGKYRLTPTAITNELGLL
GUT_GENOME278617_0049139-265IYMIEKVNPGHPDKVADRIAGAIVDLAYKNNENPKIAVEVLIGHNECNVIIETNVGIKRIDIVNILERITGSNKIKLNLKIVPQDEKLAKNQSERIKCGDNGIFKGVPLNNLEKQISKLARKLYDNIPYDGKYILTEDHLIVCQSNATNEKIQNIINEVMDTTNLKIIINPLGEWEGGTNVDTGATNRKLGSDMAQSVTGGGIHGKDLSKADVSINIYAFKKAQETG
GUT_GENOME024247_002086-341AEYVSPGHPDRLADVLVESIVDLAVGRDPDALVGVECAVHTDHVFVDGRIAAGKGTCAVASDEIEAIARRVYRAAGYGAGWIPAPEKLKVIQNVCLEALSDEERAIRNVSDDQNVVGGYACNRPETDYLPPAHFIANYIGRAVDAWRWTVPGKFGPDFKVLVDITVARDARSSHHAGRGMRPASPPPYRWNRLTLSIQHRAGVFAAEQHAQILPVVRAALAELEGRGLAGIAALPLEKFILNGMGDFIVGGPEGDNGLSGKKLAVDFYGPEIPIGGGAICGKDPHKVDVAGAFRARQLALKLLKERGGSAVTVRLGWTPGDATPAFREAESVDSLC
GUT_GENOME183613_000574-317TSEYCAVGHPDRTCDYIASYILDRYLERDTNARVALEVQLKDCFCTLSGEVTSAYRFTNSELAEFCREAIRKVGYTDDYLAKWGEGNVISGSGVEVTIHISQQSEDIAQGVNREGWGDQGIFWGLAVNDVRRGFMPLDYWLARGTANALYRSGFGGLDVKTQVTTEDGRAVECVVAIPLDPVTEEDTKRVIADYVKSAVGSDCRVIVNGTGRYVKHGSIGDCGTTGRKLVADFYGGNSKIGGGSPWGKDPSKADVTLNVLARMKALEYLKAHGLDEVRCGISCCIGRSEIRVSYFDAADNLLETHTENDPPSHV
GUT_GENOME233451_008239-351TAESVTLGHPDRASDFIADRLLDAYRQKDPNAHVAAEVMLSHEVCTIAGEVTGKDSDKVNAQAVAEDALERIYGKTPDSFDVDLFPQSPDIEAAVDSGADQGAGDQGIVYGYATNATFCRLPIATMLARRITDRIDRYGDIGNLGPDGKAQVTIAKNPDGTFDHIKTVVVSVQHTEELTHEQVEAKVRALIAPVFEDYNLTETEFLINPSGRFVLGGYEADTGLTGRKLMVDTYGGLAHHGGGAMSGKDPSKVDRSGALMARWVAVQIVEAGLAEEAEVSIAYAIGKAEPVAVDVRTNGTWNTAPDRYIAEAVRRAFDFRPKAIENALGLQMVVWADTARLGA
GUT_GENOME052876_0163891-420LDEALKQDKYSRVAGETFASANQITIAGQLTTNAKLDIEKIVRDRIKEIGYDNAELDMDYRICKIDINITKQSSDIAQGVDVGGAGDQGIMFGYASDETEEYMPFAISMAHKLSRRLAEVRKTGILPYLRPDGKTQVTVEYEGNTVKRIDTILISTQHNDGIDMDALITDIKDKVINEVIPQKYLDENTKYYINPTGRFVIGGPLGDTGLTGRKIIVDTYGGYARHGGGAFSGKDATKVDRSAAYMLRHIAKNIVANGYAKKCELQVAYAIGVEQPMSIFVDTFGTNAIPEEEIEEKIRNKFDLTPKGIIEYLGLREPIFTKTTNYGHFG
GUT_GENOME056551_00156258-606MTKRVYTAESVTSGHPDKLADLIADSILDECLEQDPDSRVAVEVMLAHNKCFVSGEITTNAKVDYEYVAKTVIAQVGYNPNNLEFEVRIHEQSPDIAGAVNRAEQGAGDQGIVYGYATDETLNYMPLPAELAHRLTYRLEECRRNGLIKGLLPDGKSQVTVQFNGDRFDKILSVLVSAQHEEWKELGELKSEIKAKVIGYVFAEYDMSDVEILVNPSGRFVLGGYEADTGLTGRKLMVDTYGGIAHHGGGAMSGKDASKVDRSGAYLARYIAKNIVTAGLAEKCEVALSYAIGLPTPTSIDINTFFTGAVNERLIERAVSKVFDLRVRTVIEYLGLKKPVYAGTAVGGH
GUT_GENOME114528_011438-326YTVESVLRGHPDKICDQISDSLLDLFLQNDADARTAIECLGTKDTLVVAGEVSSTAVLPIEQSCEEIYNQITGYSGLRVINLLSQQSPQLAKAVIGGGAGDQGVMYGYACDNQYNNYLPYGYWLVNMLAKRLDILRTQTHSFEPDGKIQALVFEDRIIKLTINVQHVVDADLSKINSLIRQHVLSDIGNENIAINPDKGFIRGGFDNDTGLTGRKIIIDTYGGLAPHGGGAFSGKDPSKVDRSAAYMCRYVAKNIVANGLAKECTVSVAYNFGEAQPAMLCVVADGNDSTKIKSFVEQTFDFRPQAIIERLELKQPIYF
GUT_GENOME243183_015094-335TVENVLQGHPDKICDQISDALLDECLKLNKYAHTAIECLGTGHNVVVGGEIDGIEDLELDVVKIVNDLYHSIGYEEFLSVQNLLRCQSSQLKMGLIKDGAGDQGIMYGYAVNNKYNYLPYGVYLSSLIAKEIDKYRLNSDYLLPDGKIQIIVNDNRLEKLSVNIQHKGKVDKCFMENDIVDNVLSKFIDPSETEFNINQNSNFLQGGFLNDTGLTGRKIIVDTYGGIAPHGGGAFSGKDPTKMDRTAAYMARFVAKNIVANGFANECTISVAYAFGEERPIMLQVVTEKRDNHKMLTQLINERFDFRPEAIIERLGLRQFSFLPTSRYGHFS
GUT_GENOME028266_015214-352IITCEQVSNGHPDKICDQIADAIVTDILQHDRNARVAIECLLKKSQLFIAGEVTTDYRPNYNQIVHDVFNRIGAEKLGWNLTELLRIGILVDKQSPDIAMGVDKGGAGDQGIMYGYATNETAEQMPIPYMVATKFLQLLKNHPSKMFRADAKAQVSYDYDTGRITTFLCSVQHSPDVEVSDFRHIIESMMVLAACEYGLDGDFTKLVNPTGRFVLGGSYADCGVTGRKLACDTYGGIGRMGGGALSGKDPTKVDRSAAYMARKIAKDIVQAGYADKCEVQLAYAIGVAEPVSVYVNTYGRSHVAMSDGEIAAKIKEMFDLRPKAIERKFKLRQPMYLETAAYGHMGRKN
GUT_GENOME278725_02546138-533YLFTSESVSEGHPDKVADQISDALLDEFLAYDPNSKVACETLVTTGQVVLAGEVKSNAYVDLMEVARRVIHRIGYNKSSYKFDADSCGIFSAIHEQSADINRGVERAEAMSQGAGDQGMMFGYACNETDNYMPLSLDLSHLLLTELAVIRREASAMTYLRKGKVTSYLRPDSKSQVTIEYNERNEPVRVHTIVLSTQHDEFVTPADDSKEAQEAADREMLQTIYNDVKQVLLPRVIAQLPERVRTLFDDQLILLVNPTGKFVIGGPHGDTGLTGRKIIVDTYGGKGGHGGGAFSGKDPSKVDRSAAYAARHIAKNLVAAGVSDEILVQVSYAIGVAEPVSVYVNTYGKSHVSMSDAEIARRVCEIFDMRPKAIEERLKLRNPIYEETAAYGHMGRK
GUT_GENOME092646_00280117-342MFEKVNPSHPDKVADRIAGAIVDLAYKQDDNPRIAVEVLIGHGKCHIIAESSVYIDKADIKLAVKRIAENVDVDYVEVPQDKHLADNQEGQVRCGDNGIFKGVPLTREQKELSKIAQDIYAKYKFDGKYILNGKRLIICQSNAASKDLKAIYPNAEINPIGDWTGGTNVDTGATNRKLGSDMADSVTGGGLHGKDLSKADVSVNIYAFLKAQATGTVVELCCAIGD
GUT_GENOME037932_014376-327FTSESVTCGHPDKICDQIADRILDALLEQDPQSRVACEVTCTTDAVHIFGEVTTKAEVDHAGLARRVIEEIGYTEPNRGFDAETCRIKVDLHEQSPDIARGIARRNEAERINTGAGDQGILFGYACKETVNRMPLPIELAHALAKRLELVRRSNELQYLLPDGKTQVTVEYDGQDRPCRVAAVIVSSQHREGVDIHTLRDDILGHVILPTIPSALIDKDTQFFINPTGRFVMGGPAADSGVTGRKPISDTYGGCARHGGGSFSGKDASKVDRSGAYLARYIAKNIVAADLADRCEVQLAYAIGLAEPVSFRLDTFGTERLKK
GUT_GENOME050028_0177516-364YIFTSESVTEGHPDKICDQIADAIVDAALAQDPYSNMAVECTIKDDFVLIYGECNTKAELDYETIALNEIKRIGYTEEYHVHTVVNQQSPEIHNAVNRDEVCAGDQGIMFGYACDENDEYMPLTIYYAHLLARQLTKVRREHPDLLKPDGKTQVSVEYENDEIKRIDTIVVSTQHSKDITQEEIKKLIMDEVIKPCIDEKYLDENTKYFINPSGSFVVGGSWGDSGTTGRKIVCDTYGGRGRIGGGCFSSKDPTKVDRSAAYAARHIAKNMVAAGVADEMLVQVSYAIGVARPINIYVNTYGRSNVKLSDGEIAKKIDELFDLRPKAIEERLKLRNPIYEETASYGHMG
GUT_GENOME011972_0113253-406TCESVRIGHPDKLCDFIADFILDSCLEVDPDSRVACEVMATGGKIIVAGEITCELRPYIESDVKKALQICGYSPDDFRVFVNCHRQSKDIASGVSNSLEERNGCEDRYSSLGAGDQGTVYGYATNETEEMLPLPLVLAHNICREIDIQKREHRLRGVMPDGKAQVTVEYENGKAKRIKTVVVSIQHSWKKPIKELEDEILNDVLPVALSVFPADENTEILINPSGCFNIGGPEADTGLTGRKIMVDTYGGLAAHGGGAFSGKDATKVDRSGSYMARYIAKNLVKAGLADCCQVGISYAIGKAEPLAVSVESFGTSNLSDEELAEIVTKVFDLRPAAIIERFDLKKPIYKKTSAN
GUT_GENOME199909_00772196-546KYVLTAESVTEGHPDKLCDTIADAVVDACLEHDPTAHVACEVMATAGKIIAAGEITAAWLPDIPAVICRTVREAGYGGSDYEVEVITREQSPDIAGAVNSGAETGAGDQGIVYGYACDETPELLPLPVVLVHELTRLLTQARKLRTIPGLKPDGKAQVSVEYQFGVPKRVSAVVVSCQHEDGVELAELRHAVMRHIIQPAFQDFPLDAETEILINPSGRFVEGGFEADTGLTGRKLMVDTYGGLAPHGGGALSGKDGTKVDRSGAYMARYIAKNIVAAGLAERCTVSMAYAIGRAKPVMVEVDAHGTGKYSNAALEQAVRTVCQLTPNGMIDTLGLDKPIFRKFCNYGHFT
GUT_GENOME044882_008866-287KYFFTSESVTEGHPDKVADQISDAVLDTLLAQDPNAHVACETLVTTGMAVIAGEITTTGYADLPHVVRETIREIGYTSSDMGFDADTCAVISSIDHQSPDIAQGVLRAAPEDQGAGDQGMMFGFACNETPTLMPAPIFWAHQLSQQLTKVRKDGTVPFFRPDGKTQVSFEYLDGKPVRINNVVVSTQHAASASQADIIDAVKKHVIRPVLEPSGYFDEKDCEIFINTTGVGEIFASPVPSGHAARPGDAVLVSGALGDHGLTVMGSREGLSFLTDVASDSAP
GUT_GENOME275843_005535-349YTCSSSFSGEYNKVCDKIVESITDYILHFDCDALINIHCSIFPGVVILSGVVGQINKTIPYEEIAKETLRFIGYDNNKTDFNCNTCTFINNIRTDMDWFLEEDMGSGEYGPVYGYACNETIDFMPIAQTVAQNISVITFSKAQKGELYWSMPYGKCQVGARYENGKFKCIDSIIYSTRYKDVKDSKDIIPDITNICNIATLGYKVDENCRLYVNPLTEISNAFGTSGVKAGENAYGPYCPIAISGVGKSNNYCARYGSYMARYMAKNFVAAGACDAMTVQLIYTAQWDKPFEIDLNPVNSEVSVEELKKIADITFNYNGKAIINNLNLRGTSYLPYACFGQYGRP
GUT_GENOME096387_019921-323MFTTESVTAGHPDKLCDAISDAILDTALTQDPNARTAVETLATPGRITLAGEVTTTATLDTARIVRDTLTHIGYNPDQYVVDDLIASQSPDIAQGVDTGGAGDQGHMFGYAVDAPGYLPAPVTLAHRLARAVHANPFGLGPDGKTQVTFDGDQLRTVLVSVQHPSSLEQAEVREQVQQVIAPHVDLTGVTLLVNPTGRFVLGGPDADTGLTGRKIIVDTYGGYARHGGGAFSGKDPSKVDRSAAYAARQAALSLIANGYATECEVQLAYAIGIPEPVSIHVDTKGTGNTKAAEKALTNIDFTPTGITQRLNLRQPIYTNTAAY