UHGP-MC 35103


Information


Number of sequences (UHGP-50):
270
Average sequence length:
220±19 aa
Average transmembrane regions:
0.02
Low complexity (%):
1.75
Coiled coils (%):
0
Disordered domains (%):
1.58

Pfam dominant architecture:
PF01255
Pfam % dominant architecture:
8370
Pfam overlap:
0.88
Pfam overlap type:
equivalent

Downloads

Seeds:
MC35103.fasta
Seeds (0.60 cdhit):
MC35103_cdhit.fasta
MSA:
MC35103_msa.fasta
HMM model:
MC35103.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME207783_0237715-234GLHVGIIMDGNGRWATRQGLSRLRGHEAGVEAIRRVVEAAPDQGVGTLTLYAFSSDNWRRPRTEVMGLMALLRVYLASEVESLVRNGVRLSVIGRRDRLPGGIADAIGRAEAATRDGTTLHLRIAVDYSARDAILHAAAKARDVDALTREAFSDLVTGEAGLRDVDLIIRTSGEQRLSDFLLWEGAYAELHFTQRMWPDFDAEDLAQALTSFRGRERRFG
GUT_GENOME242956_0001616-243AQSLRHLAIIMDGNGRWAEARGLARTEGHRRGAEVMRDVARACADRGIRILSLYTFSTENWRRPTSEVNALMKLIATFFKKYQKELIEEGVVCRFAGDLSGLSPQVQKIIREVKELTLEKPRMQLVILLNYGGRDEIIRAVRKLAKSVVEGELKAEQLNEAHLSQQLDLPDLPDPDLIVRTGGELRLSNFWLWQSAYSELYFTDTLWPDFSGSDLDEALRAYAGRKRR
GUT_GENOME147629_015828-232PAHVAIIMDGNGRWAQQHLRPRAFGHKAGRLAVDRVVTACAQRGVKTLSLYAFSTENWRRPPHEVKVLMELIEAALREEAAKMQRNDIRLRVLGDRLPLSGSLRDAIDAAESLTAACSRMDVVLAINYSGQWALIETARRLAQAAAAGKIVPTTLDEKAFAAYRPMPDLPPVDLLIRSSGEMRLSNFHLWELAYAELWFTDVLWPDFGEAQLDAAFAAYHHRDRR
GUT_GENOME235296_0080060-284ADCGLRHIAFIMDGNGRWAPRRGLLREAGHSVGARVFKRTVTTCRDWGIPTVTAYAFSTENWKRPQKEVSEIMRLLDAYIKEAREDNEKNRIRYIFLGDKERLDDELREKCESLEALTADNRLTLNIALNYGGRDEIVHAVNALISEGAEHVTEGDISRHLYTAASPDPDLIIRPSGEYRLSNFLLWQAAYSELYFTKCLWPDFDEAQLQLAVTEFSKRKRRYGG
GUT_GENOME143124_0093112-240TDAVPRHVAIIMDGNGRWATQRFLPRVAGHKKGVDAVRAVVEACAVRGIEYLTLFAFSSENWRRPPDEVSLLMRLFVHALEREVARLARNGIKLKVVGDISAFEPRLRELIADVEARTRDNDRLTLTIAANYGGRWDILEATRRLLAARPDLAAHPEQLDEAMLTPHLSMAYAPEPDLFIRTGGEQRISNFLVWQLAYTELYFTDCFWPDFDADQLDLACKSYRSRERR
GUT_GENOME025979_010615-219RLAIIMDGNGRWAKKRGFPRSAGHREGVRNIGRVIDRCIALGIPVLSLYVLSTENRKRPREEVEGLFNLAKSYFRKISEFNRKGVKVVVSGGREDLPSDIIAEIVEVQRKTAGNGILTLELCFNYGGKREIIRAAERFAESGEKVLTEESFERFLYNKLPPVDFIIRTGGEMRLSNFLLYQSAYAELYFTDVLWPDFSEEELDKALTEFGRRKRN
GUT_GENOME026833_008297-236HIGLIIDGNRRWSKNNGVSIMDGYKSGLIALENVIHHFAGSEVKCISVYAFSSENAKRPQAEIDAIMSIIGGFIDKHPKNVALSFVGDEKYIPKELLEKIHSFDNENKEFDMLVNICIGYGARNDIVRGARRLLERSIDTLALCQEEGLTEDETSKQIDALFTEDGFKACLSTSFLPPLDMIIRFGGERRLSNFMLFEAAYSEIYFSDTLWPDATSAEIEEFIESFKNTK
GUT_GENOME212726_0145822-247PENLPQHVGFIMDGNGRWATKRGMPRKFGHREGAKNFKDLSLYCRDIGLRNASFYVFSTENWKRPQDEVDAIMDLFWDYLCDADQYLERRVRMVFVGDRSRLQDRLQKRMAELEHDSKDFEKMNLILAINYGGRDEIKHATQEIAKEVQTGVLSPESITEQTIADHLYTAGLPDVDLLIRPSGELRLSNYLIWQCAYAEYYFTNVLWPDFHANELNKALADYAQRG
GUT_GENOME141763_0247965-291DRLPNHVAIVMDGNGRWATQRGLARTEGHKMGEAVVIDIACGAIELGIKWLSLYAFSTENWKRSPEEVRFLMGFNRDVVRRRRDTLKKLGVRIRWVGSRPRLWRSVINELAVAEEMTKSNDVITINYCVNYGGRTEITEATREIAREVAAGRLNPERITESTIARHLQRPDIPDVDLFLRTSGEQRSSNFMLWQAAYAEYIFQDKLWPDYDRRDLWAACEEYASRTR
GUT_GENOME114554_0056212-235NNIPKHIGFIMDGNGRWAKKRGLPRTAGHREGAFALKRVCEACRNFKIKTITVYALSTENLMREKKEVDYIFELIEEFISKELSNAEKNGVKINIIGDLNHERIPLSCKEVILNAMEQTKNNTNFILNIALVYGGRDEITRAVNKILASGKKSITKEEFSNYLDTAGQEDPDLIVRASGEQRISNFLLWQLAYSEFYFPKEYWPEFNKDLVKKCILEYQTRTRK
GUT_GENOME069333_005132-212VPRHIGFIMDGNGRWAALRGLPREAGYAEGLKAMMKVIARLAERGVEAVTVYAYSSENFARPSSETAAILSVVTGWNAAYDGNMKVTYMGDFSRFPQAVADSIAEVERRTAANTGMRLNIALGYGGRDDIVRAAAAAARRGEITRQTFESELSSNTLPPLDLIVRSGGEKRLSGFMLYEAAYSELLFSDKLWPDMTEEDADGALDEFARRT
GUT_GENOME103624_0001888-313EFLPRHIALVMDGNGRWAQQRGMTRTEGHRRGEAVLLEACRACLEAGVPWLSAYAFSTENWTRSYEEVRFLMSFSRDVLRWQRDELNEMGVRIHWAGRRPRLWPSVIKELEIAEELTQHNDNMHLVMCVNYGGRAEITDAANRLAMAVKEGVLDPRDISEKTLSDLMYVPEMPDVDLFLRPSGEKRTSNFLLWQSAYAEMVYQDVLFPDFDRSHLWAALDEYARRD
GUT_GENOME065756_015665-230NVPQHVGIIMDGNGRWAKKRGLPRSFGHKQGAAVFKKTINWARELGVKCVTFYAFSTENWKRPADEVEGIMNLLRQYIKDIRAAAREDIRFIFLGDITRLPEDITAELLDIQQSTVNNTGFIAGVALNYGGRDEITRAARELAREAASGEISPDGITEDSIEKLLYTAEMPPLDMIIRPSGEQRLSNFLIWQAAYAEFVFMDVLWPDFTKKHLEYAIQIYNDRDRR
GUT_GENOME143283_0049026-255KESVPNHIAIIMDGNGRWAKKRALPRAAGHREGMKNIRTITRCANKIGVGALTLYAFSTENWKRPALEVDLLMRLPEEFLNSFLPELMELNIKVQMIGEMDGLPDHTQRAVQNAINQTAVNTGMVLNFALNYGGRRELLLSVKEIAHQVQNGTLNLDDITEETISSRMMTATLPDPDLLIRTSGELRLSNFLLWQLAYSEFWFTDVHWPEFKEDDLLQAIESYQSRNRRY
GUT_GENOME095357_0035356-324LEGKPVPRHIAVMCDGNRRWAREAGFIDVSHGHRVGAKKISELVQWSGERGIELVTIYLLSTENLHRDKAELQLLCRIIGDVSDELAKSKLNARLRVVGHRDILPDALTDRLATNEKATANHTGVCVNMAVGYGGRQEIVDAVRSLLEDRLKEREEASRDSSTKDDERAHQVELVSSSRGNELGEDYDDDQRFSLEDIAEAITVQGISDHLYTSGQPDPDLVIRTSGEQRLSGFLLWQSAYSEIWFTDTYWPAFRRVDFLRALRDFSRR
GUT_GENOME095596_0101913-246ENIKLPRHLAIIMDGNGRFAQKRMLPRSYGHRQGTINLTNVCRIVDAIGINYLTVYAFSSENWRRPAAEVNELFSLVATYFNKYIDELMQMQVKLNFIGNLTQLPEDLQKIIHSAVTRTAENKRLQLTIALNYGSRLEIAEAAAATALLLQKRNDLSHLSLTEVEKIFEANLQTAKLNLPDPDLIIRTAGEMRLSNFLLWQAAYAEFYACDCLWPEFGEKELLAALLAYNQRTR
GUT_GENOME008120_001487-230LPNHIGFIVDGNRRWARERGLPTYNGHKRGFEVLKQMAYVAQARGIKHVSAFIFSTENWRRSEDEVGYLMKLFLSAFQNDMRQMVKDGFRIIFLGRRDHLSKQILDAIAETERDSANNTSTTLALCFNYGGRAEIVDAAKHLAAAVQQGDVSLKDFADSDLDKYMYHPELPDLDMVVRTSGEERISGFMLWRSAYAEFLFLRKNWPDMTEADFDACLDEFAHRQ
GUT_GENOME109809_0018110-235ENKMPLHVAFIIDGNGRWAKKRGLSRSMGHKVGFNTLLKMVQEVRALGIKYMSVYCFSVDNWNRPQAEVDYLMQLFRDILNPRKYKKFKEGARINIMGDMSRFPEDIQANAKKAVLDSQNNTDLVLNLGLNYGGREEIVYAVNKLIAEGKTHITKEDISNNIYTAGQPDPDFIIRTSGELRLSNWMPWQSTYSEFYFPKIHWPAFNKKHLITALVEYQSRNRRFGA
GUT_GENOME121462_0077965-287FSVLPRHLGIIMDGNGRWAKKRALPRTAGHKVGADVFKKISKECGRIGIEEVTFYAFSTENWKRPKEEVETLMHLFYDYLIEAKNDLQTSGNLRVRFIGEEEGMHPKLLELMRDAEKETASRTGTLINIAVNYGGRQEIISAVNRLIAQGKTSITEQDISENVYTAPDCDLIIRPSGEERLSNFLLWQSAYSEFWSSDVLWPDFTEQDLHLALAAFEKRNRRF
GUT_GENOME255028_009719-239IAAGLPEHIAIIMDGNGRWAEQRGLDRVYGHKQGVDTVKSVVEASGEIGLKYLTLYTFSIENWNRPRAEVDALMGLMTDAIIRETSELMRQKVRVKVIGNTGDLPEEVWHKLEGLIHRTAANTGVTLVLALSYGARWEIVEAARRLIADYKTGRVADTEEVTDELFARYLTTAGMPDPDLLIRTSGECRLSNFLLWQCAYTEFYFIDKFWPDFEKDDLYLAIRNFQQRERR
GUT_GENOME076999_0097527-260FPKNKVPRHVGVIMDGNGRWAQQRGLVRTDGHQAAEPVVFDTIAGAIEAGVRYLSLYTFSTENWKRSPQEVRFLMGFSREIIHRRVAQMDDWGVRVRWSGRRPKLWKSVIDELEVAMERTKHNKVIDVVFCINYGGRAEIADACAEIAREVAAGRIKGDRVTEKMIADHLYNPDIPDCDLVIRTSGEQRTSNFLPWEAAYAELDFVPELFPDCGRDVLWRSIDHYIHRDRRFGG
GUT_GENOME143018_005113-219ELKHLAVVMDGNRRWARAKGFLAKLGYSQGVKTMQKLMEVCMEENISNLSLFAFSTENWKRPKDEIDFIFELLDRCLDEALEKFEKNNVRLRAIGDLSRLEDKVREKITLVEEKTKHCDALCVNLAISYGARDEIIRAVKRVIEKKLELNEENLTQNLDLPLDVDLMLRVGNAKRLSNFLLWQCSYAEIYFSETLFPSLTKREFKRIIKEFRNRERT
GUT_GENOME248056_004999-225LTRVPKHIAIIMDGNGRWAKERGLERSEGHRKGLDVVHEITDAAAEVGVQYLTLYAFSTENWNRPQAEVDALMALVVFGIERETPGMVEKGVRLSVIGDLDRLPSDVKARLLKCIRDTSGGERIVLNLALSYSSRWEIARAARCIAQEVQKGRLSVDDITETTVDEFMTTVNMPNPDLLIRTGGEIRLSNFLLWQLSYAEFYFTEEYWPDFTGESLY
GUT_GENOME211787_0015513-233LNNENIPKHIAFIADGNGRWATERGLPRFEGHKAGRETIKRVLDRCFERGVETVSLYCFSTENFNRPKEEVEYIFNLFRSMKDIGLSLIKRDARFHLMGDLSMLPSDIQEQMISITNQTKDCKSHILNFGFAYGGRHEIVSAVNNLIKDGVEVVTEETFEKYLYTAGIPDPDLIVRASGEQRLSNFMLYQAAYSEFYFPSKYWPDFDENVVDECIEAYQKR
GUT_GENOME155396_001754-234LQHIAIIMDGNGRWASAKGKNRTFGHRQGAKVVREITKYCAGQNIKYLTLYAFSTENWLRPKSEVDFLMKLLEHYLQSELEVYQNNNIRFVVIGDVEGLSKKLQKLIHAMQDATAQNTGLTQFLALNYGAKNEIARSALSLYDRNVFAQAKTLKREQAIEYIAKNIENTLDTRGASDVDLLIRTGGEMRLSNFLLWQCAYAELFFTPTLWPDFSAKELESILQDFYHRIRR
GUT_GENOME071644_005968-216DKISHIAFIMDGNGRWATARLLPRSAGHRAGINVMVDIADACFKRDIHLVSFYAFSTENKFRPVDEVNGLKTLIKTKLPSLSKRLKKNNVRLNVIGDRSYFEPDVRTAIDESESLTADSDGGILNLALNYGGRQEILQAAGSEDFEGSLYTAGQPDPDILVRTGGEKRLSNFMLYQLAYTELFFLDTLWPDFTETQLDEVIREYYNRDR
GUT_GENOME098572_0369113-244DLNIPEHVALILDGNGRWAKKRGLPRTAGHKQGCVTVEQTVENAARLGIKYLTVYGFSTENWKRSTEEVGALMQLFRYYLKRLLKIAKANNVRVKMIGDRTRFDRDIIEGIDTLERETKDNLGMTFVIAVNYGGRDEITRAVKKMMEDCREGRLEADSVTEQVVSSYLDTAGIPDPDLLIRTSGEIRLSNYLLWQLAYSEIIVTDCLWPDFNKEELLKAIAQYNKRDRRFGG
GUT_GENOME026433_009645-220VTHIAFIMDGNGRWAKARRLPRIEGHRRGLNTAERTIDACLALGVKYVTLYVFSTENWKRDKKEVDGLFDLAYRYIDRFEVFCRDKARVVVSGRRDRLPDRLLGKIVEIEQKTAHFDKICVNLLIDYGGQNDIADACAKVCASGVPTVENIRGQLYNAFIPDPDLIVRTGGQKRLSNFLLFQAAYAELQFSDTLWPDFSEEELRSIVADYAARTRN
GUT_GENOME159536_0127737-271GLRHVAFIMDGNGRWATARGLPRESGHKEGMVSFEQVAEYCADIGIRYVTVYAFSTENWKRPKHEVNAILMILRRYLKRAMKLIDQKRVHFVVIGDISPFDEKTQQLIRDLDEKSACYDRVLNIALNYGGRAELVHAMNEAVREKAEQGVSSAESMLPGMGELLTEDDIERHLYTYPCPPPDMIVRTAGDKRLSNFLLWQSSYSELIFSPVLWPDYRQGDVNDSIRDFLSRQRRY
GUT_GENOME103750_0036753-286LDYDRMPKHIAIIMDGNGRWAKERGLPRMMGHREGMKTIKKILDRSAHLGVKYLTLYAFSTENWNRPKDEVNALMNLIIEFAQKEGENLHKNNVKFNTIGDISKLPSNSLKAIKDLKELTKNNEGIVLNIALNYGGRNEIVYATKKICEKVCKGDINIEDINENLMSMNLYTKGMPDPDLIIRPSGEQRLSNYLLWQSAYSEFWYSNIKWPDFSERNLEKAIYDYQNRDRRFGK
GUT_GENOME116534_00980161-391NEPRFPTHVAIIVDGNRRWAKQRMLPVSMGHKAGFDRLREITRYAVGDRGVFVLSLYVFSTENFSRDAKEVGYLMDLFANNFRTIADEMNERGCKVLFSGRREGLQQRVLDAMDYMMDLTKDNEKGIVNFCLNYGSHAELTDATRAIAAEVKAGTLEPEAIDEKVIEKHLYRDLPPVDLMIRTSGEERLSNFLLWQCAYAEFYFTPTLFPDFTKECFDKALAEYARRDRRM
GUT_GENOME140362_010546-223HLAIIMDGNGRWAKNHGLIRTKGHEVGAKAVTDIAKFCIENSIKNLTLYAFSTENWKRPKKEVDFLMKLLKDFLVSKRDLFIQNSIKFHTIGDIEPFSKDLKDEILNLKNLTSKFDKLNFILAINYGGRDEIIRAANKALGHSKDGKLSEKSLSQNLDTSEFGDVDLLIRTGGEQRLSNFLLWQASYAELCFTPTLWPDFKSSELKDIIQKYQKTHRK
GUT_GENOME009159_0075213-241TSALPRHIGIIMDGNGRWAKKRGLPRSAGHKAGAESLKKIITEANNLGVEYITVYAFSTENWKRPKEEVDYLMGLLMDYLINAEKTLAGENVVIRAIGSRKELSEEMQRQIVKTEEFTKNNTGIVMNIALNYGGREEILHAVKDISRAVKAGEISADDISERNISDALYTGGQPDVDLLIRTSGEMRLSNFMLWQVSYAEMWFTDKLWPDFKPADLRKAIYDFQNRGRR
GUT_GENOME259529_001964-225NNLPTHIGIILDGNGRWATKRALPRLAGHEEGAKAIKRMLVASKKLGIKAVSVFAFSTENWKRPKDEVDGIFKILENTIDENFDDFVKDGYRLEICGNLEKLKKPLKEKIEKLKDASKNNNSLIFNVCLNYGGRAEIVDACNKVLKSKKKSVDEKTFEEFLQSSILPPLDFVIRTSDEQRISNFMLWQIAYAELYFPKFFWPQFNEKKLVKCLKEYSKRKRR
GUT_GENOME274362_0032316-245DLARLPQHVAIIMDGNGRWARQRAQERSAGHAEGVNSVNRITRFSSDLGIKYLTLYAFSTENWNRPQGEVDTLMHLIGWTIRRETPELKANNVKIHILGETDRLPAEVRESLFEGVRETAECTGLNLLLCLSYSSRWEITRAAREMAEKARKGELDPAAIDEKTVSDHLSTHSIPDPDLLIRTGGEQRISNFLLWQIAYSELYFTDVLWPDFDNSAYLDALIDYQSRERR
GUT_GENOME096381_06346293-521GHAMPGHVAVIMDGNRRWARRRGLLAWQGHHAGGRALIRLVNAALRLGIRHLTVYAFSTENWNRAQQELVTLFAAAADAALQGAAWLHELGVRVRWCGRRDRIEQSLAAALALLESMTFGNDVLNLTVCVDYGGRDELAAAARALAAEAVAGTLRPEDIGPDDLARCLYAPGLPDVDLLIRTSGEQRLSNFLPWQLAYAELLFTPDQWPDFGLAQLREACAVYASRERR
GUT_GENOME050656_005435-233RFPRHVAIILDGNGRWAQQRGRERTYGHQVGAANVKNIVRAAGHMGIRCLTIYAFSTENWKRPAFEVNFLMHLFKNYLIGQLRELITDNVQVHIIGDTMELPSSLQEEIKVCEKDTANNDGLILNVAINYGGRMELVNAVRDISQKVAEGRMEADSVTEETITNHLYPSASSDVDLLIRTGGESRISNFLLWQVSYSELYFTPVLWPDFSSEELEKAVFWFTGRDRRFG
GUT_GENOME067983_000483-228VPNHIAIILDGNGRWAKAKGMPRGYGHIKGCANLETICDDIKDLGVKYLTVYAFSTENWKRSREEVEGLMKLFRSYLKKCMKSAEKNHMRVKVIGDPSAFAPDIQEKIRQLEEFSKDYDELYFQIALNYGSRDEIVRGVRKLAQDVADGKLVPDAIDESCFENYLDTAGVPDPDLMIRTSGELRLSNFLMWQLAYTEFYFTDVPWPDFHKEELIKAIEKYNERDRR
GUT_GENOME052754_0085311-180VLPKHVGVIMDGNGRWAKKRDLPRYKGHIEGAKTFRKIGEFAADLGIKYITFYAFSTENWKRPKEEVSAIMNLFREYLKEALGRVEEDRKNGIRISFIGSREGIPADIRLLMKQVERIGRKEDRIMLNIAVNYGGRQEIAESVREIAEDVKKGKIKPSEIDMDTISSHLY
GUT_GENOME236867_018275-223NHLACIMDGNGRWAEQRGLKRTDGHKEGAYAAERVAKGCIKNGIRFLTLYCFSTENWKRPKDEVNYLMGLLSMQVQANIPKFNKLDIRILFLGNKEGMPESAYKGLMKAVEATKDNSTLTVQLAINYGGWDEIARAANKAVESGITTFTPETLLSFVDNPTVPSPDMIVRSAGEKRLSGFLLLQSNYSEFGFYDELWPDWNETMVDRIVDDYLSRTRKY
GUT_GENOME056856_0116111-173RIPQHVAIIMDGNGRWAKLRGQERSYGHQAGAETVHVIAEEASKLGIKYLTLYTFSTENWNRPSNEVAALMALLFDSIEEETFMKNNISFRVIGDLTKLPDNVRERLETCIAHTANNTGMSLVLAISYSSRWEITEAARRLAALVQKGELTPEQAQDPNNIEG
GUT_GENOME136911_006421-212MIMDGNGRWAKKHFLPRIEGHRRGLSVAERIISHSLERNIRFLSLFVFSTENWKRPNGEVSSLFSLAEKYVSKLEEFCKDGIRIVVSGEMKGLPDSLVEKIRLVCERTKYNDAICVNLCINYGGQAEIVHAAKALAEKGRPITIEGISENLYNPFIPAPDMIVRTGGQIRLSNFLLFQSAYSELFFVDTLWPDFSNAEYDSLIEEYSARTRN
GUT_GENOME009122_0063712-233MPKHIALIIDGNGRWAKQRGWARTIGHKYGFENLKKQIEFVRDLGIKNLSVYCFSAENWNRPQKEVNYLMELFDQMLDEYERDYLDKDIRIIISGDMADPRLPEKVKAKALALMEKTKSKAGFILNPCINYGGKQEILKAVNDCISEGLTQISEQDMTNHLYSAELLPLDFVIRTSGEKRASNFMLWQSAYAEWYYPKTYWPAFTKHGLVKALKNYMSRERR
GUT_GENOME282926_0040910-235AKVPRHLAIIMDGNGRWAQNQGLNRVIGHQRGAVTVERIIKHAAVRGIKVLTLFAFSSENWKRPSYEVQALMSLFARSLKKESKSLQENGIKTRIVGDISRFSESLQRSIADVESKTASGSAMVLNIAANYGGRWDILQASQAIYEDIASGKLDKRTLDENIFKKYLIVKDDVDLLIRTGGEHRLSNFLLYQASYSEIYVTDKLWPDFDEHDLDTALEFYAHRERR
GUT_GENOME232847_0125021-194LCLMLTDRELAADSAKLVSMLKWVKEFSEIKRFIIHISSDSEYIPLPLSEMEKHASVRVSSSHGDERYGSGALDVLIVIGKCGRQEICDAVSKIAESGISPEDITEEIIEANLLYKVTPDFIIKTGGNHLTDFLIWQSVYSELFFCDVNWNRFRYVDFLRALRDFQSRQRRFGK
GUT_GENOME013834_003238-226VKHIGFILDGNGRWAKKRGEERLYGHKMGVEAVRRTIDACLKYEIKYVSLYCFSTENWKRDKYEVDGIFKLLENFIDEHTQDCINKGVRVTFLGERDKFDDLLREKMAKLEYATKDFDNIFVNIMLNYGSRDEIVRAINLIIHDNIDKIDYNTLRTYLWTKDMPDPDLIVRTSGEQRLSNFMLLQSAYSELYFPKVFWPSFSERWLKKSLKVYSKRHRR
GUT_GENOME096069_0173112-236MPSHVAIIMDGNGRWARQRHLPRIMGHYAGVKAVEKTVMAALEMGIRYLSLFAFSTENWRRPKGEVLGLFGLFRYYIKCKVKKLVEENIRLRFSGRIDKLPEDLRDLALWAEKETEAGDKLDLILCLNYGGRQEIIDAVNKILAAKTDCDDVLEIDEDCFRGHLYLPDVPDPDLIIRTSGEKRLSNFWLWQSSYSELYFTDCLWPEFDEGKLREAVEEYQRRERR
GUT_GENOME237439_0044712-244KPLRHIAFIMDGNGRWAKKRLLPRSAGHKAGVKNIRNIVDECFFTYDIFACSLYVFSTENWNRPDKEISYLFNLLKVFFEDNIEDFVKKDVKILVSGDLEDPRIPKEVLDVINEAKARTSLCKSHIFNVLFNYGGRREIVTATKRIVKDIEEGKIGVDDINESSFGDYLYSSELKDVDLLVRTSGEERLSNCVLYEIAYGEFIFNDEYWPSYSKEDLKNDLIKYSSRNRRFGG
GUT_GENOME127787_0051235-254KNIKHIAFIMDGNGRWAKRRGMPREYGHKKGADTFRKIVTYCFEAGLQHVTVYAFSTENWKRSEREVRALMRLFTIFIEDAFEDIRKNDVRIVFIGDRSVFREDMQRRMAALERESAANRNILNVALNYGSRSEIVHAVNTLIERGEKKISVEMLNACLYTRLSPDPDLIVRTAGEQRLSNFLLWQAAYSEFYYTDTLWPDMNGEEVDRALLSFGNRQRR
GUT_GENOME128039_004475-224RHAAIITDGNGRWATRRGLPRTAGHTKGRDNFQRICDWCLELGIPYLSFYTLSLDNLKRDKAELDHINALAHELCSEDGIRRMREKGIRVLLCGRRDLVAESDLAAYEQAERETADCGKLTVMLQVYYGGRDEIVRAARQCVADGAEISEEAISARLYAAAVDAPDPDLILRTGGYSRLSGYLPWQSIYSELYITPTLFPDLTREEFAWACRWFESIERT
GUT_GENOME014890_015467-226IPRHVGIIMDGNGRWAKLRGKPRSYGHKKGAEVIDEIVSECFDKGIEAVSLYAFSTENWIRPKEEVNAIFGLLGLLLNSKFKKLMREKVRLIVSGDLSAIPENLRKRCEEVMALTADFTGRTLNVAVNYGSRAEIVRACNAIAQDGIKTVTEEEIEKRLYTAGLPDIDLVIRTSGEMRLSNFFLWQCAYAEFYFTDVLWPDFHANELDEALEWYSGRKRR
GUT_GENOME013578_0115931-263KIDWEKLPRHVAIIMDGNGRWAKGKGMIRTAGHSAGVKTLKNILKTAIGLKLDALTVYAFSTENWKRPHAEVDFLMNLFSEYLKKEVQEMHEDNVQIHFIGRIDGLPKALQQQMHDAEELMKHNTGVKFNVAANYGGQDELVRAMQAIAAESINGKLQAADINEAVIEEHLDTKGNLPVDLMIRTSGDVRVSNFLLWQAAYAEFWFTDTNWPDFTPECFVDAIVDYAKRDRRF
GUT_GENOME217388_004936-221PRHLGIIMDGNGRWAQKNGKKRQYGHRKGAMNVERIVEHAFLRGVSVVSLFAFSSENWSRPQEEVDAILNLIRDYFDRYLKRLVKNGVSVRVMGDVSALPEDLRESVARVTAETAGGKNILNIGINYGGRQALVRAAEKLRRGSVPVTDATLAGALDTAGLPALDFLIRTGGEKRLSNFMLYESAYAELYFTDVYWPDFDEGELDRALEEFSCRSR
GUT_GENOME029027_0124978-297LPHHVGIIMDGNGRWAKKRLLPRSAGHRAGMESLHQIVRTTHSLGIEALTVYAFSTENWARPAAEIDALFKLLSEYFYKEIDELNLNGVRIRTLGELSAFKPGLRSLMEDAVERTKHNTGLCFNIAVNYGSRDEIVHAVRRIAAEGIAPDAVTEQTIADRLYTSGLPELDLIIRTAGEERLSNFLLWQAAYAEFVFSKTLWPDFTPEVYFGCLREFLNRT
GUT_GENOME038979_0114427-252MKIPQHVAIILDGNGRWAKSKGMPRNYGHTVGAKNVEIVCKAAHDMGIKYLTMYAFSTENWNRPEGEVAALMKLLESYLKNCIKTADKNNMRVRVIGDTTRLSARFQKQIVELEAASAKNDGLNLQIAINYGSRDEMIRAMKKMYHDMENGSRQVSELNEDLFASYLDTAGIPDPDLLIRTSGEQRLSNYLLWQLAYSEFYFTDVLWPDFNKKELKKAVEYYNGRE
GUT_GENOME213858_0026318-246KAPLPQHIAIILDGNGRWAKKRGLPRTAGHQEGAMNVREITKLCGKLGVKALTVYAFSTENWKRPEEEVKFLMKLPLKFFDEFAPELIENDIRLKVIGNTEDLPTELQEKIRELSHQTKDNQTMTLTIALNYGSQDEIKQAVQQIAKEVKAGELKVEEITEDVIENHLMTHDLPPLDLMIRTSGEQRISNYLLWQLAYAELYFTSVAWPDFKENQLYEAIYDYQKRNRR
GUT_GENOME096459_0020768-294SEVPAHVALVMDGNGRWANSRGLTRTEGHRAGEYALMDTVAGAIDAGVRYLSVYTFSTENWKRSPAEVRFIMSYASDVLRRRTAQLDEWGVHVRWSGREPRLWKSVIHALRQAEATTRHNDTLDLVMCVNYGGRAEIADAARSIAEDVAQGRLKPGGVTERTFARHLYLPDVPDVDLMVRTSGEQRISNYLLWQLAYAEMMFVDTPWPAFDRESLWDCLLGYAGRER
GUT_GENOME114281_003282-216KHLAVIMDGNGRYAELKGFERSKGHEAGAIAFEKACRDFATLPLDVLTVYAFSTENNSRPKEEVSNILSVIAYFLEKRIYPFAAENGINVRFIGKIDDLPQNLKKTISIAPVFEKGKTIVIAINYGGTDEICRAFKKMSENKSDFTPENLINSLDTAGLPLPEAVLRYGGYKRLSNFMPVQTVYSELFFTDKYWPEYEREDFCKVINDFEKIKRN
GUT_GENOME245220_00705215-442REVPKHIGIIMDGNGRWAKKRFLPRAMGHRQGAQALDKLTKEAEGLGVEYITVYAFSTENWKRSKDEVDSIMGLLREYINRFFNDKKSILKIRSAGDLTALSPDLQADIEKLKEQSKDRPGMTLTIAINYGGRDEIRRAVTKIAEDVKAGKIQPEDINDQMISDNLDTAGIPDPELIIRTSGEFRTSNFLMWQSAYSEYYITDKLWPDFNVEDLKDAIKAYQTRDRRF
GUT_GENOME021799_0089123-252KKCDHVAFIMDGNGRWAKQRGLARHFGHHEACKRIIEIFEACREFNIRVMSFYAFSTENWNRPQEEIDHLMDYLEEFFQKEIDYLDSVGTKIMISGDLSRIRPQTREVCLQAVERTKNNDRWVINVCLNYGGRGEIVRASKVFARDVTEGKKNLEDLTEETFKSYLYTAGLPDVDLMIRTSGEERLSNFMLYENAYAEFVFTSTKWPDFRHDDFVDCLKEFNARSRRYGG
GUT_GENOME237836_006011-160MELFVDKFTQMLSDPRVVDNKINVTVVGDRTLIPPHVLEKIEAVEEKTKDFNDYYVHFALAYGGRNEIVETAKRVVSAYQNGEITLEEITPEKITESMYPAGDMPPVDLILRTANDRRTSNFLPWLANGNEAAVCFCVPTWPEFRYVDMVRALRTYDERM
GUT_GENOME025013_0009910-241RHLAIIMDGNRRWARRRGLPAAAGHRKGAETLEQVCRDAKDLGIKFLLNRSSDLSALYAFSTENWQRDAEEVATLMGLLREYLKNDLQEIQKNNVRIIFIGERGMLDKDIADSMQRIENQTAANTDLTLCLAVSYGARQEILNAVKRTAKLVRAGEIKLDEVDEHLFSNLLYTKDIPDPDLVVRTSGERRISNYLLWQVAYAEFYFSEVLWPDFNRAELEKIIADFNTRERR
GUT_GENOME237840_0138120-248KPIPKHIGIILDGNGRWAKQQGLPRIMGHAQGIKRIKKIIPEVKKLGVEVLTLFVFSTENWNRPSKEVDFLMDEAKKNYEKIIDSVTNLGYNVRVIGEEIKNNLELNNKINELNFLGSKGEKFTVVFAFNYGSKKEIVKATQNICQKVVNKELEIKDIDEAIIENNLYTKGLPPVDLMIRTSGEERISNFLLWQIAYAELYFPKIYWPDFDTEALYTAIENYQQRNRRF
GUT_GENOME258880_0105817-249GQKRLPRHVAVIMDGNGRWAKQRGLDRSMGHIEGVNAVRRVTEIASDLGVGFLTLYTFSTENWNRPQEEVDALMHLISVAIERETPDLIKNNVRLRLIGNLTRVPEYALQRLNNCVDATASCSGMTLILAISYSSRWEITEAARRIAKEVADGKLDATQVDQELFADHLTTKGIPDPDLLIRTGGDYRISNYLLWQIAYSELYFADIFWPDFSKEDFCDAILHYQSRERRFGK
GUT_GENOME275250_0126137-259NYIVPRHVGIIMDGNGRWATARGRERSYGHKKGAAIIPDVIDEIFKKGTYCLSLYALSAENLARPEKEVKTLLSILQKGIKKEGERAVKNGVRFVVSGDKSVLGDKARAEVEQLENQSAACEGHILNICFNYGSRQELCRAFSLMQKNGETEVAPETVNKYLYTAALPDVDLVIRTGGEKRLSNFLLWQSSYAELYFCDVLWPDFSSSDINAAYEWFSSRKRR
GUT_GENOME243601_009295-217KHVAIIMDGNRRWAKNHALSAHLGHQKGSENIDKVIKFCAYNGVKTLTLYSFSTENWKRDKKEVDFLISLFKKFIGDFKDSFVKNGVKFDTIGDLDSFDDELRFKIYDCKKATENGKFTLILAVNYGSRDEITRACQKVVRAGLEISQENIANALDSAPYGDIDLLIRTGGERRLSNFLLWQSSYAELMFSDTLWPDFDDIELAQLFECYKKI
GUT_GENOME174916_0000852-283IDLDRIPRHISCIMDGNGRWATSRGLSRTEGHKQGIVALRDVITTAVRLGVDVLSAYAFSTENWGRPQHEVNVLMHLFAKTFIDELPLLKRENVRVVFLGDISSLPKKTRDVFEFGLSECQDHTGMVLALAVNYGARAEIARAARLLAEKVRTGELNPGSIDEDAVARELYTGGLPDPELVIRTSGEMRVSNYLLWQIAYSEFYITETYWPDFDRWGMLRAIASFQKRDRRF
GUT_GENOME103730_0326066-293AHGCRHVAIIMDGNGRWAKKQGKIRAFGHKAGAKSVRRAVSFAANNGIDALTLYAFSSENWNRPAQEVSALMELFVWALDSEVKNLHRHNVRLRIIGDISRFNSRLQERIRKSEALTAQNTGLTLNIAANYGGRWDIVQGVRQLAQQVQEGLLRPEQIDEEMLGQQICMHELAPVDLVIRTGGEHRISNFLLWQIAYAELYFTDVLWPDFDEQDFEGALHAFANRERR
GUT_GENOME017173_0063418-223KERLPRHVAVIMDGNGRWAQKNKVSRLAGHNAGMLAMKEIIKRADVLGIKYLTVYAFSTENWKRSQEEVGGIFGLLVKYVASELKELNENNVKVAVLGDLKKIPKSAQTSIDKALSTTRENDGLHFNIALNYGSRQEIARAARRLAGRVLSGEMDLSEIDEAAVSRDLYTGEENGFIPDPDLIIRTSGEERISNFLLWQAAYSELI
GUT_GENOME118098_0108912-231SIGIIMDGNGRWAKSRGLPRTAGHKKGAQVFQDIADYCDELGVESVYFYAFSTENWKRPAEEVGAIMRLFGEYLIKGFDYENRNVRICFMGDRTVLEPKLQELMNKLERDSACKTGLTINIAINYGGRPEIVRAAQRLAAKAAAGELKPEDITEDALSAQMYTAGQKDPDFILRPSGEKRLSNFMLWQAAYSELVEMDVLWPDFTRGDLDAAITEFNRRS
GUT_GENOME203592_0122123-246ISGIKHIAIIMDGNGRWAKKRHMSRECGHSYGAKNFKKIVRYLGDIGIENVTVYAFSTENWKRPKKEVDEIMRLMREYVDQMKKEFEHEDVNIHFIGGREPFDTELRADMDYLERVTAGRKCRLCIAVNYGGRAEIVDAVNKLIASGKTEITEDDITENIYSGICPPPDIIIRTGNEYRLSNFLLWQLAYAELFFSPTLWPDFDKSEINSIIKEFHKRKRRFGG
GUT_GENOME176071_00721160-391GLKIPQHVAVILDGNGRWAKQRHMPRTYGHKVGSKVVEDMLTVVDDLGVKYFTVYAFSTENWKRSTEEVSTLMTILRTYLKDCVKKSMKNNVRCRVIGRREELSADIIESIENLEQKTKNNTGLQFTIAINYGGRDEITRAARKLALKVSNHELAPEDITEEMISKNLDTWELPDPDLLIRTSGEQRLSNYLPWQLAYTEFYFTDVHWPDFNKEELIRAFEKYNKRERRFGG
GUT_GENOME066321_0144916-245DRNTVPSHIAIIMDGNGRWAQKQHLGRVKGHVAGAKIIRHLTNIASDIGVKYLTLYCFSTENWRRPLEEVNFLMDLIRNYLINMAEDMVAEGVRLTAIGDMESLPEKVKRELYRVMEITKNSTKINLNLALNYGGRDEIIKAVNEIAADVKAGKLAPGEVDSDIFSRYLYTAAMPDPELLIRTSGEIRLSNFLPWQTAYTEFYFTEIQWPDFDDAEFYKAIITYQHRHRR
GUT_GENOME149797_0062919-260MLDPNNIPRHVAIIMDGNGRWAKRKGLPRSYGHRAGADTLKRIVVAADDLGIKVLTVYAFSTENWKRPDEEVSYIMKLMKSYLSKNLLELKEHNVQLHVIGDMKRLHPDLQDAFSQAEAEMAANTGLILNVAVNYGGRGAAAHGAAPRREITQAARKVAEAVRDGQIRPEDISEATLAQYLYTEPENDVDLLIRPGADKRVSNFLLWQMAYAEFWYTQLCWPDFSKDTLVEAILSFQGRERR
GUT_GENOME244339_006201-163MNLFDNELTRALKTKKQNEKDGICMNFIGDIDSLPKTLQTLINKTKSEKKETTRTVSTIAINYGSQQEILNATKQIHKDISNEKINVDNFEAADFNKYLYTAQMPPVDLMIRTGGEKRLSNFLLWQSAYAELYFTDVYWPDFSTDELDKAIISYANRNRRFGD
GUT_GENOME244358_005822-228DKENIPEHVAIIMDGNGRWAKARMMPRTYGHKVGMEKVIETIEWADERGIDVLTLYAFSTENWKRPESEVSFLMNLLVTYVDKELKKLHEKNAIVRVLGDMSELPEIVQKKLGESIELTKNNTGLVVNFAINYSGRAEIVHAVNGIIEDAKAGKIDKVDYDTLSNYFYTHGEEDPDLILRTSGEMRLSNFLLLQAAYSELMFIDESWPEINREILDRCMDDFKKRNR
GUT_GENOME252386_002628-235MPEHIAFIMDGNGRWATSRGLKRSAGHYAGAETLDKVTRLCYKKGVKTVTFYAFSTENWRRPAEEVAALTNILTEYLKKMIGYFEEDSDEIYTHTRVRFIGDTKVFNLIQRNMIDKITRVSEKSEKKMEMNIAVNYGGRAEIVAAVNSFIKEHNGKKITEQDICDRIYTAGQPDPDLIVRTAGEMRLSNFLIWQAAYSEFVSVPVCWPDFEEKDLDAAIEEYSHRTRK
GUT_GENOME242999_001909-234MKNIPTHIGIIMDGNGRWAKKRFLPRTVGHKAGVEAVREIVRKCASLNVKYLTLYAFSTENWSRPKDEVSALMSLIVTYLRSELEELNKEQVVIKTIGDMSALPDLPRQELEKAEEKTKDNEGLVLTLAINYGFRADLLQAINKLALENVKEVDDKKLRSYMYTSFLPDIDLLIRTSGEKRLSNFMMYEASYSELYFTDVLWPDFGPDNLVEAIIDYQSRQRRYGG
GUT_GENOME048537_0079876-304PPEAIPAHVAIIMDGNGRWAQARGLPREAGHRAGAETVRTIVTECRRLGIRHLTLYTFSSENWSRPKAEVSALFSLLLEFLGQEVPRMEREGIRLNILGDMEALPLAARTALRHGLRRTAANTDMVLNLALNYGGRAELVRAVRAMMAEGLRPGDVTEQSLADHLYTAGQPDPDFLIRTSGEQRLSNYLLYQCAYSEFYFTPTPWPDFGVEALHEALAAYAGRSRRFGK
GUT_GENOME011289_0027616-238RLQSVAVIMDGNGRWAKRRMMPRTYGHAQGAAKIEHILGIFREIGVHHMTLYAFSTENWKRPKEEVDMIMDLVYRYLDTVVIRRIREDPTFSMKFLGDKSALPEKLREKCIEVENMAKDRPFVCNVALNYGGRDEIVHAAREAVAAGEEITEESISRHLYTSHSPDPDLIIRTGGDFRLSNFLLWQSAYSEFVFTDTLFPDFDREDIMKAVHEFYRRKRRFGG
GUT_GENOME194823_014145-235NRIPQHVAIIMDGNGRWAELRGKERYEGHVAGVEPVRASLRAAARWGVKYLTLYAFSTENWGRPTHEVDALMELFCKSVVNETPELIRQGVEIRMIGDRTRFSEKVQRYLCEAEQRTAGGKTLTLILALNYSSRSEITRAVQRIAARVAAGELAPGEISEGTVSASLDTAPYPNPDLVVRTSGECRLSNFLLWQASYAELYFPEVLWPDFTEEEFDRAIEEYARRDRRFGL
GUT_GENOME215117_0035661-293LVPPRHIAIIMDGNGRWATKRGLPRPAGHKAGAETFRRIATYCKNIGVKYLTVYAFSTENWKRSEVEVSAIMALLKRYLLEAVDTMEKDHIRLHFFGDMTPIAPDLRALARETDEITEHLSKDAFQANVCLNYGGRDEILRAARRFATDCAAGKKRPEELDDALFSSYLDSAGIPDPELIIRPSGEQRLSNFLLWQCAYSEFYYTDTLWPDFDEAELDRAIAAYQQRDRRFGG
GUT_GENOME064225_012384-177CEDADALGIKYLTVYAFSTENWKRPEDEVVGLMDLFAKTMLAEVDGLHEEGVRVRTIGDLSALPAETREAFDEAWEKTRDNDGMTLVVAVNYGGRQEILRACRGCKREAVLVAAAGRSVVDALTARMKGAVGAGGPARVSLGEVSEMIDVVRLGLSAGLSFDAALEIFCANRRS
GUT_GENOME057820_008265-226SGIPVHIAFIMDGNGRWAKKRLLPRKLGHREGVKTIDRIADCVFERGIKYLTLFAFSTENWKRPADEVSGLLSLFATYLRRKIPKMVKNGIVLRVIGSARGLDEKLHKLIADAYERTKYGTRGNLTVCFDYGGRADIVAAVNKCVEEGRTVDEETFGGMLQTADLPDPDIIVRTGGEKRISNFLLYQMAYSELYFSDTLWPEFSEKELDDILAAYARTDRRF
GUT_GENOME112022_000568-226IPAHVGIIMDGNGRWAMQRKKPRSYGHKAGSDNVDKIVTYAFKNGIKVLSLYAFSCENWSRPKEEVDELMSLLEKYFKKFISKILKKNVRLSVMGDVSVLSDKLKKVISEGVEKSAANDEFVLNIGINYGGRQEIVKAVKEIVSAGEEITVENISTHLYTSSFGDPDLIIRTGGELRLSNFMLYQGAYSELYFTDVLWPDFNESEFDKAVSDFGERHRR
GUT_GENOME247680_0063133-282NMPAHVAIIMDGNGRWAKKKGLPRLAGHNAGMKAMITATRAASDWGIKYLTVYAFSTENWKRSNEEVSGIFKLLVKYVESELKELHRNNVKVNAIGDFEKIPKPAFEKLLETLETTKNNTGLTLNIAINYGSRAEMIRAARNMFSEALKKTYDLSDSENVIISELSYIKINSMVEDLITEDNFSKYLYTGDETGNIPDPDLIIRTSGEERLSNFLLWQAAYSEFAFTPVLWPDFTREEFWRIIEEYANRD
GUT_GENOME242426_0134334-263LKKVPRHVAIVMDGNGRWANARGLARTEGHRVGGQAMMDVVAGAIEIGVEELSVYAFSTENWRRSPAEISFLMGFSRELIRSNRDVLNSWNVCVNWVGRPQRLWSSVLKEIREARLLTAQNTGLRANVCINYGGRTEIVDATRRIAEEVASGKLRPADITESLIADHLYVPDMKDVDLLIRTGGELRTSNFLMWQSAYAEFYFTDVMWPDFTREDFWRACEDYANRDRRF
GUT_GENOME103718_0231624-248VPTHVAIIQDGNRRYARERGDDAPDGHRAGAATTERVLDWCADLGVAELTLYAFSTENFERPDDELVPLFDLLEDKLRDFADADRVHEQGVRVRAIGDVDRLPERVRDAVAYAERRTADNDRFTLNVALAYGGRTELLDAATAIARDVDAGDLDPSDVDVETVEDRLYDRPVRDVDLIVRTGGDERTSNFLPWHANGNEAAVYFCAPYWPEFSEVEFLRAIRTYQ
GUT_GENOME156675_0131525-230IPKHIGVIPDGNRRWAVENGLEKDQGYLHGIKPGLELFKLCQKLDIKEITYYGFTTDNTKRPSYQKKAFIDACIKSVELIANEDCELLVIGNADSPVFPEDLKKYTKRTTIGKGGIRVNFLVNYGWEWDLNLLKACDIKSKNIYDNIQSKDISRIDLLIRWGGRRRLSGFLPVQSVYSDIYIIDDLWPNFNKDHFFNALEWYKDQD
GUT_GENOME199712_00616214-437DGRLRHIAFIMDGNGRWAKKRGMPREYGHKRGAENFRRIGRYCESIGLQYMTVYAFSTENWKRPEHEVKALLGLLDDYIEEFFREIDKEKIHLCFVGNLSAFPDSLREKMERLDRETADRAFRLNIAVNYGGREEIAHACEALIREGKQTVTEDDISAHLYTAGCPDPDLIVRTGGDLRSSNFLLWQSAYAEYYFTDTLWPDYRERDVDEAIRSFLGRKRRFGG
GUT_GENOME009037_0128712-233SGLLPKHIAVIMDGNGRWAKKRLMPRSFGHQQGMNRMIGLLEHAFDVGIDYVTVYALSTENLKRPKEELDGLFNLLRKHFVDCMRRICARGVRLKILGDVSLLPQDVQGLLKKAEEDSVMYLGRGVNVALGYGSRAEIVRAAKLAAERGKTLTEENFSDFLYTAGQPDPDLVIRTGKEQRLSNFLLFQSAYAELYFSDKMFPEFSDKDLDEAIAEYGRRTRR
GUT_GENOME096560_005375-194RNIIEDCHKMGLKHLTLYAFSSENWKRPGGEVDYLIKDLPSEIFKSEILKVYHKKNIKINFIGDINSLPPQTVELLKVAEKKTSKNNGMHVHFALNYGGRKDILQAAKQILINESKETILNISEETFTNYLYTKGVEDPELIIRTGGDLRISNFLLWQISSSEIWFTKKYWPSFNAKLLMKAMEEYDIRK
GUT_GENOME054365_000024-212KHIGFIMDGNGRWAVAHGMSRQEGYAHGLVALYKVAKRCSERGVEAVSVYALSTENLNRPQGELDSIFKVVEKFNLTYDGEYKISYMGDFDSLDDKLAYSIEQVEEKTRDNKGMWLNIAFNYGAKADILHAAKVAYDHGEFLEDTFEKHLSSSHLPPLDLIVRSGGEMRLSNFMLYEAAYAELMFLDKLWPDIDESDVDKILDDFDKRV
GUT_GENOME283282_010939-235GGLPEHVAIIMDGNGRWAKQRGLARSQGHKEGLNSAKRIVAQAASLGIKYITLYTFSTENWKRTQEEVGYLMTLIKGHLRAEFQFYKDNGIKIEHIGDLSGLPKDVQKEILNAKKDTEHFTGTTCVLAINYGSRDEIVRGIKKLVENNVKTEDITEKLISDSLDIKDLPDVDLMIRTGGEERLSNFLLWQCAYAEFLFTDTLWPDYTENEFIEDIIKFQKRNRRFGA
GUT_GENOME074383_0091114-191LPDHVAFIMDGNGRWAKKHGLPRKAGHEEGVRALKKVVKCLSDYGIKAATFFAFSTENWSRPKDEVDALFSLVEKFSESAGRFCMKNNLKIRFLGFEDGLSESLVQKIRTVEAETAGNSGMIFAIALNYGATEEIVRAAKGAALSGNITKDSFEKHLLTKDLPPLDLLVRTGRKTALK
GUT_GENOME012640_0105720-240VNTFPYHVAIIMDGNGRWASASGKPRSYGHKIGAENVERIVSHAFERGVKALSLYAFSSENWSRPEKEVDKIFSLIVSFLKKYIKKAEDNGVKVVFSGDLTALPKKLLSEAGEVINGTKCNDKFVLNIALNYGARAEIARAASLAAANGEPVTPETVSKYLYTADIGDPELIIRTGGELRLSNFMLFQAAYSELYFTKVMWPDFNEEEFDKALEDFSKRKR
GUT_GENOME133816_009696-233HLGLIIDGNRRWAKLHNADTATGYKAGLDKLVEAVNALSGKGVKCISVYAFSTENEKRPAEEKAQIMSLVDYFIENHPSGVKLCFIGDFDFLPKTLKEKILSYKCADNSYKLQMNIMLGYGARHDMVTAAKKLNNQARELFENEMANGQSYDRALKLSEEIFTEDNFLNNLSTSNLPPLDLIIRYGKANRLSNFMLYEAAYSEIYFVDKLWPDASALEIAEIVDGFKL
GUT_GENOME117585_006988-232IPTHVGIIVDGNGRWAQARGLSRSEGHRAGFNRLKEICIYAGDLGVKYLSLYVFSTENFKRAKPEVDFLMNLFVKMFHKEFKEVMDRNFKIVFSGRRDPLPKKVLDEMDQITEESKNNTGTVLNFCLNYGGQTEMLDMTKKICQEVIDGKLNIEDISLDTLQHHLYQDLPPLDFVIRTSGELRLSNFMLYQASYAEFYFPKIYFPDFTEHDFDDAIVEYNNRNRR
GUT_GENOME071233_0082812-237DPNNIPKHLAFIMDGNGRWATKQGLPRTAGHRAGVDALTRVAEASRDFGVQYVTVYAFSTENYNRSEKECNYIFELTRRFAKSKLKTFLKNDTRFVIFGDLDYSDKLDDETKEALLNLQEATKNCKSHTLNMCFSYGGKHEIVQAVNKLIKAGEREITAEKIAKNLYSAGMPDPDMIIRASGEQRLSNFLMWQSAYSELYFCDTYWPDFNKETVKECIIEYQKRNR
GUT_GENOME183596_0017612-239SKKMPTHIAFILDGNGRWAKNKGLPRTAGHHKGANTLQEIVKACSNLGIKYCTAYVFSTENWSRPESEISFIMKEIKKICKDYQKFVKLNIKVKIIGVRDYLTKDIIEMLDLVTQKTSNCTGMTLLLAFNYGARREMIDCVKEIVNKINNNEIKIDDLNEKIIEDNLYTKDIPSVDFLIRTSGEIRVSNYLLWQIAYAEMYFTKVYWPDFHTKELYEAIYEYQNRHRR
GUT_GENOME285273_005544-233LKIPNHVAIIMDGNGRWATNRGLKRSEGHKEGSKTLEKVAIHAINKGVKVLSVYAFSTDNFKRTKEEVDYLMNLFILMFKTKFKVIDKENIKVIFSGRREPLSKDVLDAMDSIVKKTENNTKGILNICLNYGGQEEIIDGTKRIIDDINNGKISKDDITKEKFYEYLYKDLPPIDLLIRTSGEYRVSNFMLYQMSYSEFYFTNTLFPDFDEKEFDKAIESFNHRDRRFGK
GUT_GENOME237518_0047510-237ENVKIPTHVGVIMDGNGRWAKKRFMPRKYGHREGAKTFKKISRHARDIGISYMTYYAFSTENWRRPQDEVNSIMKLFENYLDDVSSFTSENIRLRFIGDRTKLSPVLQEKMHNAEELSKDFDSMTVVLAVNYGGHEEITHAVRSIARDVKSGKLDTESIDEQLVQSRLYTEEIPPVDLIIRPSGEQRLSNFLIWQSAYAEYYFTDILWPDFTNKDLDAAVLEYSERNR
GUT_GENOME109396_0003212-236KIPNHLALILDGNGRWALKRGLKRYEGHQKGIETLGEIARAAFNKGIKYLSAYVFSTDNFKRSSIEVNFLMSQAKNYFKKYLDTPQEKEEYTLHIIGELTNLDDELKLLIEKVNNKNSKNSYHLILAFNYGSQEEIVNVTKKIAYQVKEGNLDVNDITKDIISKNLYTNAFPPVDFLIRTSGELRLSNFMLWQLSYSELYFTDVLWPDFNSLELEKALIDFEKRE
GUT_GENOME043946_0092824-252NVMHKLPQHLAVIMDGNGRWAQRRGLPRSEGHARGVDVVRSLLTRCRALGIPYVTVYALSRENLKRPPQEVRFLFDLFIRFLKSELPELERQGIRLGMIGDRNALPLPVRTALDYGIKKTSGGDMLFTLALAYSGREEIMRAARTLAASLPNGDTSPEEQEAVFRRGLYNPELPDPDLIIRTGGELRLSNFLLFQSAYAELYFSDRLWPDFDDAELNRALAHYAGRTRR
GUT_GENOME255497_0088122-254DQSRIPRHVAVIMDGNGRWAKQRAMNRLNGHKAGINAVRETIRCANDVGVDYLTIYSFSTENWKRPQDEVNGLMNLFAKTMLAELDGLHEEGVRVATIGDTSALPEKTARAFSDAWEKTKNNTGMTLVIAVNYGSRAEILEAAQRSIDGALAAVAEGQQPELLTETAFESRLYTADIPDPDLLIRTSGEMRISNFLLWQIAYTEFYFTDVLWPDFDRYEFLRALLYYQHRDRR
GUT_GENOME253122_0054342-264KEAMPRHIAFIMDGNGRWAKKRGLPREYGHKEGAAVFRRLTEYCGSIGINTVTVYAFSTENWKRPEREISAIMKLLRDYLNDALREMEKNRIRFCFLGDRSRFDGETRRLMQEAEERSVGYTLRLNLAINYGGRDEIVHAVNACLREGITEISEDDISRHLYTKDSPEPDLIVRTGGEQRLSNFLLWQSEYAELYFSPILWPDFSERDVDAAVADFCRRNRRY
GUT_GENOME096381_01017631-857HLRPRHVAIIMDGNGRWAVRRGLPRTAGHDAGQRAVRETVYGALEIGLPCLSLYSLSTENRERPAAEIEAILRLFQDGLDTETEEVWRRDVRLRWSGVPEGLPPDLVRSLRRTEHLTRDRTGLTLNMCVNYGGRDEIAHAARALARQVADGALSPDAITPRHVAAQLHLPDLPDVDLLIRTSGEQRTSNFLPWQSTYAELVFLDTLWPDVDRTHLWQAVAAYARRDR
GUT_GENOME009619_005503-230IRHIAFIMDGNGRWAKARGLPRTMGHKAGIKRLKEIITFSLFNCGIFCCSFFLFSCENFNRPQDEINFLFSYFEEYLKNDIDYFLKNKIAVKVVGDLNDKRIPDSLKNTIALTLKKTEKFENNKIVNLMFNYGGKQEIIHAINEIITINNTRKNTLLINENNIKNFMYTNKLPDVDLLIRTSGEKRISNFMLYYLAYTEMFFIDTYWPDFTKDDLEKIIDDFYKRNRR
GUT_GENOME211297_000995-218QHIAFIMDGNGRWAKLRGKPRNYGHREGLKTVEKVLNWTMSKEIPYISLYVFSTENWKRSATEVNGLFALAERYLSDFKKFCKDRIRVVVSGRKDGLPDKLVKCIEDVTAITANCNKLTCNLCINYGGRQEIADAVAQLAQTGDFSLSALQKHMYNDLPDPDIIVRTGGHKRLSNFMLFQCAYSELYFTDTLWPDLSLEEYNKIVDEYNSTVRN
GUT_GENOME136346_0139413-241LSPLPKHVGIIMDGNGRYATRRGLPRSMGHRAGTERLRGIIKLSSDLGIEALSLYAFSTENWKRPRIEVDTLFAIFMEYFTKEVEDLHKNNVAIRAMGNIAALPEKVYAQCMAAMEKTKNNTGLKLNIALNYGSRAETVNAVKSIALDAKEGRLNIEDIDEECVMSRLYTRGLSDVDLVIRTGGEQRLSNFMLLQSAYAEFVFTDTLWPDFSDECYIAALGEFAHRNRR
GUT_GENOME273588_013435-232DNNIPKHVAIILDGNGRWAERQGKSRSEGHKAGLDRLKSLSEYIINKGIKVLSVFAFSTENFKRDKKEVDYLMNLFSNGLKSSIKFFGDRNIKVIVSGRKDNLPKKVINTINTLENKTKDNTLGILNICLNYGGKAEIIDASKEITKDVIAGKISVDDINEELFKRYLYNNLIDVDLLIRTGGELRISNFMIYESAYAELYFTDTYFPDFDELEFDKALDSFNKRDRR
GUT_GENOME072384_03750468-640GLQECAQELKRIIHARYYNPAAGLYAIDLKKRYYSEHPQVLALLVDPYAPVIPGLIDKTDTIAARLPGCFQANVCLNYGGRDEIVRAVNRFAAEHPGETITEAAMAEYLDTAGIPDPDLIIRTGGEYRISNFLMWESAYSELYFTDVLWPDFSEKDIDEAVAEYRHRDRRFGG
GUT_GENOME112053_002464-223NKIPRHIGFIIDGNGRGAKKRLLPRKLGHNQGVKAVKKTINACLELGINYCSFFVFSTENFKRSKEEIDNIFELLDNYIKSDLQEFDNKNLKLVISGDLSKLPNHLQESLVHCMDKTKNNTKMIVNMCLNYGGRQDIVYACNKAINMGLKNVDEKTFKELLFTKDLPDLDFVIRTSGEERLSNFMLYDLAYAELYFTKTLWPDFNKKALIKALKNYNKRD
GUT_GENOME243377_007976-242TNNLPRHIAIIMDGNGRWAKHRGKPRLFGHRAGATSMKKIVIATAKMGIEYLTVYAFSTENWKRSTEEVSGLFKLIIQYVASELNELIDANVHINVIGDYKKLPDASVKALDKMIDRTKDNDGLKFNIAINYGGRDDITRAVNQIIREREINNLSFKTGKTEVTEDDISSHLYTGKMNFDIPDPDLLIRTSGEQRLSNFLIWQSSYSELIFTKTLWPDFDEEEYRSLIEAYSNRDRR
GUT_GENOME112773_0120329-255MPKHIGIIMDGNRRYAREFLGDDINAGHKAGEKKIHELLDWCLDLDIKYVTVYAFSSENFSRDEDEVNFLMEMAEGSLREIVEDPRIITNRVRVRVLGDRSKLPDYVCEAIDYADEKTKDYDDFMFSICLAYGGRQEIVNAVKEIARKVQDGDVLPDEITEDMLSKHLYTSDMPDPDLILRTSGEVRISNFLLWQLAYSELYFTDVYWPGFRHIDLLRAIRTYQQRV
GUT_GENOME029114_011734-228INHIGIIVDGNGRWAQEKGKIRSEGHKAGADALEKIILYTSKNKIANYLSLYVFSTENFNRSSEEVNYLMELFMKWFKKAKSKYDSENIKVLFSSQKSFLKPEIVDAINELEEATKNNTGLVVNFCLSYGGRQEIVDATKKIASDVKEGYLKIEDINEKMYSEYLYNDLPDVDFLIRTSGENRISNFMLWQISYAEFYFPKVYFPDFTPKCLEEAIEVYKTRDRR
GUT_GENOME159423_00906194-425PVQVPAHIGIIMDGNGRWAKKRGLPRTAGHTVGAQNFRTITRYASSIGVKYLTLYAFSTENWSRPAEEVSALMKLFHQYLEEALRDFMDENIRVRFIGDISAFAPDLQALIHRVEEASSVKTGMVLNLAMNYGGRAEITRAARILAERVKNGELSPEDITEETLSGAMYTAGQPDPDLIIRPSGEERISNFLLWQSAYTEFEYFDILWPDFKPKHLDEAIEKFNSIQRRFGG
GUT_GENOME134621_0126211-253LDMNRIPAHIAFIMDGNGRWANKRKMPRTYGHHEGTKTIRNLALEANNLGVKAMTVYAFSTENFKREQSEVEYIFKLPKEFFSLYFQELMDNNVRIMTIGHLELAPQETQDIINDAITKTKDNTGLKLVFAFIYGGRDEIVEATKAICEEYKVPLIEDAAESKEGKIDLDNITEDYFESHLMTKELPPVDLMIRSSGECRLSNFLLWQLSYAEFIFDEAYWPDFNAEKLHECIWKYQNRDRRY
GUT_GENOME243511_0073720-248FSGKLPEHIAIVMDGNGRWANERGLARTEGHRAGELALMDVIAGAIESGISVLSVYAFSTENWKRSPREVSFLMNYSRQVIHRRCAELKSWGVKIVWSGREARLWKSVRQELENAARETENNQGLILNFCMNYGAQAEIADAMRQIGAEIEQGQLRAKDIKESLISAHLYQDLPNVDLFIRSGGEQRISNFLLWQSAYAELMFVPELWPEFDRCTLWRCLAEYGKRERR
GUT_GENOME096480_0249330-261SAHVPAHVAIIMDGNGRWAKERSLPRIMGHRQGMQTVKRIVRKADAMGIKILTLYAFSTENWKRPKDEVDYLMSLPQEYLATELDELIQRNVQIRMVGKEERIPSSTLEAIVTAREKTKENTGLILNFALNYGGRNEIIDAVQQIATLIQDKQLTIDEITEENFSHFLYTANLPDPDLLIRTSGELRISNFMLWQLAYTELWFTDCYWPDFTEDDLEEAVRAYQGRSRRFGA
GUT_GENOME003220_002186-222VPNHIGIIMDGNRRWAKEKKKKTIEGHLAGANRIISLAKYIFDKGVKYLSIYAFSTENFNRSAEEVSYLMGLIIKFFNERVNELHDYNIKIVVSGLRDNLSKEVLKCIDNVVDLTKDNTGGVLNVCLNYGGRREIVDAVNKIKEANVNITEETFGKYLYNDLPDLDYVIRTSGEERISNFMLWQISYAEFYFPKVYFPDFDEKEFDKALEIYNNRNR
GUT_GENOME109374_006669-225AVPVHVGIIMDGNGRWATKRGKPRSFGHAEGVKTMEKVIGYADEAGIKVLSLYVFSTENINRPEEEVSGLFSLAEKYFSRTKELVRRNYRVTVSGSVTKLPYDLVIKFREAERLTSCNTGLTVNFCFNYGGRDEIVHAFKRMAAEGLRDIDEEKVSEYMYSRLPDPDFIIRTGGMKRLSNFLIYQAAYAELYFTDVLWPDFSREDFFAAIEDYGKRK
GUT_GENOME199785_0148425-258LDMQNIPQHIAIIMDGNGRWAKAQGKVRTFGHQAGAETLKTIVRAADKLGVKVISAYAFSTENWKRPVTEVNFIMELLSRYLTNEIDEFNENNVQVRFIGSRKGLPEIVQQKMEHAIEATKNNTGIILNLAINYGGQAEILHAVRTIAAEAANGTLAVDDIDNNVIENHLYTKGLPAPDLLIRPGGDLRISNFLLWQIAYAEIWTTKVFWPEFTPDHLVEAILAYQGRERRFGG
GUT_GENOME128859_001462-211KHLAIIADGNRRWAASQNLPKEAGHAQGLNVIERCCEWAISRDVKMLTVYCFSTENWGRESGEVDQIMDLARWYFRERREWFTSRGIRVRFAGRRDRLAPDLVKSMGVMEDETRGCDALTLTICADYGGHDAIARAVQSGAKTVHEIDEALTAEIPTPDAILRTGGEMRLSNFLLWQAAYAELFFSPTMFPALEDSELDALLEEYSTRTR
GUT_GENOME017712_007547-224LPRHIAIIMDGNGRWAKQKNLPRSAGHNAGAKAVERTIRAAEKLGIEFLTFYAFSTENWSRPKEEINGLMNLLEKTLDKYMREAKTNNLRILISGRREPLPPHLLAKIDQLTAETAHKTGLTVVLALNYGSRAELLDAVQKLVQDGIKNPTQADLQARLYQPAVPDPELLIRTSGEKRLSNFLLWQCAYTEFYFTDTLWPDFSEKDLSAAVEDFSRRT
GUT_GENOME057036_0156225-256IKNAPAHVAIIMDGNGRWAKERGKPRIFGHKEGANRIREVMDAAREAGVKFLTLYAFSSENWNRPQDEVEALMNLLVAAINEYGGGLVKNKIRFRTIGDISALPQKCQDAVADLSKKTENFSESTLVLALNYGSRDEIAMAARKIAEMVLKGDLDPSSINWEKISGQLYTSDIPDPDLIIRTSGEQRLSNFLMLQAAYSELYFTDIYWPEFGRKEFLKAVEEFKLRERRYGK
GUT_GENOME009216_0037512-235VLNKLPNHIAFIIDGNGRWAKKRGLPRTAGHRQGVFAVKDTIQNCYNYGIKQVSFFCFSTENWNRPQDEIDALFGMLRDFIHDDIVPYKEQGIRFLISGDITKLPQDLQQAIQNAVNETQDCNKMLVNLCINYGGRLDILRAVNSLLKQGKTEVNYLDFEKELYTKNLSELDLVVRTSGEQRISNFMLYQMAYAELYFTKTYWPDFDKKALDEALLDFQSRNRR
GUT_GENOME111314_0069618-244ELLPRHIAIIMDGNGRWAKKRLMPRSVGHRAGVEQVKRVITMSSDIGLSALTLYAFSTENWKRPKDEVGTLMSLLLEYLKREIGELHRNNVKLCTLGDIENLPREVYEAIVSAQQKTKDNTGLVVNIAVNYGARAEMVRCTKKIAQLAGEGKLKPEEIDEQLISSYLYTSHVPDPDLIIRTSGELRISNFLLYQMAYSELYFTDALWPDFDEQEYLKALLDYKNRKR
GUT_GENOME103718_0017420-232PEHVAIVITERDLLVDGAFDTLSAALGWAFEYGAARVTVSVSVLDAAVVPTLVRELRQVDAPRRLVVRGPENAAIGGGEGAVDSESDEEDGDESAPGRRARDAEPGDAPIRITVGLGGKAEFATAVRELAADVDAGDLDPESIDAADVSDRLVFPEEPDLVIKTGAERLSDFAIWQSVYAELYFTDVNWRDFRKRDYLRAVLDFQDRQRRFGR
GUT_GENOME061470_0163319-250IAHGSIPRHVGVILDGNRRWARSIGTTASHGHRAGAGKIAEFLEWAEEVGVEIVTLWMLSTDNLERDQHEIDELFDIIGQAVRALAAQRRWKIGVVGDLSLLSEDLADELRSAQDATKDMDGLSVNIAVGYGGRHEIADAVRSLLRDAAAQGRTLEEVADSLTDEDITNHLYTKGQPDPDLVIRTSGEQRLSGFMIWQSVHTELYFCEAYWPDFRRVDFLRALRSYAQRERR
GUT_GENOME207750_0272224-255LKMRQIPEHVAIIMDGNGRWAKKRTLPRIAGHHEGMKCVKRITTSANDLGIKVLTLYAFSTENWKRPKSEVDYIMSLPEQFLGTFLPELIEKNVRVTAIGYTEGLPESTLNSLNHAVEQTKYNTGLQLNFALNYGSRAEIIDAVKNVVKDSETGKIEIDNFSEEMFSTYLMTGKIIEPDLLIRTSGEVRISNFMLWQIAYSELYFTDVLWPDFSELHLIEAIEEFQNRQRRF
GUT_GENOME236132_0044610-225NIPSHIAFIMDGNRRWAKKRGLYKLLGHKRGAKALKDIVKVLLDLKQIKYASFFGFSTENWNREKKEVDYLFDIAYNMLVENEAELIKDNIKFSVMGDVSPLPENLRNLILKVQDETKDNSALCLYIGINYGGRDDIVQAVKKLDHDQISVENIKKNLYCPYEIDLLIRTSGEQRVSNFMLYQMAYSEFYFAKCDWPDFDKKQLYKAIKSFSKRHR
GUT_GENOME111393_0098115-240LPKHIGIILDGNGRWATKRMLPRNLGHKKGVQTLKEIVKEVKNLGIPNLTVFAFSTENWKRPKDEVDYIMDQLKKYYQTSLTKLIENQIKVKFIGTKKNLSSDLLMIIEDIEYQTKNFKEFTLSIAFNYGSREEITEAVKKITTKVINHEIMVNDINEELISKNLYTAELYPLDLIIRTSGEMRLSNFLLYQAAYAELYFPKTLWPDFHKKELYLAIKEYQSRNRR
GUT_GENOME013038_0078811-242NIPEHVSIIMDGNGRWARQRGLERVSGHYNGVESVRACVEAANEAGVRYLSLFAFSEENWNRPESEVSSLMSLMLKSIADEIDSLAAKGVSFRVIGNLSRLDDNLVEAIKKAERLTAPASGKEPLLTLVVFLSYSGKWDILQAAKRMAKEYADDPAGLESVGIEDFDRYLATAGIPDPDLIIRTSGEERISNYLLWQAAYSEFLFVDTLWPDFRREDFRAALEEYSHRNRRY
GUT_GENOME237422_0104712-244SEKMNIPEHVTIIMDGNGRWAKQRGQERLFGHKEGVESVRACTEMAVEKGIRYLSVFAFSEENWDRPVEEIEGLMQLMLKAITMETPTFQKNGVRFRVIGDFSRLSQKLRAEIDDCMKLTENNSNLQLIIFLSYSGKWDIVQAANKFIKENSHKLAEGEMPQMSAAELAGNLSTAGIPDPDLLIRTSGEQRISNYMLWQTAYTEYYFTDVLWPDFRKTEFQHALDVYSKRERR
GUT_GENOME141494_0180724-258GPIPQHIAIIMDGNGRWAQNRRLPRVAGHKEGMETVKKVTKKASRLGVKVLTLYAFSTENWKRPKDEVSFLMQLPVDFFDTFVPELIKENVKVHVMGYENVLPEHTQDAVRRAIEQTKNNTGMVLNFALNYGSRAEIVTAVQEIAEEVAKGEIHAEEIDDELIAKHLMTGFLPKELQDPELMIRTSGEERISNFLLWQIAYSELYFTKALWPDFDGAHLEEAIASYQNRDRRFGG
GUT_GENOME249905_0143010-233THAPRHVAIIMDGNGRWAKARHLPRAAGHKKGVDALNRVVETAARAGVGNLTVFAFSSENWKRPQAEVDTLMRLFAEGLLLWEKPLTDAGIRLRVIGERSAFPEDVQSAITHAEAATAAGTGMVLNIAANYGGRWDMTEAAKKCVAEGVALTPENIAGRVALADAGEVDLLIRTGGEQRISNFLLWQSAYAEIYFTDELWPDFGGEGLLEAFAWFAGRERRFGM
GUT_GENOME053900_0030811-237KHIAFIMDGNGRWAKEKGLPRSSGHYAGVRRVKPIAEEVFFKYHISTMSLFVFSTENWNRPKEEIDYLFVLLKRFFSSFLKYFGKRGVRLCVSGDLKDPRVPQDILDSIQNALEKTKNNKDYNFNVLFNYGGQKEIVAAAYNIAKKVKEGTLDLEKIDYNIFYENLYQKDLPPVDLLIRTSGEERISNCMIYELAYAEFVFEPTYWPSYTPKILEKNLEEFLKRNRR
GUT_GENOME285651_0083616-240AAPRHVAIIMDGNGRWAERRFMPRVEGHRRGVQTVRRVIEAAATAGIKYLTLFAFSSENWRRPADEVSALMRLFATALKREAEAMREHGVRLRVAGDLTAFSDEIRASIAASEALTEHCDKMVLTICANYGGRWDVVEAFRKVLKKHPDVVNDPSLITEAMISDNLAFDWAPEVDLMIRTGGEQRISNFILWQAAYAELFVSHALWPDFDAEDLQAALQWYQGRE
GUT_GENOME188928_01789277-459ARMKERGVFIAHCPDSNTNLSSGVAPVRRYLEEALRDMEKNRVKFRFFGDLTRLSPELQTLCHDAESRSEEYDVQVNFCLNYGGRDEIIHAVKAYTADVAAGRADPEELTEDVFSGYLYSAGVPDPELIIRPSGEMRVSNFLLWQSAYSEYVIMNVLWPDFTPEHLDSAIEEFHRRSRRFGGT
GUT_GENOME234136_00146256-477LRHLPEHIAIIMDGNGRWATERGKRRGEGHIAGAKTLGEVLKWCRARDIRYLTVYAFSTENWKRPKTEVAGLMRLFARTLRSKADEFVKNEVRFRMIGRRGDLPAKVLSEIERLEKLTRAFAREFIVAISYGGRAEIVDAVNAAIATGEKVTEETFRNYLYAPDVPDPDLIIRTSGEIRTSNFLLWESAYSEYHFTDVLWPDFSEKELDRALESYAARHRRR
GUT_GENOME040646_000481-182MDGNGRWAAKRGLPRPAGHKAGADNLRKIMDACRNRGIKYATFFAFSTENWKRPKEEVDALMRLFYDYLDEADRFTGKKARIMFLGSKEPFPEKLRERMIKLEQNSAGYDSMTIMFAMNYGGRDDIVYAARRLAEAAADGDIKPSDIDEKLFSGMLYTGSAPDADLIIRTSGELRLCGRRRM
GUT_GENOME033881_017713-173IPNHVAIILDGNGRWAKAKGMPRNYGHVQGAKTVEQICEDAWNLGIHYLTVYAFSTENWNRPKDEVSALMTLLRNYMKSCLKRATQNNMCVRVIGDKTGLDDDIRKRIAELEEATKDNTGLHFQIAINYGGRDEIRRAVTNLAEQVQAGTLDFARINTSALSSTADEVGVF
GUT_GENOME058442_00481242-468KAANLQHIAIIMDGNRRWAKEKNLPSAFGHKKGVDALKAAMRACDDFGVKYLTVYAFSTENWNRKKEEVDFLMNLLGETIKNELKEMHENGVVINFIGDLTKLSSKLQEILAHAVEVTKNNTGVHLQIAFNYGSRDEIVHAAKLIAEKVKNGNIRPEEITEEMISQNLYTKDIPDPDLLIRTSGELRLSNYLLWQIAYSEFLVTKRYWPEFDKNALSEAIIEFNHRQ
GUT_GENOME127598_0045834-262HTALPRHIAIIMDGNGRWATRQGLPRTAGHKAGVEALREIIRECDHIGIEALSIYAFSTENWKRSAEEVGALMGLLLAYFSSEIDELDEKNVCIRILGDVDGLPQPQRDAVNAAMARMKDNAGLKLNIALNYGGRDELLRAAKVLAARAAAGEIRPEDIARSDLENALYTHGLPDVDLLIRTSGEMRISNFLLYQIAYAEFVVTDTLWPDFDVRALHEAIAAYQKRDRR
GUT_GENOME046070_0239736-262KIPRHVAIIMDGNGRWAQKRGLPRPVGHRAGVEALREIVRMSSNVGVEVLTVYAFSTENWSRPQDEVGAIFRLMLEYLNREVAALVENEVRIRIIGRRDNLNETLLRAIDKAETASAHCRGLTFNVALNYGGRAELVDAARALARAVQRGELSPEAIDEQTISQYLYTAGQPDPDLLIRTGGDQRLSNFLLYQSSYAELYIPDTLWPDFTPEAYDQALRSYAARQRR
GUT_GENOME013431_0170232-255AAQTAKIPRHIGIIMDGNGRWATARGKKRAYGHEAGAKNIETVVDCLFEKGVENVSLYAFSTENFSRTKDEVDELMSLLKKGLKKYGNYAVNKKVRLIVSGDMSLLNTGLSREIAKEVQKTAKFGNRTLNICIAYGGRQEICRAAETLRQKGEPITAEALEKQLYTADLQPLDFIIRTGGEYRLSNFLLWQSAYAELYFTPVLWPDFNQEEVERALSAFSSRSR
GUT_GENOME143153_003565-252KTLKHLAIIMDGNGRWAQMQGKSRQVGHRKGAQNVRAITDWCAKHDISYLTLYAFSTENWNRPKKEVDFLMKLLEKYLRDEASTYHKNKIRFRAIGNIATFSQPLKSMILELESQTAHYTNLTQALALNYGGRNEIARAYAKIVGECVGGVDSGAESKNADFALSVVKMLDSDRDFKQMEALVQRHLDTADMPDVDLLVRTGGEQRLSNFLLWQASYAELAFSKTLWPDFGSDELAEIIEGFYQRQRR
GUT_GENOME111711_0022220-242IDYENLPAHIGFIMDGNRRWARKRGLPTLAGHREGAEALKRIVKRASEIGIVNLSFFCFSIENFKRSKEEVDYLFKLINEILDLVPDLIKIGYKFHHSGDKSLLPKETQEQIEKIEEKTKDCNKGVLNLLIAYGGRHDIVCATKKIIEQNVKAEDITEETFKNFLTTSPLPDLDLLVRTSGEHRISGFMLYDMAYAEIYFVNKHWPDFKGSDLDDCVIEFQKR
GUT_GENOME278116_0122811-248DMKRIPQHIAIIMDGNGRWAEARGEKRTYGHQAGVDTVRHITAECARLGVKYLTLYTFSMENWNRPTSEIQALMGLVLSSLKDDIFMNYNVRFQVIGDVKRLPAEVQDKLNETIETTAANSGMTMVVALSYGSRWEMTKAVKDIVRDLQKKGLDKYSDQDLDQLITEDTVCSHLETRFMPDPDLLIRTGGELRVSNFLLWQIAYTELYFCDTYWPDFREQNLYKAILSYQKRQRRFGK
GUT_GENOME086973_0091620-270DPATVPHHVGVILDGNRRWAKSMGFGAAQGHKRGADKIEEFLGWAEQMGVQVVTLWLLSTDNLSRDPAELSPLLDIIAHAVDELAETGRWSLRLVGAVDLLPEPLAERLRAAVVRSRPTSAEASGADGGAGEAVETAEAEDRRMQVNIAVGYGGRQEIADAVRELLREQAAAGASLEEVAASLSEEDITEHLYTKGQPDPDLVIRTSGEQRLSGFLLWQSVHSEYYFCEVNWPAFRRVDFLRALRDFASRE
GUT_GENOME041030_0026219-244DEKNVPQHVAIIMDGNGRWAKQRNMPRTMGHKAGVETIRRIVKEADRLGIKYITLYAFSTENWKRPKDEVNALMKLLVQYLKSEVSELHKNGVILRVLGDISALQEDCKKEIEDSIELTKNNTGLVLNVAFNYGGRDEIIRAVQNIVTDVESGKINKADITKERFSDYLYTNNCPDPDLIIRPSGEQRISNFLLWQCAYSEFWYSNVNWPDFSEKDLQKAIGRSYP
GUT_GENOME256537_0174110-234MPNHIGIIMDGNGRYAERRGLPRTMGHKAGGENLKTITRFAKDIGVKNLTVYAFSTENWKRPTAEVSGIMKLLGFYLGDWRHQLGDDSLKIRVIGDTSEFSPSLQKKIEVIERETANEEGLNLNIALGYGGRDEIVRAARALASLCEKGEMKSSDITEEEMSKHMYTNYVQDPELIIRTGGEIRTSNFLLWQSAYSEYYFTDVLWPEFTGDDMLKAIASFQHRDR
GUT_GENOME051219_002809-179INMQALPEHVAIIMDGNRRWAKKNNLSTPQGHKEGAENLKRIAKFANKIGIKHLTVYAFSTENWKRSQEEVGAIMKLLKFYLLDFFNWSDENIKINVLGRIAELPKDLKDQIHKIEEKTKNNTGLVLNICFNYGGRDEIVTATKNIVQKVLDGELKIEIDGTLQQKYDMIT
GUT_GENOME139736_0063016-241PVRVPQHIAFICDGNRRWAEARGLPPLMGHKAGIANFENLVDWYIARGVSTVTFFIFSTENWNRSKEEVDYLMDLFYTELKKNMKHALEKNLRYRVVGSRDRLPKRLAHMCDKLEETSAENTGGTVVFALNYGGQGEIIDAVNAAVAAGEPVTRETFETFLDTGDLLPIDLMVRTSNEFRISNFLLWKLAYAELMFVPEHWPDLVKDERMWQRTLDEYARRDRRFG
GUT_GENOME053282_011407-232IPYHVGIIVDGNGRWAEARGLSRSKGHEAGFQNLQKLTAYMYKKGVKYISAYLFSTENFKRSDSEVSFIMNNLLVDRLKDILAFCHEEKMQAFFSGRHNKLSDKVLKAMNKIEEETKDYQGRVFNICFNYGSHAEIIDATKKIVKDVSENKLNIDDLNEEVFSTYLYHNLPPVDLLIRTSGEERISNFMLWQCSYAEFYFPKTLFPDFKEKDFDEALTIYTKRDRR
GUT_GENOME096389_0206025-291PKNQIPQHIGVVVDGNRRWAKLAGTPTADGHLAGANKIVEFIEWCAELSIPTVTLYMLSTDNMNRSAAELEQLTEIIADTLDRLAESRPGGRPVRVHPVGQPELLPESLAARLRSLSGSGAPSTHDDDAGETVSTATGSIRRIAAKAVGRSSAAEDQPCVHVNVAVGYGGRQEIVDAVKDLLRDAESQGRTLSEVADELSPGSISRWLYTRGQPDPDLIIRTSGEQRLSGFLMWQSAYSEFYFCEALWPDFRRVDFIRALRDYAQRQ
GUT_GENOME238203_0101110-233HVAVILDGNGRWAKERGLERVEGHIAGADNVVNFMSWMERYPEIRYTTLYAFSTENWKRSQEEVNALMELLCRFLDEKLPILKERRTRLLASGDLSGLPLVCQEKLEHVRCETSVDYERTLILALNYGSRNELVHAMRKIGEKVRDGQLNVEDITEQTVCEHLYLPEIPDPDLLIRTSGECRLSNFLLWQLSYSEFYFTDTYWPDFGIEDFDKAVQSYYHRERR
GUT_GENOME007515_002685-221HWAFIMDGNGRWAKLHGFPRKSGHVEGVKTVEKVVDFCLNCGEVGVVSLYVFSTENWDRPAAEVKGLLSLAEKYVSRVKTFLEKDVRVVFSSSYDKLPVNLVKSMRECEEKTKNCKKMVLNLCFNYGGREEILHTAKVLAEKGELKDADEEVFLKNMYNDLPCPDLILRTGGQMRLSNFLLFQSAYTELFFSDTLWPDLTEDELNGVLDAFKHRVRN
GUT_GENOME260315_019269-226IPRHIAIIMDGNGRWARQRGLDRICGHIQGVESVRKVVRAAADCGVEYLTIYAFSTENWGRPSREVEALMKLLCESTEKETPALLSEGIRMRFIGDTEALSADVQEAIRRSEKTTADNTGLTLQIAVNYSSRWEITRMAQRVAAEAVRGGLKVEDITPEVISGHLVTAGVPDPDLLIRTSGERRLSNFLLWQLSYSELYFTDVYWPDFDEQEFARAIE
GUT_GENOME088253_016848-238LNHLAIIMDGNGRWAKKRHKPRFVGHREGMDNVERITLAADKLGIKVLTLYAFSTENWARPKEEVAYLMNLPVRFFDKYMPTLMENNVKVNIMGYLDELPEKTYQIVQRAMAETANNTGLVLNFAFNYGSRREITSAMQEIGGLIEAGELKSEDITEKMISDHLMTGHFGKYQDPDLLIRTSGEQRISNFLLWQLAYSELAFSDKNWPDFDADDLKQFVNEFKHRNRRFGK
GUT_GENOME256238_014717-227TDPRHIAIIIDGNGRWAKKNGKDRNYGHLIGSRVVFDTLKTITELGVEFFTVYALSLENWKRPVEETSYLLDLFMHSMDEETIMNCNVRFRLLGRLDTLPQEVSDYLLGLEERTKANTGTVFTLCISYSGRWEIMEAFRKALEIGKTAEPVSEQAIARLMPSSYLPNPDLIIRTGGESRLSNFMLWQSAYAELYFTDVLWPDFKREDLLYAIGYYQARERR
GUT_GENOME020782_0082827-256QMPKHVAIIMDGNRRYAKDVLKTDDTSEGHRRGKDKVEEVLNWCLKLNIRVLTIYALSTENFSREDSEVDYLLKLIIETMYELADDERVHKNRIHVRMIGDRELAPEQLMDAVKCVEERTKDYENYRFNIAVAYGGRQEIVSAVKEVARKVKDGAIDISDIDESSISRHLYTSDDPDPDLVLRTSGELRLSNFLIWQLAYSELYFTDVYWPGFRYIDFLRAIRSYQQRSR
GUT_GENOME128361_005758-233EHLPKHVAIIMDGNGRWAKQRGLPRGEGHKQGAKVFKRICEYAADRGIRYVTFYAFSTENWRRPPEEVSGIMDLFREYLREAEEREQENAQKGLRIRYIGEREGLADDILELVDELERGSADKVRTTVNIAINYGGRNEILSACKRLAEQCARGERTPESLTEQDISDNLYTAGQPDPDLIIRPSGEFRLSNFLIWQAAYSEFIYSDILWPDYTEQDFDAALEEYA
GUT_GENOME057827_005718-234KKLPLHVGIIMDGNRRWAKNQRLPVSSGHSRGAEVFKNLALYCNKIGLKNLTVYAFSTENWKRSATEVSALMFLFKKYLKSVMQDFKNENIKLKFIGDVSKFSSDIQKLINTIETETKSRTGMRLNIAMNYGSRAEIINAIKNMQKDIDSPLNKNICDLTEGKFGSYLYTENQPDVDFLIRTGGEKRISNFLLWQSAYAELYFADVLWPDFSEKIFDEAIQEYYNRN
GUT_GENOME065105_0111716-197PDLTRLPRHIGIIMDGNGRWAKKRGLPRTAGHAAGAETFRTIAYRCRDLGIPYLTVYAFSTENWKRPVEEVSAIMDLLRKYLLEALEKMEKDGFRIRFFGDLEALPPELRTLCLETEETGAAVAGAKYQINVCLNYGGRDEILRAARAWGRDVLAAARGWRTSPTRSSPGTWTAGPCPTPTS
GUT_GENOME140605_0109316-238DHGPRHVGIILDGNGRWAQQRGLVRTEGHKAGAGKVIEACRWADELGIEVLSLYAFSTENWKRSKEEVAALMSLLIEVVRIHFKELMDEGAKITVMGDITRLPLPTRKAVEYAIEKSKNNQGLVINIGLNYGGRDEIIRAFRRAGQAGISPHDLTAEDLDAFLDTRDFPPLDLIIRTGGEIRLSNFMIWQAAYAEFAFLDCYWPDFSKEDLVQACFSFSQRNR
GUT_GENOME113544_008329-237KINHIAFIMDGNGRWAKRRLMPRTYGHKVACKRIIEIVRFLRTLNVKVVSLYAFSTENRNRPKDEITKLFEYLDDFFKDYIDEFVENKCRIHVSGDLSKLPQKSQKTIQEAINLTSSFDDFVFNICLNYGGRQEIVRAAKLAYQDIKEGKINEEDLTIDTFKNYLYTSGLPEIDLMIRTSGEERVSNFLLYQLAYSEFIFTKTCWPDFTPEKLVECLKEYEKRDRRFGA
GUT_GENOME108707_011053-158IPEHIAVIPDGNRRWAVAHGMAKGDGYHYGLRPGLLLLRRAKELGIREITYYGFTVDNCKRPSEQVGAFRRACVDAAELLKAEGADFLVLGNTKSRCFPEELKEYRVRKKVGDGGIRLNFLVNYGWEWDLAHMKKDGRPESWDVSRIDLVLRWGGR
GUT_GENOME154659_003504-213LNHVAVIADGNGRWAERRGLERSAGHEQGLNKVEDMMHWCVDMGIPALSVYCFSWENWNRPKEEVDALFSMANRYFERYREFVENNIRVLISGTDKRVPPESVEKMERIQRETAHCDGLTLNLCCNYSGRMEIVDAIAKGARTEEEIAAALYQNLPEPDLIIRTGGFQRLSNFLLWQSAYSELYFTETLFPDFSIGEFRHAVKRFGDTKR
GUT_GENOME100613_009858-232IPRHVALIMDGNGRWAKKRMMPRTYGHREGVKALHTVIRSLERLGVEYATFYAFSTENWSRPDDEVSELMRLFNEQLDSLTKYVGDNIRLRFIGDRTKLSAELQDKMAYYEQQSAEKTGMTVIIAINYGGYDEICRAAGKICELANEGYVLPKDVTPEFFQSFLDTKDFPNVDLLVRTGGEKRLSNFLLWQTSYAELYFCDTLWPDFNEKSVLAAIKEYSSRDRR
GUT_GENOME218106_0143012-233VPSHIGFIMDGNGRWATAKGLPRTAGHLAGLKACRKVIAESARLGVRYVTMYAFSTENWKRSQEEVSYLMTLIANKLHGELSFYNQYGIRILVRGDISALPKKAAQTIVSTVEETAGNDGLVCVLAINYGGQDEIARAVNRWKVSVDDPSRAITPDDIANHLDNPQVPAPDIIVRSAGEQRLSNFMLWDSAYAEFAFYDTLWPDWGAKEVQTVCADYAKRTR
GUT_GENOME119618_0096275-320APENRPVPRHIAVIMDGNGRWAKKRGLPRKAGHKVGAETFRTIATYCKDIGVQYFTVYAFSTENWKRPQDEVDALMNLFRSYLKEAAETMFERGVAVRVETYTKKIQIEGRVLGDLTVLPEDIRAQIAEVDEIADRLGPDAATASLCINYGGRDEIKNAVRAIAQRVKNGELAPEDITEDTITANLYTAHMPDPDLIIRPSGEIRTSNFLLWQSAYSEYYFTDVLWPDFKTTDLDAAIDNFNNRNR
GUT_GENOME096439_0237131-257ENAPNHVAIIMDGNGRWAQKRGLPRFAGHKEGVSTVNKIVKTAVKANVKVLTLYAFSTENWKRPKQEVEYILKLPKEFLHVYLPSLMENNVRIETIGDFSALPASTQEAVNYAKEKTKNNDGLILNFALNYGSRYEILKAVKQIAEEVKEDAIDMNSLDEETFSKYLYTEDVSDPDLLIRTGGEYRLSNFMLWQMAYTEFWFTDKLWPEFTEETFYEALLAYQQRKR
GUT_GENOME236221_001198-228IQHLGIIMDGNRRWARQHKLESVVKGHEKGGDKFIETCMWCQEAQIPYLTVYAFSYDNWNRSTDEVDGIFELLESFFKKRIDTCIQRDIRLIPIGNMDMLSEKSRNTLNRAAMLTKDCKNLTVNIAISYGGHDETLRAARRIAADAASGTIKPEDIDSELYRSYLDEYESPEMQLVIRTGGDKRLSGFFPWETAYAEFAFLDVLWPDFSHEQFDNTIEQYY
GUT_GENOME110181_006333-238PKTVAIIMDGNGRWAKKRGLPRTAGHKKGADTIRTVALAAQDMGIQKLILYAFSTENWKRPEDEVSYLCKLPGLFFNKFINELMEKNIQVTFAGELEKFPKATQDVINKAVNMTSKNTGLELCLAINYGSRREMLLSIKKYADEVASGKRENDLTEEEFSKYLFVPEDIDLMIRTSGELRISNYLLWQLAYAELIFTPVAWPDFDEKELQKAIYTYQNFDEKALKDCLDEFASRDR
GUT_GENOME243094_0102914-241DRVPRHVAVIMDGNGRWAKKRKRPRLVGHAKGLERLEEIVGSCPRLGVRELTVFAFSSENWRRPAEEVDFLMGLFSRTIDQKIGALRDNGVRLKIIGSRAKLGDALAKKIEQAEAATAGNDLLKLNIAVDYGGRWDILQAVKSLLADGMTNPEDVTEEALQSRLCLGAESEPDLYIRTGGEMRVSNFLLWQMAYTEFYFSPTPWPDFGQEEFARALRSYGTRERRFGR
GUT_GENOME214923_0173513-238KPLPSHIALILDGNGRWAKKRMLPRTFGHRKGAETFREIIHTSADLGIKYLTAFTFSTENWSRPKEEVDFIMNEIVHLCKDWEKLEKQNIKLSVIGTRKNVPIYVQNALDDVCEKTKTCNGMHLVVAFNYGSKEEIVHATKEIATLVINGVLQIEDINEKNFENHLYTKGIPPVDLMIRTSGEIRLSNYLLWQNAYAEMVFTKTLWPDFHTKELYEAILEYQKRNR
GUT_GENOME026735_006817-226NAVKNLAIIMDGNGRWAEQRGLPRSDGHTAGIRRMLSLTSHAFDKGVRSVICYSLSTENLARAKEEVDHIFTLIPQYADMFVSACQKLGVAVRFVGDLSLLPEEVRGSMANTEQRTQSCSADGKTLYVAVAYGSRAEIVNAVNAAVAKGQPVTEQDFLQSLYYPVEPDLVVRTGGRHRLSNFALYQLSYAELYFSDKLFPDFTEDDLDAALDYYADVERT
GUT_GENOME147473_0016010-235SLPKHIAIIMDGNGRWAKSKGKPRVFGHKKGVNAVRKTVAAASKLGIKAMTLFAFSSENWRRPEEEVGLLMELFITVLSSEVKKLHKNNLQLRVIGDTSRFSERLQKKIVEAENLTASNTGMVINIAANYGGKWDITEAAKALALKARNGEIRVEDINEQLITEHLTMADLPEVDLLIRTSGECRISNFMLWQMAYAEMYFTPEFWPEFDEDSLVEAVTWFINRER
GUT_GENOME011447_0032153-282IKERLPKHIAVIMDGNGRWAKKRGEERIMGHRNGVNAVRQICEACAELGVGYLTLYTFSTENWNRPKEEIDALMSMLVTTIAQEEKTLMKNNIKLACIGDLKQISSETRNALLECIDRTSGNTGMVLILALNYSSRWEITEMVKEIAGKAKRGEINIEDINQEYISSQLTTAEYPDPDLLIRTSGEYRLSNYLLWQLAYSELYFTPVLWPDFSKQNLYEAIVDYQKRERR
GUT_GENOME053651_001964-225IPCHVAFIMDGNGRWAVKRGLSRLKGHREGIKKAELVIDYCLRRGIKVVSLFVFSTENWKRAEREVAGVVEAGLFKLASRYLDNIADFSKRGIRVVVSGDLSRLPEELVLKINKAETATSSNRVMTLNLCINYGGRAELTHAVNEIIRSGKNSVSEKELGEFMYQNLPDPDLIVRTGGQMRLSNFLLFQSAYSELVFSDTLWPDIGEKELDGFLNIYQHRNR