UHGP-MC 51211


Information


Number of sequences (UHGP-50):
171
Average sequence length:
240±40 aa
Average transmembrane regions:
0.15
Low complexity (%):
3.75
Coiled coils (%):
8.33
Disordered domains (%):
14.94

Pfam dominant architecture:
PF12083
Pfam % dominant architecture:
1754
Pfam overlap:
0.28
Pfam overlap type:
shifted

Downloads

Seeds:
MC51211.fasta
Seeds (0.60 cdhit):
MC51211_cdhit.fasta
MSA:
MC51211_msa.fasta
HMM model:
MC51211.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME014504_004013-171GRASSTAKEPDALVDLKGKLEEEKTRHAHMIAVNKYFRERGTVKGCEGVGSCEKAMIEGRLKMGDRSPFLPWQFSSSKDRIRRLQARIRDAETAIAAGSQPVKVEGLPGVTYYENGALMRVQLTFENDPGPEVRGILKGNAFRWSYSQRAWQRILNENGKRAARAALEE
GUT_GENOME098668_0096187-354LTSFSPDERGESDIASHEKELHEDLQSMPEQQRERYMENYKRYFSAMIAANSRCASAMITGPARFNTGRNEKACNSHAKSVTAFREWRERALEAIRKATEAAKPEEQRLEEEWQKVKAFIDDAASTIHGIDTGTARGYSRALFVSNLAGRLSTYVNHGNVEIIDRAVARLREWNDKVKKPVVTARHSIFKYPELVRKVREKQQERASRENREIPFDGGKVVYNFEEDRLQILFDKIPDTDMRTTLKRNAFKWAPRNQAWQRQLTRNAE
GUT_GENOME232186_0241860-328YKPGSTTASYRAAVDAFAAKCEAAKQGCRPGHEAKLDALADRYARRLAAYYNNSAANNARHVSWFVAGPSNYNMRAHEKWSRREEKLREDWNAIQRMEDEISRAAGDPSTILSGDPEAVEQLQAKVERLTLAQDTMKAANVHYRKHGTMDGFEMSPEAAKAVKEVTLYARGPFPTYALTNNRANLKRTQERLSALLAAKETAPVEQQAAEGCTYREDTEIMRVQLIFEDKPDADTRELLKANGFRWAPSQNAWQRQLTDNGKRAAREVI
GUT_GENOME279529_0025992-370AFSGTSHVPEQRSMQYIRMYETLLLSDLEKIPEAEREAYYDKFKNWVGILFSKQSSILSPMITGPARFNNRRNTSANNAYDKAVEDFNKWRGNYAKGVLRRIEAAKTPEQRAAEEWENFRKELLPTMTSIVDIDEGRARGYNRALFVSSLYGKIERKAHNGQSALVVAALDYIKEYSAGLRKPIFTPRHKVWKLAETCQWREAVRKKNAERESAEFPVDGGTIVVNYAENRLQIVYEEKPSAAVRDSLKKCAFHWAPSEGAWQRQLTTSAISAAVHVLF
GUT_GENOME000880_0156539-298DKAYELGENVILKKPSEEERVANLCERYSRRLAQNINRDIQIGMMCPSVMISGAGNFPVKKKEKQVAAWDKNREDYEQVERILWKIGNILSGKGVIKSDDENAIEKLQEKVDILKEKQEQMKEVNKAVRLKDTEKGNEILRNMGYTDKQIEQLRIPDYCGIIGFPSYLLSNNNANIRRLEWRIKSLQATKSNGTQESENEFFNVREDVDSMRIQLFFEGKPEPEVREILKRNGFRWAPSVGAWQRQLNENGKYAVKSVLR
GUT_GENOME007232_0030797-366ERARFYIHSYEDTLQNDLKQILGEERERYTEKFREWVRTLFERHSRILSPMITGPARFPTSRNNKANNAYDNALNEFENWRVKAIKAINRRMEEAKPEEQKRSEEWLRLRMEIISTANTLKDIDTGVNKYSMRSLFISSLYGKLERIANNGKADLIQKATEYIKELNGTLPKPIFTARHKFWRLSEVVQASIKRESEIRGRDDAELAFDGGKVVKNYAEDRLQILFDEKPDQETISKLKHNGFRWSPRFTAWQRQLTSNAFYACARVISV
GUT_GENOME217295_0212333-259AFRRSEDAVRGIPAGQPILPGARGIPQRRAQERSWKLIGASVKNDRKADYLRSRADAVRENTAISGDDPEALEKLEKKLVTLQAAQERDKALNTYYRKNKTVKGFEGISDEGAAKIDVGLAGLREALRHPVPAFQLSNRNAEISRLKKRIEQLRRVDGMEHVEIPFAGGVLMTNEEINRVQIIFDDKPDETLRARLKANGFRWAPSEGAWQTQRTPAALRRARLILG
GUT_GENOME225391_0124893-320AFSATSFDPEKRAAQYIREYEKLLLDDLKEIPQDEQGQYIAKFKEWVATLFAKHSRIMSAAITGPARFPTERNRKANNSYESAVAEFQSWRERTQKAIARRIEAAKPQEQKTAEAWERIKEDIDRFVDWNLCSTNLYNRLETIARKGEVELMQQAIDYVRELNKGRKRPIYTERHKFFKLAELAAVIRGRRAATVTKENKDVPFDGGIVRYNFAEDRLQILFNEIPDK
GUT_GENOME277992_0076347-262SYGVDPERFIEKRFNLLLKYLNSESRCMSAAVVGFSKFPAARNEKRTNWAHNHLTRIVQFDETLEKKLKRITRPRLSQTEKSDVWEKKIEALKSLQELMKKCNKLIRSGKKAEAEQMAGFKFVPDYAGRIGFPDYKLRNNLANIKRLEKQIEDVKRMTTKPKEEISFDFNGGRVEYDADEIRFNIFFDEVPEVEKRSKLKSHGFKWSPRRSAWTRG
GUT_GENOME147338_0345085-341DAYVIDFKKDYNERLFSLFNDYLGSESRCANWAVTGPARFPVERNRRRMESAQNKYDAINAAEEKALKKAKRELFPNGDGSFINSASDNAVEQLEQKIAETKAAHEKMKQINTIARKYYPKGSTEQATSKTKQRCIDELKEQCNLTQEEAEKLLKPCQYRAVVIPFETYQLQRSLQEIARLESRLAEIARLQSQKLEGQFKNGERFFVIDNRIAIDFGYKPSDEIRHLLSKNAFKFSPSRGNLWVRKLTANAKFAFE
GUT_GENOME113543_0032181-317SWLVAGRSNYNFARAQKQLDAERRAEESFAAKMAAYLDHTANQLARLVSPARQIEAYANGCDDPIASGDPLALEKLQARLDYLTATQERMKRVNAWYRGHGTALGCPDLSASAALKIDDVLAEEVARGCKSPVPFEPWALRNNAANITRLRTRMDDLSNLKAARENQKNSASEFENPCGLEVVENVETMRLQLLFEAVPSAEVRALLKQNGFRYAPSCHAWQRQLTDNARRALSSLL
GUT_GENOME131331_01677110-374EYESLVIEDLKKLPPEEHDEYVAKFREWVGTLFDKHSRILSVMITGPANFPVGRNEKSNNSFDRAVEEFSEWRAKYAKRVAKRIEAAKSPEEKEAEEWHGLKRDIDYNVQTCVEIDTGKNTYSYRTAFTNSIFGKIERLANSRKAALVLKALAYIKEVQESDTTGLKKPLFTSRHKIWKLQEVCEQAIQRQAERENRESVKIPFDGGKVVKNYADDRLQIFHDKKPDSNVIFSLKRNGFKWSRFNGCWQRQLTSNSYYGAARVII
GUT_GENOME039366_0105962-259SWNALGKSVKLGEKAEYFERKAEAAENNNSIYLGDDDAVDRLQEKVDALEKAQGMMKAANKIVRSKKLNDIAKVEQLQTLGFSENKAIELTKPDRYGEYGFPSYMLSNNNARIRDAKQRRDRARKLKETEDKEYTISGVRVVENAKENRLQLFFAGIPSKEIRSQLKENNTFRWTSSIGCWQSYLNHWCIERAKVILN
GUT_GENOME082440_00479406-703IDEETARNAHYCIHMGDYKSGSATASYRNSVNKAAQMVEQQKARVSAFYHDKLDALLNSYARRLAQWTNDYNRNQASYPSQFIAGAGNFNMRKHNRQMSREDSLWEEYRQIEAILDKIRSVGTGPVDLADPHAREMLTERLNSQRQMLEDAKTANAYYRKHKTLEGCPGLSEKNRAWLTRPGVFASGDGSPISQYGSPFPAYELASLRGKIERTEQRLAELDKREQQAAEPQTGTAFDGGQIVRNIDLNRLQILFDAIPDADTRAALKQNGFRWSPKNQAWQRQLTDNAERAARQVLR
GUT_GENOME255140_01400574-783KYFDIDEEQARRGRESYSLFGYKEGSETAYYRQQVDYAFKLAAVAKEKTADPELQARADMLAERYAKKYAEWLNEKNKIDASYPSWIVAGPANYNTKKNDKKNARLDAHFKKLDEIEGIKSEIQAVPYYKPREEKIYTVKERQHEGADKYFKIERNKDVNRLQLKFDGKPPQEQREILKKHGFRWSPTQEAWQRQLTPNAEYSIDYVIRD
GUT_GENOME111442_0008339-297VDEAYDLADKVAESRPEEADRVYGIADRYSKKMAANLNDRSRIGCMCPSVMICGPANFPVRKKEKQNAASDRNYQEFQEIQKMLSRIRSIGNGKGIIKSGDVDAVERLEKKLNGLKELQEKMKAANAYYRKNKTLDGCPVLTQEQIDNLKEKMQKDWHYEDKPFATYQLSNNNAEIHRIEGRLKKLRAAKTEGNTESENTYFRTVENAEIMRLQLFFDDKPESNVRDVLKKNGFRWSRKNNCWQRQLTDNARYSLERVK
GUT_GENOME204256_0002629-297FVKDFDDLLLRIKNKIFSDKILLDGQKQEIYERIASKLKVLAEKYLSAESRSVSVMIAGPAKFPVARQQKVMRSVEKNLNEYVDRIEWLQNGGLSKILYRNYPDEAKKDLDDKEDEKKLMVNAKMFVENKISEKDPEVFTFNAFPHLKGCFETCAKHKSFNACRKCIEFLESQLDFCRIQRNITILKRILSEAEEKSAQTPEKSVKQQDFKHFTVIENEQINRYQIIFGGKPAPEIIKNLKENGFKWAPSQKAWQRQITINGKLAIKRF
GUT_GENOME179869_018452-168INNKPIKSDDPQAVEKLNAKLEACQKQHDFMKSVNSYYRKNGTVVGYPGIVESQAKAIDDKIQNGRYWDLPFPSYTLQGSNQEIRRLKQRIAELTHNREVGFVGWNFEGGKAVANRENHRLQLLFDEKPDETRRIELKRNGFRWSPSEQAWQRLLNDSAIYSSARID
GUT_GENOME229724_0121293-252IYAEDDDAVENLTERVAALESLQERMKAANRIIKNLKQTQEEKIEALCRLGFERRNAEELFVPNCFGQIGFADFTIRNNGANIRRLKKRLESVARLKSTPTKQYTIGEVRIVENTEANRLQVFFPEKPSETVRKELKSNGFRWASIAACWQSYLNERQKY
GUT_GENOME236267_0128237-256CEVCKQYGIAPDRLLQKHTHLSEAYLSAESRCASWAIVGPAKFPSAKMQKRAEYAHAHLDRLCRFVENIEKIVQRLAHRSESQDDKCARWNKELARRMCLQEKMKEVNRLIRQGRKAEAEALAGHTLKPDFLGRVGYADYELRNNLAAIRRLEKQIKLVDLAREKKADSGFEFAGGRVEFDPAEIRYNIYFNDIPAEELRRTLKHAGFKWSPRRKAWTRG
GUT_GENOME108494_011501-148MKEINAYYRKYKTMKGYANLTDEKAKEYDKAIEESWYKVPYARFELTNNNAKIRNAKERLASLEKVKATAEENNDEEKFSDLPFTVVRNTEIMRLQLFFEGKPSAEIREVLKKNGFRFAPSQNNAWQRHLNNNGLYALKRVAQTLREC
GUT_GENOME157329_037223-167IKTDDPQALEKLEAKLKNLTDRQEMMKIVNTILRNTSTSKEEKIKLLGAKYQLSPETVEKLMTTQYSFEKPGFQSWELSNNNQEISRIKKRIKLILHYQEEVRKVEETGELLEIEFDGGKIVDNVPENRLQIFFDSKPNAEVCAKLGRNGFRWTPSKDAWQSYRH
GUT_GENOME137885_0097962-305TDEQKSECLAYWSDRLYKNAVDYLNAESRCISWAITGPAKFPVKKAEKAQASAEAKLNDYVYTLDKMKSAEIFNRYLTAEQKQNRVDEQKWREIKYTLSEFVAYKKHGLTLDPNNWTFNAFPHLKRAFTTQAQNGNFAMCDKILEYLHERQSEGLKVEKNIKILTDILDEWKAKLATPQESTEKEVDGVRIVENTEENRLQLFFDGKPEADIIALLKQHAFKWSPRFKAWQRILTPNAKFALNE
GUT_GENOME242980_0151531-253VEAFSNARKIQDNIPLGQPILIGHHSEKRSKKDRDKIDASIKKSVEKQEKAEYYDEKIKAAEKNTKISSDNPEAIELLEEKISKLKARQQKYKDMNKYYRKNKTMVGFEDLSDEKAEEINRRIDEDYSFNKKPAPSYILSNLNAMIKSAEKRLEQLKELDAMEYEEVEFDSCTVISNDITNRVEMHFGYKPNEDMRTMLKRKGFKWSRNNGCWQRLRNKNSLS
GUT_GENOME106598_0192380-260YEAKAEAAENNQAISSDDPEAIQKLTDKLEQLIEEQSYKKRVNAYYRKHKTCKGCDGVTDERAEMLDSMVKDESSYEKSPFRRFELTNLNANINRIRKRIEVLKARKEAPPESWEFEGGYVYMNLEENRVQIFFDDIPSEEFRQFLHRNLSFHWSRYHGAWQRQISDAAIRAAHKATDRFL
GUT_GENOME018891_00994238-462KKSKKSVANSIEKPTRHYYEINEQRAKYAHQMYSMRDYKENQTTNMYKAEVDGVYEMALEAKTKCNNDQEKIKKIDYLTDKFSKQYAEWLNKESELKLRYPSQMITGAGGWTPNKIARKESAFNNLFNELKTINKIKEDIKNEPFKVKRNETKGVAVQSDRYESKYFKVIQNEQENRLQLKFDGKPDDATRTLLKKNGYKFSPSRGVWQRQLTNNARYSVRLINE
GUT_GENOME149831_01089107-386AYRWSSFEPEARAETDIMQYEKQLVEDLKQIPEEKQNEYVSAYHSKFSALLGSLSRCASPMVTGPAKFNCQRNNKALDAYQNRFDEFQDWRNRFKAAMERMKEAAKPEEQKQEEAWNRLKRDIASSAQTIHDIDTGKARGYSRALFVSSILNKVSTYAGKGEVEIVQKAVDFITDFNAQCKKPVITPRNRFFQLPEMARQARLKLQEIRERENRELKFEGGTLVWNYEADRLQILFDSIPDDQRRKELKSYGFKWSPRYQAWQRQLTQNAVYAVKRVLNL
GUT_GENOME011457_0128586-322RRRAIFVPVNVAGPAKYNFEKANKQAERDLHNSNEWDDKMAAFLSNTAKELERLTPIDLVIEELRSGMRGDEVITADDKYALDKLNARLEYLLETQADMKAANKHYRKYGTMKGFYKDEKTAEEKDKEILTHYSWEQQPFESFELTSINGKIKRVKERIAHITKLRTEQPFEDFEFEGGKVIANYTEDRLQVIFDDKPDSTVRSEMKSNGFRWSPRNGVWQRQLTKNAYATAKQLFS
GUT_GENOME274039_0175131-243KSEKAVEGISFGQPILVGHHSEKRHRAAVARAQAAGTRAVAESNLAKHHQKKAAGLSDYLENTIFDDDPDAIEKIEQKIARLEKEHEFMLAVNKICRNRKLNEAEKISAIVALGASEESARKILAPEYSWQSAGFESWALSNNSANIRRYKERLATLTARRERQAAAENAVNGVLIQDRPDGYLAVTFAEKPDYSVISALKTAGFRWCKGSWY
GUT_GENOME078715_0184611-151ARTAHYMVHMGDYKPGSATEGYRAAVDEAAARMDKAIAEATAFAQRPMPDFELTSLRGKIKRVQARLEELEKLRAGDAPEGWKFDGGEVVINTDLNRLQIVLDGRPDDNMKQVLKSRGFRRAPSQGAWQRQLTDNAIYAAK
GUT_GENOME051352_0090424-162SEDNKKTKWIAKLEQLKSRQEMMRDVNSLVRKGKVEEAEKKYNIELRPNIFGTIGFESYELRNNLANIKRLEQQIAQIDRVRESKAETGFTFDGGRVEFDAEEIRYNIFFDEKPSEEMRGRLKHSGFKWSPRRNAWTRG
GUT_GENOME278758_01172163-364PGQPILVGHHSEKRHRKALDTSWRKMGESVKESEKAEEYDRRAEASATNDAVFASDADAVELLEKKIAVLERVQDKMKKANAVIRKTKDEAERTARLTEIGFTEAQIKEILTPDCFGGIGFQSFTLKNNNANIRRLKERLETVKRNKEAEPSEYEINGVRVEENPPENRIKLFFPGKPDEQTRQAVKRMGYRWTPSQGCWQC
GUT_GENOME258945_0004974-305YLERANILLRAISRTASSAVVGRSNFPVRQNEKRLQSEQKWRVEYIKYPANRMACFKRHWGIRKSTAIRTEDDDAVIRMEAKIKKMEAIQERMKRVNAIIRKAKGNDNQAREEISKEFPKLSHKEITLLLMKNPNRGQFRKGYQAYEMSNNNANIKRCRERLAKLKRAKETQDQEFACQDDISVEISHSENRVKIFYPDKPDLETRQNLKSSGFRWAPSEGCWKGYVNRNTL
GUT_GENOME256840_0280663-248RSMEIREKAEYYRQKAEAMEKNTAIYSDDPNAITKLQEKLAQCEAKQEYMKAANRYYRKNGTMQGYEEIKDEEAGQLDQNIRDDYSRENIPYPSYKLSYNNANIRRLRERIKSLSQNMKFVGWEFSDGKAVANTEIMRLQLIFDERPDEEKRCILKQNGFRWSPSEGAWQRLLNDNAIYAAGRIDF
GUT_GENOME109374_00192112-284EGQNLNKTIYSTDADALEKLKERLEEQEKKYAEMKKLNKYFRENCTVKGYPGIDDEKAAKIDERISNAYSWCKAPYPQYEMQSMSQKIRATKERIKSLETEQTREETEYDTDGLGFDVVENKEIARLQIIYPGGGRVDNETYKRLRENGFVFSRTNGAFQRQLNENSRYAVRR
GUT_GENOME090750_0132925-300YKWGTVTARYQELADGAWELAERVARECPEKAEAAHKMAERYARRMAENLNKANRIGTRCPSILIAGPAGVDSKKKERQCAAYMRNETEYHELQKYLEKIAGLLDEKTSVLSKDADAVERLEKKVKALSDLQEHMKAVNAYFKKHGTLQDCPDCTAAEGLQLMQDMEKDWHLEKRPFQTFELRNNSQNLRATQKRLEGLKSAKNAGNGEIITDFFKVVKNAEIMRLQLFFDEKPDTDTRMILKRYGFHWSPSNSCWQRQLNQNGLFAIKKVLEALK
GUT_GENOME259118_0053247-276IPEELRADYEAKYLQKWCEWLAALSRCYSVLVTGPARFNNRRHDRMNDYERAAKQRLQDWRDKVVKRINRQERLTGWQEVERLQNKLDTLTEWHEQMKAANNIIKNKSLTEDEQCEELAAIGLDKREITDVMGKGALPWKGYPTASLSNNLTKIKATQAAIERHKAMAEAEDKEITFNGGRVVMCNSDERMRFYFDAIPSIEVRNLMKRHAFKWSPKNGAWQRQLTNNCK
GUT_GENOME233284_0045028-312YKMNSATNEYKSILAEFSEDINLLINAHPNNATSDNMELINQLADRYSKKLADAINELNRIDSSCVSWMLSGPANYPMKKHQKQQQARDNFYAKNQSLFDPFNNRFYKKIRNLLTNATISSSDALAVEKLKLRIEELEEKQDLMKRSNAWYRENGTMIGFENLTDEQAKILDEAIKNAPTCKTMPYAPFKLTNNNQNINRLKGRVKKLEAMKERTADSVNEYEQIEGLQVEEDKETMRIKLIFDSIPSKEERDILKLWGFKWSPSNSAWQRMLNHNGIYATKQVI
GUT_GENOME026153_0215053-262EKRHRRDLDKSWNAMGKSVQESEKAEYYRKKAEAAENNNAIYTEDEDSVERLEEKIARLEKLQQAMKDRNKIVRDKKLTEEEKIAKLIETGMTEKAAQGLIVPDVCNDIGYAACYLTNNSATIRNAKKRLERVKRLKSQEEKTYEVNGIRVVENPQENRFQIFFVKKPAEEIRQKLQHVGYRYSYGNGCWQCYLKRWNIEAGKQILMSLG
GUT_GENOME263819_0059342-321YVENTATNEYKYYCDKVYDVLEKIIEQKPNLAEKATYKVDRYCRKLADYYNAYYKNEASCPSILITGAGNFPIKKKNAQNKRREKLHETWKYLEQQSEQIKNLLIMDQPILSKNQDAVELLEEKIAKLEEEHKQKLYWNKYYKKNGTLKGAEGLSDKQIEIVEDFVRRNPSFAPFSVTNDTANIRRYKQRLEKMKEAKATGTKIETVNDENNNKLFKVVKNTELMRLQLIFTDKPNDEVRTILKKNSFRWSPKNNAWQRQLTENGMFALKRVVNEINKLS
GUT_GENOME234507_0111423-291YKPGRATKEYRNQIDEAGKELDYVLERCKTPSQMEHAADLFDKYCKTLAFAINEENRIGCMCPSVLITGSGNFPTRKKEKQVAAWGANRSNFEKADYYLRLMNGVHLQGIQSNDPCAISALKEKLAKLKSKQEKMKGVNAYYRKHHTLEGCSLLSSEQIAEITSDMGGWNDKPFPSYSLQNNNANIRSTEKRIKELEATKATESSETEYDGFSYVENTDIMRVQFIFDGKPDDETRNILKEHGFRWAPSQGAWQRQLTSNGKYAAKEVI
GUT_GENOME172984_0164935-306NSETESYRKVVDEVHEIGQAAKERTIEDKHEYIDYLCDKFAKKYAEYVNKGLSIKMQCPSVLVCGPANFPTRKKEKQNIARDNHRKYYDDKLAPIVDKIQKLGTGTEVIRAGDPLVIEKLNDKLESLLEEQASMKQENAYYRKNKTMVGCGGISDEEAEKMDRYIAEYNGSRAPHMSFSLTNINNKIKAARQRIQELSKIKEKGTEETKTEYFKVVENAEIMRLQLIFDGKPDDKIRAILKSNGFRWSPKNGAWQRQLTNNARWSVKRVISA
GUT_GENOME045107_00125109-364QLHDDLMNMPEAEQEQYMQNFKSYYSNMLSASSRCASTFVTGAANFNHRKNEKANASYQKRINDFTEWRERALKAIAKRVEDQKPEEQKREKEWKRLRNDIGSSASVIHAINTKASKCYSKALFVSSIYNKVSTFASHGEVEIVDKALAYIREWNIRVKKPIITERHKFFQLSEVAHKARAQQEKLAGTENKEYSFNGGKVVLNYEKNRIQIFFDEKPDRAMINRLHYDCSFNWAPTCGAWQRIITYNAVDVTKRA
GUT_GENOME101195_0007660-226RKHLERIDNEMRKSIQESEKADYYRNKLDNIDNNKVISSDDPKAIEKLQAKLKKLEEAKIEVKARPHEWYELPYLNADIKRTKDRIKEIQELEELQFEEITFTGGKAILNRDINRLQLLFDAIPDEETRTLLKGHGFKWSRYEQAWQRLYNKNGIYAVRYVVKLINE
GUT_GENOME083214_00828104-366TMYERELHEDLQTMPEEQREQYLSNYKSHLSGVWASESRVASAFVTGPAKFNYRRNEKAENAYRNKYEAFRQWRERAFKSIEKYKESLKTDEQRAEELWTKVLADIDSTAETIHSLDMGKERGYSRAIFVSNLFGRIATHAKNGNVEIVDRAVARVREWNAKCRKPIITERHSFFKLPDVARSVRQKAEETAGRENREVAFEGGKVVWNYEADRLQILFDAMPSEEMRSKLKTRAFKWSPRFQAWQRQLTDNAVDAARQVLNL
GUT_GENOME282970_0276170-275YNTILQKDSEMVSVAVAGPSKYDHAVNQKKQQRLWKYEKEAFGKINRFIENTHKRLKDLEPLVVTLAKIAAGEFEIISSDDPHAIEKLEVKAIYLRELQDKMKTQNKAARQRGEKAPYPPFSLSNNHQNIKATEERIAVIKARNEKTLEGCEYENCKVAVSKEDNRVRILFDGKPNEIVRTRLKQNGFHWSPKNGAWQRQLTEAAM
GUT_GENOME130229_0058030-239NASNDAVADIPLGQPILVGHHSEKAHRRALERSNSAMIRSVHESEKAAYYARKAEAVENNDNIYIGDDDAIERLKKKIAELTAVQEQMKATNKIIRAKNMTDVEKVEALVHIGFSAPYAQRYVANGTQFPAYALTNNKAKINAAKKQLAKAEALANKEDREYTIDDVTIEECYSENRVRIYFPGKPDDEMRENLKRNGFRWAPSMGCWQA
GUT_GENOME015514_0049386-245ISAENNNSISSDNPDALKLLREKLEKLTNNQNLMKAINKIVKNKKSTDAEKIEEILKLGVSKETAQKALEPDFCGRIGFPAYALQNNNANIKRVKERIIELEMKENETTTEVEVNGVKIIDNIDANRLQLFFPDIPSEEIRNTLKHSGFRWSRYNGCWQS
GUT_GENOME147482_0356197-343RKLKRYFDEYISAESKCISWHVTGPARFPVAKAEKANKNSRDKLNQYADYPQRALKAIAKKLFPDGDGSVVKMSSDNPVEQLRIKISEAETMHGYMKIANRLVPAAYKASENGELTDKSAAVLENKMLERGIPQNLLKTFLEPNPIVNTWGRFSLANSNANIRRLKQRLVEAERVEESRNLNSLEGELENGIKHGVIDGRIGIWLGGRPPKEVTQRLRKFSFKFSPTRNNAWVRAHTVNAEAVFKRD
GUT_GENOME101405_0021994-347LDELRRYYEGYRAKVLAHLSALSRCMSAYIAGPSNFPVHRAEKANAAERKRMEELTEWQERAQHAVKRNLGLLPPSGVITSDDPEALEKLRKKLADREKMQEVMKAANKIVRDRKMTDAEKISALMKCGLTEKNASAVLHPSESYLSPGFDSYMLSNNNQEIHRLRGRLVQLERLQAEAAKQAENGGAPELDFDGGRIVDNAAANRVQIFFDGKPGAELRAELKSRGFRWAPSVGAWQAFRNWHAMSSARQICG
GUT_GENOME206704_0048330-270YRASVDEAYALAETAAEARPEQKERALALADRYARKLADWTNKKYRIDSMCPSVLISGSGNFPVRKKEKQNRAIDAHWQEYEKLKAMKESIGRLGDESSIIKSGDADAIEQLRKKLDGLTAKHQRMKDANAKARKEGKPAPYAPYALSNSNQNIRATRQRLERLQTAKEKGTSAQHIEFMGESVEVIENAEAMRLQLVFTGKPAEDVRTTLKKHGFKWSPKNSAWQRQLTDNARFALRQML
GUT_GENOME067377_0142344-299DLADRVAAERPEEAERAYRLAARYAKKMADYYNREASIGMMCPSVMISGAGNFPVRKKERQVAAWERNHQFYEEAQKILGKIESILNGKGIIKSDDERAVEKLEEELEDMKTLQEQMKAANRAIRLKDTEAGDDLLREMGYSEEAIKELRKPDYCGRVGYPNYALSNNNANIHRVEERIKKLKTIKERGSSEVEYKTFKVVENTEAMRYQIIFDGKPEPEVRDLLKSNGFKWAPSQGAWQRQITSNGRYALGKVVE
GUT_GENOME076094_0077648-223PIIIGHHSEKKSRKLHERAWQDIGKSIKEDKKSQYYKNKAESVENNKVIYNDDPNAIQKLKDKLEYLEKQRELIKADKNHKTWELQNIGARIRETKRRIARLEKLDEIEFKDIEFAGGKAIHNKEINRVQLIFDNIPDESIRTALKSKGFHWSRKEGAWQREFTENAIKATNILIR
GUT_GENOME278552_00457236-537YQQGSATSSYNASVDQFDKNVNELIQRYGNNATLTDKDWEEVYSIADRYAPNLEKYTDETNRNEASYPSWFISGPARYNTKKNEALMSRSRALYENNADRINPDDNVYLKKIKSILTNASIKSNDDAATAKLQDKYDKLKAELENGKAMNAYFRKNKTLVGFPGISEQSAKAFDRANASGDYFSRQPFMSYRLQNGNAELHRIQARIDTLNKAKAQAEAAKANPQAAAEATAAKYPKVDGVEVQENAEEMRVQLRFPGKPDEQTRSLLKSHGFRWSPTQGAWQRQLNGNGTYAARQVMNTLA
GUT_GENOME063466_0109545-266CFRQSESMASVIPMGQPVHGKADRNYREKIWNKMGQSVKASEKADYYERKAEAAENNNAIYLDDDNAVEKLERKLAELVKAQEDMKAANKVVKNKKLTEEEKKVRLMELGYSETSAVELLTPCYGHIGFPSFSLSNNNANINRIKKRLELAKRMKGTPEKEYTINGARVVENYPENRLQVFFDDIPAKEIRDSIKQHGFRWSRYHSCWQSYMNRRNIDFIKE
GUT_GENOME147139_00133639-906DISKERAYQAYYWSSMSPEKRGEAFRTDYADTVNNIYDELKKQVQSPEQEAILNDEMNRFKKGYLQRSLKLLNATSNTASPMVTGPAKFNVRRNNKMLDRKYSALEELTAFSDKAKAAILKKIRTGTPQEITSENEWRDLKRELDATHMKENLFGRIETLARNGKVELLGKALEHIKNKNVFTDRHRIWKLPELAKKTADALATKSAYQGAEIVSVPADNRLRILFDARPDQDVITDLKKSGWKWSPANNAWQRQLTPTAMSSAKNIL
GUT_GENOME168542_01538983-1269YRPGSATAEYRHYVDQAVEIAERQKKRVEPEYHEKIDSLLDTYARKLAENMNKGYEITARVPSVLIAGPSNFPVRAKEKQNAASDRNMEEFQYIQGLLDKIRSTGMGGISADDPQAVSKLEKKLEKLEASQELMKAVNAYYRKHGTLDGCPHLTERGIENLKADMASGWHYEKKPFQSWQLSNNNAEIRRLKGRIEELTRQKEAAYVGWEFDGGTVEINRVANRLQIFFEGKPDAAVRDELKSNGFRWSPKAGAWQRQLNDTTIRVADRIKCIQPLSGEKPSALQLA
GUT_GENOME145926_04190173-349NKAAGVGRGGISSDDPDAVFKLLCKLQSCMKSQLSMKAANKAIRSHKKDRNQQVSALIDLGFTFEDTNVLLDGDFCGRIGFAPYALQNNNAEIKRLQTRIKQLESVKAVAEPQRKEYDGFDLEIDPEDNRILFYFDGKPADEVRSVLKSHSFKWSPTRKAWVRKITPNAVAQAERVK
GUT_GENOME089143_0010423-297YAEGSATSEYRREVNRAATLAEECKKGKTEAQQEKIDYLLDRYARRLADNMNASNRNRASCPSVMVAGWSNFPVRKKQQQLSRDDTLMREWRDIQGILDQIRAVGHGGISGMDADARERVQAKLTEREAMQEKMKSVNAYWRKQGALVGCPGLSDKEVARLTASISQGASTGRSEPPYPRWALDNNGAEIRRLRSRLAVLDAQQAQGDSEQTFTGGVLRITPERVQLVFDDKPAAEIRDIVKQWGFRWAPSQGAWQRQNTANGRYAAKQVVKAIE
GUT_GENOME129596_0154458-258NYRKRIDDKFESAFREQSKADELRAKAEAAENNTAISSDDPEAIQKLKDKLLGLEESQEKMKAINKYYKKNGTCIGCDGINDESAEKLDNAAKKNYEGRPFPTYSLTNNNQNLRSIRLRIEELEKLRELDFEPVEFEKGKVIVNKEINRIQFFFDGKPDEETRGVLKSWGFHFSRYNNNAWQRQLNSNGIYATKKVLEKLD
GUT_GENOME215400_007713-165IKLDDPNALEQLTEKLSKRRQLQEMMKEINAVIRSSQSDDAKIDFIMKKCGKSWEEARNFLHPSESWCDPGFAAWELANNNQEIGRLRKRIREVERYREAAARAEEEGNSEFPFDGGRIVDNVPANRLQIFFDGKPDADVRTRLKQNGFRWAPNSGAWQSYRH
GUT_GENOME239585_0019930-214NQYSNSNANRILQIAPGQPILIGHHSEKKHRRLIKKAQDDIRRSIEEDNKSKFYQERAQTAENSKVIYSDDPQAINKLKEKLERLEYEKSVIKAREHYTWELTNIGAIIRETKKRIERLENLEKIEFKEINFPNGKVIHNKEINRIQFLFDDIPNEETRKILKSHGFRWSRYEKAWQREFNQNCI
GUT_GENOME036193_0163455-269PAGQPILPGRRGITHRNTLEKSWNALGKSVKLDEKAEYYKSKAEAAAKNDNIYLGDEDAVERLTEKLEGLKRSQEIMKFVNKIVRDKKKTREEKIKAIQENGLSENQAEKILTSGYVGFASYSLTNNNANIHRVEDQLKRAIALRETETTEETINGIRVVRNTEENRLQLFFPDKPDKETRSKLRHNGFRFAYSNECWQSYLNNRQIYRAKELLK
GUT_GENOME085736_0162659-248YRDRAWGKMEKSVEAGQKADYYRSKAEAAENNSAISSDDPEAVTKLKEKLQQRQEQQSYMKKVNAYYRKNGTMKGFEGISDEKAAEIDEAVKSDYSWINAPYAPYELSNNNAEINRLKKRIESLERREETGFVGWQFEGGEAVANTEENRLQLLFDEKPSEEQRSKLKGWGFRWSPSNKAWQRQLNSNAI
GUT_GENOME155367_0145955-270RDRRYRDRIHNTFGKAFATMDKADYYEEKAAKVGSGGISSDDPDALDKLNDKLENLRRNHEFMKAANAAIRKGKTPEGHPFFPNIPTEEVFTSPEAKLANLMALGVSEGMAQDILKPDFLGRAGFAPYSLQNSNANIRRVEQRIRELERAAAAESREEEGQGYTYREDTEENRIMFLFPGKPDDDTRQMLKGYGFRWSPTRKAWVRMLNNPGRYAA
GUT_GENOME001449_0124687-257IYLGDDDCVERLQVKLDELVKLQENMKAANKIVKSKKLSGEQIRQQLSELGFSDEKVNEILTPSFTGRIGFASFTLTNNNSRINNTKKRLDQAKEMKATENKEYYIGKVRFVENSRENRLQLFFPEIPDKALRKQMSNNGFRWCSSNGCWQSYLKRYNIRFAKELLAPKES
GUT_GENOME124593_0091829-289AEYRQMVDDAAALAERCKQGRGEAAAAKIDALLDRYARRLADNINARNRNTASCPSVMIAGPANFPTRKKSRQNAREDSLMRDYQEIQHILHQIRTTGTGGIQSGDPEAIRQLEEKLARLEKDHSAMKAADAYYRKHKTLDGCPGLTPELARQVNSFRADGAAPFSGYPMQLSLSNIKRTRQRLEELKAAKSAAPVEQETPTGVVYREDPDAMRVQLVFSGKPDADTRALLKSNGFRWAPSVGAWQRQLTESGKTAARRVL
GUT_GENOME112835_0098038-265TEQNEIKNLCDKYGVDSERILKKHYDLTMKYLCAESRCASAAITGPAKFPVAKMEKRNNIAHNHLQKLCDYTKNVEKMLIRITRAKKTEAEKIQEWEEQVKILKNRHEIMKKYNKGALAYDELPPDMKKHIDFVKRNYPRLKANFTGYKLTNNLANIKRLENQILLAQRTKEIKKDTDFIFDGGRVSFDDVGIRYNIYFDNIPDVEIRTRLKQNGFKWSPKRKAWTRG
GUT_GENOME058625_00763461-682EMVDRYASIAESMIDKGDKALFAGGCDKSASLYTAKNAFDLNDVTVIGSGFNEEGAAYSVVKEYGTAAAACLKDFKGASIYNAGCANKAITCNLPDSEKKEELQKKLDGLEKAQETMKAVNAYYRKHGTLDGCPHLSPETLENLKADMASGWHYEKKPFQSWELSNNNAEIRRVRQRIESLTRANEVAYVGWEFDGGHVEANRDQGRLQVFFDGKPEADARS
GUT_GENOME018722_0048349-241ERDRRFRNRAGAKMDKAMEASKKAEYYEQKAESVGKGGISGLDPEAITKLEKQLEQLTAHQERMKAANKAVRMKNQEKGNLKLKELGYSDKEIEVLRTPNFLGRLGYPNYELTSNNANIRRIKRRIEELKELSEMKVETETNSLYEFFMDEGRLQFSFDGKPSGEIRDILKSYGFKWSPSRSTWVRQANRNGF
GUT_GENOME234141_0129556-308TEASEEELCAEAERYRKNMKDAFMRYFSTHSHCISTFITGASNFPVSRAQRANNAADNALKAIIELDKNARASVKRRFYKDPNAPIRSDDPDALEKLQKGLAARMRYQERMKQANKAIRSAKGDNAVAREKLRELGYSEATAEELLTPDFAKRTGFPDFRLTNNNKEIHRLKARILAIQKLQTRPEESGENADGVRYEVDTAANRVKLFFPDKPSEEVRAKLKRNGFRWAPSNGAWQAFIHNYTISIAKELVG
GUT_GENOME265631_0098237-293VDEAYRMAEEQAERFPELAEKAYALADRFARKYAEWLNEGYRIDAMCPSILVSGGGNFPVRKKERQNARRSSHMERYEKIMGIKRRIASVGTGGIQAGDPNALERLEAKAKRLEDRQDMMKRANAHYRKHGTLEGFDGVDHDEAERVRHDMERFGMNQPFPSWQLSNNLATIKRTRARIAELQREKESDVEDRETEINGEPCTVVENADIMRLQLVFGGKPDEETRSKLKANGFRWSPKNGAWQRQLTENARRALRA
GUT_GENOME030570_00029477-753SFSKYETGSATRIYHEKCDFAYSILDKILDEHPEQAESTAQKIDYYCKKLAEYYNDYYRNEASCPSVLICGPANFPSKKKERQNERRTLLIERWNYLENYLSKIQNFFNVSRPIKSGDPDAIEQLEKKITELESEHKLHLSANKYYKKHKTLQGFEGLTSAEISQIESLIRDPSRFVPFYVSNETTNLRRYKARLERLKKEKNLVGTCESVTIPYQGVVFKMVENTENMRLQLFFEGKPSEEIRALLKSHSFRWFGKNKCWQRQLTNNARFSYKRLK
GUT_GENOME214339_0092522-302YIPNSATNEYRAAVDRAAAVLEEVKAKCKTQAQRERAEYYFDRYAKKLAQAINQENAIGTRCPSVLISGASNFPVRKKEKQVAAWEANRANFEKADHYLHLLKTAHIQGVKSSDPEALEYLQEKLARLEASHADMKQANAYYRKHKTLDGCPGISQATREWLTRPGVFAKGDGTPLALYGCPYPAYALQNSNANIKRIRERLEALKAVKASPTQEEQHNGCTYLENSEIMRVQLVFDGKPDEATRALLKSNGFRWSPSQGAWQRQLTENGKAAARRVLSAL
GUT_GENOME143499_03870200-456YEQRLEEKRDRLESRALKSRRLSNEAYEQSHRLTEMIPFGQPILVGHHSERRHRAALDKSWNKMGQSVALSEKAEYYDRRAAAVGSAGISSDDPSAIEKLEEKLAKQKKAHSQMIAANKVIRSKSDDESKRLSLFEIGFSEASINASLKPDCMGVIGFASYQLTNSNQRIKATEKRLNELKKVRTIDVPAMQEFNGFAVTYDMDENRILIKFDAKPSAEICKLMRRYSFNFSPTRNNSWVRKITGNALASTKYLIAE
GUT_GENOME110648_0009721-299SDYQKGEKTLEYRGLCDFAAEMAEREKKRKPEYAEAIDALLDRYARKLAEWMNTESRIGTLCPSVFIAGGDGVSAKRKAKQNARMDAHMKKRAEIDKLLDKICKIGTGGIKAGDAGVLVKLESKLADLREAQETMKAVNAFYRKHKTLEGCNLLTTEQIEKLKVAMTSPWRTNSAPFAPYQLQNNSAEIRRIEKRIESIEAIKEEGDKESEVDGVDGLRVVENTDIMRVQLFFPDKPDADVRDILKAEGFCWAPSKGAWQRQLTENGRFAAKRAIEQIK
GUT_GENOME079230_00681569-853YEAGSATSEYQKLVNRAAEIARNQKEQVDPIYHPKIDALVDRYARKLDENMNRQFEIDARMPSVLISGGGNLSVRRKEKQNAARYKNAHEREKIDHILQKIQSTGKGGISADRPDAVLKLQQKLEKLEKEQAMMKAVNAYYRKYKTLDGCPELTKDQIKQLQQEMEHPWHREEKPYASFQLSNHNANIRRLKQRLETLTVHQSMNYPEWEFSGGKMVANKEVNRLQIVFDEKPEETTRSELKKNGFRWSPKMKVWQRQLNQNAYRAANHISAIQPVTGEKPSQLY
GUT_GENOME283956_0008827-295ATAEYQRQVDEARRIAEEVKAQCKTTAQRNRVDGMLDKYELTLAFAINRDNEVGTWCPSILIVGGANFPVEKKKRQGEAWSANLMNYNKAAEILDSIRDYGHNAPINSRDPEALTALRKKLERVKGQHEHMKAVNLFYRENDTLDGCPDIGPLEKAAIEGRMRQWNDRKPYLTWQIGNVRKQIQQIEKRIAEMEAAVQENAQPVAMEDLPGITYHENSSTMRVQLIFEGKPEPDIRAILKSHAFKWCPSQGAWQRQLTENGKRAAREVL
GUT_GENOME007920_0068448-280QQLEIVKKRTEQFKELITNIYNEYLSISANFVPVNVAGPAKYNSNKFKKIADRMDKKIDEINDKINKFYDNTESMLKNAYSKDEIILKYKNGYNEPISSDDPLAREKLEAKLEYLQTKHQSYLDFNKKARLNKTEQLPPYVLANSNQNMKAVKDRLQTLDKIDNIETGRYYFDNGEVSFDKEDMRVKVFFDENPDEKTRDELKSRGFKWSPKNSAWQRKLTPNAIYITKRMFK
GUT_GENOME095032_0134731-293TAHYRDMCDDVYTAAEDIAQRLPYLAEKAEAKAQNYAKKLAEYYNDYYRNEASCPSVMICGPANFPTRKKEKQNSRRDTLQQEFQNLQDYAASIKAMGTGGISADDPNALERLRQKLDILEAARERMKKINAHFRKYQTMEGYPEPLTEQEKQHISFMVNSGFSKYGIYDTSNISATIRSTKERLERLEAQKAQETKETETEFCTVMENTEIMRLQLIFPDKPDEETRNILKRNGFRWSPKNGAWQRQLTNNAKYAAQNVLNA
GUT_GENOME112053_0057611-166KNRADHLANLYAKKYANWINEKNKIEASVPSILITGAGNFPVKKKEQQNARREKNWQEFNKIKKIKEDIENLKYYKPKVEKQGKAVYGHYNFENKYCEVIQNEEANRLQLKFNDKPDEKMREILKHNAFKWSPTNGVWQRQLTPNARYTTERMLKQ
GUT_GENOME231248_0040390-359EGSATAGYRSQVDDAAYTAFRQKQRVDPMYHEKIDRLLDAYARKLAENTNAGNSIAASCPSILIAGGSNFPVRKKEKQDAGADRNMAEYNEIQGLVDKIRSTGTGGISSDDPQALEKLRSKLDGLVKLQERMKAANAAIRMKDTAKGDAKLAELGYTPEEIKQLREPDFCGRIGSPAFELSNNNANIHRIQGRIAELEKRASTPAPDGWEFDGGKVVMNTGENRLQVVFDGKPDADIRDELKANGFRWAPSQNAWQRQLTNNAIYAAKRI
GUT_GENOME255682_0032273-318ELNKDLIHRANSNSFGGKRGDISEHEYQVYVERVLSWPISEEKKQSILDKLHEKWSEMLKYEAQHVSVMVAGPAKYNSRKLDKSDKILELSAAFSEWFKDLEDQIKIGQQKNDKAEKLLKLIEFCRKPDNPCDPTNHLAELAMYDNASFVQIYEELYPEYKWRKNSTIAKLYQKSLAGEIKEIRKEVFFEDANMTAYKEADRVYIKFLMRPKRQLIVALKSRGWWWNSYKNAWSTYPDKLDPEWVA
GUT_GENOME032409_0016481-359SYRGISQSPEERGQSTIADYEKELHEDLSKMPEEQKEKYIQNYKKYFSAWLSAQGRCLSAFITGPSGFNTRRSEKANTSSDNRYKEFREWRERAIKAIKELIERNKPEAEKINEEWESVRRGIDRSATIIHDINTGKDFGNKALFVSNLVNRIGTHANNGNVEIVDKAIAYIREWNAKVKKPIVTERNKLFKFPEVARNVRGREEAKANQENKEVRVGNVTVVWNYEIDRLQILFDEIPDKDTRTRLHREFSFNWSPTNTAWQRKLTDNAVRAAKRFLN
GUT_GENOME119507_0123748-255PNHYSYKSDKRYRDRIDSKMRQSISADEKADYYENKAHAAANNRAISSDDPDAAQKLQEKLAALEKMQTKMKTINAYYRKHQTCFGCEGVTEEEAARLDKRVEEGYSWETAPYPAYALSNNNQEINRIKKRLETLTKNREVGFRGWEFDSGKVVANDDNNRLQVFFDEIPAPEVREALKGHGFRWARSEGAWQRQLSSNAIYAASRIA
GUT_GENOME022022_0024928-299EAFNEDFEKTLSDFAERVNQQNITKEQKKECFDYWSSRLHREAEKYLAANSSCISAMIVGPAKFPVARAEKRQRSAEKALNSYCYTQNKLKDDNIFNRYLTAEQKQNRIDEAKWREFEHTIGQFLRFHNKENLPAGEWAFNAFPHLKGMFETQAKNENFAICEKVLAELSARAITFPGIDRNIKILSALLLKYREKSENRKTPETSEQEINGVRVVKNTDENRLQLFFDGKPDAEMITKLKGAAFRWSPKNKAWQRVLTDNAIAAANRILAQ
GUT_GENOME000729_0118525-303YVAGSATAEYKKYVDRIYAVVDQIKERKPHLTEKAVKMADRYSRKLAEYYNNYYRNEASCPSVMISGPANFPVRKKEKQNRRREKLMKDWEYLQSYAGKIEYLLTMEQPILSGDENAVMLLEEKLKKLESDQAMMKAVNSYYRKNQTLKGCPELTPEQTERLKKEMGRDFHYENKPFMSYQLTNQNATIRNTRFRLEQLKKEKDAGTTERENRFFQMVENKELMRLQLIFDGKPEPEVREILKTNGFKWSPKNGCWQRQLTGNARYAVRRVIQALEKME
GUT_GENOME204020_011016-169VIRSDDPQAVEKLQAKLDKLTKQHARMKEINAYFRKHATALGCPGLSDEDAAKLDRRVQEGYSWEKQPYPSYVLSGNTAEMRQLRQRIEEVSRTQSTEYVGWDFPGGRAEADKEDNRLRLYFDEKPSEEQRSTLKCNGFKWAPSVGAWQRQLNDNAIYAAARMD
GUT_GENOME111744_0269890-372FYSTSHTPEVRAAQYIRDYERQLQEDLKGLEPDFHFDYISRYRDWVRELFDRHSRIMSAMITGPARFPTARNAKANSAYDTACVKFNEWRENFKKRALKRIEEQRSPEEKADREWTVIKRDIFSTASTLLGIDLKKPEYRGYSRTCFVTNLASRMETLAKNGKVEMLRRAAEYIKSLNAQFKENGAKEIFTSRHKFWKLVEEAEAQVNAQEERANRENVEIEFCGGTVVKNYAENRLQIVYDDKPDRATIDKLKKHGFRWSPSNGAWQRFLNVNSYYACADIV
GUT_GENOME112692_0277858-247YRNKIWNTIGKSVKAAEKAQYYEQKAKAAENNNAIYLEDEDCIEKLEEKLRNHIHLQEQMKAANKIVKSKKISDSDKIKKLKDIGLSEENAIKLINPDQIHGPGFAPYQLSNNNAIIRNTKHRLEQAVKLKNTESKEYMIANVRVVTNTEENRLQLFFLDKPDEETRTKLKKNGFRWSPLNGCWQSYLTR
GUT_GENOME161540_0032826-302VMGSKTEEYKKAVDEAYDLVDKIKDQRPKQAEKAENIARRYAKKLADNYNKGFRIELMCPSILISGAGNFPVKKKEKQNEARDRNLKEYNQILGYIDKLKDILHGNEVIKSKDADAVEMLQEKLDKLVENQNMMKAVNAYYRKHKTLKGCPEFSEETAIKLEESLDDRKKEQHRVFMRYELQNNNSNIRSIKKRLEGLKKEKEQGDSEISLDKFGLDIKENKEIMRIQFFFSGRPKPAVRDVMKSKSFRWSPKNGCWQRQLTSNGRYAAKKAVEEIE
GUT_GENOME157329_032973-165IMTDDPQALEKLNKKLNALTMYQETMKAVNTVIRSSKSEEAKIEFIMQKCSYSRDEAYQMLHPRESWQEVGFPSYRLTNNSAEIRRLKQRIAAVSKYQKEVETVEANGELPEFPFPGGKIVDNAPENRLQIFFDGKPEKEVRDKLKHRGFRWAPSCGAWQSFR
GUT_GENOME277800_0035417-299MNSMRDYCDGEKTARYKRRVDEATEIAEARKKRYPDEADRIDLLLDRYARKLAEWYNKESEVESMCPSVLISGAGNFPVRKKEKQNERRDALMREWEAIEVLLRRIASTGSDAIKSGDAQALEKLRQKLERLESAQETMRETNAYYRKEKTLDGCPCLSLDDIATIKAEMAKPWFLGDVPFAPYSLSNNSAKIRQVKERIALLEKAKATGTQEHAADDIGIDGLRIVENAEAMRIQLLFDEKPNEEVRDILKSNGFRWSPSAGAWQRLLNGNGKFAAKYVIQK
GUT_GENOME051534_00070187-368KIRSLNTINFKKSMHYNPFAYIRSETEKLHKCETEQEFMKKVNVYYKKNGTCVGCEGVSDELAAKLDENIKQAYSWDKQPFPSYRLTNNNAEIRRLKKCIENLTATQNTEFVGWKFNGGEAVINEDKNRLQLLFDEKPSKEQRETLKANGFRWVPSDTAWQRQLNPNAFYAANRIDFIKPEN
GUT_GENOME205974_0082842-309SKTAEYKAQVDKCYSLVDKLPDDLKEKGATMADRYAKRLADWYNKQFRIERMCPSVMISGGSNFPVRKKEKQNAARDKHYQLYDEIQKIPNKIKTLVNGTNIIKSGDADAIEQLRNKLAKAEALQTEMKATNAYYRKHKTMKGYKDYTDERATELDKAIKESFDGVPFASYTLTNNNAKIKNTRKRIAELERLKETATEQTNETYNTDLFEVIENADIMRLQLRFDGKPDADTRTVLKQNGFRWSPSNGVWQRQLTDNAKFALERVIE
GUT_GENOME214909_0116496-346RRGVRTVMNAETELNKDLKDMPEEEKERYIQNYKKHFSAWLTACSNCASSVITGGSNFNVRRAEAANNRESARLKEFAEWREKALKAIARRVQEAKPQEQKIEEAWKSIKNDIDRHFLPANLYNRLETIARKGEVELMQRAIDYVRELNQKRPRPIFTERHKFFKLAEVANSVRQQQADMSEKDNCDIPFAGGVVRLNYAEDRLQILFDEKPAADMISKLKSSGFRWSPRFGAWQRQLTGNAIYAAQRLLN
GUT_GENOME206730_0206050-245IVGRPALPRLREKSINLMNKSIEADKKADYYDSRVEAAENNTSISSDDPDSLKKLQNKIECLRKQQERMKSINKYYRKYKTCVGMDGLSDEEAARLDQAVKDGYSWETAPFPSYELTSINQRIKAAEARVRKLEAVDEMPAEIIQFDGGEIESDPVTNRVMIRFDERQGEEIVEKLKGRGFRWAPSVKAWQRLRNP
GUT_GENOME259246_0257269-269RDRNFRSKIHSIMGKSVQEMKKAEYYQNKADSVGKGGISSDDPNAIEKLKSKLEKLQQAQELMKKANKLIKKFPEHNARLEGLIELGFSEEKAIDVLNPKYGSIGFASYSLQNNNAEINRLKKRIAELQTLENRTSNEVENDLYKYTECKIENRCMFIFDGKPTEEIRQILKSNGFKWSPSRGAWVRQLNANGIYASKRVI
GUT_GENOME008297_01409106-299RYRERIGQTMDKAIGLDDKADYYAEKAKTVGRGGISSDAPDAIVLLEHKLAEREAKQARMKEINAAFRKGDAALLALGMTQAEIDKMRENMPSYFGQPFPSFSLSNNGAEIRRLKKRIEALKATALYETARTAFDGGVIVDNAEENRLQIIFDSKPDKAVREALKGRGFHWSPNNRAWQRKRSHDATYCARLIC
GUT_GENOME011738_00935165-461SFSDYKDGSESGSYRSRVTAFEANVEELRSRYKDKNYTQEELDEVEHLTEAYARNLANYTNENNRKEASYPSWAIAGPANYNTRKNDAKWAAIRSLYENNKDRIDPDSNIYLKKIDNILSQRVIKSGDSQAVSRLQTKYDNLKEELETGKAMNAYYRKHGTMAGFDGISEEQAKKYDTELTSPNRLSRQPYPAYALQNGNAELHRIQNRIDILKRAQERAAEAKANPEAERQRIESRYPKVDGVEVSENGEAMRLQLKFPGKPDDKTRTLLKSNGFRWSPSQGSWQRQLTDNARYSA
GUT_GENOME251539_01245118-330IDARVPSILITGGGNFPVAKKAKQNAARDRNYGEYAEIEKLLDKIRSTGRGGISADDDLAVEKLTKKLEGMESQQAMMKAVNAYYRKHKTLEGCPELTAEQVEKIKASMSQDWRKDPVPFPSYLLTNNNANIRRVRQRIEDLSHKAEFVGWTFPGGEAKVNEAENRLQLIFEDKPDADTRQALKSEGFKWAPSQGAWQRQLNQNAIRAAARLD
GUT_GENOME210473_007661249-1501ADTAPEQAAVRYYTINEGAARRANDANSYRDYAAGSATAEYRQMVDQAAAIAQRQKERVDPMYHEKIDALLDTYARKLAENLNDHYAIEARVPSILVAGGSNFPVRKKEKQNAARDRNMQEWQDVQGILDKIRSTGMGGISMDAPQAAAKLEAKLVKLESAQETMKAVNAYFRKNKTLEGCPSLTPEQITKLQQEMAQSWHLDKSRPYPAYMLSNNNAEIRRIRGRIEQVRQHEDTNFAGWEFDGGRVEANKA
GUT_GENOME096214_0012044-289AQVQKFEQLCTNNEQRTYLAEQLTRFKVRFLELRRRLTAARSNCISTMIAGPSNFPVRRAEKANRAELRCMNECEAWANKAIAAIQKGIFARKTAVQIEQESMDAVKDMIMRSYLDTPFGRQNCYGRLQTWAKHNTPEMVQNTLNFLKKWQSEHLDGKGFTQRHKVWSLTGMMPQEPQESTSEIHDGIEIMRNVELDRVQIFFPGKPDSDTITTLKQYGWKWSPKNGAWQRKNTANAYISAKEIIS
GUT_GENOME194746_013191-247MQYYEINEQTARRANDVNSMSDYRPGSATEEYRAAVDKAAALVQARKAKISPYYHDKLDALLDRYARRLADYYNAYYRNESACPSILVCGGSNFPVRKKQKQNARRESLWQEYKEIDAILDKIRSVGTGAVDLTDPHARELLQDRLQQEQNALDYCKAANAYYRKHKTLRGYASLTDQQADALTDPDAFSVKLYGKPYGDFELSSLRGKIKRIQARLADLDKLQAAAQQPDNTTKFDGGRTGRSKEE
GUT_GENOME092477_0094332-305AFNGTSFSPDRRGESLRREYANDLASFQSVLEKYMKGEDEDKIDEEFERFRSGLKQRYLAYCSSHSRCMSAFIVGPARFPSARMQKYSGWADNKMREISSFIERAEKSVKRRYFKDPNGPIKSSDPDAVERLEAKLSACRKTQETMKAANAVIRKSKGDKDKAMAGLIEIGLSEQTVAKILTPDYCGRIIGFQSFRLSNNNAEIKRLEGRLRKIKTAKETTPEKIETETGIIIEKCPEENRIRLYFPDKPDETVRASLKANGFRWSPRLKAWQS
GUT_GENOME014667_01116281-538AHESHSFSDYKTGSATAGYNAQVAEVAELAEKVKSSAPEEFHSEIDSLVSRYASRLAEWTNRRNRIENSCPSWFVSGPANYPMRKFEKQQAALKNCYDDLEKIEHIKARIKNFANRPIMSGDESAIDRLTEKVEKLRKKREEMKQENAEARKAGKTAPYGSYTLSNSLANLRTAEKRLEELKKAKETPTAAADDLKNEFCEVVRNTDIMRLQLIFSGKPDDETRNILKSNGFKWAPSQKAWQRQLNNNAVYAAKRVFS
GUT_GENOME285659_0036846-293YLAKLPAEMHEHFEQRYLELYAKWLSACSRCISSFITGPAKFPVKRAEKLNNWERGAREKLDNWAKRFVDRVCNPTQHRLTGWAEIERLQDKVEKLTALQEAMKSANSVIRKKKLTEEEKLDELVALGFSEANAKTLLEEPQYSFMRKGFQQYQLTNNLAKIKDAQQRIAHLQNMVDREDAEGMVNGVRVVYCYSEERLRLYFDGKPDGETISKLKKAAFKWSPRNGAWQRQLTSNAIYAAERVLGVE
GUT_GENOME070861_0031543-301NEYRTMCDRAQEIADEQKQKHPECAETIDMILNRYCRKLAEWINRENAIGTMCPSVMIAGPAGINRAKKEKQIAAYKRNAEKYEEISGLIRRIQSVGTGGIKAGDANALEKLKNKLENMEECYQTMKKANAYYKKNGTLDGFYLFDEITEEKMHEYKHSGQPFSAYTLQNCIAEIRRIKARIASITAVKEEGGGEKVIKGIRVVEDTDDMRIRLIFPDKPDEETRNALKANGFRWSPKNSAWQRMLNANGRWAAKNFLA
GUT_GENOME258715_0157660-263LDRSWNKLGESVRLEEKAEYYRRKAEAVASNDAIYTEDEDAEERLEQKIAMLTQLQEKMKAANKIAFSKKLTEEQKIAALVQQGYSEEKATTLATGRGAVYGHIGFPSFTLSNNNACIRNAKQRLKHVQTLKNTETKEYTIGEVRVVENTKENRLQLFFSGKPADEIRAQLKHMAFRWSPSNECWQSYLNRWQIDRAKALLNTL
GUT_GENOME143410_0364292-347SFHARLRAAYMGYLSSISGCASAMITGPANFPVERMRKRNGIANDKYRQIKEYSQKAPERFLRRIMPFGDGTIIQSNAPNATELLNSKIARMEKTHAEMIGANKIVRKTYKGGSAEGVSPEAQQKCALALAEFLSIPLDKALNILQPSPYYSHGKVVPFYSYQLANHNAEIARLKQRLAEVNKLQNDSLFIEQTLTNGIEIKLSEDGKIEIHFGYKPDEQTRRMLCDNSFKFSRYRNNAWVRKFTPNAQAVFSRIV
GUT_GENOME259941_0010031-282NPAESERIDYLLNKYSQKIAQNINAVSANESKYPAWFVSGPARYNVKKNDALMRRTMDLFNQRESYEKILDKIQRIGNGTEAIKSSDANAVEKLTEKLEKLKRQHSQAKLANAYYKKNGSLEGFHPVGLTRGQEKQMFEGARWMKEHYPTVTAPFNLTNSNAEIHRLEDRIKNIQSAKEAGNKEVVANSNYRVVKNAEEMRLQYFFDGKPDREVINIMKSNGFKWSPKNKAWQRQLTDNAEYAGRKVNEALN
GUT_GENOME009819_0001131-295YRAKVDEAAAVLERVKPLCATDAQRDRAEWLLDRYAATLAAAINKDNEIGTRCPSVLIAGPSNFPARKKAAQVAAWDANRDMYSRAEHYLNLLRRAHCQPIKSNDPEAIEALTYKLNRLQAERDRMKATNAYYRQHKTLEGCPDLDPEERRNIESHWAAGWYTGTPYPPYALQNSLANVKRLQDRLNSLQTAKSATHDDEEHDGYTYHEDTEQMRVQLIFDGKPDDDTRALLKSYGFRWSPRNSAWQRQLTDNGRRSARQLMREL
GUT_GENOME110373_012314-166NNLIRANDPMAINMLKENLAHFENNIAYMQAVNDYYKANGTMVNFEGIDYAEAVRLDERVNGYQSAPYPGRFFKENYEKIGRIKSNIDRLENRPKTMFNGWQFVGGEAIINLANNRLQLIFEEKPTSEQRAILKQNGFKFAPKATAWQRPLDYKTMAAANRID
GUT_GENOME130378_0095540-279SFLAQIPEELQNEYEKRYISKYSEWLQALSRTFSVMVTGAGNFNNRRHQKMNDYEQSARERFETWKEKVVKRVNRQQRLVGWEEVERLQSKLDTLTELQEKMKAVNKIVRNGKLSDEEQRKELEALGLSESSINGFMAEPPYSFMKKGFQTYQLSNNLAKIKDIEQAIKHHTVMATTEDKEYKFDGGKVVICNSDERIRIYFDEIPNSETRSMLKGNAFKWSPKNKAWQRQLTPNARFAL
GUT_GENOME025239_0146178-248KTEAVGLGGISSDDPDAVKKLTAKLEERTEKQEYMKAVNAALRLKDTAKGDARLAALGMGPDEVAARRKPNEFGRIGVFSRWMLSNNSAEIRRLKQRIKDLRKAQEREDAVEDVETDLYTYRAKENRVQFVFDGKPEPAVRGILKRYAFRWSPSRGAWVRQATANGIYAAK
GUT_GENOME157258_0007895-369SMDPERAAKVEMVSYEEGLNADLANIPTEKQEEYYNKYHSWVSTILAKESRIASSFVTGPAKFNNSKNEKANASYAKACEDFSAWREKTLKSIRRAMEEAKPLEQREDERWDKIRNDLDLTASSIRDIDENNAPYHRSLFVNSLYGKAETLAKNGEKTLLDRYVKRVVEWNDKLKKPLITQRHKFWKLQELCERCVEKAEARSGKESVTIEKDGCSIVKNYSIDRLQLVFDGKPKAEVISKLKSNGFRWSPSNTAWQRQLTSNALYAAARVVDVT
GUT_GENOME053596_0105123-302SDYQEGSKTASYRAQVNEAYDLAEKVIEKRGERFEKRARILADRYARNLGKYYNEDARIGCMCPSVLISGAANFPVRKKERQVAAWDKNQEYYKYCDKILSQLKRLMTCSEVIKCDDENAIEALKEKIEAKEESQERMKEVNKYWRKNGTMTGCGFLTDEQIEKINRSMAEDTWRLSGVPFGSYHLQNNLANIKRLKGRLEELQNTKEKGSSEADYGEFKVIENTDIMRIQIMFDGKPEETVRNVLKANGFRWAPSQEAWQRQLTTNGKYALKRVLEALQ