UHGP-MC 4433


Information


Number of sequences (UHGP-50):
70
Average sequence length:
461±28 aa
Average transmembrane regions:
0.02
Low complexity (%):
3.2
Coiled coils (%):
0
Disordered domains (%):
1.54

Pfam dominant architecture:
PF01170
Pfam % dominant architecture:
7143
Pfam overlap:
0.32
Pfam overlap type:
extended

Downloads

Seeds:
MC4433.fasta
Seeds (0.60 cdhit):
MC4433_cdhit.fasta
MSA:
MC4433_msa.fasta
HMM model:
MC4433.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME060816_010862-490IREYLEQIENGMDVRKNLIALKAACKDSAGRTAVLYALNNQLQVFYDLLKSEEPKIRKNVALILGQLGVQDSLVPLFDAYEKEETLFVKSSYLTAIRELDYRQYLEVFKKRLQELQSQEIEESNRKHIKEECKLLQDMILALEGYQKHTFTGWKVPSRIILMTNRNFREITAEQIKSHHPKVFNAGVQARTSDLKEIFEVRTFKELLFVLEDKSSLTKEGKTVDEMARMLAKQISDSSLLQFLQERHQGETPFYFRIECKNKMDLKQKSVFTKKLGSYLQEQSENQLINSTSHYEVELRLIENKSGNFNFLIKLFTLKDSRFSYRREALSSSIQPVNAALVMHLAKEYLSENAQVLDPFCGVGTMLIERNRFLPTGDMYGIDLFGKGIEAARRNTELAGVRINYINRDFFDFTHAYLFDEVISNMPRAIGQKTKADITLLYRKFWTKIREHIGKGSVLVLYCYDREILKETLPRQFFTIKEEFEISKKE
GUT_GENOME220532_0170264-498IIQHNIELHPEAFPRLLQSEDPKVRKTCAQLLGRVCPDAFAPHLLHALAAEKTDFVRPSILLALGNARHAEGVREALRAFVIPEGDEKQVKEQQLALQKALSSLSSLEKPALPLAGACKGKILYLSCPNARVTALEMQELGFSPSAPDHPRGYLSLQGVPSLERLFEARTFFDAGFFFGKFDSLSSAVEAVSSHKLVGEVESLYGRSDLTYRVETICDFEPMGQKERKATAEAIAVNLGTSSRLQNSPSAYSLEIRLIVSQKYVYVLLVPACGELDARFTYRAQAISASIHPAVAASCVYFCKSYLKENAHILDCFCGSGTMLFERARYPYRSLTGTDISHPALKAARANERLARTGAHFLIKNAVTPFEKRYDEVICNMPFGLRVSSHGENRRLYMDFLDNLARTLAPGGTAFLFTHEKKLIAGLLKNRYRLLE
GUT_GENOME245676_0036917-447QALSALCTLAKSKEGRREIEAALGGHTLLRECTCSREPKVRKNAYRLLGALGDITDGSCLAEALRQEETFFAIPSLLLALGSLGQEAALRAYVPPVSTGEAMDKHVAEIALARQKALQTFEKREREPIDRLDQPREVLCFAPQGFLFCLISELRALGFAPEERGEAALVTTDRMDILYQADCLTEALLPIAREVPLEVDRLATAAGPMPSTPYRIELRGFTRDRRKLIRLLSDALEGENNPSDYDWELRVDCHMDTADLYWKLCNGVDARYPWRVRSLPASMHPALAACLSRYAMGLGRVGKPRVLDPFCGSGSLLFEREKWGNCKALIGVDKSGSAVEAARENARAGGSKASFVTKDILRFECREGFDLILSNMPFGNRVGNHQSNKELYQRFVRRLPLLLTEKGTAVLYTMEYRLLEACLKNVRGLKLR
GUT_GENOME094643_0185314-462DVRKNLIELRKLLKTEPGSAAWQRDRQRCLSLMLKLLKHEDAKVRKNAALILGEMGCQDALDALFYAYECEEKLFVKSAYLTAMSQLDYRTYLNAFRERMEELMQMEMTPENQKHLNEELKLLRDMLLIVEKPMKHTFTGYSVPSEMILLTSPGMEQLTIDVMPRNVRDAAKAMRGGVRILAERPGELFGIRTVKGFMFRFCANPLKATDYQAVAAAIHDAGLTDYLKKRHEGDGPFYFRIDLRTKLVLNEKSQYVKRLGAELERLSGHHLQNSASNYECELRITENKQGQYSVYLILHTIADSRFSYRRNAIATSMHPVKAAEVVSIASEYLADDADVLDPFCGTATLLIERYRKRKAAHLYGVDIFGEAIDGARENASLANVPSYFINKDFAEFSHSYTFDEVLTELPAGSDKMTPDVLFTLYKNFVRKLNTWMTPGGYVIVVTTEK
GUT_GENOME226213_0013014-485EDVRSLLSNLRSQIKEDKKAVEVYADAAFVGRLGDLLSHADAKTRKNAALLLGDLSDRVQTLDLTDKVTQALWTAYKREDTKFVLASYIKGLSAYDCGERLESLQAERKRLGAEEIWEEDKKHMRLLLEQMDLLLEAYQEKENSRYQGIHKKHALILTTDPYMKEALLAQVQDLGYADARMVGRGVRLLTDSLDKLSQIRIYREMLFVIRFRGQTLAAEENLAQAIVLSELLPLLEEVYGKKKKYPFALRMQMDADSKTTKRLAYAIEEGSQGRLANRPKDAVIELLPRQKKDGTYVVYARVLGQADRRFTYREHALPTSMAPVVAAEMVELVRPYLKEEAHVIDPFCGLGTLLIERMQAGRTRDVYGIDSFGEAIFYGRQDAEKAKKNIYFINRDYFDFTSSYLMEEVITEFPRMEHKEREEVDQFYRRFFDKTGEITADQAMVFALSTEEGILKKQLRLHEEFQLVRQIP
GUT_GENOME103953_0185380-498TPLYAQLSASEPKLRKNAARLLGALGDGRDAAALAAALPAERTLFVVPSLLLALGRVGGSAAEAAIAAFPVPEPRDETEQKHVEEIRRALETAKNSFASEAFPPLRRLPAQRTILLLAPKGFLAELRQELESRGFSPDPVPCRTPDGAEGLTVRADDLRGLYRCRTAMEILLPVADGIPASPEAVARAFSPAPALPYRLELRGYAGDRRAFLLETVRLLGGKNNPSHYASELRVVVSGNSAALYEKPCNVADTRYLYRKQAIPASIHPATAACLVRYARTHAGTTNPHPVVLDPCCGSGTLLFERERLSPCKRLFGVDLTPLAVRAARENAEAGDSRARFIQKDCLSFTPREPVDELYANLPFGNRVGTHEDNTALYAGLVHRLPDWLSASGFALLYTMEYRLLETCLKREPRLRIIDR
GUT_GENOME088994_0143917-488TDTRACLIALKKEIREEGGKRALAYLLAGDYSEFALLLKDEDAKVRKNTAFILGEMECEDMLPRLWDAYQTEEQLFVRPAYLRAMSAYDCRVYVPELKKRRDYLLNAPVTEAAVKHVREEAAALRALLLKYEKEKKHVFTGLARETDIYLMTNREHREITAGQLKEEKPALFPGGVRIRTRDLQKLDRIRTWSEMLFPIRGAASIPAEPQEAAKVLVKSELMDFLKSCHEGSMPFYFRLEVRGRMEQARKNAFIKRFTAALELATGRELVNSASSYEIELRLLQKKSGDFVPLLKLYTRKDRRFSYRRNVLAASIQPVSAALIMELLRPYLKEHAQVLDPFCGVGTMLIERVYLMAADPVYGIDIYGEAIAKARENAETAKMQINFINRDFLDFHHEYRFDEIITNMPTVTGTKSMEEITQLYEKFFLKAREVLKEDGVLALYTTEESILLHCLKLYDFLKVEKKWVIREKE
GUT_GENOME033208_0077315-477IRQTLSKIRQAIKDEEAFSDLYNVMWDQTNLLIPLLQSEDAKTRKNTALLMGDLAMDDFLEPLFTAYQKEETLFVRGSYLTAMKNFDYEELLPKLHKQYDKLCTQKTAEENKKHIEEEIRLLRELIIEEEDIPMHTFTGMNRKHQCVLLTNRNHADFVATQLTELGLKPTVFSAGVHVVTDALEPLLSLRTYQEMLFEPDMLKPCSFDAKTIVSMLLQSDLLTFLQQDHKGDTPFYYRIELKSNKDLRFKSDLTKKIASTLELESNRMLLNSPSHYEIELRIIENEEGNCSIMVKYFTLPDHRFSYRTESVAASIKPTDAALLAALAQPYMAEDAQVLDPFCGVGTMLIERQKIQKANTSYGIDIYPTAIAKAQENTQNAGQIIHYINKDFFNFTHEYLFDEIFTNMPFAAGKTTQQEIVQLYEQFFKKIPSLLTKEGKVILYTHNATEAKKIAAKNSYKLLA
GUT_GENOME029131_0178546-477LCDCLSHEDPKVRKNAALLLGYTKNSMMASVLLDAYKNEEKDFVKDAYLKGMSHYDCKPYIKELQEIQNELMNSEVVDSKHIKAQLKVLNPMIRSYSYHKKKSIRLLHKEVDVILTTLPYYQYTLFEQVKHLRYKPVGQGVLVRTKAIYDLLGLRNYREMIFPLSGCSGLAQDGHLIGKRIAQSNLVDVLHRLYDAEGCFYYRLVDQMREKKSAVINDVVQELIEEMPSHLQNVTSDYDIEILIRELKPGTVNVYLKLSSLDNPRFNYRREIISNSMQPYVAATLMEVAKPYMGMHSRVLDPFCGSGVTLIERCMAKPVKFAMGLDIYGEGLEAAKKNAIAANQAIHFVNKDANRFVNNEMFDEIFTDMPTYAQMKDTRALKNLYDNFFKRIHRHVLPEGYVFIYTTEISLVEKNLRLNSDYLTLIEHYDVP
GUT_GENOME015529_0055840-520ENIDVRQNVSKIRALIKQEACFDEFVKCILGREDILIKLLISDDAKTRKNVALLIGDIAGAYLGDGMADTFMSELYKAYKDEMQLFVKASYLQALSNFDYRYYVDSLKSELERLTTFSCDVADKKHIDEQIRVISDMIVSIEGVRKHEFTGFDVLSNVVLTTNRRYVDTVKEQLLELDGVDEKCVKTFSAGVMMATDRIAEIMTIRTFDELLFRVAGITVLNSDVNKAAEQVANSALLKFLNKRHNNKDNRPYYFRVELKSKMDMKAKSNFVKKFSSQLEGFTNRQLINSKSDYEFELRLIENKEGSFNCMIKLYTIKDERFAYRTESTSTCLKPVNAALIVELAKAYMKSDAQVLDPFCGSATLLIERNKKILANTSYGVDISGDNIVKAKINTENAGQIIHFINKDFFEFTHTYLFDEIITDMPFETSHKSKGDIKQIYKQFFANARNYLKGDGRIIMYTHNREFVKEFANVSGFYVLE
GUT_GENOME222296_0003462-491AACRGHIAHQFALRPALAGQLLANPDAKVRKNTAELLGRMDADAHADALIAAYRAEEIDFIKPSLLLALGSAHKNEAARAFLASLPPLETGEAKHQRAQRDALSKAQDSLSPRAFVPVPDAPMQKPRDLLLTCPNARLAAEELSSLGWDARAFDAKSGLTLVKNARAFLPVYRARAFYEAGILLFESDSLFHAVHGLTQPLVCRNVKNLYGRLDLSYRLDVVGPTITRDERLRAYRAVQDALATSPLHNSPSSYDFTLSILCLGRRFAAVLFPGSQNDTRFAYRKQAVSASMHPAVAAACCRVLAPYAQENSRVLDCFCGAGTFLLERARLPHASLAGSDISPAAVRIAQANAEAAGVSADLFVKDASHPFRHTYTEVIGNLPFGLRVSNHAGNERLYAAFLQNLRHMLAPGGRAFLFTNEKKLLLSEID
GUT_GENOME108528_0105616-477RAALVDIKTGLKENGAVKAFKENPLYDIMVFRALLGNEDAKVRKNAALIMGMINEPSCADDLMKAYMNEDKLFVKSSYLTALKKYDCSKYKDELINRRDELENGCFDDADMKHISAELKELYSIFPHSGLIKHKFHNPAQPVEVIFTTGRDTVEALMGAVGEFKNGAAVKQIFCGVSFKTKEIRAVSSIRIYREMLFPVNGLAPSAKSEIASDIMSGNLIHLLDEMHDDADRAFRFRVTSKNDTADIASRIQAASGGRLINSPSDYEIEIKLIASKSGGYGIMLKLHTWSDRRFAYRRDYVAASMKPVNAAMMIYLVRDYLKEGAQILDPFCGVGTVLIERNKAVRASHMYGIDTFGEAIAKARVNTAAAGVNINYINRNFFDFRHEYKFDEIITEMPDDTGVYDAFLDKCAELLNEDGLIIMLSKEKNLIKKQLRLSDKFSLLREFSFNSKENLNIYIIKG
GUT_GENOME189606_0155221-459DVRKNLIELKKKLKENGGLSDRLSQENLTVLTDCLSHEDAKVRKNAAQIIGELGDDRAGVSLYTAYEKECTLFVRPAMVHALGMLDYRDFSRGMKQRLQVLLKAPVTDENRKHIDQERHELAELLRLIEKDSVHEFCGWGLKERILLLTGPGFEKTVAAGLRAEHAVPVAGGVLVTASPGRLMENRLWKNMLFLFARHNGFEPEPEKIAAGLMDHQLLSWLDLRHRGSGPYTFRLDLRTRKNDADKNRLLKGTALALEGASHGWLINSASDYEIELRLIEGKDGRFLACVRLCTFEDPRFKYRKNFLPVSENPVTSAQMLAFAAKWLRDGANVLDPFCGVGTLLAERRYFGSVRGLYGVDISAQAIDGGRKNLAAAKMAANFIRRDFFDFRHDYLFDEIISNLPDHGFDEADAKAFYGRFLGKANTLMASNAVLVLYVR
GUT_GENOME215504_0040854-491DLSSDKQVSAIVAHYIDINSNKLVLLMENAEDAKLKKSCAQLLGKLRPDEFSQTLMECLSREKTEFVKPSIILALGNAKNSEGVYDFLKDYRITAEEKKHVDEQRLALDKAMSMLKSDKLEYKMINPLEKGTKLLLNCPSVRVSQREITALGYDAAIISERDNYLLVKGVLKYRRIFAARSFYKAYIVFNEFTNFQAALTAAASNKFINFVRNLYGEGELEYRISITGEASREERKKFSESLASKIRQKGFMNSPSAYMFEIAIVEAGRGYFLTILPQNQLDDRFDYKKMSISASIHPAAAASCVSFISPYLKDEAKVLDCFCGSGTMLFERAKRPYGSLMGTDIAKEALKAARLNEKFAKTGAHFYLKNALSNFEDQFDEVICNLPFGLRVGSHMKNEQLYSRFLSNLKGIVKKGGYAFLFTHDKKLLLELIDRVGI
GUT_GENOME041759_0190366-492LGDRLSLHGMLESEDVKLRKNVARLMGRLGDARDAEPLMAALTREERRIARPSMLLSLGALKGDAVARFLADYRPQPAGSADSAKHYNEEQEALKMARRSCQDVRRHAFTGLKRAHELEIRTPDGMAHVTAAELKELGLEPYNVRGGFLHVKTKDWDRLFLARSFKEALFPIFRGAGPECADIALKAKGPLTALLEECAQGEPPYGYRVEVRGFSDRGEVSRKLASALDGQSLINSPSRYDAELRIEKRGRMVDVYIKLTCYEDTRFRYRVGSVPASIHPATAAAVLGYASKYREEGARVLDPCCGSGTLLIERELLSPCISLTGVDIDGEALEIARRNAQAAGSEARFIRCDCAKYKTKELYDEIVCNLPFGNRVGSHQDNERLYAAILDNIPAWLRPGGFALMYTMEFTLLKRLVRERPCLSMLA
GUT_GENOME139649_017727-483EAIKNGEDVRQNLSQFRQCMKEEAVPEQADTSVFIGLLSAEDPKERKNAALVLGDLGAQEALKPLYEAYKKEEKKFVKSSYLSAIGKLDAGGLIEELKHIYRELGECSPNEEDKKHVNAQLRELDRILAKYGERKKHTFTGYDRENVVILTTGKDFYDVTARQVVYGKTKPHPLGVLVATRDLKPLLSIRTYRELLFVVPHKREIGTDAKEAGEILADCGLLPLLLECHDSGAPFFFRIEVRSTMSPDEKTAFVKKVAAVIEEKTDRQLINSATDYEVELRLIQTKSGRFFPCVKLQTIPMKRFAYRKMSIAASIHPAQAALIMQLVKPYLKEDAQIMDPFCGVGTMLIERNKLVPAREMYGIDLFGEAIAKARENTQNAGDRIWYINRDFFDFTHDYKFDEIITDMPPRGKKTKEEQDAFYARFFAKAKEHLEREAVLILYSNEIGFIKKQLRLDGSYKLLQEYCMRKKDGYYVFV
GUT_GENOME230902_025041-495MLRATFQNIVENRDVRKNLIFLKEMLREDIKKGSHNKEALLYVIAGRYEVFRNLLQDEDAKVRKNAALIMGELGVSEFAEILFDAYDEETQRFVKSSYLSALKNMNYSELLPKLRTSYEIVKSTPITDENKKHLGEELRLLEELLILSGGEKKHTFVGYEMVNEMVLLCNRNHIHVTMEQLEKIPKKEFTAGVMVKTKELKEVLKIRTYRELLFAVDGVRTVPNDVTKAAELLTDGKLYQFLKERHKGDEPYYFRLEIKGKMDDEKRIAFVKKLSVEIEHLSERKLLNSVNHYEVEIRLIENKEGKFNVLLRLYTLPEERFLYRKEVIASSIHPANAALTAQLVKPFLIEDAQVLDPFCGVGTMLIERDLCVRAKTMYGLDIYGEAIEKARINTEAVNRCGLKKEDGGHLVINYINRDFFDFKHNYLFDEIITNMPTVSRSMDEGELFSLYQKFFKRAYELLKDQGIIVLYSHNREYVRKFANMKPYHIEKEFEI
GUT_GENOME159714_0037770-484QEIVPLLASEDAKTRKNAAIVLGQIGQRAVVPALCGALSKETQQFVRPSLILALGQLGGTEAQQAVKALPLPEGTDKNTLAERDAIQKALSRLAPQKSRTFTGFAAPRPVWLMPVNGLIGSLLAEARAKHIEVKQKGSVVEAVTRSLPSLYALRGFYEALLPFAVGVKAEPEALAEAVEAGGLYALLCQMHDGSGAFPTRLEVRGPGIDRAAFASKFFARVNGAHFVNSPSSYDIELRCWIKNGQATLSVRLHTFQDPRFAYRLESVPASMHPAVAAALLYDHRRYMKQTHTVLDPFCGAGTLLIERKKLMGAKAFTGLDISPKAFSIARANCHEAGLHAKVFNRDCRGFHSDFGYDEILCNLPFGHRVGSHEQNERLYDDILQQWPKILRPGGFVLAVTNDKQLFSALATRHGW
GUT_GENOME000604_0366812-490GRDTRANLIAIRQQLKDGADGGNLCSELLEHTDFLCGLMGAEDAKIRKNAALILGEIGAQKAMDALCQAYVSENTLFVRPAYLEAMGQLDYRSAVPVLTQRMEIIDRMVMAPEEKKHLTQERHLLAELVDLVSGNGGHTFRDKAAPSKVILTVRRGREDILYDEIVRKLPGITPVKMAGGVSLAVKRPSVLLRLRTFSGMLFCFCRENGFPKDGEVLAGAVVDAGILSYLNARHEGEGSWRFRIDVHTRDDAKQKAALAKRMAAGLEGLGHGRFVNSVSDYEIEFVLMEGRDGRVFVYLKLHTMDDTRFDYRKNFVAASMNPVTAAQVIACALPYFKENANVLDPFCGVGTLLIERKLQSEGLRALYGVDIYGKAIEGGRENARLAGIPVNFIQRDFFDFTHEYLFDEIITDMPDKFDTQEKKEQFYLEFFNKCREHLSDGGRLFIYCDDAALIRRVCRRYPEYSILFSVTLSEKTKTG
GUT_GENOME040780_0082955-495KKQYRDEINLRLGGRGMIRDALLSGDAKLRKNAARLAGALKDPLDAAPLAEALLRERRRLVRPSVILALGAIGTREAKKALAEYKIEPPADESERKHFNDEQEALRLALSRLQAPPSRSFTGFASPVDVELRSARNLGAQLAQELDELGFGVKKAWNDGALVTVTDPMPLFEARCFSEMLIPIKKNVPFNAKEIAAAAKGSMFGLLSSSCGGEAPYGYRVDVRGMDAGRSDFIKELANALDGGGLKNSPSAYDAELRVERHRETCRLYLKLYTIEDGRFDYRVGSLPASMHPATAAAILRLASSSLRTNARVLDPFCGSGTMLIEREKLSRCSALTGVDITPKALDIAVKNAEAAKAGIEFVCKDCLKFRASRPYDEVISNLPFGNRVGTHSSNERLYAGLLPKLGEWLRPGGVAVLYTMEFTLMKRLVAENPRLELIRQS
GUT_GENOME007506_0101656-486SLKENQKFKEFISSAINSQDEKIRKNAYVILGNVESDFCHDLLIKAIDKETVFYCYPCIILSLGNFKDLSNCERFFKKCQELFEKGQMPEKFFLLTKDAFEKAFPKKTYPVKNVKINAEDKILLTTQKSYFDLFIKNLGKKCEKTKFGITLRNINQDDFSEILNRKDYYDLFFLYSITDDFSQETISSAVTAFKEFVKRNASLPVGYRIECKSDIKEKEKITKLVKNLCSSSNLLVNNPSDYSFTFYVQTANNEEKGFIAFKCDFLFKNRFPYRKNCLPASINPVTANIIANVAKSYSAPNSVCDCFCGTATMLIERARTINANFYGSDINEKAVEMAKENCELANVQAKIIRKNVALLSGQFDEIISNLPYGLRVGNHENNFEIYSALAKKCKNVLNKNGFAFLYTADKACLRKLLKQNGLTLINEMPME
GUT_GENOME104227_0008814-472DVRTNLISLKDKLSKDLSLTELKQDINYNTEIFAKLLLDEDSKVRKNAALIIGMIDEPGLVKNLYESYEKENTLFVRSAYLKSLRKYDFSAYKDSLTERLNNLKKAEYEQSELKHIAEELQELKTMVGTKGEREAVSFRNPKEPVLIFLTCKKEMADILAEQVKEMTGLVTKKVFCGVALKTADIGKVSCIRTYKELLFPLNGLTAYDGADVIREIIKGDLFKLLDSMHDKKDAVYNFRVSGNIDAMKFGREIETASYGRLVNSVSDYDIELRFIQNKESKSACLLKLFTKKDNRFAYRKNHVATSLTPVNAAGIVKMSEKYLEKYAQVLDPFCGVGTLLIERNKLVRTRTMYGIDIFGEAIEGGRQNAVLAGVGINYICRNFFDFRHEYLFDEIITEMPRFDKGQADDFYRRFFDKAGEVLKEEGVIILVSEEMGIIKKYLRLNKKFRLINELPFNSK
GUT_GENOME069237_000208-476SICRGEDVRANLICLKQLIADKKEKSAFAYMLAGDFRKLAALLKDKDAKIRKNAALILGELESEDVLPWLYEAYEKEKQLFVRESYLKAMEKLNFSQMLPKLKKRQKTLQEELLYAADEGKKHLQKEAAVLSRMILKYEKPKHHTFCGGKTEDVILLCNRNNRQITARQIQTGEVSMLAGGVRIQGADLQEIAKIRTWQELLFPLHGKVEREYTAEAAAQTVWNTEIMELLQSLHEEKGAFLFRIEYKGSLDERKKSEFVKKTASLLEEKSERMLINSISDYEIEIRIVESKKGGYLPFLRLYTMEDERFVYRKESIAASIAPVNAALIMQIAGDHLKENAQVLDPFCGVGTMLIERKYFRSADPLYGVDIYAEGIQKARDNTAVTKMPIHYIARDFFTFEHDYLFDELITDMPASADEAFYRTFFKKAETLLKRGAVLVLYNRQGTLLEEMCRRQNNLTILKTAVLNE
GUT_GENOME014029_0200214-492IRENLSDLRKEIKEESACEEAKELLSDKKELIEGFLESGDAKTRKNAALLIGDLKWDDSVDKLFDAYKNETTLFVKSAYLTAIANLKADKLLPDLRDKLEELQGMEVAEENMKHHQEELRALNKIIIKYEGITRHTAITSGQDMELLLVTNRNHREVVRRELENDLGIKTAKVHPLGVMVQTDNIKKIFAVRSFRDMLFPIHTGGLLDSNPRKAAEQLWKSDLYKLLTDLHKEPAPFYYRIECRSGMDLDKRSDFSKKLASALDGLSGGMLINSASDYEVELRLIANREGKFFAAVKLYTIADHRFDYRKNAISASIHPSTAALLMEIARPYLKENAQIMDPFCGVGTMLIERDKKQAAREVYGTDIFGDAIEMARENTELAGRKYNYIHRDFFDFKHDYLFDEIVTNMPMRGRKTREEMESFYEEFFDKAEEILTDEGIIIMYTNEIGFVKKHLRLNQDYRLIQETCIQTKGDFYLLI
GUT_GENOME040200_0193075-544RQNLSSLRQEIKDENALAEALKLLAGEDELLVSFMGAEDAKTRKNAALLIGDLHMSQLSDEVFKAYEAEQMRFVKGSYLAALSQLDCKELLPQLMERAKELEHMTVTAENRKHIEEELNEINKILIKYNGIKHHTPVLEGVKAELLLMTNRLHREVVRRQIPVKDTKLHPLGVLVKTDNIPLIMQVRTFRKMYFTIHAASLLPKDAQKAAGLLAESDMYDILRRMHLEGGPFYYRIESTADAAYQSRLAKAIDMHFAGRMINSPNDYDVVIKLIPTKNDNYFVCMRLCSIQDNRFAYRKNVLPTSMHPSQAALIVSLAKPYLKETAQIMDPFCGVGTLLIERAHLVPAREIYATDTYGDAITMGRENAAFAKTRINFIHRDFFDFRHDYKFDELITDMPVRNRQTKAEMELFYERFFDKAAEHLVSGGIIVMYSNEIGFVKKQIRLRVEYRLLQETCILDKNDFYLFVIR
GUT_GENOME035511_0115614-493NLRENLTKLKQLLKTPEYRERFVKITGKNYDFLMKLLVEDDPKVRKNAAGVLGDLHCQDALDLLLDAYEEEETLFIRADYIKAMANLDCREYLDVLESHLEELTAREEIPENEKKHVQEEIRQLQLLILAQGERKPRIFNGYTQLNDVILVTRKAFYDVTQAQIRDAKVARTGLGLRVQTANLDQLLEIRTYKELLFVLHGADRIPSSPQEAAKALADTDLLELLEKNHAQGDFFCFRVTISGPMSLEQRSMFIRKLSMELEELTDRRLINTPSNYEIEIRLIQMKDGSFYPVLKFYTLKDNRFAYRRYTISAGMQPFLAAGMLKLVEEEMTEHAQVLDPFCGTGTMLIERNYLMAARDSYGLDIFGEGIDKARVNTKIARMNINYIHRDFGDFRHDYLFDEILTDMPEQGNMSREELDKIYQMLFDRGMELLKPGGKMFVYASEMGMLKKQIRLHGNAYILKREYCISEKQKKYLYVIQ
GUT_GENOME228107_0011173-518KQEKYARQLNDIMGSERKALRNALLSNQPKLRKNAAVLMGQLKVPSDVKYLKQALSVEDTRFVRPSMLLSLGSIGGDEAKAFLASYTIQPANSEDEKVHFEAETEAYKTAMRSFLTFEKHEFTALPRPCEIELRSPDKLSDPLARELAEHGFRVSAVHRSDVHVHTEDMMGLFACRCFTEALIEISANSNPDPKSMGIKAKSFLEKLLPACHTGKPPFGYRLEIRGEGNIDRLAIARKMIAVMDGETLLNCPNNYEVEIRVEIGGNGGAFMYAKLLTIKDERFSYRVSALPASMHPATAAAILKYAEFFLGGKDARVLDPCCGSGTFLIEREKLYPCAGLTGVDISNKAIDIARSNAEAAGSIARFVHNDCMRFTAERPYDELVANLPFGNRVGSHKSNEKLYAGILENLPKWLRRGGVAILYTMEYTLLKKLIREHPGLRLVTEV
GUT_GENOME082534_0134955-529ALRETLSKIRSEVKEPEKREEARKEVDVAELLTGLLSEEDAKTRKNAALLIGDLQIEEAKEALLAAYREEKTLFVKSAYLTALCELHVEEYADFFRERLEELKALPAVEEEMKHRNEEIRELTKILQKTEGVKKHPFRGFTAPHEMLLLTNREQREVTLSEVKEIGASVQRRVELHPLGVKIFSKEVLPFAKLRTYRELLFQIHVESRLNDRADAAAKLLWESDLYKFLTECHKGDAPFYFRLEVKGRENRTEFVKKLGIALEKVSGWKLMNSTTDYEVEIRLIETKDGGFVPFLKLYTIPVKRFSYRKHAVAASIHPANAAMLVYLAKPYLKEDAQILDPCCGVGTMLIERDICVPAREKYGIDIFGDAIEMARENAAAAGERINFIHKDYMDFKHAYKFDEIITNLPDRGKKSREEMDAFYAALFEKSKEILADEAVLVLYSNEIGFVKKQLRLHREYRLIQEFAIRKKTGDT
GUT_GENOME170651_0101762-494LNARLYDRRLLYQGIKSDDPKMRKNAARLMGALQQERDASVLIEALKAETQRYARASMILSIGAVGGDEAISFLEGYEVAPAADETENKHVSEEREALRVALRHVRPVEQHKFAGFKRDIDVELRSPKMLGAQLAAELEELGITVKRQWGDGALVSTCDPTQLFDARCFHEMLIPLVRGIRADANVIAAHAKHTFEPLICETCEGEKPYRFRVELRGSFTDREKLVRDIVDKLESETLENSPSFYDAELRIEQQKDDRCDLYAKLYMIRDSRFDYRVRSLPASIHPATAAAVLRLAYDELKVGARVLDPFCGSGTMLIERSKLSPCSKLTGVDITPKALEAAKANIIAADADIELVCKDCIKFRAEAPYDEVITNMPFGNRVGSHINNTRLYSDFVDMLPKWLKDGGIAILYTMEYTLLKRTIAMHEELELVA
GUT_GENOME017737_010958-510IAQGNEVRANLIEVRQQLKQSGEKEVLNRILTEKPELLVGLLGNEDPKVRKNAALLIGDLRMEQMLEELYQAYLKEETLFIRRDYLKAMGQMDVRAYQERLEQRLAQMLGEDVSEENQKHYREELLTLQKLVLRNQKRKSHVFVGGDERFDVILTTNRNQREVTAAQIGGKKVSLVPLGVKVQNADIRELLEIRTFRELLFVLDIPSVGPGPVDAAKALAASDLPKVLRRAFGCQRVIQSEEQAGGVAAGKGPEERQPFYFRISIIGSMPLDKRSAFIKKCSFELEKCTGRRLVNTASDYELELRLLERRDGTFLPLMKLCNLPDPRFSYRKESIAASIHPSEAALIAALAKPYLKEKAQVLDPFCGVGTMLVERDRVLPAGTMYGIDIFGEAIQKARTNAVCADREIYYINRDFFDFTHEYLFDEIITNLPVRGKKTKEEQDWLYQNFFDKAEELLRKGGIMVLYSNEQGYVKKQLRLHEKFRLRKECCMNEKEGFYVWIIE
GUT_GENOME138783_0112310-485LNGIEVRKNLIALKQELGEEKQKREFAYMLEGDYSRLLPFLKHEDPKVRKNTALILGMLEDEDLKEVLLSAYEEETQEFVKSSYLKAMEHYDYQEFLPALQNRLKYLEGEVPATESQKHWKEELTILRRMVFRYERPKTHPFCGMKEESEVILLTNRNHREATKEQLPETEDLKMLAGGLKFKTSQLDRILPIRTYSEILFPVQRTALLSPDPREAAAQLAKSDLLPFLERMHDGRPPFYYRIEVKSRMSLEERAQFLKKLTTQLDFFSDGKLRNLPSGYEVEIRMVENKAGRFIPLLKLYTLKDRRFAYRKQTIAASISPVNAALFMQLAKDYLKEGAQVLDPFCGVGTMLLERSFLNKTGDMYGIDLYEEAILKGRENARIAERTIHFINRNFFDFHHEYLFDEIISNLPGITPSKGRDEIHNLYECFFGTAAEVLVDGGVLLLYSTEPDLLLHCLRRTGQFSLLKQWKIQDKT
GUT_GENOME117389_015986-499YEALQKRENLRENLVELRAQIKDEAARAQFVSLIGDGRILLQLLEEEDPKVRKNVAMILGEIEWTGAVDALVAAYEREQTLFVKSAYLKALAYLDITAYQEKFKSRMEELLSYTPAAEEKKHIDEEVRALGRLLEKTDESTGHTFTGFKDPHEMLLLTGYARADVTLKEIGELPADIRRKTAKHPLGVAVYTKDVRAMANLRTYRELLFPIRLKQDTMNTVLNNTMQSEKWAEVLADAIWQSGIGDFLKECHKQSTPFRFRVEIRADMENAKRASFAKKFAIQLERVSARWLINSTGDYEIEIRLIKKKDGGFAPFLKLYTISMRRFSYRKNAVATSIHPSLAAMLVALAKPYLKENAQILDPFCGVGTMLIERDIAEPAREKYGIDIFGPAIDGARENAALAGEKINFIHRDYFDFKHEYLFDEIITNMPVRGKQSRSEMDALYASFFEKSKEILAPGGVMIFYSNEEGMVKKQLRLHTEYRLIQEFIVSEKR
GUT_GENOME018982_0131514-495IRANLIELKMHCKEDPKAAWALRAEDGVPQIFEKLLSHEDAKVRKNAALLLGELGIDESAPALFKAYGAEQTRFVKSAYLQALKELDVTPYLDALRERYQQLLAYEPSEDEKKHVAQERSLLQQLLSENDVLKKHTFSGWRRTNDVIFTTEYALREPLQKQIEMRSREAVRTALHPFGVRVRTTELSQLSRMHTYRELLFTLRIRGTVRGNAAELSRALLDGGLITLLEELHAETAPFYFRIEVRGMEDTVQKTALAKELALELEQGSGGRLFNSTTDYEVELRLMRTKDGSFYPCLKLYTIADDRFAWRKQTISASMHPSLAAALIELARPYLSEQAVVLDALCGVGTFVIERNLRMKAYDNYAVDLYGQAVAGARQNAAAAGVTCNFITKDFTQFTSKHRMTEIFADMPKRGKKTKEELDALYSGCFRQFSELLEPHGHLFLYAGEEGFVKKNLRLNKNLALIHEIRIRPKEDGVFYIIE
GUT_GENOME261521_0125814-489NIRENLIELKKSIKNQDALKEWKEYHATHPVLYAFLSSEDAKIRKNAALILGETNESGAAKALFEAYQRENTRFVKSSYLTAMNGLDIEIYQDAFGKRYKELLAEVPAESEKKHRTEELHALDKLLGGLNQNKNKKHRFTGYEEEVEVLLTTNPAYREITAEQIKKDRPVLVPAGVKVKTTHLRDVIKIRTFREMLLLLSGGHRIAAEPEAVAEAYAKSNLMELLNRLHEGNPPFRFRMEVRGIAPEEKGSFIRKAAATLEDLTGHQLLNTVDGYEIELRLTKNTDGTLYPSCKLFTIPMRRFSYRKEAVAASIHPANAALFMKLAEPYLKKGAQVLDPCCGVGTMLIERDLLVPAGDMYGLDIFGEAVIKARENAKAAGRQINYINRDFFDFTHKYLFDEIISNMPLRGKKTREEQDAFYSQFFDCAGKFLKNGGHMILYSNEGGFVKKQLRRHMEYRLLDEFCIREKEGFYLFI
GUT_GENOME094891_0222411-487GEDVRQSLSIMRGLLRDKKEGPAAGLWLDEKFEEEPEFWLSLLKSEDPKIRKNAALVLGLLGRDECRDALLAAYEKEEQRFVKSAYLLALQGCDCEEQLPFLKERRRILLGEKRTAENDKHLREELEALNGLISEYETAKKRRFTGKNELSDVILTTWKDYRDLTARQITCGEVTMMGSGMRVKDGDLRELLSIRTWRELLFCLDIDEVSRENAAKELAESNLREVLKRYYEGGAPWQFRIECRSKMNLEERSRFTKKLADGLMLETGGTMLNDTSDYEVELRLNEKKSGGFLPLLKFSGLPDRRFAYRKHSVAASMQPALAAVLAELAKPYMKENARVLDPFCGVGTLLFERNYAVHADTLYGTDIFGPAIKAARENAATAGIPAHFINKDMRDFSHEYKFDEIITDMPGEGKNRTAHDVDCLYRALFERAEALLRPGSMIFVYAHDRGYAKRWIRENKGLKLEKEWRISEKEDTW
GUT_GENOME237530_0041014-481EVRANLIALRRELKEAENQRRFAYLLGGDFRVLTRLLQSEDPKVRKNAALILGDMETEDVLPFLFAAYQKEDTLFVRPDYLKAMEKLDYSPYLEKLKHRLEELKKTALTEENRKHVREELVQLQNMILKLEKPKKHVFIGYEPAPEVILVTNRNQKEVTKAQIREGQVTELGQGLRIKNGNLKELMKIRTYTELLFPLPGARALSGSPEQIGAGLAGLGIADFLEDMHKSGGCFYYRLEIKGPMAAEKKGDFIRKVAECLDRRLEGRLWNTAADYEVELRLLEKKDGSFLPMLKLYTLKDSRFSYRKETVASSISPVNAALTAELARNYMKEGAQILDPFCGVGTMLLERNRAVHAEHMYGLDIFGEAIEKARKNTERDGSIVHYINRDFFDFTHDYLFDEIITDMPRSTDSMELGALYHRFFNRAYEFLKEDGVIILYTMNPELTERELKRHAGFRKKEEYLLNEKN
GUT_GENOME025619_0010920-483ALKESLKEASDKDAFLGIIHYDMDFFRTYLNSEDPKVRKNVIQIIGILELSQLTDDLIGAYFEEQTQFLKAEYVKVLGQFDVSPYKKLLRERYEQLMETPVAESSKKHVSAELRELRALISRLGEMKKHQFIGGDTRSVMIFTTLSGKERYLKRVIDEAIGEGKTTLVKGGVKVISSDYETLSKIRCYQEILFVIPKVGLLKGSAAQMAQQLKNAGLMDYIRERVSGSSAIPYRMELRGFLEEKEAQHFRKQMSQELEHMTEYALINDPAQYELEFRFHKSRKEDGCHVYLKFSLLKDERFAYRTHALSSSMQPYVAAMMAEITKDYVEDWGQMLDPFCGVGTLLTECNYNRHAHSIYGVDLFKDAVTIAIQSAKQTHMDIHYVHKDMADFSHDYLFDLVITDLPCVSPKMDKSRIHAIYTTFLDKCDEWLKESGIISVYTKSPAILEEVIAGQHKFRLITHTQ
GUT_GENOME076279_0177116-502IVQNQDVRQNLSKLRQEMKAGRGREAACSCISGEEEKLILLLESEDAKIRKNAALLMGDLGNQEYLEPVLKAYRSEEQLFVKSSYLNAVKNFDYGEHLAFFKKRLDELAQIKLTPETEKHITEEIRELSSMIISREGVLLHPFCGWEETFDIVLLTNRNFQEITREELRKLEPHAKTKVFGAGVMARVENLNWLEEIRTYQELLFAVKGLPSCPMDAEKAAEMIVKSDLFKFLARSHEGNPPYYFRVELKCKMDHSKKSAFTRRLSSKIEKLSGRKLINTTSNYEFEIRLIENKEGSCNVLIKLFTLKDERFSYRKEVIPTSIKAVNAALTVKLAQPYMKEGAQVLDPFCGVGTMLIERHKAVKAGTMYGLDILEEAIEKARENTAAAGHIIHYINRDFFDFKHEYLFDEIITNMPFKIGRKTEEEIENLYEAFFSSAKKVLKDDGIMILYSHNAGHVKKMAPANGFTVSETFEISLKEGTYVFVLN
GUT_GENOME098570_0357613-493QDIRKSLIELRQLIKDNNNKRAFLYELDGDFTTLEVLLKDEDPKVRKNVALIMGELEIEDFKSLLFEAYEKEEKLFVKSDYLKALSHFDCSELSEVLQSRLLTLSSEPVEENNKKHIDEELRLLTQLTLKLEKPQKHKFNGYKVLSDMILLTNREHQEVTLSQIIKGQAKAFGAGVMVRTDDLKEVLRIRTYSDLLFKLPEVSTVTKDPQEAAKTLFDGGLLDFLLTRHTGYPPFYFRIEVKSKMELDKRSTFTKKLASELERISNRQLVNATSSYEFEIRLIENREGTFNVLLKLFTIEDNRFSYRKKALAVSIAPSTAALVVELAKPYLKEEAQVLDPFCGVGTMLIERQKACKSKVLYGVDIYGEAIDCAIENANWASTDIYFITRDFFDFSHRYLFDEIITNMPTALGRKTQGEIIELYALFFQKAYTMMEDNATMILYSRDKDLVERGIRNNTAYKLEKEYKISYKEEAYVFIIKV
GUT_GENOME258960_0108612-489NIEVRQSLSSLRQEIKDSSKRELLLSWIHDGDLDLSVFLENEDAKTRKNAALLIGDLALSSESDAVFHAYQTEDTRFVKEAYLTALKSLNAAPYVDVFRKRYEELSLYEASDDEKKHVEHELHALSELINTIEPFKKHRFLGGRQTFHCIFRTNPLHPEITAALMEESSAASSKMGVRVKTNHLNRLLPIRTYNELLFQIPGMVSCKPDADVAASVIAGSNLMALLENTHEGDFPFYFRIGVKSRMALSERSKFAKKVASKLEELTAHKLRNSTSHYEFEIRMIEGKSGDYYLLVKLNTIVDRRFNYREEFIPTSIKPVNAALLVELAKDYMIPDAQILDPFCGVGTMLIERQKVVKGNTSYGIDHSPEAIEKAIYNTNLADQIVHYINKDCFTFTHDYPFDEIFTEMPYATGQKTEAEIREVYEKFFPFAKRVLGPEGTIIMYTRNREYVKQFAVKSNFRILKEIKITQRPESYLMI
GUT_GENOME000232_0237413-490NIRENLITLKQELKTENGKKQFQKLTDGCCDFIMKLLVEEDPKIRKNAASILGIVHCADAVDVLVDAYEAEETMFVRAEYLRALGELPCQEYLELFKKRLQELSQKSCEIEEKKHIQNEMRELQQLILSVEGVKKHTFQGWQRMNEVILTTIPAFRDLTAAQVTGSQTVKTGAGLRTKTCDLKQILKIRTFKELLFVIHGEKELPRDADEIAKWLYQSDLKQILKENHKEDGPYYFRLGVTGPMSLEERSRFSKKAAAQLEELFERQLINSASHYEIEIRLIQNKEGSFYPCVKLLTIPDTRFAYRRYHISAGMQPFAAAGILALAKNYLTEHGQVLDPFCGVGTMLIERNYLEPARDSYGLDIFGEAVEKARANTRIAGMNINYINRDFSDFRHEYLFDEIITDMPVKGSMSYERLEKLYEMFFTRASELVKKNGTIVMYSGEIGFVKKQIRLRNQFHLEREYCISEKNKTYVFIIT
GUT_GENOME048724_0010338-503LIRMKELVKEDSSAKKLLNAESEDSLWEGLLQHEDAKVRKNTALMIGELGRSELLEALFSAYKKEQQLFVKSSYLKAMQKFDCSSYLEQLKEIYRTLLEEEPPEENKKHIRQELRELEILLSKEEGHKRHSFCGYDIESEVVLTTDKGYSGITAEQVKNARVAVSSTGVKVRTKNLKPVTHIRTFREMLFGIHCKEKIEKEPNSAAGRLLESDLMELLSQHLKGQPPYYFRMNILSPMETEEKNAFLKQVAAGLEEKSGRRLINSANDYEIEIRFIQSKDGTFQPYLKFYTMPMERFRYRKNALPVSIHPAQAALLMRLALPYLSEDACVLDPFCGVGTMLIERKAAYPAKALYGVDTYGDAIQGARENTELAGENIHYVHRDFFDFTHRDRFDELIANMPAKGKKTKEEQDAFYRQFFEKAGTLLKKRGIMVLYTNETGFVKKQLRLHREFGLLKEYCIREKEGY
GUT_GENOME092545_00595287-769VSTGTELRAGLIEMKNLLKEEKNRRELAYQLGGDFKILTRCLSDEDPKVRKNAALVLGAMESDDLVPVLLNAYKKEDTLFVKSAYLKALFDLDYEEELPYLKERLQELDETPVTEENQKHLREEAGILQQLISQKEKHKKHTFDGFDRQVEVILLTNREQREATRNQLKEEKVTMLAGGMRFFTYDLESVLPIRTWRELLFPVKGLKSVSGTPEAVASQLAAPVLEQLKSLHSGGGAFYFRTEFKSPLAPEKKSAWVKMFSAALEKASGRELVNSTSDYEVELRLIEGKNGSFVPLMKLFTLKDTRFSYRKESYAAAMAPVQAALLMELSKPWLVNGAQVLDPFCGVGTLLVERVKAGNADPLYGLDISEEAVLKARVNAEAAGITIHYINRDVRDFRHEYLFDEIVSDLPLTGRSRNLLELTELYSAFFKRVPELLKKGGRLFLYTSEENAFRSALKENRELKLLKNFLIQEKTGSRLYILE
GUT_GENOME000614_0174211-472IKNHQDTRAMVTLLKEAVKENPVELETDFLFNLLTDEDNKIRKNAAVIVRCCGLMDHEQEFIELFLKEENWIVKNEMIEAIKNFSLKDHLSMLQEFYESLNASLDDPNWRHKEEMKGRLMEIFHYYGSIKHPQFIDLAKKTKVILTTLNTNRELVFDKLETKNKKMTSMGILLLADSIRDLEKIRDYNELLLVLKGFKEIDFDIDAIAKQIVASGLVEQLNQMHKGDGIYSFRIETSSLKKDKLKNDEIKRLGLKIQTLSEGKLINNASDYDIELRLRETPKGFCQVFLKLYTYHDPRFEYRKNALATSMQPYVAASIADLILPFVTPKAKVIDLCCGSGTLLIERDYIEPCKFLMGIDVFNDAIKMAKENSELAGVNIHFVQRDLNRFTHSHQFDEVMANLPIQSANKDRKSIETLYMQFMKKAIELTVPEGFIFAHTSDTEIFKKAMRFYKDQIELIDEK
GUT_GENOME131740_0030057-498TEKESRLLISHYAKIHAAKFCALLEDSDPKIRKNMALLLGRLDATQYAPLLLRALAREQTDFVKPSIILALGNAEDVPEVLAALQKLQIPLGEDKNLREQRFAREKALSKLAPRQEIARPVPMKKPVRVLLSCPNPRVSRSELSGLGYSCALFGELDGFLALSHITKFSRLYAARTFYEAGVYFASFPSLPAAIGAVKSGVFLALIKNIYGTLELSYRTGAPGASFSDMGRKGIAQKVADALSSTPLKNSPSAYDFEIRFLQGKRSIIVALYPGSRLGGRFYYRQNAISASIHPAVAASCIHFISSYTRPSADVLDCFCGSGTMLFERAMLPYSSLTGTDISHQALRAAQGNERLAKTGARFFIQDATKPFSRQYDEVISNLPFGLRVSSHRENFALYQAFLSQLPGMLKGDGHAFLFTHEKKLMRELLSESFHLIAHANFS
GUT_GENOME217881_0173118-539RENLSLLRQALRGKDRGEEKEEELRACLRPENLLLWERCLSVEDPKARKNAALLLGDLAECSEKWCRSFGEDAQSEATCSGTSSWRGEAVRALWEAYGREETLFVRPAYLKALKGFPAGELLECSGELRRRLAELDNNGTEAAETELDGAETGEAGEEAGKKTDTAKREPSGAGEHAKHLRQEKRALEELLEKVSPEDGRNIPSVSREALKGEYRLLLTADGAVAEKLAERVAHLAKRVKVTARGVMVITDRLSRVLELPLYQEAWFVARLRKGVRADREHLAEAVASSELSVILERFYPRERPFTFHLRLLGAEAERRRGDFLRKLADGIEAASGGRLRNCRGACHAQLILLEKQDQSFGLFVKITGMKDERFSYLRYRQPTSMAPVAAASMVALCEPYLKRDAQIIDPFCGVGTLLIERNRLVPAGHMYGTDIYGEAVLQGRENAALAGAQIYFVNRDFFDFTSDYLFDEIIGEFPRFSPGERAKAEDFYRKFFAAGKEILKLGGRMFLLSGEEGEIRKH
GUT_GENOME087143_0146114-497DVRANLIALRNALKDEKEIRAFAYLLGGDFSVLGSLLVHEDPKVRKNAAILLGKMESEDMLPLLFETYQKEETRFIRADYLKAMEHMDYASYLGKLQERLQELRSVQAAPEEQKHMSEEIRMLQTMVLKYQKPEVHRFTAMDQAEDVILVTNKEQSAVTAHKFPAGCCRRLKGGVRVNGVPMREILPIRTYTELLFPIDAETLPQSDPELTGRRLGGDSEQANSLTKRMQRMHSGTGPYRFRIELKSRMEASKKGVYIRKLSDALERASKGMWINSATDYEAEVRLLERKDGGFTPFLKLYTIKDKRFSYRKEFVSSSIAPVNAALTAELATKYLKENGQILDPFCGVGTMLIERNKVVKAKEMYGLDIFGEAIEKARINAQQAGCRIQFINRDFFTFEHAYLFDEIISDLPQVTQAKPKQEIHELYLEFFRKAPQHLKENAVLILYVTEPQFVAEAVRRHPEYRVEEMFLLNEKNRTTVYVLR
GUT_GENOME045442_0231914-483RESLIGLKLQVQKEASIQNDTELKDKLIHLLNDEDPKVRKNSAVLLGHYPGTVDILLEAYKHEKTDYVKDAYLKGMSMQDCRLYFRDLQMIQSHLMNLEDTTPKHVQAQLKILNPLILKNQIHKKKIIKLKHTPVDVILTSLPYYQFVLFADVLHLRYKPVSQGVLVRTDSIYDLLPIRTYKDMLIPLQGASGLDMSLETIMNGFERCNIEDIFDRLYDDHSVFYFRVVDAMREKKPQLIKKISEKLFELYPQKLLNATENYDIEIVIKEVQRGKVNAYLRLIHLNNPRFQYRREVSATSMQSYVAATMVQLAKPYMIEDAKVLDPFVGTGTLLIERCFAKSAHFVMGIDVYGQGIESARKNAKLAGQNIYFVHKDSLRFVNNEMFDEIITDMPTFAQMKDQEQLQNLYDRFFERIRRLVKPGGYVFLYTSEISLVRKNLRLQEGYLSLEEHYEVPRGKNMFYFFIIKVK
GUT_GENOME158645_0003016-496NLRENMIQLNQMVKSEAVLDDFLDEYYEHESCFTGLLDNDDAKVRKNAVKLLGRVGDPVLLDILFQHYEAEETQFLKSDYLAAMAGFEYMRYLPGLKERWEILEGQERTKHSAEEIKQLQKLIWKAEPPKRHSFCGDEMENRLLLIVPRGHEETVLKQAESIPETHGRVMTGGCLVTTKNLEKVRQIRTFQAVLFDFYPALIPSLDGETIGKKILKEGLLPYIRERHAEKRPFMFRVDVKGMKDIAKKNQLAKALSKYLEENSGGMLINEPSFYEVEIRVIAAGTKGSRVFLKLTCMPDKRFLYRKYITSTSMHPAKAALIMHYAEPYLKPEANILDPFCGTGTLLIERAAAVSFKSMYGLDISEQAVQAAWENSRRAGMTIHVVHRNFNDFKHEYKFDEILTDMPRTENLRGKDSAAYLYELLFTRGRELLAPQGIMAVYSEDGKLMERKLAENSWLQRVQRIPVAKDHTSWLYILQNKD
GUT_GENOME218105_0003859-491RENIALRLQPSRAPLWEALESPVSKMRKNAARLCGALANPLDAPVLLAVLQKEDTRFVLPSLLLALGAVGGPEAEAALLSYTVDEGDPKHVASEKEALRMARARCTTTQKHAFTGFSKPVLVELRTPKWLSESLVYELKGMGLSPVKVTTAAVTVRTADIPALYQARGFTELLLPLGKCESKPQAIADAAAGMGALLQASHAGEPPFAYRVEAKGEVPDRARFAKDMAMLMDGPLLRNAPSDYEAELRVEIGKSARVYVKLFTLADTRFPYRKETLPASIHPAVAAAVLRYAREYLKSGDRVLDPCCGSGTLLFEREKLNACASLTGVDIAQRAIEIARGNAEAGHSRAKFVHNDCRRFIAGRQYDEVIANLPFGNRVSSHNENETLYAQILDKLPDWLRADGTAVLYTMEFTLLKKLLRARPNLTLVTETKT
GUT_GENOME097725_0111649-470VIADADSRAADFVKANIDGILKILRTGDAKMRMHAAQIIGNTCASQYLDDLIDAIMHEQTMFTLPSYLLAIGRAKNDRAKRFLDSYQLRSDLEKHREEEKAALDKALAGFIIRKKARVRVLPNDIVVLASPNLNVTYSQCKDAGLSPKKFGKYIAVSGLTNFYDIYQLRAYTDAYIYFGCAPVAQLPEFLAQRERAIMQRTGVTGYRLEVRSVSHEVRLDLIKKCVAAMNELINTPSSYSIEILIEMDGDMAQVFLNPLTDDRFSYRKKAISASVNPGVAASVCAYASEFFDPDARVLDNFCGSGTMLYERGFYPHHSLTGADINMTAVEAAKENSRYAHVHPQFHYIDCLKFTAKRYDEIIVNMPFGLRVGNHSRNERLYRSYFTILPEILTDEGLAVLYTHEKNLTENLIKSNGRFNVLK
GUT_GENOME253845_0037372-496ISHYMDLQREKTAGMLKNSDPKVRKSAAQLLGNLKPDEFCSELIEALNTEGTEFVRPSIILAIGNAKRTPAAITALTDYVLPSDCSEKHIREQAAALSKAISSLRGGEDTDVKVKALPEKNEILLRCPSAKVTAEELKALGYSVSLHKALNGFVSVKNVPKLSSLYVARSFYDMNVLYGVFDTMNAAVRAAKSGGFEDLIRTMYGEGELRFRVDVQSLNSQTTQQLRSKVSSEIAAELLPRGLINSPSNYAFSVRLLLTREKAYLGAAPAPKLDGRFQYRTDAISASIHPAVAASCMRFIKPYLKEKADVLDPFCGAGTMLFERAKYSYASLTGTDINSDALRAARQNERNAKTGAHFLIKNAGSPFRDKFDEVICNMPFGLRVGSHDENRSLYAAFLRNLTKMLKKDGVAFLFTHEKKLLAELQ
GUT_GENOME284015_012425-488SLKQLYEHITRGDEVRQNLISVRQMIRDESLRRKFMVLLGGDFSILTGLLKDQDPKIRRNAALILGQAENEDVLAPIMEAWRTEKTLFVREDYLKAVEHLDYRPCLSQLRARLAEIEEGEGQEEEPNQTGQEHSLWDNNKHLAGEASQIRRMIERAGEKKRHTYAKTDPAPELLLVCNRCQVAATAEQIQKGSVRMLKGGVFVRDGSLEELMQIRTWSEMLFPIPGARPIPAQDREAAECLHEMKVYGYLKYLHGGEEGPYRYRIELKGKRSLMERKGSFIRGLASRLDMLEKGGIQNDDTDYEAELRLIERSDGTLAPMLKLFTLKDRRFSYRKASTAQSMAPVNAALVLRLALPYLKDGAQVLDPFCGTGTLLIERELILPSGTCYGIDTFGEAIEKARANSGTIGHINYINRNFFDFTHAYAFDELITELPQEGSRIVEEADGVSFEERFLRKASELLSEDAIVAVVTRRPKALGEAARQA
GUT_GENOME261089_0121645-467DTACVHAALRSDEPKARKNAARLLSAIGTQPDVPALSEALAREETRFVVPSILLALGAIGGRAAEDALSAYAPPLPESPEQEKHCAEIQAAYKKAVGRLAPQPASAIGSLSEPAPLLLVPPSGFADMLIDELRAHGAHGEAALGGVRVVERDLGALYRCRCLLEALIPCGEARQEPAAIAAAAKRGWQCFCPPGEEPALPYRVELRQYAGDRGAFIRSVVRAVGGTDNPSGYAWELRVDCLPDQTARVFVKPCAVNDQRFAYRKGALPASIHPVTAAAIARAAKLRCDFGAHARVRVYDPCCGSGTLLIETGRLFDGAVLMGTDIAPNAVRIARENTAAAHCRATILQKDCLRFEAREPFDLIVANLPFGNRVGSHESNEALYRGLIERLPALLSADGLAVLYTMEGRLLERCLKAQRALGIS
GUT_GENOME191958_0066115-495NVRQNLIELKQLIRDDNQKKALAYELAGDFSDFTVLLKEEDPKVRKNAVLILGEMECDDLADQIWDAYEAEQTLFVKADYIKSLSKCECRQLLPKLKARLKELSETKVTVEEEKHVRAEIFALQALVLKYNRPKHHKFTGFDHELELILMTNRNHREMTAAQLPAETEIKMLSGGIRFCTTHLDDVLKIRTYTEMLFPIPGLRLLEGSPDHMAKQLVHGGMLRFLNENHAGNPPFYFRLEIKSSMAADRRIDLVKKMSLAIERESAREMVNTTSGYELEIRLVANKEGRFIPLLKLSTIPDWKFAYRKEVLPTSVTPANAALFMALAGPFLKKDARVLDPFCGTGTMLIERARFLPCNTLYGIDILEEAIQKAKVNTELANIPIHYINRNYFDFRHEYHFDEIVTNLPGMGKSKDTESLQILYDRFLQKSQKMLENNGIVVAYTTVPEILKKQLHNYPAYIVEKEAIINERENSRLLVLRF
GUT_GENOME192988_0002014-497VRKNLIALRSEIKDPAQKRALAYKLGGDFGVFTRLLEDADPKARKNAALILGEMECDDLLPLLYKAYESETQLFVRAEYLKAMAHCDYTGYVSRLKGCLKRNQETRWEESDIKHVREENAVLRSMILSVEKPEKHRFTGYDNHFDVILMTNRNFGALTLAQAEEDESCGKSGVHGGTVRLETDNIRKLFNIRTYTEMLFPIKGASRIMEKEPEAAARILARSGLLSFLEENHKGDGPFYFRVELKGKMPLEKKGNFIRKFAAALENYTDGSLCNYASDYEIEIRLVEKQAGGFIPLLKLYTYQDRRFEYRQNALAASISPVNAAVMMQLAKPYMKEDAQVLDPFCGVGTMLIERNIVKKANPLYGVDIYPEAVDKGRENAGRAESVINFINRDILSFKHEYLFDEIVSDLPAVTRTKDKGQILDLYSAFLDKAKELLKEGAYLFLYTPESDVLERCLKERREYEIQESWIFNEREGSIFYIIRY
GUT_GENOME079020_019691-474MLETIIDNIKQGQNVRENLIELKKWLKGSIEHQDTFLSFLQYDYTLLKELLCHEDPKVRKNVAGVIGELELEGLLPELLDAYEEEGTLFVKTAYLKAFQKIDCECYVPKFKERLEYLLSEKFTEEQEKHITEEIRELTSLIGQYERTQSHYFIGYDEQGKVVLTTNRYYPELTLRQLYHCNGKKISGGVAVEYEDMRDFMSIRTWQEILFPLPSAKNMPLDPKVMAHTILMGGLMEFLEARHLGGGSYRFRVELRGRISIEEKRVMAKKIAMAMEKESSHRLINSVSDYEVEIRLLEGRNGGYHAYLKLYTIPDNRFAYRKGILATSINPMTAATIMELARPYLKENAQVLDPFCGTGTMLIERNFALPAHPLYGVDLFGKAIDCGRENAKRARTQINFIHRDFKDFSHEYLFDEIVVNMPSTTGKQGEKEIEELYELLLQKGQRHLKKGGIAIIYSQDPSIALSVFKRHLEWK
GUT_GENOME048623_0156425-467LSELCALVKKQADRERLCARLDHRALYASLACGEPKARKNAARLLSAVGTAQDAPALAAALAREGTRFVVPSLLLALGAVGGEEAAAALRAYEPPQPAAPEEEKHCAEIAAAHSRALDRCCGEPPPALHALLAPRPALLRHPAGLQAALLDELQALGVAAQGTPDGVCVRASSLDALYRARCFFEVLLPCGTVEAEPRAIAQAARGGWTALCAPMPGEPAAPSLPYRVELQGYAGDRVAMIRAVATAVGGRNLPGHYAFELRVRCHAGAADVSVIPTAVQDTRFAYRRAALPAAIHPVTAAALARTAAMRLRPRRRLRVYDPCCGGGTLLVEAAQAADCAALLGTDVAPAAVTLARETLRAAGCRGAILQRDCLRFSPGEPLDLILSNLPFGNRVGSHASNGPLYRGLCARLPALLREDGLALLYTMEGRLLESALHGVAGLV
GUT_GENOME236237_008199-467IEKGIDVRRNLLEIKQAGAAGLKSERLYDISLFALLLKHEDPKVRKNAASIIGDMGECRLLDALYEGYVNEQTMFVKSAYLKAMSRMDYSPYIERLKKRRDGLSAMEVSEENLKHVADELKVLNGMLDDENPGGGHTFCNPDRAVTVFLSAEKSAVSYLLNDIPGAKEAAGGVIFKTADVGKISRIRIYRELFFPVNGFRCYERAELPKELASGNLLELLESLHREKKLPYRFRVTAKDMDLSDLGARIAALSKGMLINAPSDYEIELRLVKNKEGRYGCLLKLHTAKDSRFSYRKETVATSLKPQNAALMVRICADYLRPKARILDPFCGVGTLLVERAIYMRPHCVFGTDTFQNAILGARVNTKAAGMQFNYINRNFFDFKSDEPFDEIITEMPDIDKTEEEELYRNFFDKCSTLLAPRGIIIMNSDRKNLVRKHIRLNGELKLIREYVMNEKAGSY
GUT_GENOME238257_0079616-485RSNLIAVKEFLKAPEHVMEFKNAAGYSPELMASFLEDEDPKVRKNAVIIMGLLGEEGFGKIIAEHYFTEQTFFVKSAYLTALKSYDFSQYETQLDERRKFLEQGNFEQSDLKHAAAELKVLQTMTEAEGTSHIKHEFCNPDKPVKVIFTAPREIRGYLKKEIENAGGKGTQAVFCGVMTETADIKKLASIRIYKDLLFPLNNLKSVDKADIPAAVCNGELYEILEKLHKGAEYPFKFRLTAKDVDLSDIAARIQTLSGQKLINSVSDYEAELKLVAGKDRKYGIFLKLYTLSDKRFAYRKNTVATSMSTVNAAFTAYVAQNFMKENAQVIDPFCGVGTLLIERSKRVKTGHAYGTDIFGTAIEYARINTGKAGLNVNYINRNYFDFSSEYFFDEIITEMPKIERDEADDFYKRFWEKSDELLKDDGVIIMVSGEMGLVKKYIRLMKKYRLVLECPLGSKDDSCVYVIKKE
GUT_GENOME175750_008908-484NAVRGIDTRQNLSALRSAVKDDAGMNDLFELVEDDLGTMTGFLDSDDPKVRKNAALLMGELDMSDFVQPLVNGYRKESQLFVRSAYLEALKWYDIEEYLPFFKGRVKELSDTEITPENRKHVDEELKALTELIVSCEGISRHEFTGFKRLEADTEGILITNRLNMDTTRLQLEERGVKVLPFKGGVRFSTDNLTFLRDIRTYSDVLFVIPGCRTCEFEPEKAAAMLAGSGLMKLLYRLHGGENGQPFYFRVELKSTLDMEKKGKFVRRFSGELERQSKRMLINSASDYEIELRLIENKEHKLNILLKLYTIEDERFSYFVKHIAASIKPVNAALLVELAKEYMVSDAQVLDPFCGVGTMLIERQKAVKANTSYGLDISAEAVAGARLNTEAAGQIVHYINRDFFTFEHEYLFDEIFTDMPFAVNDKVTLELDNIYRRFFTCADRHLTMEGRIIMYSRNPQMAVKYSKSAGYRILKRI
GUT_GENOME172934_0243212-481GQDVRQNLIKLKAELKEESQRASLLFYMGGDYSILEELLTHEDAKVRKNTALIIGELKIPAFLDKLYQGYETETQLFVKSAYLTAMKGFDYTNLLPDLKKRWNNLNHMEAEESNKKHIQEEIRCLSDLIIAKEGIVKHKFIGYDVPSKMVLLTNRNYKNITLDQLEGIQAKEFNAGIILKTDQLREILDIRTYSECLFILDDLKTCKMNVEEAAGAIAGSSLIEFLNKRHEGTEPYYFRIELKSKLPLDKKSAFAKKLGTEITRLSNQKLINSASNYEFELRLIENKEGNLNVLVKLYTLEDKRFLYRKNVIAASIQPVNAAFLAELAKEYLKEDAQVLDPFCGVGTMLIERNYLVPANTMYGLDIYAEAVCKGKENAKEAHTIIHYINRDFFDFKHEYLFDEIFTNMPRAIGNKEEDEIFNIYRRFFKKAPEHLKYGGVIIIYTHNRDYVKKLADKNIYRMEAEYEISK
GUT_GENOME237871_011324-416EDAKTRKSAALLIGELGLSNQDEYREKLLSAYESETTLFVRECYLKALAEGKEELTAEEMERLSERLSYIDSHEFPESDMKHIIAERKALVALIGEGEREKAVFREPENLPVLMVPAKGFYDPLKKALADRGITNGISSIGVLLRGNDFERVKDLRIYDHLKYMIPGDFSLAADSLRDEIEHSILLPTIEKLYKGNVSIRVYLHQSGEKNANQIKRFSGELIRLSVGKLLNVAPYDAELHFYRKKSGGYSLFIRPLTEDRRFLYDINRLSTSMQPVKAAVMVSLLDEYLESYSRVVDLFAGNGTLLMERDEALRTKVMFAVDTSEEAVKAGRENSKAKGRTVNFVHRSAFTFESEEPFDEIISELPDLYEKSSTDRREFFEKLGAETKKILRRGGKAFYLTSEGNEIKAMIRK
GUT_GENOME247276_0010967-507NPALLPAMRKALASDASPKLRRNAARLIGLFTKDEADAQLLIARLKCEDTRFVRPSLLFALGAVGGESAQRALDEYTPAPPADETEQKHYLEECEALKQARAAAMKHEKHIFRGLDKVYEIELTAPDRLTEQLKAELEDFDIEAFDVRRNSLKVNTDDYIGLFEARCFSEALIPIDMKVDLTAEAVSSCAKPFMLDFMRKTHEGEPPYRYRIEITGDLPGDINRSELKKAIRDLTDDKTLVNAPADYEIELRIAASVSSARLYLKLFTVRDERFPYRKEMLPASMNPAAAAAVLRFASDYLTVNARVIDPCCGSGTLLFERGMLSPCASLTGVDISHKAIDCARVNAEAAAKTCGVTQAKFICNDIMRFESKRPYDELICNLPFGNRVGNHSSCERLYEGLLDRMGLLVKKGGIAVLYTMEFTLLKNLIRERRNIEILKQE
GUT_GENOME057686_0141215-491RASLIALKKELKDDAKKKVFLTMTGNRLDEIMKCLVAEDPKVRKNAALILGELHCQDALDVLMDAYEEEDKLFVKEAYVQALSMIDCSEYLPQLEERLQELVAYEAPEEEKKHVQAEIRALQELLLQKKGVKKHTFNGWNRSSEVLLLTIPAFRDALAEEVTGKKKVLKSGVRTIVSDFETVMKIRTMQELLFVVHTQTDDNVLSTDPETLAAQLAESDLMQILTETHKGDVPFCFRIGVSGTMPLEERSIFAKRVAAAIEGAFERNLINSTSHYEVELRLLQHREGGFVPLLKLYTLPDHRFDYRRYYVAASMKPTLAAGLIALAKPYLKEYAQILDPFCGVGTLLMERRFAVPARNAYGIDTFGEAIEKARVNSKIAGMQTNYINRDYFDFVHDYKFDEIVTDLPAGNLSKPQLDDLYRRFFEKSDEVLAEDGRMIFFSREMGLVKKQLRLHPQFRLAQEFCIQEKNRSYLFIVE
GUT_GENOME094836_0036511-492QQKDVRAALSGLRAAIKEDENYSRAADLIGDGEVLRPLLMSEDAKVRKNTAALIGDLELTELSDALFLAYQKEQTLFVRGTLLKALEDTNAYPYLEQLRERYEALCEKEAEEQDKKHIREELHVLERILRKEGKEVIHTFTGWDQKYTILLTTNQAYADLTAEKVHADKKAVTSLGVKAVVDNLREICKIRTFRELLFPIALKKEIHYEDGPEAFAKAVAESKLYQILQVSHKEPAPYYFRIDIKGGLSLEERSRYIKRAAAIIEEESGRKLLNAKDEYEFEIRLFFNKDRKIFVFLKMYTIPMERFAYRKESISASIRPSAAALLVELAKPYLKDNAQILDPCCGVGTMLIERNELVKAREIYGIDIFGEAIEKARINADAAGLLVNFIHRDYMDFKHKYLFDEIITNLPVRGKKTKEEQDLFYKRFFDKSKEILAPGGVMVLYSNENGFIKKQLRLHPEYKLYQEYLIIEKEQFYLYIIG