UHGP-MC 26467


Information


Number of sequences (UHGP-50):
141
Average sequence length:
238±24 aa
Average transmembrane regions:
0.03
Low complexity (%):
2.45
Coiled coils (%):
0
Disordered domains (%):
2.72

Pfam dominant architecture:
PF00668
Pfam % dominant architecture:
9078
Pfam overlap:
0.53
Pfam overlap type:
reduced

Downloads

Seeds:
MC26467.fasta
Seeds (0.60 cdhit):
MC26467_cdhit.fasta
MSA:
MC26467_msa.fasta
HMM model:
MC26467.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME142064_032341263-1494IDEETFSSLIQKVDEIYNMDLNEALIIGLVLTLNKLTKKDEIIVELERHGREAINDYIDVSRTVGWFTSMYPAYFSIEHEDIEDNIKSLKEQFRNIPNNGFNYSILKFLNKKLKGQESKYVRFNYLGDFDNIIDKEKLNISDIEFGLCSDDKNSLTALIDITAIIINKKLKVTAEYSKRRFKDEIVEKFIESYIETLKLILDKCSNKGFKEFTPSDFDAVDISQEDLDALFD
GUT_GENOME193594_006011254-1493YLKNGKIECMLSKETTEKFIEEIGKNKTISELYVLADAVLTALEPLLGQDFLIDTEWHGRNTEYQELDLSRTIGWFTSIIPIHITNLDRDEKERLRTIKEEMQKAKQLSVDPVFTGWYAKNRKKIQPIIKLNYLGNYSSVLKRELFCYASYDTGDDVSMNNYMENQLEINAIIIDSKLQITIYYSYSLCSEQKMQEICQKIPNHIEQITTFLREENPYCFTSMDFNTVNISTQELDSIFE
GUT_GENOME115724_03058297-503TVRISLDERYTQRLLYEAGNAYNTQPQDILLAALGRAVEKVSGQKKVSVCLEGHGREMLTTAIDVDRTVGWFTSIYPVILESYEAIEDMVINIKEMLRKVPNHGFGYQFITDIAHCTDANIYFNYLGENNYADSESELFSVGKCIADENCLVGDIQFNGDITNGILSFDVGCRQGTCSEEVISTLAECYKATLCEIIDCCSMQGEKL
GUT_GENOME033600_001261995-2235ETSEGGNLKEIKIELEESDTKAILTKAVKAYHTEINDLLLASLGMSIYKVSGRKKNVIFLEGHGREKIHKEINIDRTVGWFTIVYPVILNCDDDIEKNIISTKEMLRMIPNHGLGYGLCKDSINEIEQNITFNYLGEIDNENREMFKNADFYTGKCMADEDETENIEINGYVKQGRMVFEITYKDNRYDNNVMEKWSQIIKDTLIDISNFCIQQKTTKKTPSDYVFKGLSWESLNKINDIA
GUT_GENOME103007_02251279-514KNSETKVKTIVFDEEYTEKLLVQALNAYHTHINDIMLAALGMTVKEITNKCRIAVCMEGYGRYPIHRDINVDRTVGWFTNIYPVILDCSDDPESSIINTKEMLRAVPNKGLGFGLLYHNEERSITDIYFNYIGEKENSCLADDFNVGKSVADENGILGNIAFDGFICDKKLHFDVVFKSDGSDDNILSEFADIYESKLKYLIDYCSQKKNLVKTASDSSIGDITLEELEEIMGIIE
GUT_GENOME227266_005231210-1480VQREMDYWNKVDQFMLNNIIQYKYTASDKMKCTSLVFGEKMTNALVNKAGKAYNMEVKDIIITALLRAMTQAYGNEKQVISMESHGRQNDYIELDIDRTVGWFTSFYPILFENLGIDIQSDLVNVKEKMSCVPKHGSDYLSLKSNNVLKISEYLHPLVNFNFQGTFVYDSRVSYFARSNEEYAIPMAAENIFGSPISLDGIVINNKFEVVFSYVQSLEKELEICALIENFRKELKGIVDFCTNQKHRIVTPMDLGCPGMNWNDFQTVQEFA
GUT_GENOME145993_01834792-1013AKLRNDFRLGPNSEGSANTLWIGFDERQSHALLRDIPRQHCVAIQHLLLAAYVRSVADVLASGGTHLSVDIESHGRQLFEDELEMNRAMGWFTAVYPLIAEVVDGEPVLLTARRLANLAEKQADAGALYGIRRYLSDAPRKSVKGSELCFNFLGHFGLDSDASLGWSWSNLYPGAARHPDVSRVHLLKLTGRVVANRLTLDLSYSSNVHSRQTITRIGERFI
GUT_GENOME096513_041622293-2510DMEEIEVDLTAEETGILVRRAPAAWGTEMNDLLIAAYLRTLNEWCGHTRFAIDLEGHGRSEFSPDQDLTRTVGWFTSLYPAVYEIPEGNLLSVLKTVRENHRSIPRKGFGYSILKFLTDPERKEGIDFGFAPEICFNYLGEFQDGLKQEGVRSASIPFGDLIDHSMIWPYQMELNAHIVNGQLRCALRFNRRRIARGSMDMLASRFRHHLVLLSEHCS
GUT_GENOME141763_030601249-1474LSVEQTSELDDGRRRFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSVLRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIGYGLLRYVYAPTGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLPVREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAEALERTFPLALSALIQEAIAAEHTEHDDSEIVGEP
GUT_GENOME143709_041342326-2557LPTDYNKQKSLRKHTRTTTINLNEQQTAELLKQANRAYNTEIDELLLSAIGMAFKKWADLNRFIINMEGHGRESIMPDVDINRTVGWFTSEYPVVIDIGDEPNPLDIISKVQKDIRRIPHKGIHYGVLRYLSEHSRNDKMDIKPEVSFNYLGQFDRDSKEDDGDVQFSAFSGGDSMSLDQVRECKIDIECVVLHGRLSLTVHYSDQQYKEETIERLTAFLRESLSEIIQHCM
GUT_GENOME096513_036673303-3556EYAQSHKIMKQLPYWKQVEEADIPALPKDFAVQSNRWSDCGAVTVQLSAEETEALLKKAHRAYNTEMNDLLLAALGYSVKEWGGANQVLIELEGHGREELFRDLNVSRTVGWFTSLYPVVLNMEHSKDLSFQIKTIKEMLRNVPEKGTGYHILKYAAPAERKEQLRFNLCPEISFNYLGQFDQDMATAVFGPSSMPSGASFSPDSDRSYALEVTGMVENGQLILQLMYNQQQYELKTMTALAESYKKHLLLILE
GUT_GENOME222656_017133662-3880DHKKVSIKIDTVLTAKLLNKVNQVYNTSVNDVLLTALARTLKSVHGSEWTPIKLESHGRGEIESNINIQRTVGWFTTIYPQLISTNLLETKSFSRKLEYGIEYGAKNGIHSYELPTVMFNYLGQVGSNETIEWSIVQTDLGKSTDVSFNEALTINGVIFNKVLNMEFSGYIENIDQISHLYKNELESLVTELYETPRTYLTIEDVNQIISQDELNSLQN
GUT_GENOME098210_003711258-1491RYTLNNNINELSEMIKSKINTTLSTVFLAALIRSLSNIMSANEFSVMVEKHGREHISSDIIIDRTIGWFTYKIPIILKVCDDIMKNIVNTKEKMLSIPDEGIGYGLLKYSNQLSDNYGHTDILYNYLGENELYQDNKIFRETDIQSYSSRIESYDSGSYLTFNILINHGKVSFDIEYDTGIYNQHDIQQLCSNIENELSVIGDLCRDNDSRILTASDFELQGIDIDDFDSLINE
GUT_GENOME044254_003244333-4572DKVSVRLNKEDTSDLIYRSNHAYHTEINDLLMTALGMAVNKTTGNQWVGVRIEGHGREIIHKAIDIDRTIGWFTCEYPVVLKMSEDVQENIIEVKETLRGIPGNGMGYGVLKYEKDSEIEDKAVQICFNYLGNFEEGADEEQSEIIRISKYAGGIDSAPENHTISSLNFNGVLSEGELEFSLEYDMAKYTDDFVGALMNAYKEALVSVAHHCTAKKEQVKTPSDYGLSNISMNDLNNMLE
GUT_GENOME231408_019301296-1519YEFLELKKTSKDNYYCNSDSITFSLTSEKTEDLYKKISMYPEFDINSVLLSALSRSVKRTYNKCKIKVMMEGHGRYKNVSNLSIDRTVGWFTTLYPIEISCEHNLIEKHIKDIKVGMEIVPNKGGNFLSLLTSKKLKEDFLNQEIFELSFNYLGNINNNVNKKYSVLSIPDIKNVGDYNQKAALIEINCFTLNNKLSVSVNYSLNIRDKSKVKELVNTFKSEIE
GUT_GENOME148113_00289777-1023KKKLCDKELQTDIRKIETTIFEKKETEIFKNRVPHKYDASINEIMIAAFAYAYLKTVNGDKILFNLELHGRDICNLDLNTSRTVGWFTSEYPQLIMCDKLDGSVYEMIDNIKRQIREVPHKGQTYEMLKYYGGYNFGIETLDINFNYLGEIGFEKNPQFEVFYDKDRMSLGPMGYANVSSDDANYFNLNITPYIFENKLHFSCSYNPYFISQSFVKKIHKHFVDIFTEMVNSSLASGMTSEIEIKME
GUT_GENOME200288_023083374-3649MESERAYWQHIEQLTYEPLPKDFEQGTSKLKDSRLVTVRWTAEETEQLLKQAHRAYHTEMNDLLLAALGLAVQAWSCRERVLVNLEGHGREELLPDVDITRTVGWFTSQFPVVLEPGHVQALGHQVKQVKESLRRIPNKGIGYGILRYLSAPHDGERFALEPEISFNYLGQFDQDYESSGSQPSPFSPGSDSSPNAVMNYVLDINGVVSEGVLELTIRYGETQYQRETVERLGTLLQSSLREVISHCVSKERPELTPSDVLLQDVTVEELERLSEH
GUT_GENOME207955_008601318-1544LSQETTARLIDTANQAYFTRVPDLLLAALTRTLSEQGYGSRHCVMLEGHGREAIDAELDVSRTLGWFTSTYPLALVDAKEWDALVCATKEQLRAVPDKGVGFNALRLHHPRGWALPEALVVFNYLGVSHQGEAAQPWQPLPLAPGAPTSPDNLPRELVSLHGGVFDGRLTLRQVGALNQQGSDALMARLADNLTALVEHCCNLKASRHTPSDFPGLGLSQAALDRVL
GUT_GENOME070087_00718868-1105ENSKKYWADRIDEVRKYRFDYKYDPKTEFSGYDSIKMTLTPKETADFLRSVNNTMETEINDLLLSALAEAVYSVTGQDKITVTIENHGRERLHDEILLDRTIGWFTCMYPVVASRYDDIRSTILSNKQALNDVAENKIAYGIAYAGSVSHISEIEFNYLGQLGGNAASDDSDSNEFDRGIEIAEENEFSGGISVNGSIMGGILGFSFSYDRSKYAADDIQKMVSSFKQALLNITECCL
GUT_GENOME213666_041691231-1458SCDFQINEEISEHILGDANTVYRTKGEELLLIAWKLTLQQFLGFSCLILEAESHGRDAFENMDTSRTVGWFSMAYPVSIEAKQGTTREIIISYKEQIRSAFQRRYEYGICKWIKGYQIPMVHMARFNYLGNCDLPKGTPFKLVSVFPDNALTAPDTNIYPVELDIALIHGKLHCRIICKEHQASWLADVFTEHLTEIVKHCLSQDNYTYTPSDFPEADLTQQELDMIL
GUT_GENOME192042_017791790-1978EMDKILLTLWGQMVGKVIGQDEVILEMEGHGREEIVDNVSIENTVGWFTSQYPIVLKTKGSFFTLYYAAKDIISRIPHHGIGYGLLVNAGKLQYQPIHVAFNYLGDMDERGENEEIVLSDMPLGDMVSKYNFTGNAMEFIAYIEHGKMYFDITFDYGKYDMEIIRNMAKQYMQCLKNALIQRKDEKFSL
GUT_GENOME171377_038024241-4488RKAEACTYRNSRTVAIELEEPVTRKLLTEAHRAYHTEVIDLLLAALALTYEEDTAICLEGHGREEGMVQADVTRTIGWFTSMYPVLLSLPSRELADAVRSVKEMLRGIPAKGAGYAILKYLTPEFRDEVDILREEPDIVFNYLGQFDDSTDEDKPALSPMPMGDLISPLNQMSHLEEWNCVVSEGRFRLTLRYHPQAVDEDTAQRRVTAYASNLLSLIEHCLNREDEQFTPSDFTDADLTFDELDDIS
GUT_GENOME171366_011921268-1505RDSLSFQLSEEETQQLVTAANAASRTRPLELIMTSLLLAIRTYTGRHEIILDLESHGRDELEEGLDVSRTVGWFTNIFPFSIVLEGTDLASAIKQVKERLRNIPANGAGYGILKYLNGEWKDSGERELLFNYLGEFTGARRDDMFELACHPEAIASSRSGSSYGIEINGAILHRRLSLQLSYRSARFARAEMELFMHEFHSRLRQVVQFCSQQTESVFTPSDFDGADLTQSELDSIFS
GUT_GENOME001586_009673773-3978EERRIVLSEVFDVERTTKILAMSNKHNVGVDEMILTALSLGHKKLTGNEVYAVSLEAHGRNALSGVLNVERTVGWFTTIYPFILNASVEDPYMALMNIKEEHRNIPNGGVGFGALSLGEDNVFESLPTDSFNYLGELQKDMGSLMDMYYTGESSSEKNITDGIGWVSYVAAGKLHIDVQFNEALYYRETIRKWIKNGQSELLKIVK
GUT_GENOME141698_003163620-3849KKIEEGKIKDSNIVTVSLDKLKTKELLNKVNKAYDTETKDILISALSIATGKWTNLDKLGITLQGNGRENILQDVDVTRTAGCFTSYYPVVLDLDSEDIGKTIINTRDNLRSIPNKGIGYGMLKYLTRKNKEAINIDVITEISFNYLGKLDNNESNEEITTDALLQEKIIGNKFKLRDGINITSAISNDILDVNMTYDTEAYDEKSIKEFGELFINILNEIINYCRERDE
GUT_GENOME000031_010772284-2549EEQQTAELPYEMPFHENTDCSKRDSVSFSLSEADTDVLLQNVNHAYSTDTQDILLTAASLAICEWTGGSKLRIAMEGHGREHIMPELDISRTVGWFTSMYPALISFENQRDELGTAVKTVKDTLGRIPNKGIGYGMLKYLTHPENKSIAFSKTPEISFNYLGQFNDIERQGSFRPSSLGSGKDITHTWKREQIIEISAMAADKKLHFNLSYPPARFHRNTMDQLINRIEHVLLDIMKHCAGKQKAEKTLSDFSSQSLTAEDLDSIS
GUT_GENOME141703_007771247-1487TLKQKENISRELYLNEEETKKLIDIANKTYNTNLFDLIVVATVKVIEKLQNSRYIIFELEGHGRYDSFSDIDISRTVGWFTIIYPFMIKKIPEQLSELIVEVKEELRKIPNNGFDYGILKYLKHEFSVSDNNLVRLNYLGEIDDLDNNLFSVNYDAVDNVDPRNKMDSLMDINCYLKDNQLHISFSGIRNLEKDKILDFVELYHKSLVNIIDLCGKSEQIHFTPSDFSTIDLSMDELDSLF
GUT_GENOME096202_0386633-267SCTIHLNKEATAQLIHQAGRAYNTEINDLLISALGMAVRKLTGARDVTIGMEGHGREPVHKRIDIDRTVGWFTSIYPVVVPCQEEIAESIITTKEMLRKIPNRGLGYGLFQEQLPVRPLEIMFNYMGQMDAEAKGRKLHFFSSGKGSAEENIVNRKVNINGSLLKEQLHFSIVYDKSKLSAGKIQLFIETFHDCLIATIQFCTSQEEVAITISDTDATDLVADDLQMINELFDLD
GUT_GENOME146003_003981247-1457FECDAALTTLLVDTCHQAYNTSIREVLLAALAPVLQAWHGSADSYLTLEHHGRDINDDRVRLEGTVGWFTALVPLRITAHECQKTTLRAIKDTLRKMPDHGIGYSALKYASASGDEALACLRQHRLPAISFNYLGVFNEQADDALWALIDQTADLENGAMPVSGNVLDLVLYVHDDRLSIVMSGLLPHDELSRLGARYVDSLSALVEHCCR
GUT_GENOME044254_003235357-5579TGAIYFGLDETDTDRLLTQSSTAFNTEINDLLLAAIGMAFRQVSGQERVTVGLEGHGREEIHKKIDIDRTVGWFTCMYPVIIRCLDDAEDSIVETKEMLRKVPNHGIGFGILEDEIGEIKADLYFNYLGIFDAEASKEERIAFSTGMSSAKENKMPGSINFNGSVSYGRLAFEIMYDKSKYTEEFMHKLAMAYEKALKDIIVCCVSKEESSVTISDTYASDLG
GUT_GENOME000011_030752289-2547ETTALPKDGHRNADRLQKTALIASRMLSVKLTNQLLFQTKQAYGTEVNELMLAALGMALADWTKREHFLVSVEGHGREGHVPDIDISRTVGWFTSIYPVLLDMSAPDNSSEKTACRIKNVKDMMRRVPNQGTGYGLLQSSQRIQHREPEICFNYLGQFDGLEGMMRMSPYQPRHNIAEERPREFDLDINAMVVNQRLDIQTVYTDVFSRRTIETFMDGFYDRLCEVIEHCSAKKKREKTLSDFSNKQLTSASFESIANL
GUT_GENOME080363_0266542-257QNVGFSLSKESTKLLQTYVGKKYGAEINDLLLTGLALSLREQFGSRKTKILMEGHGREAVSSNININRTVGWFTSVYPFNLDISNTEQPELVSIKERLRSIPNKGIGYGILNFLDRPFNQISSASVQFNYLGDFNDIEGSGTESKGSLFEYSSENIGSSVSSDNLFTDILLDISGMTVNGIMNISIRYSDQLFKSESIENFCALFEKNIKNILNEN
GUT_GENOME011023_0098484-268SGARTVCCAVPHELTERLVKRQRSCGKRMEDILLYALYRALTRLGLGQEIWVELEGHGRGQAASCHVERTVGWFTALRLLRLFTPEGARREAWPEIQEITDSGGAHTAVRRIRFNYLGELRTRKNSFFEALPSVWPGDIGGRNRVSCRVCVDVLLVQGRLKAHFTFKEDDSLCAGLPAAYLQALD
GUT_GENOME239690_0049026-241RSINIQLSKQDTDNLLRHSNQAYNTEINDLLLAALGSAFRDWTHHTQILIGLEGHGREDIVSDINVSRTIGWFTTMYPVVIDLPPSCSVQDTVQHVKQTLRQIPKKGIGYGLLKYVTALGHKKSMKWKLNPEVCFNYLGEFSQSIDQGVFTLTDMPQGQMLDPQSERSYSMDITAAVMEDELKISFIYNKHEYDEDTLTEVALNFKNHLLEIINHC
GUT_GENOME141734_004063312-3585KEIGYWQEVEKAETASLPKDDEAEDKRMHHTKTAEFSLSKEETEQLMTKVHEAYNTEMNDILLTALGLALKEWTGQEDFIICLEGHGREDIMEGLNISRTVGWFTSQFPALIQLRHSEDIGYQIKQIKEELRHIPNKGIGYGIYRYLTEEGKKAQPIKHDISFNYLGQFAEMADSGLFTRSALPSGDPLSPETEKPNALDIVGYIENGILTMSIAYHSLEYKESTVAAVAASFKTYLLQLIDHCLELDGGELTPSDLGDDELTLEELDKLMEIF
GUT_GENOME096513_036663338-3615MHRELDYWRSLANLKLQPLPKQRLAADRTWENTRTLSIQFSEEETDLFLRKAPSAYNTEPNDLLLAALGITLHAFTGYERAAILLEGHGREELFEELDITRTVGWFTTMYPVVLDMSKVHDLAYHIKNVKETLRSIPNKGIGYGLLNYLTERTKEQDQSLAIAPDISFNYLGQFDGDIDKGQFVQSPLSSGSAISPRNKRECSLDISGLVLDHKLTMNFSYHQEEYSEQQMIELLESCKKALLDIIRHCAEKEGSEATPSDLGHEDLSVEDLLTITDK
GUT_GENOME143709_021547574-7844METERDYWSQMAGREQMPLPKDNDQGGDAVENSATLTLQWSRRETQQLLTEANRAYGTEINDLLLTALGMAIQEWTGSEQVAVLLEGHGRESILPDVDVSRTVGWFTSAYPVMLDMKADLELPRRIKLVKETLRSIPHKGVGYGILRYLTSSNIQAADPEIAFNYLGQFDQDLQNSALQISPLSSGDAASRRQKRAATLDFNGLIAEGTLTLNLSYSAEQYRKKTMQRLAKRLKHHLQEVIRHCAAKESVELTPSDVLQQGMTIEELEALV
GUT_GENOME213384_039863396-3655LPCDNPQGGRQNSLAITVGTHLDQSWTRRLLQEAPAAYRTQINDLLLTALARVITRWTGGDFLAVKLEGHGREDLFDDIDLTRTVGWFTSMYPLVLTPAASLADSVKRIKEQLRAVPDKGIGFGALRYLGDDEARRTLGGLAQPRITFNYLGQFDGSFEGDIAHAFFTPTGEGAGADQSEHASLGNWLDINGQVYGGELNLSWSFSREMFSEATVQRLADAYEQELKALIEHCCLAAHQGVTPSDFPLARLSQVQLDEFA
GUT_GENOME096393_01838799-1084KEAEYWYYLNYYVENVRIKKDYVTINNKQKNIRYVGIELTIEETEKLIKKVNKAYRTEINDILLTALGFALKEWADIDKIVINLEGHGREEILEQMNISRTIGWFTSQYPVVLDMQKSDDLSYQIKLMKENLRKIPNKGIGYEIFKYLTTEYLRPALPFTLKPEVNFNYLGQFDTDVQTELFTRSPYSMGNSLGPDGKNNLSPEGESYFVLNINGFIEEGKLHITFSYSEQQYRKETIQQLSQSYKKHLLAIIEHCVQKEDTELTPSDFSFKELELEEMDDIFELL
GUT_GENOME107385_02340580-766NRKDFAIVLSEEETQQVCLKASEQFVSMEHIMLGCVVMALRKLLHKEHLVVEMEGHGRGDVSENLNITRTVGWFTSVYPVLIDITEDNINSYIEKIAETLSAVPSGGLSYGVLKYLSKDGLGESLDIAADIRFNYLGEFDQNLNSEFMSISNFDFGYTVDPDYPVDFLFDINSIIIENKLRINLNYI
GUT_GENOME188395_038061294-1543LLKELAYWKGINEARANKLPKDQLVSVRRFRDVRSEELILSKEETTDLLKNVHKAYGTQANEILLSILGLAIQEWNGLDNVLINLEGHGREEIISNINTKRTVGWFTSFYPVVLTMSEGKSLGQYVNTIKDSLRNVPNKGIGYGILKYMSELEEEDRKSFRYKPEINFNYLGQFGEDTYNDVLTLSDAPIGDCVDSNDETQYALNIICKVIDKQLSISFRYDCNEFKEGNIAKLKEIFYEKQKEVITHCL
GUT_GENOME096564_024921545-1812IEKSKLRKLPKDKKGQDSKFSSLKNVAMELSKEETENLLKKVNKAYNTEINDILLAALALTIYKWTGEENTLINLEAHGREEIIKNVDITRTVGWFTSQYPVVLKAKKDLAETIKNTKDLLRRIPDKGINYGIIRYLANNETKDGKEFKLKPEISFNYLGQFDRDINNTIFTASTINSGESISLNNEILHLLDFSAILMNNCLNLNIIYSDNDYNEETIKALSKNYKANLIEIINHCISKEIAEKTAADITEENISLDELKPYLKCFN
GUT_GENOME160282_015251902-2149YWKQLPWDKLPPIPYDIPQEQNHNYLYESEAFCSVVSKEDTKRLKHAFQKGYNADIETLILFALQEVICDWSHSEYSFISAMGLGRDAVPKGQNIDLSNTFGFMAIIRKIFTKRICGTLEEKLNYYIEEIEKIPNRAYGYSIIDEYLYNGEFPNQEGISLTNVSLNYRGEMNQKILGSGFYLAENSTAFGMNSENRMLGGSCKVMVNGAIVSEQLILTWVYFKNMHKKETIEKLAQQVQQILCDIANI
GUT_GENOME154900_022803-214KSSNVYGANIDELLLAGLARAIGRLTGQKMLSVKVEGHGRESIHKPIEIDRTVGWFTNVYILNLDCIEDCENSIINAKETKRRVPDGGIGYGFVECKKVPDISFNYLGDFGSDENSSNKYSVGKSYAEENLTDDKIIFNGSVLNGELRFTINSQDSKFDMSFIEALCKEFKISVNELAEYCSSSLNDDKTISDVYDDELDESEVDFLNNLFM
GUT_GENOME096202_045492367-2629QSELSYWSAVSSQIPDSLPYDNKAASDKVEDGRDFEIELDRDSTEKLLKHAQQAYNTEINDLLLTALGLSFQEWSGLSRIAVSIEGHGREEIMKDVDVSRTVGWFTVMYPFVLDMRGQGDLGHQIKLVKDGLRRVPNKGIGYGILRYLTEGIHLEELPFAFEEPQISFNYLGQFDGEGDGGARFGPSSHQTGDEIGGGSERPFAFDINGQVSDGVFRLTFNYNRHQYRDDTVEKLAGGFKKYLLELIRHCCEKETAEQSPTDF
GUT_GENOME207278_018803832-4088EQRFATSVQSRFDRSLTERLLKQAPAAYRTQVNDLLLTALARVVCRWSGASSSLVQLEGHGREELFADIDLSRTVGWFTSLFPVRLSPVADLGESLKAIKEQLRAIPDKGLGYGLLRYLAGEESARVLAGLPQARITFNYLGQFDAQFDEMALLDPAGESAGAEMDPGAPLDNWLSLNGRVFDGELSIDWSFSSQMFGEDQVRRLADDYVAELTALVDFCCDSPRHGATPSDFPLAGLDQARLDALPVALEEVEDIY
GUT_GENOME239690_02217403-669LAHAEVASLPKDQERAAGQHFVVRDSETVSVAWSEQDTRLLLQEAHRAYHTEVNDLLLTALGSALYNWSGQERVLVNLEGHGREAIVPDIDITRTVGWFTSQYPILLDVDGKLEVGQRIKRVKEDLRHVPHKGIGYGMLKYMSSDDATSSWSLQPEISFNYLGQFDQDLENGEITLSSHPGGEPLSDRTVLEQPLNVNGMITAGVLTLEIRYDSHVYQQQNVEKFARLLQESLQEVIKHCASQERSELTPSDVLFKGLTLEQLEQLT
GUT_GENOME078019_017361207-1419MSINSVHSKKITIDRNDSLFLTGSNKHTYRINDIFLAALVKAVNSDTLAIMLESHGRETIEQNIENDSMIGWFTSIYPVIFENNKDLDDLIASVSSSTKMIPNGGLNYLAYKDQKISNYEKVFEPLIIFNYLGDFQENLEYIKLVHLDMGDRNSKSNHLWFPLTITGQNVDESLVFEFKFEQSYFSISEIEKIAEKFKNNLHTIIEHLNSHDC
GUT_GENOME062844_04981781-1034LSQEHLYWEGVESEVYPDIPTDHPVTGKHILDNSISFSLNEESTKLLQTHAGKKYGAKINDVLLTGLALSLQDHFGINKTKVLMEGHGREVMNTGLDISRTVGWFTSLYPFSLDISNNSQPALVSVKEGLRGIPNKGIGYGILNYLGDSFSPKNNPSIQFNYLGDFDDEMGSVFYYSSEPIGNPISEENLETDIILDVSGITISGEMNISVRYSGDLFNEATVQKLMDSYRGYLEKMISEEESEVILTPSDVTY
GUT_GENOME116311_02125240-469SDSVSLELNEKVTYDLFYKSNNAFNTEINDLLLCSLVKAFHSWKQCNEIAVNLESHGREEAVIGLNIDNTVGWFTSSYPIILQYESDIDRNIVSTKEMIRNIPNKGITYGILYADAEPKPEITFNYLGQVGGSGNTMKYTSGEAVSSENKLPGNISFNCYLSDEKFTFIITYNKLMFSAHDMRILAEAYLEALEETIAYCSGDNESRQTISDLSDNELSDDELEMLNDFY
GUT_GENOME096513_041632377-2627HACEAGQRYKEVELKLDGHFTEQLLTTIHQAYRTEVNDILLAALTLAVTRWTGEESMAVSLEGHGREQLMAGVDLSRTVGWFTSLYPVVFRLQSSAIPHVIKAVKETLRSVPHKGAGYGILRYMTSPGSKGHHRFCSQPRILFNYLGQLGAGEEGGFKLSDAPAGSAASDYVEPLYWLEINGMVVNGELRLVIGYDAAKFDLRTMEELRNSFKEHLEQVIQHCAEAEGTELTPSDFSARELSLEELDDIFE
GUT_GENOME160282_023061209-1476NEELLYWRKISKRLGNSKPLFSEVSISALREVQFRLSEEITSKLVKLGQEIISMNPNEVLLTVLIRVLTILFNRKDISILIESHGRPELCEQIDITRTVGWFTALYPVLFEKIGDDYFDDLLIIKERLRSIPNNGVGYDVARMNKLLDSDYHEPLITYNYFGNREAENGIDSYTLRPSNYEIGNNIDIRNHFGSPLSVNAEIIKGKYCVDISYPISMITEKEYLAFKNMFLEQAEKYVIALEARKERVTTPSDYYQNDISLQDWSIIT
GUT_GENOME096513_036704463-4745REKAYWSRTALETVPVLPQDGQAAVRRWGNAETVSAELDEQQTKQLLQEVHTAFKTEMNDVLLAALSLAIGEWSGHGRVPVQLEGHGREEILEGINISRTIGWFTTVYPVVLEVEQQQGLSYHLKQTKETLRRVPSKGIGYGVLKYMTSAEHKEDLQFGAEPAISFNYLGQFDSSGSDRNDALFELSPLPTGQPISPSAPRSQALDVTGAVTGGRLRIHLTYDREEYERQTMEQLAARYKETLLELIAHCLSRESTDVTPSDLGYNQLSLEQLDKVNSLLKLK
GUT_GENOME103435_02289586-794TVKVELDPEVTEKLLYKAGRAYNTEINDLLLSALAEAVCGMTGKKDIAVSLEGHGREDILEGVAVDRTVGWFTSTYPVVLSAKGKTGETIKGVKEELRRVPNHGIGYGMLRWYQGILGKEEPDIVFNYLGRFEEGELEGGFRISQDPKGEEVSGKNLYSMAVSINGLEERGRLSFELTYEKSIHRREDMEELGQRYGEALKEIAIYCAE
GUT_GENOME005775_001351858-2094IKKSLDEKSTAHLISDIHQRYHTKVQDILIAAFVCATKKCSEENTISIQMESYGRENDYSKLNVSETIGWFTSIYPVEIAIAGDDYGKQIVEVKDRMNQVQNHGIGYNAYYYGKKANNRTQDNCFVSFNFLGTFEDDLQNELFKASEYSSGNEVTDCYIRCSLDINAEVKNGCLEVRMDYGNDLYSKDQMQVLIDNFFVEIVEIINHCMSSQESFTTSSDYSVEGLDANSMDDIFAS
GUT_GENOME213434_00355380-567VSFPAEVTPERMAAGRRTHAVHAGELLGIALFRAMQKLLDIRELLFEMEGHGRDRRGMRGTERTVGWFTVLYPVRLQAEPGDLPTQTARLAEQFRQAGARKYEWEALQRLEAKPAYSGAVRMNYLGEFHQNGQAPFLLEEISPRLDTAPCHRFDSLMGVDVLFVNKRLHAAIVLPDGAFSAGGVDCLR
GUT_GENOME105860_003511266-1485KLSMFYHINATSFFSVLIAFMWFQYQGNTEAAFTIEQNGRALLEDYDVSNTVGWFTQIYPLVLSCKSLVSMDERLKVFKENIVKAEKLARDYSALLMNERIAEAKWNQTIWLNFLGDMGHLLKNRQFSLADEMSCFYQTEDNCMTHPIEIIGWYQKDMFIMSIHYQTERFPEYFQEKVEAAWKRTVQELADFTADLNGVMFTPSDFDLAQLTQKEIDTLF
GUT_GENOME239690_0048630-299LWKNDQVQVSTQRNKRNLKFALSKENTKRLLGQVHHAYNTDINDVLITALGIALYDWLKVKQIVINMESHGRTDILPDLNITRTVGWFTSQCPVLLCMYEPDNLSEWIIKTKENLRTIPNKGLDYEVLKYLSPESLRAPLVFSIEPEINFNYLGQLDPISSSPYFQLSSYSSAASFGPEGLGNISPNNELCYALEITGYITADQLHLSIGYDIARISGQEIQHLAEGYEKQLIRIIQHCIDKKEQVRTPSDFTLKDLKEEDLDMIYDVLL
GUT_GENOME000711_028023316-3564RLEQERICELPKDRAAAERKAENTSAVTFELSEAETRLLLTKVHEPYGTDINDILLSALSLTIREWTNGSNICINMEGHGREEIIPGMNISRTAGWFTAQYPVILKSEAAAGLPETIKNVKETLRRIPDKGIGYGILRYLTDRKKAGADFSIKPDISFNYLGQIDREVQTDFFGPSSYDMGRQVSERSEALYALSFSGIISKEKFILSCSFNTEEYDRQTVQGLMDRFKASLTALIDHCTAKEEREFTP
GUT_GENOME143709_031701267-1495ELSKEETHRLLREANAPFRTQTSELLVASLAMALGGFTGKENITIELEGHGREELFDSLDVSRTVGWFTSMYPVNLRLPGTELAGIIKSVKEQLRSIPNKGIDYGIMSYISGLIPEGGRSLLRFNYMGEIDNHLQSPVLEIADKDTGSDSCASNRLTCLADVVAIVIDNKLQVRLAYSTAKFKPQSMEAFMDALMQRLIGVISLCAETGQSYFTPSDFETVKLTEDELN
GUT_GENOME096559_026891263-1494TIYFNTEDTTVILRGHSDMTAAELMQVSLVMAWHDVFQCKELTMWLEGHGREALFADLNISRTVGWFTSLYPVKFELESVRGEHETSRSILRRLKQVPHRGVGYGVLAYSLNQIPLVEPQIVFNYLGEFSQLEEGIFKIVNEASGPDVAPENHFPFLLEMNAIVMNRQLQIELTYSPASLDSDRMEQLVCNYQSKLLNLLQEQDHLSEEEFMLTPDDFDAVDLTQQELDTLF
GUT_GENOME141723_010792268-2539EKAYWEDVENRLVVPSSLAVRQSINQKRWYSKQLVRQLNKEKTDLLLKQAHRAYNTEINEILLAAFALAMKEQFGIEELPLDLEGHGRETFSEDVDVSRTIGWFTSVFPVVLEAGREEKLGTAIKSVKDQLRRIPQKGMGYGLLYCKELHTIEERPSPPVSFNYLGQFELHENEGQSLHLGNSNSLENERTHMLDLNLAVINGVLEINLEYHEREFDRNEIEQLLNVYTETLEKMVDHCIDKEDDGEKTLSDFDDQRLTDEELDHIYELLED
GUT_GENOME239690_045952-192AVHEWTGIERVGIVLEGHGREPVVPELDITRTIGWFTSQYPVALEMGGEMEIGARIKRIKEGLRRIPNKGVGYGILKYLSDVSTGSSFSAEPEITFNYLGQFDQDLAEGTMEVSPYSAGSEVSEEMVQHQALNINGLITEGQLQLSVSYNHQQFHMDSVEKFACILKERLSEVIGHCAGKDRTELTPSDVL
GUT_GENOME000031_033211251-1498ICSENSKVENSIKLKRVLTSQMTQKLLSEANEAYQTQTNDLIITALIRACHYCTGKEVISLELEGHGREPLKEDLDFSRTFGWFTAMYPAVFQAEADLPAQIRSVKENLRNIPNKGIGYGALTYLNKRIADHKKPELRFNYLGEVDRFLADKSEYRMTYLNSGAESSPENDLTVSLDLTAGIYDGQLTLDLICSNLYQTEDMEQLLDEFSVQLEELTEHCCKQESVEYTPSDFETASLSIEELDSLFK
GUT_GENOME032990_014721739-1976LPKDFVTQASYFSNNKKIIVSLTPAETEGIVKLAHKAYNTQINDLLLSSLALGLNRKFGTRNVFISLEGHGREQFDPRYSIERTVGWFTSLYPFYLKTEMSSLERLIIETKESLRAVPSKGLIYSLLKDNLGDQISISTPEISFNFIGDISENRQTNYHIVKELTEYISDPLNEKLAVIEINSYILRGQLFFEIDYSKNAFKEETIADFAQKLIDAVNDVSTHCQSLRRTVKTISDFG
GUT_GENOME139185_013573867-4097FADTRLLEFDINAADVEKLKKEAFRVLDATMEETLLSVTARAVAQTTGRKRLAFAMESTGRFAKDQNVDLTRTVGWFTSVYPLVVNTDQAAAEILAEVKEKRRSVKNLEYTFGICAGLENAAKYVKPQISFNYLGHIQASANGFVQADMPIGPTVGRNITRSYAVDISCKLVDTADGTKLHFTIAYSGKLVGTAFAGLLMENIQNNVFDVTKELSMKEKKLFTPADFDLDI
GUT_GENOME142578_020521206-1396IALSKEETEVIIRFANQKNHGNMEAVLIAVIAHTLSKNNIIPQKSILLLEGHGRPKEKNEFINTMGWFTNIYPMKIELNDDILISAQKIFFNQIKIPNKGIGYQRYFDLNFCHDWSFNFLGDMTFNDYPEFKIVDIFSENDFSPNSQALSNLHVDVILNNQELSIKFKYNKKLYNSSDFKKVEMEFQDSLY
GUT_GENOME213666_048431219-1465QAECAYWRDMLSHDTSFPWGGQEAPTREAQNTVRFLPEQTTRNLCGYVNAILHLSMEELLIAALGVALGKCLPQSKVLLELEHNGRYEDVRDIDISRTVGWFTNVYPLAIPLQVDDLESYIGKIKDVLRSVPNHGRHYSQCVENPWQYIPQAIRFNYLGRFTSKYSCFKAEPLEGIFGVQGREILCVMEINSILYDDQLVVHLRSGYSSQQIPLERITEKWMDQLLAFGQMCKDDFSFSQKVSPSDF
GUT_GENOME234083_011191210-1451DTIVFLKKQNKSEQDTFNFRKDSIFIDNEVISDGMNRAKEIFGADSSAILLAALLDVLTDYLPNKTIINMEGHGRTLSSKNIAVERTVGWFTAIYPLVVEKQFDISSSIMNIKDKLKRASQSQIEFGLMKNKDRFTYRPNLTFNYLGEFTEEINNSNTFSLSKQSTGEVKDKKNYLKSDFVVDIINYGDGFNFNFTVVEKININFEKVEMELKNYFLNLKDKLNCYDQKVITQSDVSDEELE
GUT_GENOME016866_030741052-1315VDQMEVPAVPKDMEADVTTQQDSESLFVRLAPEETELLLKRVHRAFNTEMNDILVTALGIALRKWTGHERVRINLEGHGRESIGTDIDITRTVGWFTTKFPVVLEPETDRDLTYQIKQVKENLRRIPNKGLGYGICRYLSKSEEGLVWGAEPEINFNYLGQFDDDVNQGEIGISSYSSGSPASDRQARSFVLDINGMVLDGALSLDLSYSRKQYRKETMEAFAQRLEQSLRELITHCAGKENTEMTPSDVQFKGLTITELEQIA
GUT_GENOME141703_007611323-1538EEYIKALLEYIPKIYHTQINDILLAALMLAINKWSGKKKILIDLEGHGREDIFRNINISRTIGWFTTIYPVVLNMNKCENLKDYICSIKEALRSIPSHGLKYDIYKYLKDDDVFRKIPDGDILFNYLGRFNARAEGNTLFNILDMPHYGNRDERNKRKYEIEVNLLINDYSMRIDWIYNKRKFAREDIEKLACYYSEALKDIIEHCKNTAPVYITS
GUT_GENOME000711_024753944-4203VPFIPAEKLERDTFEHSATLSIRIGPDVTAKLLRNANKAYNTEINDILLTALIAAVRDITGENKLKVMMEGHGREDILDGVDITRTIGWFTTVYPVFIDLGEEKEISQNIKMVKEALRKIPNKGIGYGVLKYMTEELQKIQTQAPLSFNYFGEMNNDMNRKVFSQSPFSPGESIGGKIVRHCAIEMNAISLNGELTIYTTFNQDQYQTSTIEQLNQSFKENLEKIVDHCVDKEGSDMTPSDYGDVSLGLEELELIKDKYS
GUT_GENOME001364_048622314-2554VELSRQDTELLLTAVNKAYSTETNDILLSALGLALQKWTGNDEFKISMEGHGREAYLDDIDISRTVGWFTSIYPVWLDISEFDQTDKDEWLGRHIKQTKDMLHRIPHKGTGYGVLKYMNNLWGSEQSNPEISFNYLGQFDQDIQSKAFEVSDIKTGNEISPDWERPYVLDISGAVSSGCLNMHIIYNRFQFNEKTIRAFASHFKHALENIIQHCAGKEKREWSAADFTDEDLTLDELSEIM
GUT_GENOME143384_003632984-3241QPVEWPCDRPRGDNREALAESVSLRLDPQRTRQLLQQAPAAYRTQVNDLLLTALARVLCRWSGQPSTLVQLEGHGREALFDDIDLTRSVGWFTSAYPLRLTPAQSPGESIKAIKEQLRAVPHKGLGYGVLRYLADPAVRQAMAALPTAPITFNYLGQFDQSFADALFQPLDQPTGPIHDEQAPLPNELSVDGQVYGGELVLRWTYSRERYDARTVNELAQAYLAELQALIEHCLEDGAGGLTPSDFPLAQLSQAQLDA
GUT_GENOME145936_034703964-4172EVSLSQELTYSLLHTANRPYHTEINELILTALVYALKALSDEDNYGITLEGHGRENISDSIDHSHTVGWFTSKFPVKLQAKKDLAQTIKGIKHLLRSIPHKGIGFGAFVTHPAITTTYDFNDMPAISFNYLGRFDAAEGYWQVVNENSGKSVGADNPLMSIIQLNGMVNQGKLWFGIHSRLSIEQAMLFASAFSQSLEKVVTHCCLQDK
GUT_GENOME146003_004971336-1549ACSRLALSIKVTQALQTEGLRAFNAKVDECLLACADGMLAHWFNGELTHVTVEGHGREEIDAALDVHRCVGWFTSLYPCALSSSPDPAMRLQAAKAQLRGVPHQGLGFGAFARELGLPLPNVCFNYLGQLTSPDEKDWALVMSPVGETRHPDNGLPFGITLFSYIEQGRLQLEVVSGLSGEQNQQLLEAWVAAIDSLIACCERQGRTRYGLGDF
GUT_GENOME257540_01237337-526LGKETSNSLTGKASSLYGTKTHELIVMALMIALSEQLGKDQAMFEIESHGRDVLKNLDVSRTIGWFTKIQSVILSIPPKELSEQLADMKKQLYTNNSLTEPLMKNEIRINYLGDLSETNNCHWILEGLGWDGDVSADNQWSYLMDANIYLLDGNLHIITAFHPEWSGIGESILCSWFDKLKEIVSHYVST
GUT_GENOME095421_01137127-371RTHADMTVFECRLDAENTAKLLDRAGPDLEAALLGAFASGLSRWAGKTWPDAASLRTPLIWVEHHGREILSDDMPDTGDAVGWFTSLVPTAISGSTLEERLRAASASLSLGEANGLGFGVLAAYRENAAAVPEALRAGLCRRGAAEFNYLGRFDLGVGSEDAESALSIEQLGDSADQSPDFPAGAPLSITLLVNEKGELECGFAVDLRGLADPEGTGRYLKSSWKRALEEVIDALGAAGSSEGAS
GUT_GENOME116293_01997295-536HEFESDKFDGYDSIDISLTSNETTTLIKESHNAFNTEITDLLLSALSEAVYRYNEQDRVLVYLEGHGREQLHKNIDIDRTVGWFTSVYPVIIEASDDWDDTIVHNKNMYHSIPNKGIGYGLLKEDEENIDGIEFNYLGEFDESQNGKNSEEVYSIGLSIAETNKRECSISINGSVSNGILSFNISYDKNLYSDSGVKMFAQHFHDTLVEIGEYCVNKDETVITASDFDSSMDEEDFEGIVDL
GUT_GENOME096513_015341293-1561VESTPCVPVPRDFEATGDWTKDEAQVMVTLPPEETERLLKQVHRAYNTEINDILLTALGLTLKEWMNTNRIVIHLEGHGRQSTIKNMNITRTVGWFTSQFPIVLPIEHTEDLGYQIKSVKEHLRQLPNHGIGYGLLKYITPRESLPGMLFTQQPDICFNYLGQINTDLRNRWFGPSPWPVGRTISPEAERKASLFIIGYVQDRRLHIVMEYNRTHHRQDTIAALLDKLQNRLVDLIRHCGEQTAAEATPSDLGYNRLTLQELDQLNNLV
GUT_GENOME096513_036713657-3909IPTDMETEDWRFKHFRAVEGSLNLKQTAALKRIASRSISNVSVQDVLLMSLVKCLQAWTGTKEVVLDMESHGRHSDELNYSRTVGWFTAMYPIRMTLQEDMDPSQQLSDLSQQLKSVPNHGFHYGILNYLTKLLPARDRLLKDVQFNYLGEFTPLAPHDLFSGFQLTANAAPENHFLVKLDVTCWITDDLLNVAFRYNSNCHREETIRSLHERYLNHIQVLVESLAEDASHLQADSLDAADLSSEELEALFDL
GUT_GENOME200288_008595011-5279VERTSVTELPVDHSISSNQAGDHAVQKLTLDTNVTRSLLTDAHHAYSTEIQDLLLAALSRTLKEWAGSGNILIQLEGHGREDVIKGVNVSRTVGWFTSLFPFVLGEKGTDDLAELIKSTKDELRRVPGKGLGYEIMKYLTFHDSTSELSLDYLLKPELVFNYLGQFDQEAARTAVFQVSDISTGPEVGPQVQKGYKLELNAMVSNGQFSMEINYNKYQYHQETIQRLAKRFGEHLEQIVQHCSRKEVKEMTPTDFQYNKLSQKQLDAIS
GUT_GENOME087803_001941260-1491LDAEKTNLLFTKANIAYGTKINDLIVTALFVTIKELTMQRDILIALEGHGREDISKDVDISQTVGWFTSIYPLRIILNEDSLRESIIKVKEKLRNVPNNGLDFGVYKYIKKEIDYDLKDLIRFNFMGDFDTDNSESDIAKLVVQNTGNDSDKNNALGCLMDINCFKTQGKFFINVIFNGNMFSRECVKTFIEKYLQILDEVIIHCVDKNESEFTASDFNESGLSQKDLEALF
GUT_GENOME096508_002472022-2288KKAKEAPVPFVPQKVLHKDTVSVKHWIEGEIGEALCSKANRAYHTETVHLVLAVLAQAVGAWKNRSALLFNLEGHGREAFTDGLDVSRTAGWFTSVFPVLLQVEEEIGATVKAVKECVRSVPRRGFGFGVLRSLTEDLTLEDREALESLQSAISFNYLGVQGEQHPGAVEIEPLPADVTVDGNYETSLVLDIVAAQVGTALCLDIRYPRALFSAEEFRDLLKLIDQAATEVAYHCSEQTVGEKTASDFTVTPLEQEELENILDDLDI
GUT_GENOME095592_02549784-1046LRQQLGYWQAQLQGFAQTLPGEDAQGSQQVGDAQTVQVELDAALTRQLLQQAPAAYRTQVNDLLLTALARVICHWTGEHAALIELEGHGREALFDDVDLTRTVGWFTSLFPVRLPVAGPVGDTIKQVKEHLRAIPDKGIGFGVLRYLGDAATRQALTGLSQPRITFNYLGQFSASAAGEDDALFAPAAESGGRQVSDQAPLGNALTINGQVFEGALKLDFTFSRQRFAAGIIEHLAHAYVSELRTVVEHCLALEHLCLTPSDF
GUT_GENOME239690_0548625-265SVKIELSIEETEKMLKRVNHAYNTEVNDILLTALGLAVREWTGNDKVLISLEGHGREEIIKDIDISRTVGWFTTQYPVVLDVSGQLDISRMIKSVKENIRQIPNKGIGYGILKYLTLPENKADLNFNLNPEISFNYLGQFNQDNVDNLFGVSDIPAGESTGPNYKSEYCIDINGGTVEGGKLELNFSYNQGEYEKSTIEGVAVSFKKHLVNIITHCAEMDYSEATLSDFTVDDLNMDEFED
GUT_GENOME234083_011181247-1452LKLEKSVISELKNNLSKLNDIDMQDVLLLSFLESYFKTYKKEQHLSIDMEGHGRNSEAIDLNVSRTVGWFTSVYPLLFKPDVIPSSFSEKVSYIHKRLKSVPNGGLTYALQTESNSKFRFSSIMFNYLGEFDSKLFESNEFSVMEKSIGEEVSGDLLMQNPLILNMLIHDGDLVASFIYDKKVFSKWKLKKFSKVFKKTLIRSIKQ
GUT_GENOME096513_035422629-2831LLKALAAQRHTTPLGILLTALSQACFDLKKQPDLLLHLMSYQRESFLPGIAIDRTVGFFAGAYPVRIRFGKQELAGHTEALLEHVKRTLQEIPNEGMDYFALRHIVPELHPEAEPLVDDSRMLFHYQSEETAWQADDFYEPLTLPYGNTNAPDNPSAYWLNMTACLKRDRLSLTCYYSTLHYEEQTVAGLVHRFAGYLRQCIK
GUT_GENOME000031_03781166-433YWSHIAAEQVSPLPKDCETEQRIVKDTSSVLCELTEEDTKHLLTDVHQPYGTEINDILLSALGLTMKEWTASAKIGINLEGHGREDIIPNVNISRTVGWFTAQYPVVLDMSEADVSAVIKTVKENLRRIPDKGVGYGILRYFTETSETKGFTPEISFNYLGQFDSEVKTDFFEPSAFDMGRQVSGESEALYALSFSGMIRNGRFVLSCSYNEKEFERATIEERMERFKENLLMLIRHCTEKEEKEFTPSDFSAENLEMDEMGDIFDML
GUT_GENOME118330_006322643-2879TVSREIDGDRIKKLLHDGIEKLGAAPNELFLTALYCTFNQNFGIEKVAVRLEGHGREDIINSEDVSRTVGWFTSAYPVILASNDSDDLVDNVLSIKDSIKRIPNKGIAYGILKYMRSDIKLEKEPVISFNYLGEFDDDIEIGEHRYGEFRDADAERDIYIDFNLMFIKGKFVVSITVNSKLNEDNRISSIADIYVDNINKLIDVCDKQKERIYSPSDFDGCNLSSDELKNIIDKYEN
GUT_GENOME105860_006331294-1518NDRVKDNVSAQVTISHEVTGSLLKIVETTTRERSFTVNDVLLTALTKALTDIFDENEITVMLESHGRESSIVNCDLSRTAGWFTAMYPVRFEFSDISDCVKMLDEVRAELKRVPDNGIGYGVLKYMNRELDDAQKYIRFNYLGNASGGHEDSRIKKVIPDLAVNSGKENINTCLIDVNCIRMDDRITVRFECSGLFMDKHKLEELALYFEADLNELINKYGNSDG
GUT_GENOME172934_01508850-1052KDAKELQYKLLAAWSETIRKWTQQDEINLSYTLNGRRGIQADSDYDLSRTIGWIAFNVPVKINLKGKNTSFEIIHQVKTAVQSIPHGGISFVCLKYINKDEELSKKELPNITFNYYQVNTNTTLFEDSGYMITKSSQNIGLMEDPQKNRKRILDLVVRRTQDNELYVKIHYCSKIHSQKTIQYLADEFKNQVQMFCNQTQNKG
GUT_GENOME000711_033263311-3584IEQHETVALPKEQTHIDKSALKKRNTVSFTFAQQETDAILKDVHNAYNTDTQDLLLASAALAIWQWVPQKGLKIALEGHGRQSEAADHDISRTVGWFTSIYPVLLQLEPNTWDDHHEEYMIRTLKTTKDTLRRIPDKGFGYGVLKYMTPPEKAKIEFGKAPDITFNYLGQFEIGNRAQTETEQLDAFTFSPLGGGEDITGTWKREQSLDISALVAEGKLTVNITYETGRFQKETIERLSQDCQYYLRKLMNHCLGKTETEKTVSDFDDRKLTEE
GUT_GENOME194075_008204300-4493QANHTYLTKSYELVLAGLIKGLSSYTDRPSLAVMLESHGRYDLENNSKFSQSVGWYTSLYPCDFSLENSKNNSCFIREVKEQYRDLPNLGIGYGLLKYNRKLLNEYLEPDIKINFLGDLGKETHKGQFSLENWLKDYDSSPFNRQNFALEITAYFFNGELHLIVKESQTFKNAEFADLVENLSQSLLEVIDCCK
GUT_GENOME239690_014493397-3644RTSIAVRLDKQDTQLLLGTAHQAYNTEINDLLLSALGLAIKQWAGSKIFAINLEGHGREEIMEEIDVTRTIGWFTSMFPIVLDMRDSEDISSVIRRTKEMIRHIPNRGIGYGVLKYLTDVEHTQDLDFNIQPQISFNYLGQFMDGTQEGSFESSPLSPGRSVGEGAESHFTLDINSIVSQDQLLLEFGYIRGEHADHAIEQLSLMYIEHLKMLINHCTSISQKVLTPSDYQDHTLSIPELDQILKQFG
GUT_GENOME196903_00147819-1050EHTKHMLYSISDAFNTEINDILLAALVRAASKKSGNSTIAINLEGHGREPIHKEAYTDRTVGWFTTEYPVVFENVGNESSRDLITVKETLRAIPNKGLGYIILQNTYPDMFTSIDPDITFNYLGEFGQEGNYGFCISEIKKSRDKDITNAFRTPITINSMIINKKYHMEVTYENNIFSGEYIEELIHYFLAELENIIEMCKGINEPIKTPSDYGETGWNFEELQYVTRKYEK
GUT_GENOME044254_003222267-2491KIYFSEQITRQLLYEAGKTYNTEINDLLIAALLMAVEQVTGQQKVTIGLEGHGREELDVPIAVDRTVGWFTAMYPITISCFDKIEDAILNAKDKLHKVPNHGIGYGLIEGAWEQMKADISFNYHGELMNESVIEINEIISTGNCSAAENVLPASINFNGSIEKRCLVFQVLYDRSQYGEEIINKLSQMYQIQLERIVQHCIEAEQKVQTASDFTADDLSMEDFEM
GUT_GENOME001364_048662349-2605EFIPKKHEAANHNYENSKTLRVSLSRTETERLLKETHKAYNTQINDILITALLISIRELTGENKLKILMEGHGREDVLQGVDISRTVGWFTSIYPVFIDLEEETGLAMTIKMVKEALRKIPNNGIGYGILKYLTKDEELLKDERPPILFNYLGEIDHDMTTEQFSSSKLPAGQSIGEKSARDASVEIDSVVVNHQLIISTTFNGYEYAEQRIRDFNQTYKESLQTVINHCINKNETEKTSSDYGYDKISLKDLDELL
GUT_GENOME096393_004871263-1516KDYAVEEAREASAEQVTVVLGEKETHALLQEIAVTHRAHMNEVLLAALVQATADWTGHPVLSVDVEGHGREEILEGVDLSRTVGWFTSIYPVHLDMKGAKTPVEALKAVKEQIRQIPNKGVDYGILRYLHPGIGSIFQSQPKPSISFNYLGQFDQTFSQEAMFTQETGFARVDHAPESKLSHLIEVVGIVTDGKLQLTWIYSREQFAIATIQEVAENMLRKLDLLIHSSATEAAFTVSDFALANLTENEMNKIL
GUT_GENOME148113_002881254-1456TNKLKELSVNEYNGNLSYLLQTVLLSAYKIFNNRDQIQIWIEGHGREEILADVDIYDTIGWFTSLYPIFYRCFGQLELQKEEVLNEMNKIPNKGVGFGIINYLMPNKYKLSFPKIKNKEICFNYLGDMSFENGTLELSPINTGEFLNPDSKRVFMMDIVAKISQGKLIVEFIVNAEYARNELNSDFVKKYRNILINLINETSY
GUT_GENOME177033_02054829-1047SVVHEASVQITDELLGAVPHAFRTGANSVLLTALSVALARWRRNKQTWTLVEMEGHGRETRFVPGPQGREADLSRTVGWFTCLYPMLIDPTRAAVQEASTEVEGSIAPLALALNAVKDQLAAIPGNGVPYQAHTWLHQSAKTPPQAQVLFNYLGRVSAGAQDFAPAGSTGQLGEQRDPDQPLVRELEFNAIAEDTGEGYVLRTTISWARGRISPERIDE
GUT_GENOME207783_005982381-2612EDADRICFTLDTDTTGRLLTRAGPAYRTQINDLLLTALALAVRSSTGLADVIVDLEGHGREECVSARDVSRTVGWFTSMFPVHLTLPAGSDIDAAIKAIKETLRAVPDKGVGYGALRFLSDDPRHAALAAGAKARIGFNYLGRFDALSGRYITLSEESAGTAVSPRNRLIHPIEISAYVSDGALKVTIEYSGRCCASSRAQRLADAFATALGDVVTHCVSGAGGLTPSDCPL
GUT_GENOME001586_009682748-2979ETEEVLKKCAAKINGSVEDILLGTMILAYGKVMGNREDEFCFELESHGREGLPTEVHLDQTIGWFTAIYPCVVKLTKDSLKQVLYVKEQKKNIPGKGVGYSVLSMAGKVENVTAPVRFNYLGDNRDTTEFKMSQMPAGDAVSAENISDQICWDMICIDGMLSYKLVTRKNNFLEQMSGNENVIDLWFEEFEIALNEVIKAIEAEERALYTPHDYGVELSLDEFKEISDNYDF
GUT_GENOME239690_04598997-1252LPKDKAVESLLLQDSEVVTAQWTVEETDQLLRKAHRAYQTETNDLLLTALGMAVSKWSGIGKIAVNLEGHGREPIIPNIDITRTVGWFTSQYPVILDLGNDLELSALIKSVKEELRRIPNKGIGYGLLKTMASQEDADSFSLQPEISFNYLGQFDQDLQGSSLQVSPYPTGSAQSLLGEPAYTLDINGMVTDGALTLTMSYNGKQYKSSTMKQLAGYIEESLRQLLQHCVTQERTVLTPSDVLAKGLSIADLEELS
GUT_GENOME032990_017381931-2138VKNNRKIRFILDKESSQLLVDSQKINLHYSVQNIVLTAVITALNEFSGQNRFRITLGNNGRFVADQLLNISRTVGRFASLFPVILTANEKSNFLEKVLTTAKILDSIPNQGIGYGILKYFKKEFDEDIQPKISFNFMGDFDNVFGSGFDISTRSALYDIDQEAEWPYYLKFVVLKRNDQILVSIEYNSEIIDDSEVKKISLSIQRQLE
GUT_GENOME239690_014501109-1388LAQELEYWAQLEQTQSGALPKDRSYETNMLRDSENGSVSLDNENTSKLLRQVHQAYNTEINDILLTALGLSIKEWTQQSRVLIYLEGHGREEVFQEMNISRTVGWFTTMYPFLLEVDNAHDLSYQIKNIKDSLRRIPNKGFGYGVLNYAQVNESGVTAFQAAPEIAFNYMGRFDTELDTDVFTLSKFSSGEEASPFTERSVALDINCIINESEQLIITCNYNKQEYHQQTIEQLLFLFQKHLTNLIDHCCSKQVTEQSPTDFLYDKLSIDEFQNLSQLLS
GUT_GENOME186025_01609855-1111EYTSLPTDGHLNGTRRLKNQQRVSLDFDREQTDRIKNAAASGYGIQMNELLLTALMSVLKRQFGVEKTSILMEGHGRENFDEAIDISRTVGWFTDFYPVILTGRETTEETLYSIKEQLCQIPSNGFSYVVLKYFTEQEFHILPQIQMNYLGEFDHNAGQEEEFFTSQSIENTYDMDENSPCMTKLAITGSILQNCLSLSVEYDQMEYHQETMERFMQLYGQALSETADFVLEKNEKVFTPSDFGCAGMDRKDFQRIC
GUT_GENOME000500_018292056-2249EHILLTSVLMALKKSYSISNIPILMEGYGREEYLTQTTLARSIGWFTNTYPLVFKVKDDVLQTLIQVKDIINRVPHNGIGYGLLHLVNSNLQDLDPEISFNFLGEISSESSTQFMTLDNIELDCLINKSNSNPYLIEFSVFFLKGELFVDVYFDSTLISQEDVEIMNQNFVQTINELINCSEDISTMRSISDVT
GUT_GENOME096372_049933984-4245PLDKEIEAGLPSLRNSICAVIPAEKTSRLLTGVKDIYSMEAREAILSALGKTFGDWSGLKRFRVELTADGRKLCADKRLDFSRTVGWFDTQYPLVLDTSNSGDLAGYLKQVKETVRQVPDGGIGYGALYDLASSRRQVEETAVRKKPRIGFVHLEPSGVVTPYFDSFNAGERITDDLWAAREYSWYLKSTIESGALTLRLDYDGHAYTEQTASDLLHRLEENLDEVLRHCEAKTERELTPADVGANDLDLDEFEDIKQFYES
GUT_GENOME188391_038232959-3226IEKQNFIPIEKDIRLSETRKLKDCETISVILDELYTDKLLRETNTAYKTEINDILLTVLVKTIKNWTKNGQVLIDLEGHGREDIAENLDITRTIGWFTSKYPVLLHLDDCYDLSDKIKNVKETLRHVPQKGIGYSILRYITLPAWNKEECFCLEPEICFNYLGQFDNDLTSDVFTTSSIDSGDNSSKENTSMYALSVGGMIAKNQLLIKFTYNTKEFYKETIETLSRSYIKLLQEVIEHCSDREEQEYTPSDFDDDSLSFTELDTLNE
GUT_GENOME000714_039171250-1498NREKENSTVESCTTLTFTLNADDTRKLLTTANVPYNTQPMELLLAVLTENLGEYTNQDQIKFELEGHGREELFEHLDISRTVGWFTTMYPIVLQVSHNIANQIKSVKEEVRKRPNKGIGYGILKEKFPSCKDLTQNIIRFNYLGEIDNSLGDGWFTIVDYQSGSEQSKDNHITALLDIVAMVKNKSLEIHVTFSTSHFQKNFLENLFSDYLERLQTYIQSCCQMQTKEFTSSDFETLNLSDEEIDNLFD
GUT_GENOME239690_01254564-801YASCAVVEEHLSPVQTYKLLHEMPQTYRTEPVDLLLAALALSVKEWIGLDDFTYEVEHHGRSVDHIDVSRTVGWFTALYPLRIQMPGTEIGSILAYVKELRRRVPHQGIGYGILKYMLGAIEKHTDIRPIRFNYLGQFDHETQSSDYVYRHTIDNQNVDNANHLTAAIEVNSLIVNDQLVIDMMYSTEMFEEESITEFMHMYVQAIIRIIDFTVCQQNVHFTPSDFETAQISLEDIEL
GUT_GENOME096513_041652079-2323LEALPKDGPGARDCTLQESKLVTMVIPAAESDELLTKTHHAYGTEANDILLSALALALYHGMNVRSIAIQLEGHGREEMLPDIRVTRTVGWFTSMHPLVLCLASSEIGETIKLVKNAVREVPNKGAGYGVWKYFGPGRAAYAQTGFPAPEVSFNYLGAFEAGEVLEGLSQSAAVSIGDSISPLTKMAAPIEINGALLGGELHFTFRYNPYCFKPETMEELSSRFYSSLCEIKDHCLEQKERALTS
GUT_GENOME200288_031501259-1482TKLIMSTVNKNLQTTSKDLIVAAVVLSIWELTKESNISVMLEGHGREMFDHTLDINRTVGWFTSLFPVNFFVESGDLTDQIIAISNELNSIPLNGMGYGIHYYLKDVSGVEERRLVRLNYLGEMDNGLAGTWFTPSDTNTSADVCPYNDLNCLLDINIYVMSGQLYIQINYKEDDFSVNTIQLWMNKIQNKLTEISLIRSVVSDPIVDLTEHKLNQEDLDALFS
GUT_GENOME141725_029591242-1510KDYWNEQVEKETMKIPMDYPIQETTEESIDQVTRTLGIEETQALFHEVLVTHKTRIDEVLLTALGQAIVDCTNQQTVSIHLEGHGREEMIEGIDLSRTVGWFTSIYPVHLNFQGTQTPIEGLKAVKEQLRRIPNRGVDYGILRYLNKGLLPFYQQKPSISFNYLGQFDQVFSRDSLFMQETGFTFLDHAPDSKPSHLIDVIGMVKDEKLHFIWMYSREQFSKLKIQVIADGMLRHLRQLINKPTTESAFTVSDFADAEDLTEESLSKVL
GUT_GENOME126325_009873755-3976KSNNAFGTNQAELLMAAFLRTLGERFGEGDYCIGLEGHGREQLGLNVDTARTVGWFTAQFPMDFQVRISEKWDELLIGVKEKMRSVPKNGIGYGILKYIASQDEPDLSLELKPEIIFNYLGEFSKSNEKDGIEIVDYDYGQTVSENQNINHVLEINMMEADGILQLSIVYDSNEIECVKIKELVDLFENNINLISEYCCAKEQTVLTPADFHNESITVKELS
GUT_GENOME207869_004731310-1554RVADGEEVVVAFDADLTAKLLKDAPAAYRTQVNDLLLAALARSVGRWSGLDDVLVELEGHGREDIFDGVELSRTVGWFTTAFPVRLSGCASDDATLIKSIKEELRAIPSRGLGYGVLRYCGTRAQSAALAESAEPRIVFNYLGQFGASLGPDSAFSIAPESAGPSRSASAPLGRWISINGQVLDRELQLTFGFGRRRFRRATIERLANHYADALRELVAYCTSGVHGITPSDVPLSRLSQPELDL
GUT_GENOME207386_004691927-2169DEVVLDVGAALTAQLLEAAPAAYRTRINDLLLAALARAVAIWCGRDDIAIELEGHGREDIFADADITRTVGWLTSAFPFRLKGGADADAALIKTVKEALRAVPDHGIGYGILRYLGRPEHREALSRAPNPRVIFNYLGRFDQDLGAAAAYRIGAEDPGAMRAADTPLRAWLTINGEVRDGSLRLAFRYGRRRYRRATIERVAELYRAALRGLVDHCVQSAGCLTPSDVPLANLDQVTLDRLSA
GUT_GENOME001364_020803854-4103KYDDSETVSAELGEEETRQLLRETNQAYHTEINDILITGLFVAARELTGQNKLRITMEGHGREQMMDGIDVSRTIGWFTSKYPVFIDLGEEKDISQNIKMVKEHLRKIPNKGIGYGILKYVAKDADIIKDGKSPILFNYLGQLDEDIDNEEFSSSRLSPGESIGRGIIREHPIEINAIVFNGRLTIQTTYNTKAYNEDIVRTFTETYKEALKTVISHCLSKEAAEKTPSDYGDKDITLYQLEEIKQKYKG
GUT_GENOME171377_019861253-1457LDEETTKNLVTSANKPYNTKTVELLLAALALAIPNMGKTEEFIIEVEGHGRDEIYNDFNISRTVGWFTTIFPVAVTSGPYDLAMRIKAIKETYRLSSEKSLEHAIYCMRHNQWDKSTFIRFNFLGEFQTHYEYFELSSDVLNHDNEMTSLIEMDSMIVEKRLKTAVRGRSPLLAEDLNRLLDYYHEMIQSIVHHCMNKGDVDYSP
GUT_GENOME076538_003343654-3862QSSDFQISTDLTVILLNNINKVYNTSVNDILLTSLARALNRIWPSEETYVQLESHGRANIDPDIDIQRTVGWFTSIYPQKISIDLVKTKLYSKEIPDLGIGYGILNGIKTEDLPQVLFNYLGQISVAETEWSIVKTDLGPMSSSEPNESLVINGGIYQKQLFIELSGKIKNLDLISQYLKEEIINLINNLHQITRTYLTLDDINYLIEQ
GUT_GENOME248579_007811240-1433TPELLAKGSGRYQGNTGEFLLASLFRAVQTVFNTKEALFELEGHGRDRPGISPIERTVGWFTVLYPVHLRSACSDPDGWMNEIMGQCRTYGDRRYEWETIHMRDNGTFLRAQPIRFNYLGELRPQTQGVFTLEPFFPTTETAGENRFSCLFSVDSLFLGRRLHAKILFNSHAFSYTIIQRLADEWKKELSALLL
GUT_GENOME063749_024752260-2476FVTNSECANKKLCIDSKTSGILSNQTQKILGIDVKEALLTAITFANKKSKFISVSIESHGRAEIENVNPNRTVGWFTSIYPIVLQKKDNMVDTAVEVKKRLHSIEKYGIAYGIGRYVTKTLPDYVCDICFNYFGGSENSNNSQYFETKYEIGEDIASDNNFCNEMNISGYMSNSGLVLRFVYDTKKYNDEEIENYIKNIKKCLHQMARTCFVKDKKM
GUT_GENOME237425_00834788-1011ITLSKEISAKLINEVNNTYGTRTNEVLLTALGLAAGKIAEGTVGIMVESHGRTELHKPIATERTVGWFTSCYPVVVNNNDNVAEELINTKETMRRIPKNGIEYLLMTEGFHEKADILFNFYKTSLAEESRENQDISFGGTSVFPGKINVNCFAIDGIVTINISVPKSRHKANISEELGLEMRKQIEKIVSVCTETDTIIKTRSDFSDDTLTESELDELKDLFDW
GUT_GENOME034405_007471240-1492KETSEAGHTGKMTEEIPLSVIEGILEKNREKLHVREQEILLAALGMVLKKKYDVKHYSVAVESHGRDAVMECMDVTRTVGWFTCKYPVCFAEKETISDMLIEIKKRLLEVPENGIGYEILRYLNHEESLSTEPEIAFNYLGDTDVNIENEDGNGVSLALSEIKSGETMSSNDRPANPLILDFISSNGNMQMMISYDTSVFEEQEITEFAEAFKETFEEMNREETCKEEVVYVNELSSEILSDADIDIIEGLLF
GUT_GENOME025996_003871227-1445VSGKTFGQRKKINFHLDKVYTEMFTGKANEKYHTKPDELLLIALAFALEEMYQFSDMRVDVERHGRDILGDADISRTVGWFTNIFPVHIRMTEEGYIEKIQSVREQIRQTGSRGYEYGILKYIKKQLEDREKNICLNYLGEYVQRPNKYFTLSRILFDNAVSPDNEVDSCFHINAFVMEKCLNIYIFYIDALEEDYLPHSFSSCYRRHLTDLLEHCVDN