UHGP-MC 1076


Information


Number of sequences (UHGP-50):
171
Average sequence length:
535±30 aa
Average transmembrane regions:
0.14
Low complexity (%):
1.64
Coiled coils (%):
0
Disordered domains (%):
13.72

Pfam dominant architecture:
PF18814
Pfam % dominant architecture:
4211
Pfam overlap:
0.35
Pfam overlap type:
extended

Downloads

Seeds:
MC1076.fasta
Seeds (0.60 cdhit):
MC1076_cdhit.fasta
MSA:
MC1076_msa.fasta
HMM model:
MC1076.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME175199_020301076-1613LKAPVEETKNLIALHNLTEEKLWGDLRLGGFPMPSIAVTRTDVPHTNFGDITLVMDKRSIDPKANRKNTVYSADAWTPTFPQVEYAADPAAERRISGRLRELSGKVDEMFKQDLYRITYDMEDLLNRYGGEDELLRQVMDNYGLKAAYLEDAGHHVSAATVQREADKGYSQDRVEKYRAIVETLGTDDPDEIGHIPLKELRDRYGDELETAFPGMTKSTFRLSGIIRQVQAYLQDQGGEPVYETATDAAATRRAIDDALDRDGYERWVRELYSGVQADSGIYNNKERFTPAGNRRTFKQTHYPVTLENIVKAMKGQNGGNTKNVSGFYGVKSLRAGTAKRFKSIADMHKLEGRLQNLTEEQTKAIHDELDRRLMDITEALAEKSPNGKGSFDYYLADSIGNMLVEIADGEVYTIDAIMGKFNGEYGFHIGNELAAQVRDLLFDVSQMPVNIFEAKPERAVGFDEVLAAVLPDDASGELRDALSGRGVNVLTYKAGDDADRIAKVNGVEGARFSLKEDTVSKNYAAILEENELLREQMK
GUT_GENOME225606_00180785-1264FSYDASSQPTDRLVAVHHIDADNLLKANALGGLAVPSIGITKVNSGYSGFGDITLIGTKGLIDPATGTQVYSADAYTNTFPAFEWGKSVDKKKAEALRQEYRKTERFFRGGMDNTMRSLIDSPDRDDFMWQFRTSVVSQKMFLDEKGIKVEPVYQQPYAGTPLEHLLLPLFQKIDADQSLDVAGSMQAYNEAYVQAVDALGDKATRVQKRNADKIRNGGRLADAYLYTLINNVEKIGKAPTEPQIDSYATEKRIREVFENNSEGFDAWVAKKTEGLFGEPKIKVGGKLVPVTLQNVVKAMTKRVAKNTQDTMTFGPGKVRAAASKKFSSVKSIQANRERVVDSATANEANKSIDAAMSDFRRRAADFYGYKDAFAAMDDAMRALADSAKGKPTTEKVRAALVKNGFKEPKGGFPSELLDSGVKILTDVQKGLTDYFEAKPQRAVGLSEFAGAVVPEGTLPEVLKVLEDAGVEVRTYAKDT
GUT_GENOME100684_001491538-2081VKNVDNTAPTENEDIRFSLSEPVEEKGNLIAVHNIYTDKLVKSLKLGGFPMPSIAVTKADMGHENYGECSFVFDKSTIDPKSDKRNKVYGGDAWTPTYPAIEYKVSEKIADKARRKYYDLYKQYGEKVRDMYRYSVTLEDTLNSDGGEQKMLDKLYDDVSMMQIYRLDHGEDVISNVVKREEKQTLNEQEIALSKQLLSTFGNRIEEIAARNGENPLPIRKAFFNKYHDEVTEAFKKYYRSTGMSIEEAEAKVANTKTFTFVSSLAKALKYKHNKGITVTEDIDYEATDKAIRDSVDQSAYRKWIDGLFKGAEEKSGIWNGKDFYTPSGRTRNFDALHYEETLENVVKVMRSEMNGDTLFGGMGIWGVAQKEYSSIDKLKSDSARLQKMSDDEYSNIKSGFAEQLSEVTKAIKNDYSGNDFIDSDIVAGNIVDAVRKSKTKSGIKSYLKEYYPDITDTAVDDIVNLVHGIMEMPTGYFEAKPQRAIRFDEVKYAVVPETLDENIKKQLSEYGVQVVEYENGNEEDRTAKLNSLDDVKFSVSEQT
GUT_GENOME262714_003821485-2070QDAGENKENNAAVQKTVRFALSAPVEVDGQKELVAVHNLTEQNLREALQLGGMPSPSIAVVKAREGHTKYGPVSLVFGSDTIDPMVDKANRVYGADAWTPTRPGVEYEVHYDAMRDFENRVYEASWEAFDGKFVNSAAVQRAGVDEASSLSREELAQKMQRDTGVQLAYLKDKGITVEPVYRMEQEQFDSIGNDALEAVIRHTGEAQLKEAFEGGDIDQLDKLADAAADALEEKYTHGALEGQNKRWMLRINKLRNENRGRLYQMLEHAYKMVTDTSAGKQTLDVEATRNAIREKAPEQKVEQWVYDKLEGVLGEKGIRNEKEPVTPSGKKRSFAQLHNPYTLENLVKAMNSQNARGQDVWGVSASTLMSTTTAEYKNLDEVRADKGRLQQMPEAEYKKLLEDADSQIEQVIRMLRKETTPHSDNSFEEQEILGEILLRAAQGKHTAAAIGKAFAKEDYVISKDAAQRILALYKDVAKIPTGYFEAKPQRAVGFDEVRAAILPDNASKALVQQLKDAGVPVQLYKAGDDEARTALLNKVPNVRFALAQQADQEAKGSDQRQASRAIADKAAALDTLGQFFDLTRGV
GUT_GENOME101363_000281218-1735IDSEKTFSLDVNMEEGKDLIAVHNLKEDNLNKALELGGFPMPSIAITKKQIGHSGYGDISVLFKKETVDPKNKDNRVFGADAYTARFPRVDYKVNEEELSKLAERIGTSANYIDANAFNGNDLKTAAMKLSEMIEVKERFIRENNLKVEPVLRMPTAEPSFMGRGAVKEYLNNENITVDKLVYDKKVREGLLDTIREKTSLKALGEKWATKINDNLSDMESDKEIYNILKEQYQTYIDIAQGKAKPIEDRYSYTDGVNNTIRDNQDAFELYTLELAEPVIGKAGIYNGKEYYTASGNLRSFEALHDDYNIENIVKAMKSAGVKNEEGSLIPGIGEVRGAVSKDFKSIEDIKKQEGRLLDLTPEQVDEIYSESRELYQKIVSEIVDNNKTISDNSFFDRDIAGRNIIEAVKGGLKPEKIQKSMQKYYKNISSNIANDIVRLGTILKDIPVKYFEGKPQRAVGFDEVAAVVIPDSISEKTKDKLKQAGIKAYEYKTDNEASRKEVVNRAAVEQDTVFSID
GUT_GENOME061558_00075938-1440PGYYQTSIQDPLITVHNISEENLNKSLDLGGFAVPSLGITKQRTPYADFGEISLIGTRDMVDPSRGTPVFSQDAYTARFPQPDWSKSLNTEKATPLVKEIMEAKKEVNENGPSGYYYLQYEGDRDKAIKDLLTHASGMYLFIKHKGIPFTPIKQQAKKPSFAAGIVSDFVDNNGLDVSTVEEGSAAHKKISDGVVEALREKLNEPTGLKGLRMRAYGDNIDAYEKTGFIPFKLIEEVKKYEVERKKAEGGDAPVSSGATQLELEKLIAPMRDEFEKFISDKVNSVYSAPLIKVGNSYRPITLQNVVEAMKSSAVSNKESTLFFGPGKVRAAAARRFDSIEDIQGNRDLIADSQTVNGINAAANDKMSDFRDLAARENRGDEFSTPDRAMEALAKVAGQKSKPTPARMKSALKKYGFEPSDETVKLGVEILNDIKKAMTDYFEAKPQRAVGADEFRGAVVPDGTSEATIKRLEKAGITVAKYDAEVEGARERVIRDLTNRLNEK
GUT_GENOME005885_00430857-1424NYENSGKPLDKQVRFSMKDTVEETRDLIAVHNLKGAELEADIELGGFPMPSIAITKPELGHSEFGDISLIFGKDTIDPKRSGNKVYNADAYTPTFPQIGYKPNEKSIKRLRDLYYKVYEKAGSDIAYPLYAFGVDDAGEAIARYGGSEQAIVKSFADNTDMMQAYLVDQGKQPIETIQREVVTEIGEPEKERYDFILNKLGKDAFEAMGTGKSRATWYEKYGESYREALKQQLIDAGIDSEAAASVVADGFSRGDVLTDALKVRSYMQNGPRRVRTQPDIAATQKAIKEAVNPSEYKAWLENTFSGIVSKKGIRNDKDTFTNSGNRRSFETLYWEFNLENIVKAMKSQDEKGASGLGTKSIFGSSAREYESIADIKADSSRLQTLNEEQYAAIKETYSNRFREIVDTYAGNKDWWDAANTLCEAVALRKTRTGIYNYLNQWPQVYKASYDIVDSLISLTQDIAKMPTEYFEAKPQRAVSFDEVRAVIMPEGKYEDLQTALESRGIPVETYNPDVEGERLSKLNSEFTERFRFSRKDDVSVTPEDAKKLQKQNDKLKQALEVAKQEIKL
GUT_GENOME176045_007661085-1583PAANAEGSLIAYHNTNETALMNALESGGLAVPSIAITNRNFDYDNFGNITLVGTRDLIDPANNTDVYNRDAYTTRTPRTEYKKMKSADVREFIKKWEKEFEKIDERAFQELRYHLENGNMRSAYDTFVYSRALKYYYLTNVREEKIELPHENLLKDDIFKDEQFTEELKEIIQNREDYTDAELHKTITEKATAAVLRLWGDDSTLVEVMTDVYSNPDEHYKLLQELRNDLRNSQKPVDTYAFDRLIDSKYRSIVNSDDYMRWAKKRFEQYAGEPQIKIGSKYYPYNLENLVKAMLKNKGKGKEDTLTYSSSVVAANAARKFKSVDEIRKYADQLAKSEDVEKQIDEINELASEYQRLVAQYYKYGASSFDVFDDSMKALSKSMTVSADSTVPKLRRALESLDFENVPDDVIETGVKVANALKHAATQYFEAKPQRAVRIEEFAGAVVPKGTNPELINRLKEAGLQVEEYDKSVPGSRRAATQKVQNAAQSANGNVFFQS
GUT_GENOME248162_000241475-2051NDGTLKKNIRLQLAAPVEVDTQKDLVAVHNLTEQNLQEALELGGMPSPSIAVVKAQEGHTKYGPISLVFGSDTIDPMANSANRIYGSDAWTPTRPNVEYEVKADKARELNTYLAQLSQQTAEGEFARRNVLSGTLDMEASDKSPQQLAEQLAHNDAAKAAYLADRGENVDVVMKQEQRFTESQISRYEKVMEAVGGKENLREIIETDRANGNHDMANAVLEEVRSAEKNWAMQELGWDAEKAEKKAAKLIAPMLRIRLENAYEYATTEDTGGKMVQDTDAMRQTLQEKAPDADVEKWLLPKIEKILGKKGVYNGKEFFTRNGNRRSFAQLHNPYTLENLVNAMNQEDARGKGAWGLSANTLMSTATAEYETLENVRADKDRLQQIPEEEYKALLEKADSQIEDVVARLRRETTPHTSSSYEEREIIGDVLLRTAQGKRTSTAIAKAFAKEGYSISKDTAQVILNLYNDVAQIPTGYFEAKPQRAVGFDEVRAAILPDNASKSLIAQLKDAGVPVELYKAGDDAGRTKLLNEVPNVRFQLAEQASRDAKRNDQQMASRTIADKAAALDTLSQFFGLTR
GUT_GENOME217315_019312042-2567EGIDLTKYRDEKTLAGVHNISAEKLKKAIKQGGLANPSAAVIDVEKQSHEGYGEISLILPSSLVDKRTGRNAGTWTADAWTPTYPDVERQFSDGGGERFARDMQALPKAMYAETRMGMNDWLDGRGSRQLAYMFLHERGEAPELQIIPARYGDATRKAVENATQGAFDLYNAGPEERQAVLEAYIGEKFGGDRAAYEQALAGTVARMEGLLNHERAMVRRRAAETLAGIRETGFDYKELSDFARSVGDDVRRGGKADAAATNEAAVKQIEDNGLQGEFDKWVDGFEERYGVREVLFDGFTPSGTRRYLPNTLENASKMMKTAGRNGATGMGISFSNFAAGLLDSLGSLKDIRARKGSLTTNHEEVDAFRDRWSQVFFDLGEKCQPDAEGYDDYGLYRLAEAAQKRDPQSYLKKEYGVELSDEDTARLKEMVKAIRTEFPAMYFETKFERPVYLNEMAGAVVPENTDKDILDALDAAGIPYRTYGAAEERPQVIRKFSEELEGVRFSTNGEKARIREQARRDGTYLK
GUT_GENOME010885_007241109-1686NTGADATKIAGNGIRYALSIGNDSVSVDVQEGKDLIAIHNLTEQNLLDTLKLGGFPMPSIAVVKAEQGHTDYGNISIVFGKDTIDPMLSSDNKIYGSDAWTPTSGNARTEYQVDGEKMRTFERRMEALSKKTAGGVFYNASLVSRLGVDDTSSQTPLQLAEKIATTDEAKLAYLVDRGETIEPVYKEKIYNRFGNETLQTYIDAVGAEHLSTIIEKAKIEDTKAFRSEEEGVRSIIRKQYVERRYETLKNNPKLKGRDVLAFVEQQADTYMDHNVTIFTIEDFVKDAWKYYRDGKSVTTEVDRLATSDKIQQAVSDADVIAWLEPQIAEFIGRAGIYNGKDLFDAKGNRRSFAQTHYAYTAENIVRAMKSTAPARGAGAFGVSASTLVATATPSYDSIDAVRADKKRLFVEDQTAYDAAIAEIDESLTKVEKDIMRTTPHHTDNTFDETQIIGSIIVEAATGKRTIAAVQRAFSAEGYAVTAIQAKRILDLYAEAAEIPTGYFEAKPERVVDFSEIKMVELPQSASDTLKKQLSERNIPYEVYGTTDAERTKAVQRLDGVRFALPETDSDGNTLSEQQ
GUT_GENOME126690_015641620-2134SHNDDIRFREVKDKNGEKSLVGLHNISEEKLRKALRQGGFANPSAAVIDISRQSHTGYGSISLVLPSSMIEKRTGKNAGTWSQDAWTPIYPTIERQFSGKGSDAFSKDLQKLPEEMRSTTKSGMDSYMDGRGEDSLAYMYLYEQGKAPELARTKPSYPEKTRTEVEDATNGSFSMSGLSDKQLSRLKDAYMEYKGFSTEGYNEAIKLRRAKLEEAIGKMNPRSILYEKRKTDLERIDKYGFDYSAVESFMKSVRDDISNSDKVDAHGTMRDSWNFIEENGMRGDFNKWLDKLNERYGIKEIIFNGFTPSGIRKYIPNTLENVSKFMKKQGRSASVGIGASFQNFAASLLDAKGSLKDIRKDKGKLTTDHADVDAFRDKWSKVFHELGEKLQPDAKGYDDYGLYRLAEAARSKDPQKYIKEEYGIDFSDEDVKTLNEMVDAIRNEYPAMYFETKFERPVYLEEFAAAVVPDNVDGDIRKAIYDAGLKIFTYKADDEISRNEAVKQASEIDGVRFRF
GUT_GENOME048377_002691415-1930DLVKLKSENPTYYQTAADKDLVAMHNIDLGNLAAAIKLGGLPVPSIAVTKKQTPYTGFGDATLIMKKDTVDPSKTPVFSRDAWTGVFPKVIRLANMKRLTSFVEKAIEPLQKELPREAADYGDIYNSYILRNANNKNGDVQKMLEDSLNSAGYKYYFLKTINKEPKLKWRKKGLSKELIDHPEILQACQGLEKKYGEQGLKELLYEGPFAMLEESTDLKKANDLIVDELSKKQEVNESPFLKRVHERQQARLVNGETLHSLLSDYLERDKKVFEGESFEQQLDKRIKANQNTFNAWKESFRDEMLGESVIRDSGKPADLENIVDAMLGNLKNAQKGFAGFGIGNVIASSAKKIESFGEMHREADRNMDESTNIETSLDKSEQYTKVKDEIIDFTGRMAEAYKWEDSWESRSDASQVLMSMMSGKTFKASARKFGFTYSAALEKQAKEIIKEVKDLPAKYFEAKPQRAVRFSEVVAAVMPKNASKEIKNYLRAQGVTIRLYDPRIEGDRERVTNSAQ
GUT_GENOME009680_001201434-1943VKLKSENPAYYQTAADKDLVVYHNVSAGKLREVIKLGGLPMPSLAITKRDIPFGDFGEITLIGDKDMIDSRKSRANEVFSRDAYTVRKPVVNYEEPTAQDQKAFHEKYSGTIDELKAKGIDVQKINFTSYSGDEALARMENDVAIRYFYIQNVLKKNVPIHRLTIAPTIYYQEIFRDYPGILIALRKRPRTKNSAFLELDKACQPYFKRLEEKIKNKEGLVGLNKRRLERLQKDGHITIDGAQLLSKWANEYEKSALQKPREEVDKQAFEKDIQKTIEEAGMDKFIVFVRSEFDNLYKDKYLWDNGKKYAFNIDNIVKLMKKYRGTNNEGPGGINYGFNSLLAFLSKKFTSIRDIKNHESLLAPNKKELARYKKAEDMYNRLIDEAAELRGSYGMDLDMDLAELMKDTRDGKKNLHGFPEDKKFLQHIKDFLKEADKVTTDYFEAKPARKVTFDEFSGAVIPKGTSEETVNFLESQGIVVREYDQDIEGDREAKAKELGQKLNVYFQNKY
GUT_GENOME017325_00519506-1042EQVKPEGVDRSVREGKTDDKTLYGIHNISEDKLRKAMKLGGLANPSMAVVDKDKGTHEGFGAISLIAPSSLVDKQTGRTAGSWITDAYTQRYPFVERVMSDKGYKKFKSWVNGLDFAEKEKKEIMRQATDAMESDNDLAWELMYLREKGVDVKGYTSEVDYPWEKIVEEYPSVEDIIKAMDHNAELKETVTKLAKVEITVPIYKAVSADVRRRFKEETGENASPLNPIIRKRTHDIYNRDYAPKLLDENGMPKASDVRKVIGQMVEKYNATKKVDFNKSKSKASTYVHDNGLYADYLKWQEGKLEEFGTQGRIFRGYKNDGTRKYVPETLSNVSKAMKEESQGQTNGSEYTSFGSFIAKLAPRVDSKEEMRGSKSRLDADEAKQEFYEKWESTYYELAKELHSDAFSGEARLHDIVLQSDPKRYAKKEYGITLSPSFLKRLEALKKAIREDLKSAYFETKFERPVMLNEFAAAVVPKGIGDDVRKGLEDAGLVLYEYDPAKKGDRERAANEASASDGIRFNRSEKTELTAEERELRD
GUT_GENOME130957_007121319-1868TNSISTPNENVKFSLKDPVEETNDLIAVHNISESNLLKSLDLGGFPMPSIAVMRARQADANADYGDISVVFRKDTIDPEVNRSNKVYGGDAWTPRFPTIEHEIDYDQASNIYSRAHELYKTKAAFKQPVLLHPDNIEDGINRWGFDKYLENLKDDYTIKQLYLLERGEQPVEMQQREERAEVSEADAALYDYLLDAIPDYHYVAPFQWNKTYGAAFDQAYTDYYQNNFGFTEEQAENVLKNMNTAQKKSMMRAAQNYKENGRETVTVEEDVPATESLIDSKIDQSDFENWVDDTFSGIIKRQGIRNDVEPYTSSGNSRSFSELHYEVTLENLVRQMKTQGNGEGTFFSGLGIWGVAAKNYGTIEALKSDSVRLQNLSDEEYSEIKQGFGERLTEISNSLDSRYKSDNPFIEEDNKMTNIIDALRSSKTKSGVLRYLNEYFTNASESTVDDLLDLVVDIGNMPTQYFEAKPQRAVRFNEVAYVVIPDNASQELKSKLNENGIRYEEYAAGNEQSRVDVLNSLDDVKFSMKEEDNTSVFEDNEELAESAEDV
GUT_GENOME239142_018141766-2293ESVLYSARQETKNLVALHNLTEEKLLKSLELGGFPMPSIAITKADIPHTNFGDITVVFGKETIDPKARRENTVYSADAWTPVVPRVEYEADSKVTDRVYNRLGALKNQVAEYFQHDLDTLRYSLEDRLNRDGGEAGILENLRNNYALKAAFLEEQGTHIEQQTKQVETAVNSISSEMEDKLLNVWEALGKPTPDEIGSTSMKEIRDTYGSELEKAYPGMTKSAFRMSTVLGKLRAYLSGEANATKTSVVADGEAMKKAVDAAVDQGAFERWARELFSGIEKSKGVYNGKERFTPSGNLRSFKATHIPATLEGIVQAMRSENGGNTKNVSGFSGVKTLRAATAEEFGSIADMHRAEGRLQNLTEEQVSQIQDKLGDRLYKLMNDIDAANTKSRWSDNSFIRMDSIGENLTEIGESGRYTPENVKRVFAKYGMEVSDHTAQEVVQLLFDISQMPVNIFEAKPKRAVRFEEIRNVLVPDTASQKLLRELDNRGIPYQVYEAGNEAQRQQMVSQMDDVRFSSRAQQDSEYLN
GUT_GENOME246450_00816213-767GDHLRGENLQGAGEKVKFSLKSPVEETKNLIALHNLTEEKLWGALRLGGFPMPSIAVVKATEGHNKYGPISLVFSKSTIDPQADSRNKVYGSDAWTPTHSDARVDYEVDYDTKREFEQNIETLAKDVAGGIFSQSSVLDMVGVSDQTDMSLEETAHRLATRYDSVRAAYLADKGGDVEIAYRTKEFDSFGNDALKSYLDQVGEQEAARLAAKMLTGERLNTAEVEMVKDAIMESWTAKNEWRLKQKPELREKRIAKQREKLSDLRAEDFARNAWDFYEDGGTTTNEVNRMETAENLRHAVDDRAVEAWVLDMLWGLTGDPAIYNGADPFDARGNRRSFGETHWSYTLENIVRAMNNAKARGQGLWGMSGSGLVATATPEYNSVREMHADEGRLRTVDGAEYEQMVKSLDGELDRVAADIMRTTAHHTNNSFEEEEIIGEVIAMAAQGKRTVAGVKQTFRKEGYIISDAQARAVLGVIEHAGSIPTSYFEAKPQRAVGFDEVLAAVIPDDSSQKLRDGLTQAGVRMLEYQSGNEADRLAKVNGVEGARFSLKEDSE
GUT_GENOME139826_000101604-2151RTDLPSEGKRKYALRKSVEQIEDLVAVHNTTADKLKKTLDLGGFPMPSIAVTKTGIVHSNFGEITLVFGRETVDPKADKRNKVYSADAWTPTVPRTEYEANAKAQTRISEKLRALQGQIPADYRGYLAQYTQLDDLLNRYGGQEGIVEKALSDSAMRAAYVADMGGDVSMEKKTVTEGGVSAERADLYRRTLQLFDNDVDKMMHTPINQLSTVPGIGDVWPSVITHNANRLSRAISMTADFARGKLDERTVEVSDAAATNAKIDRQIDDGKYKAWLNELFGGAVGSEGIYNNKELYTPSGNRRSFAATHSPVTVENIVKAMAGQNGGNSKNVGGFHGVKTLRAATAETFKSVDEMHKRSERLQNMTQEEADALTDALNTRLNDIMGDIVGGTESTYNSLMKMSQAGEILVEIAETKYSAQSIRDTLARYQMPVSEALAERIKALLDDTREMPVNLYEAKPQRAVGFDEIKAAIVPNDMDASLMARLSKVTGQILTYPAGDDAERMRLRDSVEGVKFSLKSESANKVDMDQTLRAGDNSVAEQTKTQDQ
GUT_GENOME096510_006501226-1750KFSLKVPVEQAGDLVAVHNLNEDKLEKTLKLGGFPMPSIAVTRADVGHSNFGDISLLFGKDSIDPEADRRNKVFSADAWTPTVPTTEYEADSSANARVQQQLNGLRGRVEEVFRRELDMAADAEMLLNREGGEAGLLKWAEQKDGLKAAFLEDTGKHIQKIEKRIEKEKGYSENRAEKYEQIAALLGVTDPDSIGEMRLSDIRNEYGEQLEEIFPGMTKTSFRMSSILTQTQKYLKNRGGAIEYDTVTDEAAMREAVRSETDMDAYREWLRGLYDGIEGKSGVYNNKEYLTPSGNRRTFEQTHYPATLEGIVRAMSSQNGGKIQNVSSFLGVKTLRAQEAKSFRSMDEIRAEKRRIKNLSQDDFEQIEKDLNAELYDVTKEILTTNPYVKNDIMAADHVGEVLTEIAEKKSITPQGIQKTFAQYHYEITPGMAEDVQRLLYDISQMPTNMFEAKPMRAVGFDEVKAAIVPDSVSANLRVELRNHGIPVIEYPAGDEKARLNVLNNLDDVRFSLPGVEPSSEEARR
GUT_GENOME259892_004571137-1682SGEKSVKPEMRFYMEDTAEEVKDLIAVHNLSADKLESAFGLGGFPMPSIAVTKADMGHEKFGDISLIFDKETINPADRRNKVYSGDAWTPSFPTVEYKISDKKVSNIRRRIDKLVDSDIQREFRLALDPDNMDDMVNRRGGDAVEAYQGNDALRLAFLRDEGIAFEASTNPKTYSHDAEAIELLIEQVPDAVDAMENYAGYDEIMQYEPKVRGVLNSYYQSKIGESIFDKELGFRELDAALRDAVNIRKEGVPEVVDRYDTRDGLNELFEEHPEYEAKYRDWLENLFDGVVEKKGIRNSKDLFTPSGNRRSFEALHDEYTLDNIVKSMKGDAEKGGALFHSASTVAAASTREYGSISEINQDKGRLSRMTEDEQEAVKKGFEDRLSDIVSRMTSEIKGGNIFSHISNATEAITEALAKRKTASGIKKDLKSWGQFTVTDEIVSDILSLSEDMANAPTTYFEAKPQRAVGFEEVKAAVVPDTLNEDLKTQLEAAGIKTVEYKAGDDADRVRALNEAPELRFFLEDISPVDVDLLTEENQALREQVET
GUT_GENOME163674_000891680-2219EKRLPVKARFSLDEPVEQTQDLMAIHNLDGKKMDSMLQLGAIPSPSVAIVKASQGHTQYGDYTLVFPRQSIDPQADRRNKVYGADAWTPTAANAIVEREVNYEAGRAAEQKIAQLANQVAGGIFSRYSVISGRVGEVARMDEAELAKQLARDDAVRAAYLAEQGKDIEPVLKEKVWDSFGNLALQEYTEKIGAQELAQLYVKLETGERLTAAELETARESIMASWIADHEYAISRRPELRETRTARQRDKISDARVEDFIRNAEALYEDGGQTRDGVDRYATQDKLREAVDDADVEAWVRGQIRGVLGEPGIYNGKERFTASGKRRSFRETHGAYTAENIVKAMNRASARGESYWGVGAKGILSVATPRYKSVDAIHADEGRLQNMPEEEYNRLLQELDKRIGGIVADVQKTAGSYDMDEIAGLLMENAGQDAMRIQQAFSRQGYDIDGGLATEIAGMYRQAAEMPTGYFEAKPQRVVTFDEPVCIAPDDCPPERLEKMKAAGLNVIEYEAGNDEQRMEIARSLKGMRFSVDEPQAETGG
GUT_GENOME239775_016886-505NTLFAIHNTSALNLNKTLELGGFPCPSIAIVKGSEGHTGYGDISIIFPMSTVDPVDDDNYVYSHDAWTPVAPTIQCEVNRDGLSRLKTACKSQLDEKCFLRFGPEWTCLDAQFIEEEVDGFDGLETLFEKNPFMQLLYLKSQGVELPTLQCEEKTEVHTATLSPIVKDVMSVLSYAEMEELEALARPENVSLKENVARRGEIYQKVTDLTGHPYRGLQISTAVSSILNAGKVRHYTEWNDAEFLHDLSEKTDCETYRTWLFEQCEDIVNRWGILNDKFPYRDDGSRRSRDYTHDPLTLESMTNVICQRKNKGDLLNASSFLFKYAAANRLYSMEDIIKASEYLEVIDDETDMESIEHGIATMAEDIADSIRKTHSVDPLRDNAMTAIIEAAMLDTKAEMRRCLEENQEFIHLKSDTAERIYDLKEYVNSLPTRYLEAKPHRAVGLDEIGIVMLPDDKLELRERLENLHIDVLEYDHTKPEMRTELLKSVSYLGMTLPEHT
GUT_GENOME192289_005221172-1675VEEPSGKKDLIAYYNISQDNLDKAIQLGGLAVPSIAITKKETDFSNFGNITLLMPQDVVNPKETPVYTRDAWTGVFPGMVRAPKKKKLTDYISKTILPLQKEISRSVMDYGDLYTPSRIENASAAEGEKRVESFLESDGAKYLYLKSIGKEPKIMRKTAGFGTYSEFTIYPRIRKTFDGLSKKYKLDEMEVPPAEGSTWEKDYNHLKEVMLEEMATPKQDNEPRFKVHRRERQKTEIESGEKFRQLVQEYVSGHRQVMDEETFKKQLKRRTETKAGREGFTQWKSKFRKEMLENPVIESTGKEVTLDNIVEAMLGNLKNAQKSWAGYGTGNVIAASGREITSMEDMHKTADEKIDPASSIEDDTDTSAQYKQTKQNISDFIEDMANHHYKWENTFDGYSDASQVLMGMFEGKSFSTLCRKHDFKNIAGMKKRAESIVKEIQALPVKYFEAKPQRSVHFNEVAGAIVPNGTSKSTIDYLKGQGITVRRYNPANEGERHAKTEKLA
GUT_GENOME028957_000881553-2052PVEEARFRENYFQTAVPDLVVTHNTTEGKLEKTIQLGGFPMPSLGISKKGMKNRLEGFGEITLLGNRDMVDPRKSRNNDVFSRDAYTVRKPAVNYEEPSEKDQLAFYKKYKAVKDELGEKGIDAPDIDFSAYDGEQILNRFNRDPIVRYYYIKNVLGKDVPVKKITIHPEVRNKRFFDKYPEVLKALKSDKIREGDYSELDKATKPYFDHLQNLIDEKKGLIGLYTKRLRKMTTDGHINKEGAFMLWKWMDDYEEDVKKKSYDTVDKPAFQKDTDRLITEAGIEKFNDYIQKEFESLYKEKYLWDKGKKYKFTLANIVKLMKKDRGADSEGAPGNYGFNSLLAYLSRKFTSIRDIKKNEGMLQPNEKELAQYKKTEAMYTDLLDEAAQLRGKYDESIDTDLAELIKDVRDGKTGDMHGFPENKEFMDHVQSLLKGIEKVTTDYFEAKPARAVSFSEFSGAVVPENTNPEIIRYLENQGIAVETYDPKVEGEQKVKTEELA
GUT_GENOME191442_007661410-1995KNDVKFSMDVPVEETKDLVAVHNVDERALERSLELGGLPMPSIAVIKAEQGHSKYGPISLVFGKDTIDPQRNSENKVYGGDGWTPTKAPVEYEVNYDAARRLEKTLDAASKKVAGGIFDASTVLRKRGIENTTEMGTREIAEKLAGDDSVRAAYLAEQGAEFEPVMMEKVYNRKYGNVALERFTEKVGEKRLQEIHDALTDGVPVDDALGSDADSIRSIIHDYYAKSQESLLARHAKKMGWTAEEVQQKGEERIARTMKTNVTPFVLEDFAQDAWKRMTDTNNTGTEIDRMATSDKLRAAVDDGQVAAWIEEKLDGVLGKAGIYNGKERYTASGNEKSFAQLHYPVTLENIVKAMKGTQEERGDGAMATATGLQAVSAKDYGSIAELKQDSARLNRVDAESYQEQTKALDEKIDRAIKQIRRETTAHTDNLYEEESIIGAVMLQAAGKKTAAAIRRTFAGEGYDISLETAKRLQTLYRDAAQLPTEYFEAKPRRAVGFDEVKAAILPDNVSDSVRGQLEKLGVPVIEYAANDEDARLRALNSVENVKFSKKKQTAAEQIVGEATEKEQLKKQLTAAETEKRELARK
GUT_GENOME260638_001201625-2174KDRIAQTEGEVKQRFSLSEPVERAGNLIAEHNLTQEKLEKALEIGAFPSPSIAIVQAEQGHTNYGDYSVVFPASTIDPEADSRNRVYGADAWTPTSSNATVEYRVDADAKRAFERSIRDLSGQVADGIFRGDSTLGKAGIEEETTKTSREIAEQIAQYPEVKAAYLADKGENISPVYKDREYDNIGNAALQRYTDNVGVQNLARIIVQMYVGDANSVAQAELQRVRQAIGEEYAERFARILDRKPERKAERVSEYAENKMYSGTRAEDFIRHAWEMVQDGGQNRGEADKMAMQDELDRKAPTQKVAAWAEKRLQDVIGEGGIYNNEDRYTSRGDRRSFEQTHWELNAENLVRAMAQAEERGANIMWYDAGGLLAAATPEYRSISEIHADEGRLQTLEQEAYEGKVMELQQSLDNVVERILQETRHKAYGYQDESQLITEALIKTAQGGDSLQSIREGMAAEEYDIDRATAMQIQELFQQAKEIPTSYFEAKPQRVVGFDEAVALLAPASAPADLMARAEDAGLRVIRYTGQKDRIRVANELPGVKFSVQE
GUT_GENOME093068_011411535-2091TPTINTASTDSIRPGNEKSNTKFSMREPVEQAKDLVAVHNLTEQNLQDALDLGGLPMPSIAVVKSEQGHSMYGPISIVFGRDSIDPQADSRNKIYGGDAYTPTAPAVEYPVNYDRMRAVEKRLAGLSEKIAGGVFRNDSALQRAGIDEESGMSAAELADKLARDDSIRAAYLADQGKTLEPVMQKKEFNRYGNDALAKLVQQIGAQELAGIEASIEAGDYQPVREIEDMVRQIIRDSYAEKHSALLNRKPELKEKRLDRFMENNVTVFTVEDFVRDAWEYYQDQGATTSEIDRWATSDKLHEAASVEDVKVWLLPQLEGVLGEPGIYNGSERFDRSGSRRSFSQLHWKYTLENIVRAMTETQAARGGQTFGASATAMQAVGSEDFQSIDEVKAASGRLGEVDTEQYKADVDAVEKRIEQATRAVMRENKPHSDNQFDEMQIIGDVMMQAAQGKQTEAAIRRAFSQEGYSISESTAREIREVFRAATALPTGYFEAKPQRAVRFDEAKAVIVPDNISRTLKKRIESAGVPIMEYRAGDESQRLKQLNSDETWRFSVRE
GUT_GENOME236871_01037536-1087LRNVGDVNGNAKKQLSIPYKNDTDKKELLAMHNLHSNELIKQLDMGGMPMPSVAVTNPDIVSHDGFGDITLILNKSAIDPNASKLNKVYSGDAYTPTFPSIDYEANSDAANRIAEKVNALYDNIPEYYQRGIRSLRDKDNINDVLNREGGVGGLLNRYSNDYSMKQLYLAEKGENVPVVTKENIKEMTEAQKYNSQLVIDELGKDAVLEILPKTRGPHNIQLQREWLDKYGEKLKEAYIKAFVESGISKEEATDIFNSEGPLYWIGRARDAANYLVNGGKTIEKVEDFKATESLIDSKIDKTSFNNWLNEMFAGIEGASGIRNNKDLFTASGKRRSFSQLHDPVTLDNVVKTMKNNAQQKGQAAFGGNMMGASTMQYGSIPDIKKDSKRLGMLPKEQHEANQKYVSDTINEIAQRYANGKDWWDARNTLIEAVSGNNSRTAIERYLKKYDYVYKYDESILDDLINLRDYIRNLQTPYFEAKPQRAVGFDEVGAYVIPNNADKALKDRLTAAGYRVVEYDPNIQGDRQRVVNELEDLKFKVQNSSLPETDYTQ
GUT_GENOME257073_019181118-1633KENNTKTQSNTRTGDFLTERERTLVATHNISSQNLMNLINDFDGAGLPVPSIAIEKADSVHDNFGDVTLLFNKDTIDPQNNSNNNVYSRDAWTSTFPQTEYKINAEGLKAIAKKLDLSENYLESNIFNTNDLNGIKRKFLNDRYVRKAFVEENNIEVTPVAYEKKPKFSFFANPTVKSFIKNNNCTFDRLVNDKEFRDAFLKVAKESIRLRIALRQVERFENTLDDCAKSKDVYSDSKESIEYCIEYAKGNAKKEVTEYSYEEGIENAINEHKAEFEKYIDNMLSESDVIGDKYIVRDDVNLYNNDGSRKSFEQTHYDYNIDNVVKAMKVGKNVVGNSFLGGMEFTKTISAQNLNSIDEIKSNEHMLQELSPEEIETQKENISNLLAPIIREVADSDKYSNGFIGASENIVDAFKAYNTVDGVYKYLKQYYSNLKKSTVNKLFKARDEIAEMPVRYFEAKPHRVVGFDEVMAVVIPADADEKLKTALKKMDIPMYEYADESQRADATHKAINTEYT
GUT_GENOME011105_000461465-2015VQDEIADTKEKRDLTGDEKIEKTEDFIAVHNLTMDQLMKDINMGGFPSPSIAIIRSAMQHTRYGDVSVIFNRDTIDPERSKANKVYGGDAWTPTFPGIEYDVNEDKYYSVMHSVDDAMKGKVPEYLRAEAKRFNTPGMNSTAEKGGVDAVVEKAKDNYGMKAAYLASKGETIEDQVHQKEVRKYDDEKANRFDKMEEAIAPVEDEFLKDRSVLSARDMLQKYGGQIQQAYEAYAETVPEDQKKRWTGRVKRANEKAFFAHNIIDDITNAIDYYNNGNESHTENERDTAAIEKQIDSKVDPEGYDAWLRETYDGLIRDEGIYNGKDPFTASGTRKSFRQMHYAVTLENIVRAMNEAGAKAVGTFGGISAVREEAVKDFSSVAEMHQNEGMLRTMNQDEYRNMEGEYINQIRQISHRIARSGQNSLIQDDNAADAILDAVRTRKTAAGIEKALKSYGMNVYEGVGNDILNLMHKISEMPTEYFEAKPQRAVGLDEIAMVVAPDTITKEQTRTLNENRIPLQTYEAGNEDARHQVIEDLQDVRFSRPVQDSEGR
GUT_GENOME257336_01454839-1355GQGVKFSLKKPLEYTKDLIAVHNLSESKLLDTVSLGGFAMPSIAVLRADGSHSSYGDISVVFRPDAVNPQADSDNKIYSADAYTPRFPQVEYSLNKRALTALAEKLNTSTSSLEVNEFAGGDRNKIIENLSYNANAKELFAKENGLTVTPVLHEPEYSHSIYKMPAVRDYIASGISFHDLVYDEAARGEFIDKIRNASTLKSMAAKWADRINGELDDAKNNQVIYEALEETFEGDRSIAEKRADKVETASSYAEGLDELVQEHADEYNAFIAELVDSVLDEKWLRNNKPYYTNSGNPRSFEALHEEYTAQNAVQLMKERGSKVVEGGVFGYGVGEIRAALSKTYDSFDSLHKDEARIQDNQDEAIKAQYDKCSEALNEICGEIAGLSKIENRFMAHDIASAAVLDIISKTKSDAYAKAYVNKEYSIDITDEIVSKIRKLAKEVEKLPVKYFEAKPQRVVGFEEIAAVVAPSSVSQSTISKLDELGIDVIVYNKDVPGDRKDAVNSIGNVRFSRKKSF
GUT_GENOME140104_03146306-731AEKTKFSLDVPVEETKDLIAVHNLSEEKLLKDLELGGFPMPSIAITKAELGHTQFGDISLLFRKETIDPANKKNKVFGADAWTPTFPKVEYEVNEEAVRKAREVLQKLPEASLPEEYKRRAESFVESLDYNLDRYGGKEGILEYAKREEALKAAYLADQGGTVETRTKEIRTEMSEAEKEQASTILEALGQDNGFSEKLSGKEAYDRYGYRIKQALLAYYQRNGINEETAQQVVSSMTKFQIANEYRKAKKYRENGGVDVKTEVDYAAMKREIEKRTDPDGYEAWLEELFGQIEADAGLPNGKDPFTPSGNRRSFQSTHLPFTLENVVKAMKAQGTRNVAGFNGIKTIRAEATPALTSIRGIKQESSRLQRLDTESYAQLVQKLDDRLMEVLADVRDGSGRTDDLMAFDEIGDIMVLAAQHPTAKQ
GUT_GENOME234120_006502818-3361ITVAKIAKVFETAKKNGEKFSLKDEKTLAGVHNITEEKLRKALKLGGFANPSLAVIDTNKTGHDNFGEISFIAPSALLDKRTGNTGGTWITDAYTQRYPSIEREMSEKGSQKFKDWVDSLEYPSEAKAEIKRQAEDALSNNNAPAWELMYLKEKGIDIKEYDSRIDYRWKEIISDHPTAEDILNSMKTDPELNEKVTSLAKHAIIHPTWEKVSLEVRRKMYKETGVKASPINPQVRKQTKEIFERDYAPTLLNKDGSPRKKDVKKVVEDIVKEHNDTKKYDFYLSKVKASNYVNKNGLYDDYIRWQENKLDEFGTKNRIFRGYTKDGSRKYVPETLENVSKAMREEADGQTNGSEYTSFGSFIAKLASRVDSTDEMRANKDKLSSNKDKEEFYEKWNGVYYDLAKFLYNDVFYGEQRLHDIVLQSDPKKYAKKEYGITLTPTFMKKLDALKNAVQTELKSAYFETKYNRPLRLNEFAAAVVPNNLGDDVRKGISDAGLPMYDYDPKKDGDRSRAFNEAINSSDNIRFSLKDEKEKIVADAKANG
GUT_GENOME109956_000083256-3783NSGAFSASEPDITFSLSNLAAIHSLDEEKFLAVDKLGGLPLPSIAVTRLDRPYTWGGEGNIYLVGSPALADPARGVEIHDRDAWSGYFPKLRWSRQEEKEREDFYNRAQEAALRYYGGTDISTLRFLKNALDGDHRGELENKLRHNESSLAVFAAERGYSPRPVTAKIPGRLNTGDKIFYAEIRKMLPWKDANTLSPDRQDDFCKAMERAIERYRSQFSQDSRDAQLTVPQTLTRKSNLRAMEKELREARGDGFQSISYLALQDARQAGKKALDSHANYKRFEKYAAQHKKAFNAWVEDKLARWLNPVPRIRETGLPATLENITRHMLESKGMGAERGLVFSTGLLRARQARRFNSLEEIKASRNNLVTTEEEKASQQKAQDLIHQFQMALSRIDGSFSAFDNAVKALSLVRGAPTPQKVLAALSRLYRGTSFQGRIARDSRLPGLGSAALSALHAELRDYYEAVPRRSVQLREFSHAVLPAALRKSKQVRDVLKRHQIRPLYHDGTREGRFQALASLIGSSASFSLA
GUT_GENOME011979_000503093-3616LSQRFNENNADVRYSLKDEKTMFGMHNISLDKLRKAIKQGGFAAPSMGVTDSKNGIYSGYGEITLIPKAEKIAKRTGKNIGTYAADAWTPIYPPVEKKFGGNGGDVAYDDIESVPKEMQSLTRNAINSFMDGRDTNSLAYLYLHEKGKAPELVHVEGKYPKELHDEVKGILGKLDSIYSTTDEQKEKLLDLFIREVYDGNREEFDNDIKNFIKKDEEFIKKRPNSNIAKNKQLDVDWMKEHGYDYGALSRFVDGILRDAETSGKVDENATMKAAQQYIQDNGMKEDFDSWKEKLNDRYNVEEVIFAGYKADGNRKYLPNTVENAVKVMKQDGKNASVGSASFSHFVASILKPMGTLDQIRKKKGNLTDNYEDVEKFQEKWQPVYDELADKMQPDAGPFESYGMDRLEDAATQKNPKKYAKDEYGVDLTDEDIDKLNELIDAVKNEKPSIYFETKFMRPYGLDEFEKAIVPNDTPSDVVDALKMAGIDVSSYERGNAEDRQKVTMDAINSSDNIRFSLAGERGAA
GUT_GENOME256699_010361792-2341LPEKRQFSMSEPVEQTRDLLALHNLTEDKLRSTLRLGGFPMPSIAVIRDQMQHDNYGEITMVFGKDTIDPQADSRNRVYGGDAWTPTAPRVDYPIDYQVQRAFEDRVEQLSANVAGGTFSRGGILGYLGFTGDGTDLDMGEIVRRTTNDKAVQAAYLAEQGRNVEPVMKAKQFDSFGNESLDAYIDRVGEQEVARLAANLMTGERLTNEEIETAKDVIMEHWTARNEWRLNKKPELRETRIAKQREKISDARAEDFIRNAWDFFRDSGATTDEVDFYATRDALEAAVDKQAVADWTLQQLDGLLGEPGIYNGRDPFTSDGRSRSFQQLHYPYTLENLVKAMSTTQEARGEGLWGATAKGLQAVSTPEYRNVEEIHADEGRLQNLSQEEYDALLQDVDDGINQVIEAVRSTNEAHSSNSWEEADIIGSVLIDAARGRHTPAAVQRAFRKEDYTISRELAQQIVDLFNQAANVPTGYFEAKPQRAVPLDEILAAVVPDTMEAELRSELEGAGVPTITYPEGDTQRRMEAVNSVEGAKFSISEEEDAAPERRD
GUT_GENOME111254_027971885-2398ADTRIKGNIGSTRFRINTPVEQRGDLVAIHNISEDKLKEAIGLGGFPMPSIAITKPEVGHSTFGDISLVFGKETINPTDRRNKVYGEDAWTPTFPTVGYKLNEDKKSDIYRRANKAGNLPLFNPVDFHSDNYKSYINGIGSDSLVNHFKDSYGAKQLYLAETGNAVEKFEEHEVEKYSTERIGFLEQMLKEIGIERLKKESYAVLENEIKQILGKYYNVDFDKLQPFRAKIRIDNAIKQAVDYAENGNNKTESDVEATKKKIDERIDQKKFEEWLRNLFDGVVEKKGIRNETDLFTPMGNRRKWESLYDEITLDNVVKAMKKQSAKGGQGLFGGNIFGAAQSEFKNIGEIREAARERIRELSNEEIEGRRNEITDRLSQIDIPMRDKGIGDAFDMIENITDSVRHSHTAKGIYNYLHDIYPAMTMDIANEIADIVKDIQQMSARYFEAKPHRAVGFDEIKFAVVPDNTDSGLLNQLQNMKIPVEIYEKGNNEQRKQILNEAADKYDTRFRMVEA
GUT_GENOME245166_01603991-1531GNERRKTKLGESVRFSLSAPVEETKELVAVHNMKTSELLKTLDLGGLPMPSIAIIKAQSGHSEYGDVSLLFNKETIDPKFMRANKVYGGDAWTPTYPTIEYKVNEKVEKKIRNLYYNLAEKYGYDEVGPLYNYANNLEDTLNRHNGEAGILEELSDNTGMMQLYLLETGKGKVENIEKEIVTELSEAEIEMNEHFINALGEDVVKSIVTPSGVSPVTHRKEWITAHKAEVEEAYVDFLKNVYKFTDEEVENVLNNTRIFDYAKMVRDANNYIKNGARTVRTEIDRAATTAKIKEMAGEGYSKWLKELFSGVEEKTGIRNNADPFTRSGNRRSWEALHWENNLENVVKVMKEQENGTGFFGGSGIWGVSAKEYGSIDEIRADSDRLKAMDEEACSEIKESLGQRFQEIAVSIMNKSERNEFIAIDDAMESIVDAVRTSKTKSGIMNVLEQYQQLNVTEANVDDIVELVKDIAAMPTEYFEAKPRRAVGLDEIAAVIIPDNVDAALLNKLEEGRYNVVTYKAGDEADRVAKLNAQNDIKFSLS
GUT_GENOME114677_018972008-2560FVAFEPTQIKSATDNRGTFDSDNPDITFSVRAVQNLGAVHSISPDKLLEAEKLGGMPVPSVAVTRLDQPYSWGGDDSIYLIGRPGMIDPKRGAEVYSRDAWTGKMPYLVHKAVGWESREQTVADLFKMEDVYQSREDSSWLHGLRYYIALDYAGDKTSREDVERCLRNEDGKALFAWQSGYRPRPKMRNAAMKHAWMTKALRDELAAYESLSEEEYESRAGEVRQKAEEAIDAYVGGLYAELEKKKPAIAAKMRESMRKTFLNSLFLSDKYGLREVLTDVRRMGKRVPDLNANRKMLARYADSHKRAYEKWVREKLEGWFSAARYIEGTGTKATLESLTKWMLSHKGRNKEQQLVFGSGKVRAAQAERLGSMDEVKAMRERLSDSAASGTSKKMTDELLQEFREVVSDVFKGGDVFDYSGIVEIQSAAMEALSKVSGNPTTGKVMTALKKVFPGGARLNKLLMREDVLEKGVDALKSLRAELEDYLEAVPQRAVGLDEWVYAVMPEELKMNREVTALLRRHGIKPLYHDGTAEGRVGVMESLVDDPMVSFSMR
GUT_GENOME036127_019942260-2775NNADIRYSIKKNETDTDKNLVAIHNLSEKKLKENLALGGFPAPSIAVTKADIGHNGYGDISVIFGRDTIDPTKSSDNRLYSADAYTSTFPQVDYKLNEDALNRLSEHLNMSVSQLESNIFDTGDKERIISKMKASEQVREAFVKDNNFKVEPVLRDPKPSKFGTNTNEAFYLLAKHVTYEQLVNNADVRNDYFEAVEEGSKLKGATKRFIEKMNKQLDAAKNKPEIFEREQARFNDDMKILNGQGEKVEDAYSHSDGVNNVIKENQSEFNNYAKNLVDKVLEDKYIRKDIDPFDEYGNGRDFSEMHTEYTLENVVRAMSKGKNQENGIFGAGIGEMRSAVSQEYKTVEDMKKDSGRIQKLSDEEIDNIYSKSEKLLGDITSDLAQLIDTGNNFTDIDVASENVKNALLKGTNKRSIKRYLAEYINGVSDTTIDKIVELADNLKNIPVKYFEAKPHRVIGFDEVKAVILPDNVDPNLNQKLKDMGINTVKYKSGDEADRQRALESVDSNVRFSHTVG
GUT_GENOME005312_02102408-915EVKEKDGGKSLVGLHNISEEKLLKALKLGGLANPSAAVIDIARQSHEGYGEISLILPSSMIDKRTGRNAGTFSTDAWTPVYPQIERQFSGDGSGRVRDAISRLPQEMQPNVRSAWNSYMDGRDEGSAFAYQFLYERGEAPEFRKIEPLFGEHLRNRISAIDALDDYDERNTALLEVYIEENFGGDQTKFKDYIETRKRVLQDKIQAFPEQKQKGLAYRKAVEKLNDIEERGYEYSSVQDFYDRVKTDIRQAGSVDTYHTLQDALDKINGSEALSRQYAEWRESLAGKYGIKEVLFKGYTPDGIRIYLPHTLENVSGLMKQQGLAAATGWGGSFSKFVAALMKPVGTLDGIRRQKGKLITDHSDLEAFRDKWQEVYFDLGIKLNPGGGTFDDTGLYRVEDIALKPDPGSFAKREYGVELTGEDVRQLKEMVAAIQNECPAMYFETKFERPVYLNEFVAAIVPENVSEDVSKSIRESGLQVFVYKPKDESSRNEAVKLASEIKGVRFRLA
GUT_GENOME000100_00582915-1483SDNDVKEKILRRFQLEEADVDETKTLIAVHNLSADKLLKTLEYDGIPMPSIAVTKGDQGWNEFGDISLVFRKDTIDPKASRKNKVYGADAWTATFPQIEYDVDPNVYYKANRQVKNEAEGKIPNYLLSEATQFITTQSGNADRQGINGIVQAAKNNMGMKAAYLASKGIEVEDRVQQVQKPDIDPETEKMYTSYLNHMTNEEHADIRQMMENGAVDEIREKYKDKIADAWADARLERGDDPKVVEKAREKYKNGPIRLLLFGIGKAYDFKKNGIQNTTEMERDTAGINKEINEQVDESGYESWIKDLYKGVVKGKGVYNGKEYYTPSGNRRTFKQTHYDVTAENIVKSMLTQGDGDQVNTSGFYGIKTIRAAASGELKSIDEMHSKEGKIQHINVEEFDAKQKVLNDRLLNVIYSITTETGSTSYTATDNVGTIIQEAAGKKNFSEKTVRKVFSEYPYWKVSDANIKEITDIVNEAKEMPVAMFEAKPQRVVGYDEIAAAVVPNDTDQQILNALEEKGIPVVTYDETQENARKEAVNSLEGIRFQLEDVEQNVETDQETDRILRENQEL
GUT_GENOME096797_011671239-1761KEVILQASDVGTHGTLMAIRNIKEPELRGLISLGGLPSPSIAITDPNKVDHSNFGRISVLFDKETINPGNRKNEVYDRDVWSPRFPSLSYEVDDNKASELYARARSAGSVPLFNATNLAPQNIEDMIDREKGEAGLIERYRDDMGMKNFYLAETSTPVPVQISEKITKISQEEKEQFDFILEKMATGIAEADNMSGKEWNDLYGKKFKEVQREYWKQEIPSIEEHTLATLVDNQKPFKQVMLARKIREYMENGGRKVEQTEDLAATRKAIDEKIDQGEYNTWLANLFKGIEKSVGIYNGKDPYTSSGNRRSFKALHDDFTLENLVKNMTKGRTQGGENGFMGVSAGAISSNIAKKFKSIADIKANESRIKALADEMVTPLKEKLGESINALYPYYKGDYFNVFNSSTEAVFEFSTKKLTSENFKKILNEYNFDVNRIPVEILQEVINDLNSLKDIPTDYFEAKPQRAVGLDEIKAMVIPNDTDATLKQELTDYGFYVIEYDPNIEGDRQNKINQFDDLKFSLS
GUT_GENOME013704_001051295-1840KNSVAEASDVVKFSAKDPVEKKGTLLAIHNLTEEKLQKFLALGGAPMPSIAVTRSDVEHSNFGDISLIFDKSTIDPKASRKNTVYSADAWTPTFPQIEYETNAKVDNAVYSRLTALSRNMDEFYRKDLKRVLYGVDDGLNRYGGEAGFVEHAMDNLGLQAAYLEDNGEHIDRITKQVERDKGYSEDRVERYQKVAEVLGTTDADEIDEMPLSDIRDQHGAELEKAVPGITKSALRLSGVLMQTMNYLRNAQNGTQYDTVTDYDAMRSEVEKRIDKPAFEKWVKELYAGIEAGSGVYNGKGLYTPSGNRRSFAATHYPATLENIAKAMAAQNGGDTKNVSGFNGIKTLRAGMAQRFKSIADMHVLEGRLQNRTQEQEDALNNALSDRMYDLMNRIDATRDKRYASMDNSLMAMDNVGEIMMEIADGGKYSAQHIQDVFNGYGLNIDGKLANEVRTLLFDVQQMPVNLFEAKPERAVYTNEVRMAVMPEGEYPELQQQFRDMGIPVETYDPEVQGDRVRVMNSDVTEPLRFSVKDVEPADAKKLQREN
GUT_GENOME058743_015861037-1560KGYEEDLVAMHNISEDNLKGVLQLGGMPVPSIAVTKKHTPYNKYAAITLIMSKETANPEKEDVFTRDAWTTTFPRVLRKPLQAKLGKLLDEIKETSGETGLKLYTDKYDMANYWDDENAYSKLEDALGTRAAKYYYLMSIGKAPKLKMISAVGEDSYLKDKAVVKKVAPLIKEITDKGGISIASDDAELSERIANVMREIMLEEANRKIQERLATLTPEQIANEPAFVKRRYENLLKKIDEETKYSLIFKFENRIYREYKLLDKKIADIDKLESDIDKALERPMTKKGYDAWVKENMDTMLDTPTIKVGNSKVPLTLDNIVEAMVGNTKNEQEGVLGRTSGNVIAASADTIKDYESMHKLADEKLDTTQDYDIDVDSNTSFVAVKEEINDIVNTVLPWYSYKGAGFEKFTDALEALIDIVGDKQEADVVKSFKKYSYNVPAGQRKALIERINTLKENIANLKTNYFEAKPQRAVRLNEITYAVVPKTTDKELVRELKANGIKVAKYDVEVEGDREKVTNAAQDK
GUT_GENOME215920_00752939-1480NETITGAVEESSRLVALHNLSEGKLLKVLQLGGFPMPSIAITRADIAHDQFGDITVIFGKDTIDPQRSTANRVFSRDGYTPTVPTVDYKVNERVGEKIRKKYYELARKYGYDTLRPMYKYANNLDDVLDNVGGETAMLSEIYDNTDIMQIFLLDTKGQKIEPVYREIRTDMTDEQVAHQENLIKALGRDVVEESYYSPPGPPSAKQAEFLAKHKDKIVDFYANFLFGNLPIEEGRAAVLESFSDRDLVLQVRAARSYLHNGRTTTRTEYDSSATQEAVRNAVSQEEYKAWVDSLFGGVEEKQGIRNDKDPFTPSGSRRSFEATHDDYTLENLVRAMKKAPAKGNGGFVSLTVNELAAKLAKEFKSVSEIRKSAVSLKAFNQEIQDGFVDTARGMINEIESLLLPQSSTERMTDWSLLDGASLIIGEIADKGYTTEKQIADYMAREYANSPYKYTKEIGDKILSLFQYVRQMSDTDYFEAKPRRAVGLNEILRVLLPEGTNARITNALDSKHIPYEFYKEGESRSSIVQKMDDVQFSRKDSEG
GUT_GENOME022845_003591162-1742NVLNENFRKTDVGVKFQLKDPVEESENLIAVHNLDADKLTQMLKYDGIPMPSIAITKSEYGWNDFGDISLVFRKDTIDPANKKNKVYGADAWTPTFPQIEYNVNEGVYYDAGRTISDSMNGKIPDYLISEAKRFNDTMSGNPEKNGLDGVINAAQNNIGMKAAYLASKGEQIRDRVQEVDTPDITPGEETLYQNFLQGIPETEWSKLKDLLENGTIKEVRDTYGSMIVDAFIDARIKDGFDREVAEKSREGYKKSPTRMMSVLMKLKKAFNYQENGVHYTKETIRDEAGINEEINSKVDEAGYKEWLTNLYDGLVGEQGVPNGKEYLTPSGDRRTFKQLHYEVTPENIVKSMMSQGGDTKNVMGFIGIKTLRAAATENMRSIGEIHKNEDRLQNFSVEDYAAREDALNTRLYNVIDSIIKKTNSNGITDVDHVGEIIQEAAGKKNFSEKKVKETFAEYPYWVASIEDVKEITDIINEVKKMPVNMFEAKPERVVGYDEVAAAVVPSNIDENVLQGLEDRGIKTVMYDPDTEGARKDAVNALEGIRFQLEDIDEGPSMSELVKENKKLKEANALLKKQFELT
GUT_GENOME217395_006471502-2038TVSGTVEETNDLVALHNLTEEKLLKVLDLGGFPMPSIAVTKPSLAHDNFGDITVVFGRETIDPQRDSRNKVFSRDAWTPTTPRVEYKLNEKRAEEINRKINSLIRGTDAELFGYLALDESNMTEFLNRNGGDLAQSYRDKDAFKYAYLREKGIEVTLPTVDKEVSDYGNATIKWVADNLGKSTLNSLEQSNYSNEAVQPYVAQVNELVSDYWQREYNMSPNNEIDLYEAKGILREAARYADRGFPKSLDARAAREVINEHLDEDGYNAWIDELFSDVVEKAGLRNNRDMYTSAGRSRSFEALHDDYTLENVVRAMSKADPKGGSWLGLNPSSLAAKLSKEFKSISDMKKAAASLTEVSDDAMSNFTTTAGQMLDEITADMVTREDYSSNFSYWNALDGAREVIGEIADRKLFTEKAIADYMKREYSGFYKYNADIGNKILGLFAYAKQMTQTGYFEAKPRRAVNFSEIKSVLLPDTASDRLKNRLDRLHIPYSVYGSTATERSTAIQRLDGVRFALPETDSTGTSLSQEQREFFKDS
GUT_GENOME148312_007611797-2338NLPGKKFSIFEDGTDKNLVAMHNLSADNLETALNRGGFPMPSIAVTKDNISHNDFGEVSVLFDKDTIDPEINNNHVYGSDVYSPTHPGLEYKVNENKSKEVYDYFKDELKNKDMAFKVNPVNFSSANLSNKINSLKGEDNFIDSLKSNYEMKNFYLSLNNNEVKKVKDLVNEEVEEVDELTADFYNYLYNNMNEEMKEIKDLSNREWFNRYKEQFNSVTQSYIDKWKGSSLASIKEFIANEARKNGIKIAATVKKVLNYKNNNGKIVNTTTVKDYDGAKKEIDKRINQKDYEQWLKDTFNGIVEKTGIRKPNVDAYTSRGDRKSFEQLHYADTLDNIVKAMKDVQNGESFFGGNQLWAIGSKEYSSIKDIKGDTSRLQMLSDEEHSEIKSELGVRFQNIAKELVNNNDLMSLDSAYNNIADAVRHSKNENQMLKYLKEYYPTTATSEIVNEIVNLLDDVSNMPTGYFEAKPQRVISPGEIKAVVIPAGTSANVIELLKKNNIPYYEYSSDENRTVATQQAINDTKIRFSKFSDKIKKEETRS
GUT_GENOME116449_008481084-1639DAAVKKFSMEAPIEQKKNLIALHNLDETKLLKTLKLGGFPMPSIAITKSDIPHTNFGNITVVFGKETVDPKFDRRNTVYSADAWTPLFPRMEYEANEKAAQRIRRKYYELEKKHGHDFVSPLYESANYLDDTLTKYGGVEGLIDKFADDTRMMQIYLADTGRTPVESVKTETITRLTDNQIEVYDALINTLGADVLNDMAAKHNEAPFAARKAWFAKHGDALKAAFEQYYTKDGIDAKTAKSVVDAMKPAELIKEATNARKYLKDGAETRKTEVDIDATNIAIRKAVDSGEYIKWLNDLYGDAVKDSGFYNNKDYYTSSGNRRSFKATHYPNTLDGIVKAMASQGDGNSRNVMGFHGVKSLRAGTAERFKSVEDMHKLEGRLKHLTAEEASQISDALDSRLSELMHDIYNLVPHSGYSNELMELDSIGEVFMEATELKYVSPANVKALFKKYNYPLTDKMASDIVALLFDVNNMPVNIFEAKPERVVGFDEIRKVIIPDTSSDTLRKALKEAGINAVEEYRAGDDAARMKIANDVPDAHFSREPESITELRRQNET
GUT_GENOME110464_00106690-1211QGNPINADERYSIKADKKEDRTLFGMHNISIDNMRRAIKQGGLAAPSMGVVDSKKGLYSGYGEITLIPRAEKIAKRTGRNIGTFFGDAWTPTYPQVERQFDEKGSLRAYKDVEALPEAMRSLTRNAIDSFMDGRDADNLAYMFLQEKGKMPNMVITQRQYPEALHREVEDALGGSSKLYGTSEEQRKQILDIFVREKYGGDKNEYSRDIQNMMQKKENIIAKHPKSYFARDMQMDLDDMREKGYDYSALSRFVDDIARDEKNAGKVDVNATMRKAKDYIRENGLQNDFKQWEESLNDRYNVKEVIFDGFTPSGKRKYVPNTLKNAVAIMKKDGKNAATGTASFSRFAASVLKPMGKLDEIRKEKGKLNTDEEAASMFQEKWQPVYFELAEKMQPDAKGFESYGLDRIEEVALQKNPRKYAKEEYDVDLTEEDIEKLEGLVKAISTEMPTKYFETKFMRPYGLDEFEKAIVPKDTPKDVTDALEQAGIEVHTYNGKEDREQVTLEAVNNNDDILFSIKREVAD
GUT_GENOME122601_006772589-3116EKNKDVRFSLKDEKTLAGVHNISEEKLLKAIKQGGLANPSVAFIDSSRQDHKAYGGISLILPSDKIAKRTGKNAGTWQGDAYTPTYPQVERHISDKGSEQVNKDVLSVPKEMQHEVRNGIYRWLDGDANSGLKYLFLHEKGVAPEPKKIQPKFSDEAYNELKFITAGDFNIYGIGKADAQKVLDMYIEAKFDGDKDLYEEKTKAWLERNKSIVDAGAKGGMRYAIAKENVELYDEYGFNYKGVQTFVRDVEYDHRKSGVDTNATLNEVEDYIKTNNLTDEFNTWLEGKEKEYGIKEVIFDGFTPSGNRRYVPNTLENVSKLMKKQGRNGATGAAVSFQNFAARLMPSYGTLKDIRSKKGLLTSDREKFDKFREKWSNVFFELGMKCQPDATGTFDDYGLARLSEAAMTSDPQAYLKKEYNVDFSDEDTKRLKEMVKAIKEEHPAMYFETKFERPVRFDEFSAAVVPTTTKKEVKEALKNAGVSIFEYDEKSDADRKRAFNEAINSSDNIRFSLKEEKEKIVADAKANG
GUT_GENOME175340_010261334-1919KKWFPNIGDEDTVFAESVKFSLKEPVEETKDLIAVHNMNAEDLKRTLQLGGMPMPSIAIVKGKMGHENFGEISLVFGKDTIDPEGMKENRVYGADAWTPTVPQIDYEVDSKKAKAFENEVGERASRFADGIFSSYSVLRSIGVDESTRKTENEVVKELAGQDAVIAAYLEEQGKTMEPVYDYKEKSYDRHGNDALRAYIQKKGTDELRKLVQDMESGAVDWMEAVQGEMDRYRESIEPELRKRAQGFFKSRPAEVVEKKVTERLKKEMVPNRVREFIEHAWELQSNPVETEGQVDRNATREKMRKMVDAEAVEKWVNEKTKGMLGEAGIYNGKDPYTQSGKQRTFAQMHYAYTLENLVKAMKGNQQEKGEGLWGATARGLQAVTSREYRSIAQIRGDKGRLQQVDEGEYKAELEAIDREIESIVSEIKKTNKAHSDNTYTESDIIGAVMLEAAQGRHTKAAIKKAFANNDYDISDMVAGKMANVFESVMELPTTYFEAKPQRAVMVDEIRAAIVPDDLGADVVEQMEKAGVPLITYEAGNEESRKEALNSVEGVRFSIREEEYEDLVEKNEALQEANEALRRQLEI
GUT_GENOME147207_041803-521KRIALAHALTESDLEFANLIGGLPNPSLGFIPSNQEFFKYGSCVLLLDPKKIDFNNNYASSIDVYSSVLPDVSFELNHSFWDQLNDRLDSALQANKVNVGEKYKTQTVIHPELLKGAISARNLLVKNPAVKLLYLKEKNILNPCKPKEIPKEKKIPFLSNESIEALIDNNVLDLPDDDANKEISKLLLADVQLKMKMLSSTGGSDGNSRKNRIELRKLNAFIENSMYFDDGIPKLYVSYFDNARNDLKEHLRRTSSIDESKYISDLEEYFKNHVPRGTFEDWVSKTIEPGFGKAFFFKTSEHDVDYDKEDISHDEIYGVERELATLDNLSKEMNRKLITCDSLFSTSIVKIAASVKERLHSLGEIENHIDQLKSEEEVNEYFMSLKVELSAIIEELAAYYKFKDPNGNVSSIYNNAATEALVQSRCEVNDELRDSFFVDQLPHELITRIDDLRNSLIVAPYTYFELKMSNPIRLNDFSVAIVPKNVSPSLVGILKENGLKICGYEQGSNFDFVSVLNKQ
GUT_GENOME235052_01129883-1443KRVKYSLKIGNENLTVDGEERKNLVALHNLSEEKLMKVLELGGFPMPSIAITRADLGHEQFGDITVIFGRETIDPKADSRNKVYSRDGYTPTVPKIDYKVNKKVLSKISKKYYELSKKFGYDTTRPLYKYVNDMERVLEDNAGEFATISQVYDDTDIMQLYLLDSGKEKIQPVYKEITESLPKERVEYLNSLINHLGKEAVLEITPKDGESLFTHRHNYAEKYRDKIAEFLVDGKIYKNTSEVFEDFKDIDLLRLVISARNFINNGATTTKSEFDSSATNKAIREATPQKEYHAWVDSLFEGIQEKTGIRNDKDIFTPSGNRRSFEATHDSYTLDNIIKAMKRAPIKGDGGFVGLNVNALAAKLAKEFKSVAEIRKNADSLKGFNEKVQDNFVETAREMISEIGRLFVPQAGDSVSGWDMRDRAETLIGEIADKGLTTERQISDFMAREYSNSSYRYTKEIGNKILTLFDYVRQMVDTDYFESKPRRAVEFKEIRDVLIPENVSEKLIQALDEREIKHITYSDSNTRSDIINKLDNIKFSLKEDSTGKQLSEGQKEFFKDS
GUT_GENOME235337_02300981-1576MQIIHPGETDVKRFSLKKPVEETDKLLALHNKDENSILEALKLGGLPMPSIAVVKARDGHSKYGPISLVFNKDTIDPQLSSANKVYGGDAWTPTAPRVDYPVNSKKALQLEQELHRLAGDTSVAGGIFGNSAALRSMGIDDTSTRNMAELAEKLASTDTVRAAYLADQGKTLEPVKKTKVWDKFGNDTLQKVVDRLGVNTLAEMEAKLEMGKSVETALGDNAEVIRDILRDYYREKGEPMLRRMAVKRGWTDAEINERRQTRIDNSMDSVSIFTLEDIVRHAWDMYQDGGVTKGEIDRMATSDALRNAVDDHAVEDWITGKLDGLLGEAGIYNGKDPFTASGNSRSFSQLHYAYTLENIVKAMKEGQEERGGNTWGASAKTLQSVATPEYRSIQEIKEDSGRLGMTEGTEYEAKLQAIDDQIDSIITKVKQGNSAHSDNPFIESDIIGGILVDTANGKKTVDAIVKAFFKEGYKIGNQTALDIQSVYKAAAEMPTGYFEAKPQRAVGFDEVLAAVIPSNSSQKLRDALEKAGVRTLEYAAGDESDRLTKINSVENARFSLKRDAALEKDFASIREQLEAGKLTEEQADRLRGEAVD
GUT_GENOME286704_018501073-1622RSSVSEDNVAQYKTDVKFSLKKPVEESGNLIALHNLTREKLEKTLKLGGFPMPSIAITKADIPHTNFGDITLVMSKSTIDPKANKKNVVYSADAWTPVFPQIDYEADRKVQSKISKKFYELANKLGYDAARPMYNYVNDMERELQVNNGEKGIIDKLKEDTSMMNLYLADVGKEQVQPIYKEVVTKLKDSEVELYEYLIKGLGKDTLLDLTPKNGESPITRRRAWLEEHEEALKTVYGDYLKKTGIPEETVNEMMSNDKIAKSLMRGAVYARNYLKNGSEKRSSEYDGEATKNAIRKAVDKGAYEEWLNKLFKGIEKSTGVYNNKDLYTPSGNRRTFKQTHYPVTLENIVKSMAGQNNGDTKNVSGFYGIKSLRAGTAERFKSIAAMHDKEGRLQHLSEEQAAAINDSLSNRLDDIINKILRTKPHSQYDNDFIMMDSIGNILVEVGDSGKYTVDNIEKIFSKYGYKINNGLAAEIRDLLFDTSQMPVNIFEAKPERAVGFDEVLAAVVPDTTDNAFIESLKSAGIENIIEYKQDDEQSRLDAVNSVDDA
GUT_GENOME009864_000451403-1901RSGDLKEKFSLKEPVEETKDLIAVHNISEENLMKTIELGGFPMPSIAIEKYNQAHTNFGDISVVFYKDTIDPKQRKENKVYSNDAWTPTFPNVEYSVNEKEISKLAKRLHASEQMLEGNFFNADKEDAVRKLKRNRIAKEAFLEEKGIQAEPVITVPKMNIAENDRVKEFLARNDVTFDKVLNDSTYQKEYLEVLDEEDRDEQKEKLESAKKSHFKYKLRKGSFERNQKVAKEEAEPKVDHDKGIDNAIQEHQKEFDAYVEEIVGPVFGNPGLRNSKDTYTRSGNARTWKQLHYDYNLENIVRAMKEQDSVGAGRTYQVYGNFQGVSSKKYSQLKDIKADSQRIKSLSTEEVKRIKENFHSRFDEMVRALAKDSKNQYDAELSAGNAIVEAVRNAKTKSGLLRELKKYNPSADTLLVDDISNLVQDLQMAPVKYFEAKPERAVDFSEVAAVVMPDNTDSKVKAAIEKSGMSIEEYRAGDQESRKAAVNKLDGVRFSIKD
GUT_GENOME014204_01797298-845RLNEVSKPEKRFSLSSNVEQTKDLIAVHNMTAAELEKSLDLGGLPMPSIAIIKAADGHSEYGDVSLVFGKDTIDPKKSARNKVYGGDAWTPTFPKIDYKANEKVGKRIRDKYYDLSRKYGYDNTRALYNYAQDLSDVLTSEGGEKAMIGKLYGDTGMMQIYLMDTGRERIANVNKETKTALTDGQVRQYENLIDTLGRDEMNGFAKKDGEELSDLVRRRKAFIEENEAGIRKAFEKTFIEDGMSAKDAAEVSAELALSDLRKYVLDTFNYIKNGAETFKTEFDSEATNAAIKKAADGEGYRKWVNDLFGGAEEKSGIRNSKDAYTASGNRRSFEALHWENNLENIVKAMLEQDETGGAFFSGMGIWGVSAKKYNSISEIKNDSSRLRTMNEAEYNALKEQYGARMQEIAQTLIDPKNDNYFIASDDAYSLIVDAVRNSKDESGILRYLKKYNSRATEKTASDIAALVRDIAEMPTGYFEAKPQRAVGFDEVKAVILPDNASGELVSRLESDGINTVTYKAGDENSRLEALNGVENVRFSLARANDEYM
GUT_GENOME117495_00284338-885SIPADGENVKPQFSLKAPVEETKNLLALHNLTEKNLLDAAKLGGLPMPSIAIVKADEGHGEYGDISFVFSKDTIDPQLFRSNKVYGYDAWTPTAPRIEYEVNEKSAKKIHDLFYRMERAKGRSFADPLYSAANTLEDELNRKGGVDKVVGAMRDDPRVMNIYLEDTGRGAVENVIKREVTRMDDNQQEMASFLIRELGDGVVSDFRAKGGESPIAARKLWYKEHGEALNAALQKYYEKLGLPAKDAADVVNAETVAAKTRYMLDTRKYLTGNTETVTEEVDRDATNKAIRDKVNQKEYEQWLEDLFDGVVKSEGIYNGKDYYTSSGNRRSFSATHYEITLENIVKAMKQGDQKGANTFFGGQAIWGVASKDYGSIADIKRDSGRLQKMTEEEYSAIRQKYSERLAELTNEIKDPAARNEFIASDDAASAIVETLRTKRTVAGIDKELRTYPTLQIKPDTAEKVLQLYEDISNMPTGYFEAKPQRAVGFDEVLAAVIPNDASAEVKVALENAGVRMIEYASGDEKARLDAVNSVENARFSLKATAEVER
GUT_GENOME194688_002771274-1831PGTEKSKTRFQLKEPIEETKDLIAAHNVDADKLNKMLDYDGIPMPSIAITKAEYGWDDFGDISLIFKKDTIDPKSNKKNKVYGADAWTPTFPQIEYDINDDVFYKAREKVEEDTRGKVPEYLAEEAKRFISTQSGNPGYAGMDGVVEAAQSNMGMKAAYLASQGIEIKDRSSKVETPNVTPEKQKVYTDFMQNIPEEIQDDVYDMLKNKASIKEIKEKYGDTLIDAYLDAKVKSGLDPETAQKQKIGYSTKAFRFTNLLSTFQKASDFQKEGIQYTLETVRDTDGINQEINSKVDKEGYDKWICNLYDGLVRGSGVSNGKDPFTPRGNRRSFSQTHYEATPEGIVKSMLAQGDGSQATTGFSGIKTLRAAAVENMNSIEEIHAKENKLQKNVSEESKSIMDSLDKRLYNVINSITIQSGHNGVMDTDNVGNILQEAAKTKKWNEKAVKDTFNKYPGWKVSDADIKEIVDIVNELKELPTDMFEAKPERVVGFDEAAAVIMPTTAPQELIEKARDKGLNVIQYDPALENARTEAINNLKDVRFQLDEDSDHDFTERKIM
GUT_GENOME048246_003991344-1902QITVGSEQGKNLVALHNLSEQNLLRVLQLGGFPMPSIAVTKVDLPHENYGNISIVFGRNTIDPEVDSRNVVYDRDAWTPTAPSTDVKLKTEAVDSLISELQGEVGEYSGYRYDVDRFFDGRYKNGAGEYVIEDYNYNKKTVGELATHNAGIMAAYLKEKGADISPIYAERGFTMGWQSFTRSEAQALLQAVGITENITRDNITAEQRAEILEKYINYKAEKNYRLLKRRKPETTFESCYERAKSNYDDGDVSQLLFLSEDFYNKNRPKDVLDDAATEKKLRDSISDMEDFYSWLWNKIENTFEKKGVYNDSDAFDRYGNRRSFEQRYYAYTVGNIVKAMSKGSQEGNAFLGGMTSGALAAKLSVQFDSIESIRAAQDYLKLVSNEEIEAFNSKTYEMYDEIVTEIAGASSDFMSNQTRRDDVGNILGECATVTPLNIENIKRKFAKETKGYDVGYKFNDSIANKVFALFQVLQHIPTTYFEAKPRRAVGLSEIVSVVLPKNSSTELVSKLKSKHIPYEFYDASNGTTRQDVIRKIDSARFALPDTDSTGKKLSTQQREF
GUT_GENOME179674_005891303-1871SAESVSENGGNVKPKTRFSLDEPVEETKTLVAMHNMTEEKLRRTLDIGAWPAPSIAIVKAKDGHTNYGEYSAVFPRETIDPQRSSKNKVYGGDAWTPTRSNARVEYEVDQSKARALEREIDRLSSEFAGGAFRNSSVIGAAGVNEETELSLDDIAERLAKYPAVQAAYLQSKGKSLEPIYKEKKFDSLGNDVLRQYVDRVGMQEVAQLYAEMETGGSLDESALNAAREVIVDDWAKRNARLLERRAENRDKLIAVQKSRLEDWRIEKFIRNAEAYIEQNGTSGDEVDKEATSEKMHSMIAPSGSWGDVEKTVQQWVRPRLDGMLGKPGIYNGKDPYTENGRKTFKETHWDYTAENIVRAMNNASDRGEGMWGLTGGTLAATSAPQYDSVDAIHADEERLRAESDDVHEKRLRDLDIEIDRVVDDLLRSTKAHSDSEYEERHILEDVLAEAAKGEHSPAAIKRSFAKDGYAIKDGNARSIMRLFDIAAKIPAGYFEAKPQRVVGFDEALAVVAPDDAPGDLLSEMRDAGMNVVEYRAGDDADRLDKINSIKNVRFSAENEDGELSEQQEQ
GUT_GENOME186087_013741183-1587RFSLKEPIEETNDLIAQHNLSAENLRKAFRLGGFPMPSIAITKADVGHTNFGEISLIFDKATIDPADSRNKVYGADAWTPTFPAIDYEINDKKASKIYAKANKAGHVPFLNPTDFHPSNMESRISNKGEQGLIEHYKDDYGLKQMYLVDIGETPVDYVQTTVREEMSDRERREGQWYLDNEPELCESFAKVRVDENPMKWRKDNIEKLREAYKNYLVSEGIAQDAADEVANAAKAYEVSKPLLNAEKIKNGKDVIERTENDFTATKEVIDKKINEEDYEKWFSELFSGIEKNSGVWNGKDPYMPSGNRRSFAATHYPVTLDNIVSAMLAQADSVRNADSVFVGTKTIRAAATESFGSIEGIKAASGKIKKIDTDEYNVLKDELDNKLAGVMSDIVSASGPSAAAM
GUT_GENOME019034_00490693-1182FRMEEPDLVALHNLTVENLKKAIDLGGLAVPSVAITKKQTPYNFGSNDGVTLIMDKNVVDPAKTPVFSRDAYTKVFPSIVRTGDRRKITKFIEDAILPAQKEIPREVVDYGNLYTPSYPANHNEKNGELQEDVESFLHSDGAKYLYLKSIGKAPKLKMKKRGIPEELSKEPKLVKALESLEKEYGSLDKLDEKGKERLSRVLKETYAEKLKAKGIRFHRLIESAMKRIESGDRAREVIYEFKKRNDMIPDKDSFISDLNKRAKGKPFESWKESFRNEMLGESIIRDTGKPANLENITAAMIGGLKNAQKNMFSGYGIGNIIAASGKKLKSLDEFHKAEKNLNPEADIEDGMSKSKEFLDLKSEVESFVNAITPAYKWEGSYNAISEAYGVLVDYVKGKKLSSALAAHDFTEPPEAETIAKRIKEKVKDLPAKYFEAKPQRAVYFNEVKGAILPKNTPKEIKDYLRTQGVKVRLYDPKVEGQREEVTNKLQ
GUT_GENOME286979_00142257-795NKLQNVNNNETKFSLDIPIEETKDLVAIHNTTESKLLSALELGGLPSPSIAIMKAQNISANNEFGDISLVFDKKTIDPQESNANKVYSSDAYTPISVKAEHKLNEKKAWDLYSKINNLVKQKLAYKPNASLFQPDNFKDQVDSAGSIAELVNKYKNDYAFKELYLADKSEPVRDIVQREKKTTLTSEDTDVFDFLNDNIKDTLQEIENKPLLPSKLWVERYDSKIKQSIANYYKSLIPGISDENIDNIFNNSDEIKTAFQRKAFVKKAIDYLKNGAEKVELVSDDEATHNLIDDKINQKEYESWLNDLFDGVVEKKGIWKGNDPFTKSGNRKNWESLYWDYNLENIVKAMNNQNAQGGNFLVSNIIGGSAKKYNNLDEVRNDKSRLQNINDEEYNQIRNNLYKRFQEIAQSMTKNDNPFAVADIIVDGVAKTETKSGLANYLKTELKGWANYSDMAVDDIWTLVNDIRALPTSYFEAKPQRAVYFNEVYTAVIPDNASQKLKNALKNAGVSYAEYKANNEQSRLDVVNSLEDVRFSRDV
GUT_GENOME222484_014281031-1618DAINQSVAQEAAENNNRNVTKFSLRKAVEEKKDLIAIHNIKAQDMDRALQIGGFPMPSIAIVKDDAGHSKYGEISAVFGKETIDPAADKRNAVYGGDAWTPTVPRVDYEVNNEKKNAIEDEIETLARGVANGIFARASVLSSMGIDEETSLSKRELAEKLAGDDAVRAAYLAAQGKDIKLEYKEKVYDKYGNEFLQKITDHYGEQELAKLVVKLELGEELDEESISEVKDIWKQDQIKKMEKQFRKKTPEEIAKRAELRAEKTVGSYEAGSFIRHAWEMLQDGGGVSEEIDRFATSDKLREEAGHKAVADWAEGKLDGLLGEAGVYNGKDPYTPSGSRRSFKELHYAYTLENLVKAMSRTQEERGEGLWGASGEGLVSVATPKYKSIEEIRKDKGRLQKMSDEEYRENLKGLDGTIAEIIQKVKDGNEAHSDNPFTEENIIETVLLEAAWKKGKPTAQSVKKHFAAEGYKITDSLAKEIMGLYTEAATIPTGYFEAKPKRAVGMEEIRALIIPDSIDAEMKDRLKEQGIPLLEYADGDEADRLRVLNSVEDVKFSIRKEDYDLLKEQNIQLKEANDALMQQFEITKEE
GUT_GENOME275688_006541266-1796LKDSVEEKGELIAVHNLSEEKLLKTLSLGGLPMPSIAVLKAKEGHSQYGEISLVFDKSTIDPQRSSANKIYSGDAWTPTYPSIEYKPNEEVADRVRDKYYELYRKHGNEVTRPLYAYANDAAAKLKDEGGEAGIMSRLVDDAQMMNLFRLDTTGKKTEPVYTETKTEISQEQRAEYDRIIEALGDETVKAIITPPGESPATHKKEFLKANNEALSRIFPQDMKPFAMAAKVRKAADYLINGPVKVSEELDYAATEKAIREATNESDYMAWLHRLFDGIEEKTGIRNQKDMFDKNGNRRSFEALHWENSLENIVKAMNEDVQKGGTSLFSGVGIWGVSTKEFNNIDEVKADADRLINISDEENAAIKQGFGARLDEIARKIVDNGADNPMIALDNAYANIVDAVRNCDTREDMLAYLKQYAPKATAATVDEIISLVNDISEMPARYFEAKPQRAVEFGEVVKAIVPDNISDELRTALEKAGVAISEYVAGDEQSRLEKLNEDDSVLFSKKDNDPTLSYYGEVIAENRIFRQI
GUT_GENOME285554_003122975-3503LSQRFNEKNKDVRFSLKDEKTMFGMHNISVDKLRKAIKQGGFAAPSMGVVDSKNGIYSDYGEITLIPKAEKLAKRTGKNAGTFTADAWTPTYPQVERIMNKQGEKAFNTDMNVKLGDVDNGIYSNVRESWKGYLSSGDVRDGLYWHYLFDKGMNPETIYQTGKYDNDITNEVMRISDNGNKTDYTDKEVAELIQLMNKATGKDNDVDAQREKLKARIASAEKQGNHLLVALKKKRLEELEGVENFYIAADFVNDVVRNNRKNGKVDVHDTMGAAKKKVEDNKKLSDDFPSWLDKKTEEYGVEEMLYNGTTPSGKPKYIPNTIDNAVKLMKKQGVAGGYTAFGSELGVFIAKNSPEVNTLAAMKNAKDKLIPFGDERHNQIKDKITKEFLELSDEIRVGSNNRYAFDDSGVSRMVELTDHKGNEKEYLKKAYNVEVSDEWMDRYNKLLDTIKKDYKVFYFETKFMRPYGLDEFEKAIVPNDTPSDVVDALKNAGIDVSSYERGNAEDRQKVTMDAINSSDNIRFSLAGER
GUT_GENOME254953_008651726-2249RFREVEEKDGSKSLVGLHNISEDKLRKALRLGGLANPSAAVIDISKQTHEGYGEISLVLPSSMIDKHTGRNAGTWSQDAWTPVYPTIERQFGENGGDRASEDIMSVPKEMQNETRKGINAWMDGRSGRQLSYLYLQEQGKAPELVKVEAKYPEKLHREVKSIMGDNSSLYELDNGKRMALLDLYIGEEFSGDQKAYEASIQEKIQRMEEKIKKGNNPKSLVVKRAQENLDSLLERGYDYRQLSDFVENVLKDAKYAGSVNDSATSLKAEEYITDNGLQKDFDRWLDGLNDRYEVKEVIFDGFTPSGERKYIPNTLENVSRFMKKEGANAATGLSTSFNRFAAGLLNAHGSLKDIRKEKDKLTSEHSDVDAFRDKWSDVFYDLGMKLQPDAQGYDDYGLARLMEAAQSNNPQKYIRDEYGIEFSDADAARLQEMIDAIRNEYPAMYFETKFERPVYLNEFAAAVVPESISQDMKEAMERAGLEVFVYHDSDEASRNGAVKKASDIEGVRFRKGKSPEPFSSASQQ
GUT_GENOME106015_002691705-2286RSTASVSDDSVAEKLPPVKKRFSIDEPVERTKDLIAVHNKDWSVIRDAALSWGGIPSPSVAIVDAAEGHTKYGDTSVVFPRATIDPEADPRNKVYGGDAWTPTKDNALVEREVNYEARRAFDENIKNLSSQFAGGVFQGSGTLGKIGLENETRWEPEEIADKLANHPEVQAAFLQSEGKSLEPVYCDKQFDRFFSNATIQRYLDAVGEQEVARLAVKLMTGERLTAEEMKPAEQAIREVYAEEHANFLNRRPESKEKRIDYYMKNNVFPNRVEDFIRSTQEFYESGGSAGEIDKEATAAKMMEMIAPGGSWNDALRTVKDWVQPQLEGLLGERGIYNGEDAVTDSGRRSFAQTHWDYTAENIVKAMNMAAAKGANMYGVTPETLAATATREYRNVDEMHADEARLRTVSEEEHEKALRDLGIYLDRVVNDLMLTTMHKYDNSFEEEQNLSGIIAEAAKGKKTVAAVKAAFRKEGYAISDGHAKSILALIDRAANIPTGYYEAKAQRVVPFSEAAAIIAPTSAPAEEIAAVKAATGVNIIQYETGNDEQRKALVNGLEGVKFSISEDSNGRQLTEAQQEFFKD
GUT_GENOME072607_013351553-2058EKGNENTVHRFFSLRQSVEETKDLIAVHNLSEKNLLENIKLGGFPMPSIAITKANNSYDNFGDISVVFKKDTINPSMEENKVYSGDAWTPMFPRTEWKLNEKAMKKMADMFNTSTNYIEQYVKDPETAVEKLKAEPKVKAAFVENEKSDIEKKTKMPDYETKIFSNEASRQFIEENNITVDDIMENDEIKRELANKVYPPKEGEKNLYKRIRENLIERLNRYKLEDDYEIFKGNVEPIFDQTSYDEAVDEYVNKHSEQYTKYIEDALENVYEDKYLVKEDVEPLKADGERKSFNELHMPYEINNIVTLMKKQKKGKGGGFFGGAGNLKGAATETFTSIDEIRRNKYKIQNISEDELRKKYHSLNSKIQEIDNYILGEENDGVGARLRKTENINETMVEAFALDKFTKSNVKKKFNDSNIEITDSVYEQMKELRDELKEIPVQYFEAKPERAVSTNEIAYVVVPNSVSEKTKEALRKNGIEYKEYAAGDKEARKKAVNSDPTVLFQN
GUT_GENOME022740_003291227-1747LRNVKNENVKYQLGDNELEETDNKELVAIHNLSEEKLLEDLNLGGFPAPSIAVTKTNVGHDGYGDISVVFRKDTLDPKINKDNHVFGADAYTPRFPQVEYQLNEKAVKSLADKLETSESFLEANLSDKGSSKAMVQALKNMYQVKEAFIKDQNIKLETKSKEMPPRMSVSRHETAAVKAFLKREDVTFDKVLNDSYYREQYLDAVSGTKLKSIAAKRKEKVNMWLDEWKADPEEYVKKKAEFEKDQLVARGKAEPEIEKESYEKSVVRTAGEHSREFDSYAEKMLSPLLGDKFIRRNTDNYRSDGSRKSFSETHRPYTLNNITETMKEAGIKNAEGGFMGLTGGLGELRAAISQEYTSIDAIRKDSGRIQDLSEVEIKQTFQKSEELISKITNALAKGENEFSARDNVLTNLVEAFRDRKTERGIRSKLRTYYDVSDATMSDVMELRRELAGIPVKYFEAKPMRGVTLDEIAAVILPTNASDNLKARLNQEGVNTILYDPNLPDARKKAISELKDVRFQID
GUT_GENOME171445_000381289-1652QERPLISLHNTTAAKIRHIAELGGIAVPSIAITRADVPYNRFGNITLIAGSDFINPKKKENPTFDTDVYSPRYPQVYDYKLEKGAEKELVSFFTPLVKKFSGEDIGAGALADLKKALAGGLDGVLSKSGLDENETAQKAWFATKGLTAKDLPSTEARYKYWIDNHSEFKEWLAPLYRAIPKQARILTGYTASGRQRFKDYTLDNIVKIMRGKARNEEGMYYGVGNIRAAHAKKYKNLTEVQNDRNRITSHEDMETWKEKLNEKWESLTQKAGGAKAEIIADIVLNSPKSKWQNELDKENITGINANALSDFVNDITSAPAEYFEVKKQRAVGLDEFYAAIVPKGTTRETRKILRDAGLRVVTYD
GUT_GENOME167201_007252735-3242NGRVKLTAAEGMKEDRKELVAVHNITKDKLKQAMELGGFPMPSIAITKAGVGHTSFGDISLVFDKESINPTDKRNKVYGEDAWTPVFPGVGYKLNDKKTAEIYRRANNVAREKDLPFYNPAMFHPDNYINKVGANGVTDLVEYFKNSYDAKQMYLAEMGNAVTEYVKREEEKYSPKHIRLYEKMLEEIGLERLQNESLEELRDEIKRLYKEYGDVDIDQKPQRVERAYLDGAKRKAVDYALHGNRNVVDDVFATQREIDGRIDQQQFEEWLGEMFSGIVEKKGVRNDKDAYTPSGNARKWEQLYDEITLDNVVAAMQKQAAKGGEGLFGRNIFGSAQAEYKSIDEIRKAAKERIRTINEEEYQTQRSAITDRLSAIEIPGSGSGFLGSMEMEQHIHDAVSKSHTAKGIHKNLKKFYPGITMETAEEIADIVQDIQHMSARYFEAKPYRAVGFDEVKLAVVPSDMDAALKERLEQMGIEVRTYASGDQAQRKRVVDEATEELRIRFQLV
GUT_GENOME157762_000441091-1614TPENNEKFSLKTTVEETKDLIAMHNLNEEKLSKTLELGGFPMPSIAITKSSIGHNGYGEISVLFNKDTIDPKNSDNKVFGADAYTTRFPQIDYKVNEDELHNLAEKLGMSTSMLDSNAFNGKDVYRAAESLKQLPDVRNKFIKENNIKVEPVLRMPQAHPSFLTKDNIKNFIVGNNITLEQIVNDSNVREQFLKKVQETAGTGRIAAKWCQKIKDELEGMADNDQIYTILEQAWNDNLDIIYGKAKPVEDAYSYSDGVNNAIKEHMDEFDLYALELAQPVIGEAGIYNGKDPYTKSGYLKSFEELHYDYNLENLVKAMKSKGVKNEEDSFVTDVNNVRAAVSKEFESIDDIKKAEERLQDLTPEEIEEKYSNVRELFKNIVNTLSSIQNASSGNSLIDNENAAHNIVDAVKGGLTENKVKKELQKYYKNVSDQTVQDVITLAKELRDIPVKYFEAKPQRAVSLDEVVAVAVPNNVSEDLMKQLNERHIPTYIYEHDNTENRKEIVNNIAEEKNIKFSVNVDTNI
GUT_GENOME117166_009581165-1722FGKLLNVNESKTDNVKFQLKEPVEETKDLIAVHNLNADKLMKMLQYEGIPMPSIAVTKADMGHEDFGDISLIFRKDTIDPKNRKNKVYGADAWTATFPTIEYELDDKRISQIKKEMDGWAQGKIPERYLNQAQTFVGTLASNISKYGGEEGTVEQAIKNTDLQAVYLASKGETIQERIKEVRNEMSAQDKSLSKALIDEFGDTITATNGWALRKIYDSYGDKIRETWVKQLVRDGAKKETAEQYVKKQKGFVVASKFQTAQKYMKNGGVTVRTETDYAGMAKEIRLKINEKEYRSWLKNLFSGTVRDSGVSNNKEPFYADGRKKSFKQTHMPVTAANIVRSMLSQSDDARNVSSFHGIKSIRAVATEDFKSIDQIKKASGRIQQLDTEQYSSLEDNLSTRLTQTISDIVEASGNSSNRFMQMDSVGECILEACSNPTQANIKKTLEKYEWKVTEAQAGELAQIIKEVQEMPVDMFEAKPQRVVGYDEIAAAVVPDTAEDEVVAALEQKGIPVLLYGDGKEGDRKKKINSLEGIKFQLEEVEDEGQLIRNLQKENKHLR
GUT_GENOME180442_002321180-1708PIKNRFSLNEPVEETSDLLALHNLTEDKLLNTLKLGGFPMPSIAVIRDQMPYEEFGDITMVFGRDTIDPQVNSGNRIYGGDAWTPTFPQVEYKVNEDELNRIEREVDDLVGGRDIRRQLDRTHSLDVTNVTDDLNRYNGDLATAYRYSNALKYAYLRQLNADYSLPYRYAPLGARSNMDDEIIIQLADMFGIDAIRQMREGGYRYYDSHKEELEKILDLVNQHYRETYNLDENLYDEIPFNTWDSLTADIYSYLRQGPKISVDTDAVSKFLADNVNQTEYENWVTNLFDSIVAKRGIRNSKDIFTPSGKRRNFEALHYELTLENVVKAMKNAAQKGGGAVLGGKSIWGVATKDYASVDALKQDKSRLQNIPDEKYQEQRQKFTERLHDLASEIANPKANNRLIALDDATETIIEALSKRKTASGINNELRSNKTLNIKNDTAEKALELFKDISNMPVGYFEAKPQRAVPFDEVLAAVVPNDISEELRNGLAKAGIQTIEYEAGDQLDRLAKVESVEGARFSTDEDDDQP