UHGP-MC 78707


Information


Number of sequences (UHGP-50):
98
Average sequence length:
302±45 aa
Average transmembrane regions:
0.18
Low complexity (%):
7.13
Coiled coils (%):
3.2
Disordered domains (%):
17.63

Pfam dominant architecture:
PF10123
Pfam % dominant architecture:
8163
Pfam overlap:
0.88
Pfam overlap type:
equivalent

Downloads

Seeds:
MC78707.fasta
Seeds (0.60 cdhit):
MC78707_cdhit.fasta
MSA:
MC78707_msa.fasta
HMM model:
MC78707.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME031271_0050213-333APDAILWMPKGEHIISASKNGKPYRGKVIVTAAAVPKLQADLEAKLARNVKPVIYYDHKRGAVAYQPARFEWDDDKGIVLCGSWTRGGRNAVEGGDYGYFSPAFLLSKTGEVAGLPRDIEVGSLVNDPAFEEIERIAAAFHDWTDEELARFADIDDFQENDNNESLKAENQPASTSNKNMDIKQLQDIGILTADEVKAANAETIALDKLKKMKSDSEAKSEELEATKKKLTDKEGELTKCQKELDGYKDEIKAARDNTEAQAKAKVEEAVKAGKIGPKDESAQAFWQKALEDDFVTASKQIDAMGANPALSTEKVAGGKTV
GUT_GENOME277288_0059228-346RLLGTGRNELVKDGRDYVLELSPETLRAMAEFQASKEEEIPIDSRHALYLAAKVAGVSEAEAARQVPEGTAALGFGRLESRPDGLWLVGAKWTPLGATLLRQGMARYFSPVVRGLEPGGVPRVTSVALDNVPALTGLEAIAASGETKEPKSGGKTMKKTEEALRGLLEDAALALSEESDAEVAERLAALRERLASLEKELADIQAELKAAEDAKAEAEDGRKTLEGKVTELELAAESARKRSLVSAAVSEGRVTNAQAGALLALGEAALSEFLKASPAGSAVPHGRAESVRRGVPLALGAEELEMARRMGVTEDEMRRA
GUT_GENOME245002_016116-261ILKVGKFTDSKGIEHEFTTEKLDKIVHNFEAVHKDVPICVGHPKTNSPAYGWIDSVKRIGDNLYCTFKDVQDEFKQAVQKGLFKNRSVSLDKDLNLRHLAFLGGACPAVKGLEQFCFEADETALDIELSDFEDLTTANTATPDDIAKIKEELAAEQAKNAELQKKIEMQKEDAKKKDFEDFCDKAITNGNILPKHKECVINILTSAADTGTFNFADGEEDAVQSVKNFIGELKMFDFSEELQDGKDNANFSDFTAE
GUT_GENOME216302_0274738-398LTPEAVDCAMAYFAKKGQKIPLDSRHFVFYLAEKLKVDEAEVRRLMPDQPGTFGFGDLEGRPDGLWITNIEYVPLGRELMREKIFRYFSPGLRGLDPLTMTPHTGEIDPFRVTSCALENEPALNDITSLAAGANDSECRVTLAAVNNSLRRLQMTKLEQALAKLLGRSSVALAATDAAATEEIASAIEKLGKLLAAAREGLRLPADAAPDAVTSALAAALEKLGATDELLGKIKQALGLPADASADAVEGAVAGVKSGSEQTAALSARLRKLEDEKEADAVEKLIARGKAEGKLTEAMVSDWAGKQTSAALASFLKVAPVAVPLETLSERPERPAAALSANAKTIFQNLGLTAEEQEKAKC
GUT_GENOME018611_0313431-354ENGVPIEWTLLRIGENPIIEEGEPGSITLTAADMRSILDYYNAKGELIPVDSNHYLHILANRKRLDESEVLKLLPSGVAAMGFGSLALAGDELRFRVKWTPTAYELMREKIFRYFSPALRGMLKPPLRLTSVAMENEPAINNLDALAASAHKPSDQSDKSAQSDRKETSMSKLNNALARLLGRDSIALGAETKEEEKDAIACGVEEKASLIEQVKTLLGLDAAAGIDEIIAALKAETEKAKTADDKQQQLDELAASAEKREHERLVTRGRAEGKIADSDMEYVNSLDSKALSAHLAHTARKVPLQRIPASPRKSDSVALSAVDR
GUT_GENOME271614_010274-272IEIFKAGSWTSAEGQTIRFTERDLQRTAAAYDPAKFDAPLVLGHPKTDDPAYGWVKRLSYAAGKLFAEIDDLEPAFSDAVSKKRYKKISASFYAPAHPGNPVPGTYYLRHVGFLGAAAPAVKGLKPVSFADAASLTFDFAEEIGNSEEAKAAGILGKIKELIAEKFGDDDAEKAVPAEDLAALEEKPAAAEETSEKTDDPTKKDEDEDEDKEKEKQESAFAEREAALAKREKELKHAENEQFLDTLISAGRLMPGMKNKVLNFMDALAN
GUT_GENOME222179_008221-227MSDATVRIAIARTGRFTDSQGRPQSFSEADLDAIAASYEPGKLEAPLVFGHPAVSDPAYGWVKGLKREGERLFAQIAQIPDEVRKLVQDGRYKYVSMSLMPDRKTLRHVGLLGAAVPAIDGLGPVSLAGDNDAVTINFAQGDGNMSTLEELQSRVTALEAERDDLKGKLEAEKAVAAKTASDFAAYRESVEVSARESRIDALIKEGKVKPAERARCLNFASVLAKSG
GUT_GENOME073793_012655-226IFRAGRHRTITGQEVEFSEADLNAIAAAYDTAVHEAPLVIGHPKTDDPAMGWVSGLKCVGLRLEADFRQMDPAFAEAVEAGRYKHVSAAFYAPDSPCNPKPGGYYLRHVGVLGAMPPAVKGLGPLNFAEDDTLFLVFGEEEAAPPEGMPSADPLNNEQNEDPMDRKTDPAAENAALRKELEEMKTRMAKQDAERRHADNLAFAEGLVSAGKLAPAGRDVVVA
GUT_GENOME251946_027819-307APERIPLIPYGRNTFTRDGKSGEFNFTERDADAIIAEFTERARDLVIDYEHQTLAGSEAPAAGWINELVKTPDGLEAGVKYWTDHAKARLESGEYRYFSPVLIFGSGGGPAALHSVALTNHPALHGVPALVANDFKPAAGMNPVAQTRKDSTMNQTEQALRKLLGESTLALSDDTDAVVAGKLLALAEELPGLRAAAAERDQLKQTQETSRKKLLLDDATAKNKITNGVREALEKLPLEQLSDILDKLPENGSGVPTGKLPEEKKDEEKDKAAALTDEEKDMARKMNLTDEEFLAVKKN
GUT_GENOME153989_0044020-326PPAKIMLIPYGEVKSTKGNFIFDDEAAHELLARYENKKNDLVIDYEHQTLYNVQAPAAAWIKALEYRKGEGLFGVVEWTERGGSYVANKEYRYLSPVIMVRTSDHRAVALHSAALTNDPAIDGMIPLVAKNDLEDEMEDNMEFLKLLAQMLGLPETASETDVAAAVKALLEKESVSINKELRDLLEVDDKADINAIKGKIIAMKNPAGFVKVEKYQELQTQLAEKVKDELVTAALKDGKIAPAQKEWAETYALKDPEGFRGFLAMTAAGAVVPTGEVAGGEGVKPKQDADDQEMLIHKNLGLTADDL
GUT_GENOME243585_0037440-371SIQLTPAGHFKPSDGRKMDVPAWYIDASVAARVIERFNARVNPAVVDYEHQTLHKETNGQPAPAAAWMRALQWREGSGLWATTELTPQAAEQIRTGAYRYVSPVFLYDKTTGEVLAMQMAAFTNDPAIDGMQAMELRAAACFAHDIDPDKETSMNPLLKALLAALGLPESTSEADAIAACSSLATSLGVLRRLQIELGAEGEPAIAACSALKTKATATPDPAKFVPIAALEEVRGQVAALSAERNAEKVTGLVEGGLANGRLLPSMKEWATELGNKDLAALSSYLDKAAPIAALVGTQTGGQKPAAAANEHGLNAAEMAVCSATGITPEDFA
GUT_GENOME026227_0067620-343DGDVSAEVQLIPYGRFSGRDGRRFSMLDADRVIDNTTRRFHPNTAANTLDLVVDYEHQTFLSGKNGQPAPAGGWIKRLINKGEEGLWGVIEWTAKAREQIKNREYRYLSPVFTHDKEGNVLALKCAGLTNYPNLELKAFNKLEDNNFNNQTEDKVDLLQKLCQLLEIPENSSQEAIFAEVEKLKAGKTGAGDTSLNKIAEALGVSAADTESIVTAINKAKTETIGIAEYMALNKEVEDLKSDLIDTAVNKAVSDGVLLPALKEHARDIYKNQGRAAFDDFINKLPTVALNKSEAPAGTPPSSGASLTAEAVAVCRQLGLSEDDY
GUT_GENOME234931_028461-374MKITEFSVMEMSEVERTADGLPKAWPIFRAGENIVTRRDSMPVNLELSQEDLQSIAAYQRAKGTKIPIDCQHVVSNLAGKLGLDEPELLKRLPRYSGVAGFGNLEARDGALYLTDAEWLPIGSEVMKAGQYRYFSPTIRGLDGKSPLRVMSVALTNNPCLQHCADLAASEDEEVTPEEVRAAQENIIKHKEADMPEELKTEAAVAAPAAPEPEKKADEELLAILKEVLGDDVTPENLKVTLAALKTKAEATEELSEKVRALELSEKARREREERHFRKEAEENGFRRGVLTPADLEKPYIKNMSAAELSEYCSSLPDGCKVPVKTLELSETEGPQRGMERAHLTDVQKAIYASTQNAKKHERVTRCLRLSRNTR
GUT_GENOME096244_0003919-329RIQLFPAGWFGAPSGGQRWFLDAALAQRLIDAANSRVNDYQFDYEHQSLNAPQASGPVPASGWFKSLTWVEGEGLFADVKWTARAASLIQADEYRYVSPTFRYDEQGNVRELVNAALTNMPVLDGMRQVAASLMFFDNGEKPMNENLRLVLCAIMGLKQESDEAAIQTALEDLQNNQLREANCSSIGALIDAHKAQLTAKDQAIQEGQTKIAALSAAQTGAPDPTKFVPVAVVDDLRTELASLSSQIQGDKVETLLTAALSDGRVMKGADEDNLRELGKKDYALMEKMIGTRKPIKALSRLQSDGMTFEGD
GUT_GENOME120660_0106518-350PTEIKILPLGRVHSQKGDFNVDDESFELIRKQFKDRKLDLVIDYEHQTLSDVQAPAGGWIKDLYKGEDAIIAKVEWTPKAAEYLKNKEYRYLSPVVLVRKRDQKATAIHSVALTNTPAIDGMFALVNSLDIEDISEGGNIMDLKELAKALGLPETATEEEIKKAVEDAAKAAEKLKEMDGKNPGEGDGKPGDGEPKPEGADMVANSTILSMLGLKADAKTEDVAASIMALKAGAPDTQAELLALKQRMAERDADEEVQKALKAGKITSAQSEWAKSYALKDMEGFKGFVDKAPVVVPQGKLDLKDAPAASNSDEVDVAILKNMGVSMEDVKKY
GUT_GENOME181690_0147521-353ASGAPNWIQLLPPRTFSGRDGRGPYRMDDPEAVIAATREYFGNADIPLDYNHQTEHAAENGKPAPAAGWIKKLEGSSFYGLWGRVEWTEAGARAVAAKEFRYVSPVFYHDKDGRVLMIRSAALTNLPNLDLKALSAAQHGADNNDARLGAKEYYMNFKEAMAQSLGLSLDAGEAEIIAAAQSMRDAVTQTGKTLNASEGLAGLIRAAHEVAAKAARADKPDPARFVPMSMHESVSQELAALKAEQAKAGAESLVRAAQDAGKLTPAMTAWGTNYATENPEGFKAWMETAPDLRPGGGKGSGTSAAPPDRGALSLNETEKAVCAAFGLSEDEFQ
GUT_GENOME143385_026496-318WRIDAASAAAVIDRARARKTPPVLDYEHQTLKKEENGQPAPAAGRFLDFEWREGSGLWGRVEYTARAARMIEDGEYLYFSPVFSYAPDGTVLSILMGAITNDPAIDGMEPLARRAAATFGLYPTQEEISVDELLKAIIAALSLKEGATEAEAIAALTALKPALDAQAASLATLRETLGLAKDASVEQIAAATSQLKKADPSQKPDPAKFVPLEAVTDLQEQIAALTARLNGGELDRLVGAALQDGRLLPSLEQWARDLGGKDIGQLKAYLDKAAPIAALTRLQGRQPEGDTHNLTDAEMEAARLTGISPADYA
GUT_GENOME098682_0035913-329GIDLSGVPEVIRVLPKGHVSSTKGDFEVDDRDIAGIIRQFKARRLDLVIDYEHQTLSDVQAPAAGWIKDLYPGEDALMARVEWTKKGREYIANKEYRYLSPVVLVRKADQHAAVFHSAALTNTPAITGMFAIINSDVLSIEEEEEEPKMELSELIQLLGLEEETTEEDVLKRIKELVQQTGGEGQDGPEGKSGKEDPAKEGTQLVANKTVLDLLGLPENARTEDVTARIMAFKAGDSVLQQRVAELEKQAASQKAEELVGLALKDGKLSPAQKEWAVAYALSDPKGFAAFVEKAPVVVPMGKTAFAVDERKQTGVDW
GUT_GENOME029799_0081020-350GNAPDWIQVFPSGTFSGRDGRGPYTCDPAAVVARTREHNGPVDIPVDYDHQLEHTAMNGRPAIAAGWITELEARADGVWGRVEWTDKGKAHVAAREYRYVSPVYYYQQDTGDIQAIESVALTNVPNLTGLKALAAREPGLNLFTGDITMSFLKTMASVLGITDVEPTEAAVETAARRVMADVGAMKSAMSVMAEAAKTDDKTPGGIAKAVQSIAARAEHPDVSKYVPVETFNTVNAELSRMKAAQSLALVEQGKAEGKISPALEGWAKGAAASAPEAFKKFLEVAPDLRPGSRATQTVTATPPEGTSSGLDATAKAICRAMGISEDDYKKA
GUT_GENOME119010_0408129-326LIRYGKTAFTKGGIRGEFDFREEDADGVIREFASRGRDLVIDFEHQSLSGGKAPAAGWIGELYKNAEGLCAKVKYWTREAEKFLLNGEYRYFSPTLYFSRSGKNVSAIHSVALTNHPAMHMIPALAADDLDSAADDANREPQHTHNNLMKGKNMNGILQKLGLLSLADADQEEQQEAVLRKIDELNAAAEKVSTFLKLYDLKSLEEAESKLNELIASDAKEAVRAAFSDGKLTESMRQWAQDFAMNDLNAFQAWSDAAPRIVPDNKDTEEAPAREKEDAEELTPEENKIVRLLGLTEK
GUT_GENOME139705_0186228-223FEGIDGRRYSLNADLVLKNSQKMSVDLMLDKNHEDDEAMGWFDINSLEKRDDGIFASLTLTPKGEELVKNRSYRYLSPAYVIEAFKENGLMVIERIASVGLVNRPNLLSQALNNDETKMKGENMEKELAELKAQLKALKAENEALQAKVKELEAEKAKQEEEANKIAQNAKIELIDKAIENKELLPKRKEAALALN
GUT_GENOME128367_0166533-328LFRVGRNELTRNGKIHVLTLSAEDIAAMADYAARKGEKIPIDSRHTLFLAARKFGVEESEVRKLLPHGVAALGFAGLQARPDGLWAVDVEFLPLAAEVVAEGMFRYFSPVLRGLEPDGVLRVSSIALDNVPALNQLDVIAAGDCVTITEEVTMTKLQDALRKLLSDDALALGAENEEAVADKVLALADELPELRRKAGKADTLELAAENARRDDLIQRALASGKFCNAQRETLEKLDIAVLSDLVEKTPKDAAVPTSPLPEGKKDEADEPAALTDREKAVAKSMNLSDEEFLAAKK
GUT_GENOME136736_0061521-288PREIKLFPLGLVKSQKGDFLIDTESYNSILNHFKTHHVDIPIDYEHQTLLDVQAPAAGWIKDLTLKQDGVYALVEWTDKAAQYLAAKEYRYLSPVVSVRESDRKALLLHSAALTNTPAIDGMTAIVNSAKAGSPSKLQVDGAEGDTDKLPPESGELGADDPAEFINGLRGMLQLPDDAPFSDIENRIATLLQGQSALKLEVNSLRFEAHKYKAEEAVTTALKAGKLMPYQKDWAFRSAMSQLEAFKDWAENAPQVVPMGEIAYEPLTL
GUT_GENOME056035_0176020-353IKLLPLGHVRSEKGDFEVDQESFDLMRRRFLERGLDLVIDYEHQTLKDVQAPAAGWVKDLILQEDAIAAKVEWTPRAKEYLKNREYRYLSPVVLVRRSDNKAVVMHSGALTNTPAIDHMFAVVNKGSDGEKEGGTGMETLLKKLAALLGLPEEASEEDVLKAFEDRIGGGNGEGNTPDQKKKEGSGVETEDGAAGTNLEEEKVVANKIICSLLGLKAGAPTSEAAAAIMALKSGIDSTAAGRLRELEEKLAKRDADDAVEMALKSGKIAPAQKEWASAYALKDPSGFKDFVEKAPQVVPMEEIQFATDAAVLKGAHPEGLDRAAKAVFAQLGIS
GUT_GENOME103712_0306450-404LLPVGPFKARDGRPFDVASGHWQLDGQIAAALIARAKALGQDILIDYDHQTLKTDQNGQPAPAAGWYNADEIEWREGQGLFIKPRWTERAAALVAAKEYRFLSAVFPYDAQGRPLELRMTAITNDPGVVGMQALAALSALPASSHMSTQPGQLATSSHVAQQEKSMNEHLIALLGKLGIQPGADGQFTAEQGTAALAALDTLQASAASAPKLEAALTATKAQLAVDSASLAALKATVATGQGGQIDLAKYVPVETYNALVTEVATLSAKVETTDAATLIKEARTQGKVVAAEEEYLTAYAAQKGVAALKALLEPRPAIAALAASQTTQVTLPDRAENAVLSADDKYAADQLGISY
GUT_GENOME135620_005082-241KYFEVFKAGNYPQGKFTKEEVQELANNYDPSFCEAPITLDHEQKGPAYGWVDKLKEEDGLLKASFKNLSPELKDFVSKGKYKKISVEIYRELEGKKPYLKAVSFLGASIPQVKGMQAVEFKEGESDIFEFEVQTNDDNIETYSEAEVEELKNTISDLQTKIAQFKEDAKKQETIKSLKSQVKDLTVELAKFKDEAAGKEELAKELKEIKDNLRNKDFNEFIDKQIEAGILTPANKDAVFS
GUT_GENOME018735_0232128-330LLRFGRNVYSKNGETGEFEFTEADADAVLQDFAARSRDLVIDFEHQSLHGGKAPAAGWIGSLEKTSEGILAKIRYWTREAQEFLLQGQYRYFSPTLCFSRSGKRVTAIHSTALTNHPAMHGIPALVADDLPGSASDEASEAAVPGIINETSSKLKGKNMNQFLTRLGLEAFCDADEEERCKAIEQRLCALTDMEKSLQDFLKINGLTDLANASEKLRTLHSMNAEKAVCQAFADGKLSENMRPWALRFAENEPAAFADWCKAAPRIIPANTGIPQTDAAPYNQTAVPGSEEARILRLLGLSEQ
GUT_GENOME208168_0153130-263NGHHSGAFRVDKLDIQRMKENFDSRKIDLVVDYEHQSLWGGEAPAAGWITEVWSAKDDSELWGKVKWTDRALEYIKNEEYRYLSPVFNFSAVDQKTGANIGVRLESVALTNTPFLDELGEVKINKNLNQLIKEQNMAEKNPNMQPEGQGTDNSAQYEAQIAELKNQLNASQTELESLRTQIAETKVEAAIVANKIPISQKEWALGYAKANLAGFEEFLKGAVVPQQKLNLPNDM
GUT_GENOME067288_0132918-350PEVISVLPLGHVVSSKGEFDVDAESLESMKREIAKRGVDLVVDYEHQTLKGTQAPAAGWVKELFLKDGSIRARVEWTPTAAEYLKNKEYRYLSPVITVRKSDGKAMGLHSIALTNTPAIEGMSPIVNSDSYQEGGQNNMNEFLQKLAALLGLGEDATEEDILEALKKNAEEMKTLKENAEAAKKQQGSEPPAKEDETVVANKAVCELLGLKAGAPTDDVTAKIMELKGGTIDGVNVLEKVKQLEGKLADRDAEDAVTMALKGGKITAAQKEWAKSYALSDPKGFASFVEKAPQVVPMGELNMNDDVKSLKSAAASEETLLVCKALGVSKEDIE
GUT_GENOME062831_0256942-328GRPASITGGEVLDWQMSAAIAADLIALFEAGGKLLLYDYEHNSLYGDSRAAGWIDKLVHVPGRGVYAHVEWTPKAAESIAKKEYCFSSPYFHFDPKTGAVRQLISVALTNNPALGDLGAVGLKRAASFSTASTAKEADMADEKQVAALTVERDGLKTEVAALSAERDGLKAQVAALTAERDTLKTKVDAAEQEKTEAALAAEQKQKDDMIAAACSGDKPLMTPAQAEPVRSLSVAELTKFIAALKPVSLTQRQADGREGGHGLTADELAACSRMGVSPEEFKKAKKG
GUT_GENOME060268_0190523-356NTDGLIKIVPKGQFAPVDGRTDTGVAHWTMTASLAQQIIAAFDAGQTDLVVDYEHATLKAAETGQQNPAAGWISKYVWDDDRGLMGEVKWTQRAKDMIDSGEYRYLSPVLEYDTLGNVRGLHSVALTNSPALDGMALAALSRQNSINPKQETSMNKEALIKLLGLAADADDKAIEAALAEAQEKLGGKTLTEALAEHKDEPQGGEGGKGDAGKPEDNPQGGNADDGEVAELKAQVAALSKKVIAMEVGGTSDGLIRAALSDGRLLPHQEASARQLAAKDPEAFKNLMEGSLKLAALSKTQTGGKGAEGGEPALTPEEIEVAKQLGISAEDYQKA
GUT_GENOME177056_0072717-350PTPDGWQQLLPKGEFRSRDGSPTDVAHWFLDEAIASRLIDLVRNLKQDVLVDYEHESWFKAKQGKEAGAVLAAGWFNADEMRWFDDEQRQGLFIKPRWTPKAYEHIKNGEFAFLSAVFPYDEQGIPLEIRMAALTNDPGVTGMQRLAVLSANLPTPTQENGRMDLLKQLLAKLGIEIAEGAEPTDEQTQQAKDALDALINGKKDAEEQVATLSAKATQVDLSQYVPKATYDAMASQVAVLSAQSTETQIEQMVTQARNDGRAMKAEEDYLKAFGKQQGVAALSAMLDSRPKIAALSAQQTTQVAATKAEKGTAVLSAADKEAAKLLGISEQDYA
GUT_GENOME080289_0000918-301PKQFLVLPLGLVHSQKGDFLVDNESYNQILKDFKGRQLQIPVDYEHQTLQDVQAPAAGWIKELVLKRDGIYGVVDWTERASEYLKNREYRYCSPVIQVRKSDRKAVLLHSVALTNKPAIDAMIPIVNKNGTKQPDQTGEPTEEGAAPAGESSETDVGALLEMLTELLQLPASASMEEIFQAVAGLIQNQTSLKLKVDSMEFETYKAKAEDVVELALKTGKLAPYQRDWAFKNAMNDVDDFSLWLKNAPQVVPMGAVDVLDAKPPKPKSRSHELLGLSAEDITKY
GUT_GENOME244346_0106854-241DLFLDKNHEDNEAMGWFNELELKNDGIYANLTLTPKGEELVKNKIYRYLSPVFDAEYKDRKIKVTRILNVGLVNRPNLLKKALNKEGEKMPEENIESLKTQIADLKKALETANSTIEALKKQIKDEADKVKREKVENLVKNGLMLGTRKDTALGLDGAAFDSYIDVCKVEANSILKKKEFDADKQKTD
GUT_GENOME142035_0338161-398GWCQLLPAGRVRARDGRPKKPAAGWLINKTSCDRIKANLAALKQPLLIDYDHHSEIAQEKGVKAVAAGWVKAEDVAWREGQGIFIKPTWTPQAQKHIDDLEYAYLSAKLEYYVDNGEPASIRIASLTNDPAITGMKSVAALSADDLYVTTPSTELTPMNEQLRQLLAALGLTVPDDGELTPELGTAALSALTDIQAKAGKHDELKTQVATLSAELDTAKQASTTVTGDVDLTKFVPVETYNALRTQYVSLSAEHGTASLEQVLDKAETEGRVFKSERTYYEGLGKQIGVAALSAQLDARQPIAALTAKQTDTLPSPKKETVSAALSAEDIQVMKMLGK
GUT_GENOME190552_000774-326EKWIEIARTGTFEDSAGRLRTFTAGDLDAIARSYDPAKRDAPLTFGHPQTDKAPAYGWVEKLKSEGGRLYANFSQVPEQVRDLVAKGHYRHVSMSLMPDLVTLRHVALLGAEQPAIDGLAAVEFTDGGDAITVDFAAAVATKEAADGDSKDAAARGEGDTMTIEELQRQIGQLQGQLEALRAENASLKKQADSHKQEKDKAEAAKTEAEQKAEKASADFAAYRGKVEGERREARVAELVKAGKVKPAEKAGVLDFAAKLAAQGGTVDFAAPDGGTESLSLEERYFRDLEARPADERGAEFSAPPAHAGGQQDNINPAELTAKL
GUT_GENOME110771_0115159-420PYTRDGVRGEFEFSEEDADRIISDFEARGRDLVIDFEHQSLSGAKAPAAGWIGSLEKSAEGLLARIKCWTGEAAEYIRKKEYRYFSPTLYFTEDGEHISSLHSVALTNHPALHNAVPLAAKDSGASVPEGNSSVHPKESSSPGSAVPAHGPAPEEGSRKQKAENNPERDPGNQSPEEEDPPSASSGEEEEGGNCAERLIRHLMRLLNLCSKGEEEKDPSSSVTGKAEEAERIQAIQDAVCALIETRDKLDAFLRTHHFSDLEAASDEIARMLPEASKLALESELAERKAEELVSEAFADGKIPVSGRPWAKRFALRDPEAFRDWARSASPAIPDNSGIEEAEPGTAQSRARAHFTEQELEIF
GUT_GENOME127523_0078295-401IQLFPAGTFAARDGRPGNLRGVNATSWRLTAQDAEAVIAHWQRTATPLVVDYEHQTQLAAQNGRPAPAAGWITSLEWEEGRGLFAGVDWTDKARAHIRAGEYRYISPVFAFDRQNGAVLRLICAALTNHPALDGMDAASATFTPYEEPPMKQILAALGLPETADEAAALAALTTLRQERDSAKAQAEAAPDPQKFVAMATFSAVQKEAAQLRDELTKLRNEAQAAALKDDIEAALKDGRLTAATKGWAESLAKTAPDALKAYLAAQPPVRALAGTQTGGTPPAGDKPGTVSLTAEEQHICERLGLTR
GUT_GENOME140833_010588-353SLSQEINTATPGVIQLFPAGEFRARDGRPTECAAWLMTREIAERLIAAADARETPYVLDYEHQTLRSAKNGLPAPAAAFFKKLEWREGEGLFAVDVEWTTAAADMVAAKAYRFISPVFSYDKTGQVLEILNAALTNTPAVDGMEEVLLAAASMMATHLTTEGNTEMDEEFLNELLSNLRWMLNLPTASTKDEILAELKKIISLLSSGQDATAAASVSLLDILNQNAQSIAELSAKIDTPDPAKWVSVQVMHDAVQQAVAQAGSTSMAALATQEAESLITVALSDGRLLPAQQEWATALAKSDPGSLKAYIEKAPKIAALTTTQTRGQPPAGAPARESEPDGDDAID
GUT_GENOME142014_008854-229YIVCNLPINQNNEIRIAVIGDWQGHHNGAFSLSKDDLEQIKTNFDNAKVDVVIDLDHKTIYEGTGEAYGWIKELFFKEDELWAKVEWLESGLELIKTKKYKYISPVFLPNTIEQVTAQNIGWTLHSAALTNRPFMEELGEIKANNKQNQKKEESMMTPEEQKEMDDLKAKVKELENKLKEQNEDLEKEKEKSVETEVDNAIALNKVSAAQKETLIALGKANPDELK
GUT_GENOME218058_0143813-319KAGTHTDMHGKKLPFTPDDLAACVKAYDPSVHEAPLVIGHPRTEDPAWGWVKALSLSGVDLMAEPAQLDPQFAEMVTDGRFKKVSASFYLPDSPSNPKPGVLYLRHVGFLGAQPPSIKGLKQVSFSEQEEGVVEFADWQAITNASLWGKLRDFLIARFSLDEAEKVLPEWQLNSLREEAYRDTLSQDAAGAQFSETGPGPSSASNEESSMTKEEIETLQEENRRLKQQAADRDARDAQVRQEQLHKDNVAFAEKLVAEGRLAPRASSVVVALLDAVAGGDKPVEFAEGESRTPLATAFRSLLSDGEP
GUT_GENOME122963_0192222-319IPTAWRLFKLGDNPLTLKGKAETLTLGETSLRRIVEYHKAKGCKIPIDSRHFISHLATKLKRDESELLKLLPDGTATMGFGSLEFRPDGLWLSEVEFVPAAAEMLKAGTIKYFSPTIRGLDGSEPLRVTSVAFDNEPCLNHIPELTKVLTLAELSAAIQTLEKEAFMPETSKMTAEDQSAAQKRDAVVREVLNLSENDSPETVRGKLVAIQEQAKLVPGLNERIKALELAEEQRNRETVLNKLLETGRLTNAMAETDYFKNMDSVELSEYGKSVPEGSFVNTKKVTDAEKRPEAPAKK
GUT_GENOME141286_0207912-342SIDLNATSTHLVLVPEGTFNGVDGRPFDAPHWVLTPERGEQIVAALNQRKVDMVIDYEHATLKAQETGEPAPASGWLKAASFSYIKGVGICSTNFKWLDKAKGHIEKEEYKYLSPVLFYTKTGEVVGLHSVALTNTPNLDNLPEARLAALAQDYFTQNSTQDSEMEELLEQLRWMLNLPLSATAEEILAELNKLSAQIKEKTGVAVAANGQHLFDALAAIDQLKLAANSQDQVDMTQFVPMAVYQEAVANAGNAEAAQKAKEIDDLIVAACSDGRLTGQATIEWVKGQAKNNPDFVKAHIESLPKIAALTQRQTEQVNLAANHQQQPVVDE
GUT_GENOME145156_0272823-364IQLFPAGEFSAVDGRPHTDEVESGKWVLTAGLAAQLVAQVAARTTPFVIDYEHQTLRAVNNGKPAPAAGWFSQVEWREGVGLYAIGVEWTENAAAMIAAGEYKFISPVFAYNKRGEVLELLHAALTNTPALDGMDAVMLAAASRLASLSTETETTTVDEELLNDLLSSLRWMLNLPVTSTAEDIKSELQKVVDMISNGQGTAAASVSLLTLLNQKDEQIASLSANAYDPTKHIPLTAYEELQGRYAVLAQQSGEAEAGALIQAALSDGRLLPTLEDWAKDYARRDINGFKTWLDKTTPLAALSSTQTGGKPPKTPSPAPAQIKTGDDVDVAICSMMGTDPED
GUT_GENOME239445_0226020-229VQLCPFGEFPNGKTLQVCDETAFSKLVEAFNAAGAKEVLCDFEHKSEDPSMTSDTAAAAWISNLSVNKERGLVGDLKFTDLGAEAVTNRRLRFLSPVWTLDADGRPDRLRSVALTNKPNIPVACVLNREPAPAVQPVEEEKGPEMDKLKELLGLAPDATEDDVLAAVGQLKEQIAAANKEKEEAEADSFAEEHKAVCNKEALKSAYLMNK
GUT_GENOME033096_0003220-313IQLFPFGRFYSQDGRTEGAGGWYVDDTNGYALAEDINQLKIKLMIDYEHQTLFIEKNGKPNPAAGWMETAEYISGEGIFVDVDWTKKAHQQIQDGEYRYISPLFLTEPDGKVTKVLNAALTNRPACHDLAEAVAFSSQFNQHQHKKDNSMLELLRQLFGTPEATEDEMKQKLTALSAAKGDSPVVLSDVYGKLKEKDGEVVALTAKVGAEPDPSKYVPLSAMKDVQDKLNALSAQVHGDKVNDLIQTALSDGRLLPSQKEWAEKLGKSDITALSDYLTVATPNQALAGGHQAKE
GUT_GENOME237527_0226036-332LLRLGRTEYTRRGEAGSFETTEEDARAIIAEAERRGRDIVVDYDHASVRGRDAPAAGWLSGLRLGADGLRAAVAWTERAAAMLSGREYRYLSPVIHFRNGRPAAIHSVALTNTPAFHNYPALAASDDGAGPTQGDTMNQKIARIARLFGERIAFSDGGELPSDALDATLAAAEKLLADHGCATFAELDGHIHALVPAERLEEAKRELSGIRAERLVAQAFAEGRLVEAQRAWATDYATREPEAFADYLKSIPENTYPTSAVQFSDCAAPKCRKGGRDAYSDQEIAIMRKLGLSDEDI
GUT_GENOME019125_0387613-309VREPPGEILLIPFGKVEYTKDNKVGSFNFDSESAQKILAEFSERGRDLVIDYEHQTLSGEQAPAAGWIEKLELNERGVVGKMKYWTADAENYLRNGEYRYFSPTIRFDENGQPCALHSVALTNHPALHHVDALVASDLKTSGKPDNHKGGIMPEEKKEPVAFTDDQVVSVAREILGLSDATAENVRGKLLALKTQAELVPGLRSRVSELEQAADSEARKQLLLELCETGRLTNAQAESDYFKNRSLADLQSYREATPEGTMVQTKKLTQPERKEPEDGKPSPILKNLGLSEEDMKKM
GUT_GENOME231247_0330113-362LAATVGDDGWCQLLPAGRFRARDGRPFDVADGWYMDAAIAARLIDGVRALGQDVLIDYEHNQLRKSEGLSPDELKAAGWFNADEMQWREGQGLFVKPRWTPAALTYLANEEFGFLSAVFPYDLVTGAPTVLRMVALTNDPGATGMQKLAALAASFDELTNHPHEGRPMNENLRQLLARLGITLAENAAVTDEQATAALSAFDALKGKADKHDELNTRVTALSAELVTAKAESGTPDLTKWVPVETYNAIRQELAEVTGKHSTVSLSAVLDKAETEGRIFKSERAYFEQVGAQVGVVALSAQLDAKQPIAALTALQTATTSIPDTRREGLTALSADEKSAAKALGMSEAEY
GUT_GENOME099050_0136372-409IRLLPDGAFAARDGRPGTFTGGTLNAWSLSPRCAGRLLERWRRRETPLVIDYEHQSLNARHNGQPAPAAGWIESLRYEPGQGLFASVRWTEGAKGFIEQDEYRFISPVFSFDPQNGEVLELKGAALTNMPALDGLGAVAATEDFPPSDNPQPETAMNALNRLKQLLGLPEDAAEETLLAELDRLEALLTPANPAPSDPSGLPNQPAFPHQEGPLPGDARPTLFDFLQACHPQAALTSLVRANTALREQLSVALSVTQGDQVARNVEAAVADGRLSRGLAGWATALGRQNPEALETYLAAVAPIAALSSFQSAGNRPVLSAASASPLSDEERFVCSQLR
GUT_GENOME238249_0038030-339GRNDFTKGADTSSFPFERKDGEAIIRDFLSRGKDLVVDYDHQTMVPGVRAPAAGWVSRLSLTEDGLEADVSWTDEASKALSSRQYRYHSPVLLFGRDGHPAKISSIALTNHPALHHYAPLVAHDTPQETNMDKTLKKLAGILGLELSFSDDMEEKGEKTPTEGGNLAEAVKAIEAKIEELMRLKEKAEAMMKQNEVASLDDLAGKIAGMVPASEKAELQGRLDAILAEKAVEKAFADGKLIEAQRAWAMEYAAKEPKAFAAFVEKSPKVVPLSDTPCGGKKEQSKEKTMGFSEDDMEILRKFGVSPEDID
GUT_GENOME232010_0499052-341PLPEWLPMIPAGTFTGRDGRSWVNDNPQAFIATSFRYPTLPFDAEHSTELLGPKGEEAPAYAWIDAMRVNADGSIDGHIEWTPDGEALVRGKKYRYYSPAFRHFPTGQVSHLSSAGLTNKPNLYLPALNSENTMTVPVQIATALGLAETASVDDAVSAIQTIKNSESLALNRAQNPDLSKFIPQETYQLALNRAQTAEDRLKTLDEKTATALVDDAVTAGKVAPANRDMYLALCRTEDGRQKFEAFVKTAQPLVNQDPSKGKENNGQQTTLNETELAMCRSMGISQEEFL
GUT_GENOME157363_004196-304DWFDVFRCGTHLDHSGKWRTFSEADIDKAIASYQSDSAPIVVGHPTLNAPAFGWIQQFRRQGPTLQARCSRVADEFADLVKRGLYKNRSISFNSDGTFRHVGFLGAAAPAVKGLEDIQFADKGEFITMDTAETVQTEQAAAQEQAEEVQVEPEAEAKTAESEAEMQSSRADAAQKAEPEVSLSDLESQQKQFQETVKKLESHIKSLENALNVEQEKNRKAEFAAYADELIREGRLNPVAKTPLVCTMEGLFASDQANFAAPENSVLNSFKSFLNIALPRNPNLFISFAAPSEDCGHSVE
GUT_GENOME095520_008691-257MTESSKNTWIDIAREGTFDDSYGRSQTLTTSDFDRLIASFEEGSRRIPLVFGHPKTSAPAFGWVEALRCAGNVLQAKFKQVHEDAKKLVEGGHFKNVSIALTSNKEHIAHVGLLGAVQPAISGLREVSFASEEKPIVIEFSAAQLRPPLDSETEQLKMELSAAKDHIRMLKAREAKKLREDRLNSLTELVGKGKVPPSEMGLLMPFAEALLDADTMIQFADQESPVHAVEALAELLDKRPISPCFYDFSMLMPPKHY
GUT_GENOME147522_0115034-368GWAQLLPAGVFKARDGRPHDTADGQWHLDASIAAAFISATRAIAPRILIDYDHQALKARENSGPVPAAAWLTPTEMEWREGQGLFIKPDWTDKARELIANKEYAYLSAVFPYDEQGRPLYLRMAALTNDPAVLGMEPLAALSADFNLSFHTPNSTINLYGTTEDNLVNELLMQLLGKLGIELAEGSEPTKEQATAALSALDTMKTTAAKAGDLEGKVAALSAENQSLTNAYNGVTERVAALSAENETGAVSQAIDKAKSDGRIVEAEVEYLTGFGKQHGVAALSAMLDKRPRIAALSAQQTQTTQPASKQDPDLTDEDLAVLSACGLDKETYLKN
GUT_GENOME254042_001148-367ASLAFELTLTGNTIQLFPAGEFRATDGRPTECAHWLLTESLAQQVIAQLSARKNKIVIDYEHQSLQAEQNGQPAPAAGWWTGSDTVWTAEGMFAQNVQWTKRARQMIADEEYRYISPVFAYDPKTGEVLQVLNAALTNNPALDNMNEVVLAAASRLIVQASTQHSPEKTMNEELLKLLRKLLGLADDASEADVLSALQQAAGNMPKENEGDTSSASLSRLLEMFATQSTLVKTHEDKIAALTIAASASASKGAPDPTKYVPIAVVTDLTTQLAALTARIDGGDLDSLVNDGLSSGKLLPVMEDWARELGKKDIAALRKYLDAAAPIAALTTMQSGGHPPTGETHGLTATELQVASLTGLT
GUT_GENOME049792_0019228-330VPNVIRLMPVGDIAGRDGRRFYLDDPETVIAETAVYKKNLDLLIDYNHQSEYAEQNGRPAPAAGWMTDFFVADGHLCATVQWTAKAAEHIKNREFRYISPVFFHTGGKIRRIVSAALTNTPNFDLKALNQTQKKENSMDKEKALCQALNAPDEENALLRIEELKSAQKELNEIAAALECEAKAEAVIKAVNETKADPAKFVEIDKVADLTRELNEIKAERAEEKAENAVNAAIRDGRLPPVLKENAKEICKKMGETALNEFISKMPKISAPSLAAGEPPKPACDALSEDEKEICEIMGISSET
GUT_GENOME261359_021849-275DVSSVPEEIRILPLGTVESRKGTFEVNEESVRRIIDGFRERRLDLVIDYEHQTLENVQAPAAGWIKDIYKAENALVAKVEWTERAKQYLANKEYRYLSPVVMVNKSNKQAIELHSVALTNTPAIDGMFPIVNSIGADASPSSESKDILKMIKELLGLDAGTDAEDITEKIKELLQAKTATENELNGLKYEAFEKQVDDVIQYALKAGKLSNSMVNGARKMAEKDIEAFKDYIQTAPQVVPIGRFALEKTDNTKSGGSHIDNLLGISE
GUT_GENOME186917_0131335-405IQLFPAGDFAARDGRPGNLKGVTAKAWRLTPEDADALLALWRQRATPVVVDYEHQTHLSRENGQPAPAAGWITALEATPEGLFASVEWTDKARAHIRAGEYRFISPTFSFDRRSGAVLELHSAALTNNPALDGMDPASAKLQALRQRKLRMETAFDYAGNDSKKPLSPSVRLVDSPTPDATHNTQEDKHMDKLLALLRTLLGLPDTADEAQCAEALSRHLPQQNLIALLKSKDDALATAQAELAGAKAAPPDPGKYVALATFQAVQQEAAQLRAKLAEMEGAAAVAALSGEIEAALKDGRLAASAKPWAEGLAKSNPDALREFLKSTPPVEALKGTQTGGKQPDATPGTASLTAEEEYARVQLGLTAEEYT
GUT_GENOME212256_0004034-368PAGQFKPRDNREMKVPAWNIDAALAAAVVQRFAAKKTPPVLDYEHQTLWKEENGQPAPAAGFFRALEWREGQGLFAQVELTARAKQYITDGEYRYFSPVFLFDPVTGDVLDLQMGALTNNPAIDGMQALSERAAATFQLTIDPSNEEPLVNPLLKAVLAALGLAENTTEEQAIAALSAHTTDLASMRKQLGLDDTAACSAMLAACTGLKAKAATAVDPAKHVPVTVVDELKSEIAALTIRLGQRDEKELDAEIATALEDGRLHKSMEKWARELGKENRASLTAYLSAAQPIAALSGSQTRGQPPVPDEKTGLTADELAVCTAMGITIEAFKAAKE
GUT_GENOME237478_01727296-600PDEIKILPLGTVHSQKGDFVVDDESFDLINRHFENRGLDLVIDYEHQTLKDVQAPAGGWIKKLVKTNDAIAAQVEWTAKAKQYLENKEYKYLSPVVICRKSDGKAVALHSVALTNTPAIDGMFALVNSIDISSPDGAEGGNSMELKKIVALLGLPADATEADVEKAIQELKKQEKTEEVVANKTIMDLLELKGDAKTEDVAAKIMELKGTADKTKDEMILELKRRMDERDAEELVTMALKQGKISAAQKAWAKEYALKDAEGFQAFVAKAPAVVPIGKTGSAGYQKEETDTELDPKILKNLGVSM
GUT_GENOME021955_0291920-333SPKPFRLFRIGENPLTRNGRDCKLTLSAEELKAIAEYHRKKGEKIPIDSRHALLLAAEKAGVSETEALKAVPSAVAALGFATLELREDGLYAAQIELTPLAEELFRQGALRYWSPVIRGLDGKSPLRITSIAMDNVPALNNLDILAASAETTTQPTGTSRSTPDGSNRKDTIMTKTEQALAKLLGGETLALSDSTDAAVAAKLEALAAELPELRAAKAKVEALELSAETARKKELIDKAVAENRVTNAERSGLMGLDIAWLAAELPKRPRNAVPAGKLPDPPEHGEEEELSDREKALAEKMGLTAEEYLKSKRE
GUT_GENOME257704_0362423-332EVPEWVEVLPPGPTVTGRDGRQWTYDPHQVIAATTAHQDGADLPFDYWHATELKAPLGEPAPAVGWAKEYRVNERGAVEARVEWTEAARNAIQAREYRYVSPVFMHSKAGRIDRFSSFGLVTKPNLSIKALNAEGAPLQPPEVNAMDLAAIAVILAALGLPDTATAEDAVAAINKLSQDKKDLQTAANSEKAPSLDKYVPRQDYARLEQRALNAEQQLAQQKKDDLEKAINAEIEAALKAGKITPATKDYHLAACREEGGLVRFREFVKAAPSVTDPVTPEGENKGGQKALNAEQQQAARMLGMSDEQFI
GUT_GENOME213065_0281530-323ILPWGESSYDGGTHKLIVNEAAISRIKEAQAASNFDKLALDFNHNSIPGSESYSGEPVKVAAYGVLEIVPNVGIYLANLEWTEDGINAWRGRHYKDISPAVLPDDTGVVCFIHSAALCRNGRLPGLEAFNADFLKLKPKVKMDKYKQMVCKLLGLVDGSDDAAIDAAFEKASLAQPCKKDVDGKVKELEDKLETFSARFKQIEDSIGGIKKSLDDNARESVMASALAAGKIVPEDAKKLDTEALKLFCAGLPSVVPVGRRTPDGIREFSANIPNNPVVSDVCKNLGLTEDEFKK
GUT_GENOME142457_0029214-405LTATLQSSSDGWYQLLPAGEFRARDGRPTDVERGCWYIDADVAARFIASTVDIGQPVLCDYNHATLREQDPTIAPPSEETQWRAEAAGWLTDPATQMQWREGLGLFVRPTWTDDASAAIDAKKWAFLSAVFPYDTVTGEPLFLRMFALTNDPGLTGMQSLAALAAASLIDSPLKPTQDTAVMNDLLRLILVALGIIATDDQTEYTEEQLKELVATATEKITALKTASETAVEVAEVVESQPTPEAVAETVEQAVDDNADAIQEAEQIIAEAELHGVDLSVAVPVTAYMRLQKKLTTAQLGNASLSASQIIAKAQASGAIVASEVPYFTALARQHGVAALNAQLVGRKPLAALSARQTKGLVTPPKSTVSIASLSAQERAVCAATGISEAKYL
GUT_GENOME177056_0179614-343KAKYGRIQLLPYGKFRATDGRPTDVEAWYVTDTNGADVVALANSQKNPLPIDYEHQILHSQQNGKEAPSAGWMEYLYFNPQGIFADVRWTDKAAEYIKNGEYRYISAVFAYDTNGYVRKIFHAALTNNPALDGMDEVMVAASVQLLNQQKEKPEMDKKLLAALCALFALKADASEAEITEKVTALSAAKGDSQVAVLDVYAKLAEKEQSVAALTAQVGSPDPAKFVPVEQVAALQADFNALKNSVETDKKEALIQAALSQGKLSPALKEWAQSLSVEALTGYLDKAAPIAALAGGHQASEDPNKSNVAALTAEQQAAAKMLGISDADYIK
GUT_GENOME165619_014655-384KTTPNKRIAVLSAAMTDSADGWYQLLPAGYFSARDGRPEDVPGGQWFIDAAVAERFIQATAAVGQPVLFDYNHVTLKQDDDPAACTEARAAGWLRDPRNDMQWREGEGLFVRLSLTPAAQAAVDAREWLYLSAVFPYDENGHPLYLRMGALTNDPGLTGMQSMAALSARLNTLISLPPDTKDELMNETLLQLLERLGVTLPENAEELTEEQLQELLTQALAAADTLKTSAQVAVDTQEVIETTQAPEDVADDVTALIDDNSTELAEAEQILEEAALSGMNISKVVPARAYHLLGKHAASLSARVSGSTTDSIIANARRTGRVSAGEVPYLRILANTQGVAALSAVLKGRPAIAALTGRQTTRLKKPAGKIAVLSASDKEA
GUT_GENOME131374_0236613-334GDSPARIVYMPEGVHYINASVGGRQKVIVDRSCLAPLKRDLALKLAKNVRPVCLFDHKMGPASFLPSDFDYLDGVGPILVGEWTLSGKNAKEGKDYGYFSPAFRLDLNTCKPVGLEPDDIEVGSLVNDPAFENIARIAASKAKLENFTVLGPDTPLNSDGEDTDAVHNQTNNTNTTMYELLVKCGVLTKEEAASDKAGKIAEDKINDLKKKSEGGEKSKAELEAAKKEAEDAKKEAATCKAAKAKLDDTEAKLKAAEEELAEVKASKAALIDAEIEAAIKAGKIAPENEEAKEALKTALTANIKAGKALIDTMKPDPAFATV
GUT_GENOME142632_0173621-244AIKGEWKGHNNGRFKVDDEDLKSMIENFNQKKIDLVIDYEHQSLKNEKAPAAGWIKELYLENDALMAKAEFNEEAKKYIANKQYRYLSPVFEFNTKDNKSGELVRAKLHSVALTNTPFIDELGELIANKNNIHQNKGEKMDEKIKELESQIIALKNENSSLAEQNEALKKQNEENAKNLASSLVDNALNTGKIANSQKEWAMSYACKDLEGFKSFLDTKDTQAQ
GUT_GENOME243891_007753-232ILSLNYKNGELIKVSPIGEIEGADGRTFKIDAKKIIENIKKSGVDIVLDENHSFAGAVGWFDKDSFEAKDDGIYAKLELNKRGMELVKDKIYRYLSPVYDLSGRDVLSIESVGLVNKPNVLNNALNSKGDKVEKEKNSELDALNEKVKKLEETITSLNKELEGFKKDDDKANESDKANESQKVNESGKVDEDSKELNSRVLKIENILKDFKKIKNESDKNKEVNSRLESM
GUT_GENOME145040_04914120-433WLINHAAVERMVSRVVALNQPVKIDYNHQTLIEGHPAPAAGFVMASPENFRFSEERGFEVRPKWNPPALEHLRNNEFPWFSPVIGYDESTGEPVELRMLAITGDPGLTGMNPVAALSADDLYNALNPPLKDTSMNEQLRQLLTALGLTVADGDEFTPELGTAALSALTGIKTRADAHDNLKTQVASLSAELETAKGTPAGGSIDLTKYVPVETYNALRTEYVALSAQHGSTTLEQLLDKAESEGRIFKSERGYLEQLGGQIGVAALSAQLDARQPVAALTSLQTDTVTVPDKKTATAALSAEDIAAAKLLGKTE
GUT_GENOME128005_0232114-346PAVERGTDGLPRGWPLLRVGDNYLTIDGRPVNLKLSEDDLGAIAGYHKSKGVKIPLDSVHVLSNLAGKLKIDESELARRLPRLGGVAGFGDLKVSDSALCLCDVEWNELGAEVMRHDQIRYYSPTIRGLDGKSPLRITSVALTNSPRLNNLPSLAASEDEDETPITPDAVREAIETLTNPKEASMPDAPTPPTAATPPAAPPEAGKGNEKEPELLALLREVLGDDVTSANLKVKLAALKSKADATAELSESVKSLQLAEERRNLTAIREKCLAEGSLTPEKLAKPFFQGMTAAELAEYHEAMKGTVVPTGVLELGDPVEKPAAAPKHYNTVAE
GUT_GENOME264508_021219-310WIAIARTGTFQDSEGRDQTFNAADLDAIAANYDPARLEAPLVFGHPKDSAPAYGWVTGLKRDGQKLFARLASVPGEVRELVGKGRYRYVSMSLTPDRKSLRHVGLLGAVPPAIDGLGPVELAGEPGIIINFASEEDAARDGGAPDHPNNASPRTGRAGERHQKGGDMPTPEELQQQIGALQQQVEALKAENAQLKDKLGESDKGKADAEKKAESTAAEFAAYKQTVEVKDREKRVDALIAAGKLEPAKREETLSFAAALAAVTVPVNFSAPDGKAGQVTAQEKFFRDLEARQADARFLDFSA
GUT_GENOME147629_0066524-343TIQLFPAGNFRAGDGRPTDAPHWQMDAATAARLNAALAARKTRLLVDYEHQTLYARTNGRPNPAAGWIEALEWQEGKGLYARVQWTAAAKAAIAAGEYRYISPMFEYDASGAILSLLPAALTNTPALDGMDAVTLAAALSLFATPESDNKPEEKDDMNELEKLYALLGLTYEKDKASLAAALAALTEQLGGQTTLKAALSAPPDAGKYVSVDVMSGLQKQVAELQAQLTARTQADNAALVTAALADGRLLPAQKDWAESLAKSNPQALTDYLATVKPLAALTKTQTDGKAPPEAGKVAALSAEEIAAAKMLGMTADEYQK
GUT_GENOME018106_001839-291LCLNAESGVPGRIQILPAGKEIRGVDGRHWKNENPAALCAKMNSSGLVTVKNGCVIDENHSTDLSAPKGGTSPAFGWFRNFTVEADGSIWADVEWNARGQKAVAEKEYKYISPVFTRDKDGNITEILRAALTNNPNLDNPALNSSQDTAEEKNMEKELCAALGLPETATMADGLEAINRLKTELNGAKNKVVDLASYAPRADLAAAQQRAERAEKKLAEMNAASLKAKAIAAVEQAAKDGKIAPASKAEYLELCASEDGLAKFEKIMAVTPSITGGAQVSDKA
GUT_GENOME218151_0103254-406HLVQYTPAGKFLPADGRAMDVASWYIDAALATEVIARHAARGQPSVIDYEHQTLHKEKNGQPAPAAGWLHGLRWIEGRGLFGEVEWTEDAKAQISAKQYRYFSPVFEYSRPEGHVLAIHMGAVTNHPGLHGLEPLSLLAAATAAFLPASQEHLPMNPLLLAVLAALGLPNTTDEKAAMAALTAVGSIKDLQAKAAAGEAATQVATAACTALSLPTDTKADVVTAALTAAVATGKPDPSKYVPLETVTAMQTQLAVLTAKQLDGEIDAAIQPALADGRLQPTMEPWARELGKTNMAALTSYLATAQPIAALTGTQTQGKQPENTTKGAHGLDASEVAVCTAMGIKPEDFAKSKA
GUT_GENOME232060_0263214-274GQDGVPQWIRIWPKGPVHRTRDGRQFRLDNPSKYVAKLNSSRVDIVVDYEHSTALLKGQKAPAAGWLKDFKENDGEVWGRVEWTEEAAQLLSKKHYRYISPHFNHDEIDGQVLNLWSVGLVNSPALNMEAICDANGQLTKDQVHEVSSILSVAGVATLSGLYAKLENDKNMAILSVVESAREKMIIPASIKDELIEVCSAIGIDKFQRIIKVLSSIQGLTLPNKSQVDGLEKPTNKNSLLTPQEQEICKNSGITEEQFIEE
GUT_GENOME123787_001826-254LCTDLSGDKTPPEWVELLPAGPEIEGRDGRSWTLRDPQSIKQAFSCRGVSLVVDYEHSTEVIAPKGGEAPAAGWINNVAVRNDGSVWGKVEWTPRAMNSITSREYRYLSPAFRHSQDGEILELVSVALTNKPNLKLTALNTQQTLSQIALALNMEEVNSLEEVLTALNKRQAEQHIQVVNEYIEKAVFCPAQRNILLSMCSSVGENAFRKFADMQEKSTSNFLHFAESVSVKGKESPKTLTESQLAVCR
GUT_GENOME238256_0181261-220FDEESCRRILEDFNAKKAAEDFQGVLVDREHFSCDLDKPSDAMAWAVDMRINPDGSIWTKWDFTPKGRDLYESKTLVYRSPVLALEQNGKTFRPFALESIGMTNTPHFKELSPLAAAKAASPINNQQGDQKNMDPEILAALGLSEDAEKEDILNAINALK
GUT_GENOME257048_0007621-315ILPLGQVHSTKGDFVVDAESFEAMKQAMEGKGVDIVVDYEHQTLYGQEAPAAGWIRELLLTDHSIAAKVDWTERASDRIRQREYRYCSPVVMARKSDNKAVLLHSVALTNTPAIEGQFPIACKDEEDNIMDILKKLAELLGLDNAAGEEEIMEALKARLEAPPPAEPVVCKTVAGLLEVPEDADAATVSAKIMALKNPANYVPAADFLALKAQLEQRQSAELVEKALAEGKITPAQKEWAAAYVLKDKEGFEKFVSLAPASVPMGRTDVPPKARGAGDADSLLVCKNLGLTPEEL
GUT_GENOME046029_0172916-351GAPEIVKLLPLGHVSTKKGDFEVDEESFKAMKAQMQQHGVDIVIDYEHQTLKDIQAPAGGWIKELVLQDGAIAAKVEWTETARQYLKNKEYRYLSPVVLVNKDNRATMLHSAALTNTPAIDGMFPIINSVGLEDYEDDDNKEGGNNTMNELLKKIAALLGLGEDATEEEVMQKLGEALNEAKQRGDAAGQKQPPEEEEGKVVANKVVCGLLGLEAGAKTDDVAAAIMALKQPKGFVPETELRALKEKIERKEADDAVLVALKAGKIAAAQKEWATEYALKDPNGFKAFVEKAPQVVPMGELGVEPDGRKAQQQTSEETLKICKMLGVSEEDLKKYG
GUT_GENOME233942_0103828-330IPEIVPLMSVGRNDGRDGRTFFVSNAEEVIKATDAFRGNADLLLDYNHQSEYARENGRPAPASGWMKNFVVQDGFICAVVEWTPKARELIKNKEFRFISPVFIHKNGEVVRIVSAALTNSPNFELKALNQQEKEQMDTEKTALCQALGVSDAQTALPRIAGLQEAERALNEVAQALNCDCSAQAILTAARNIEPDKAKYVSTTEYAKACNELVALKREREDEKAAGIVEKAINEGRLPPALKENAVAMAKTQGETALNEWLTLIKPVSGKDAPTDTPALKTDALTAEDKAVCQSLGLTEEEFK
GUT_GENOME097852_0143026-294LVPPGVFSGNDGRTWNNSNPDAVVAAFTKKRPFDVEHATHIKGPKGEKAPAIGWILALQNIGGEVWGMVDWNSEGREMLEKKEYAFYSPAFTFDDAGTVLSIASVGLTNEPNLDQLPALNREETPMPLPVELTQALGLGADADTASALTAINTLKADHQLAMNRAAAGPDLTKFVPKETYELALNRATTAEAKIQQTEEARLSALVDDAIAAGKVAPANKEMFLGMCRAEGGEEKFNAFVASAPVIADASQVNTTHQQQLGALSADELA
GUT_GENOME073019_0193731-344LSGAPEEIKILPLGEVKSTKGTFLVDDASVDMILKSFKDRRIDLVVDYEHQTLLNVQAPASGWITELRKGADAIIGKVQWTTKAKEYLKNREYRYLSPTIMVRKDRRVSAVSSVALTNSPAIDGMPAMCKDNGLTKGENTMDLKKLIEVLGLAEDATEEDILKAVKKAAEAARASETPPADPPAEVVANSTILGLLDLPDTASTAEVSAKIVGLKNGDQQLALRVHQLEEAAKERETENLVQAALKDGKITAAQKEWARSYALKDAEGFKKFLELAGPAVPMGEIGLKDAPEGEKLDTNTMAILKNLGISAEDA
GUT_GENOME143098_0037935-261GTFSIDAADIEKMKLNFDKRSLDIVIDYEHQTLSGEIAPAAGWIKELFIKDGALYGRVSWTAKAKEFIKNGEYKYLSPVYDFMGVDEKTGAWQGCTLHSAALTNKPFLDELGEVRANKNFTKETSMDDTKNPKGEPQAQAAMQNGTNYEAQIVELKNQLNTSKQEVAALKEQLAQSAVDGAIIANKLQESQKQWALSYAKADLKGFNEFLKGVMLPQQKTSIPSNDM
GUT_GENOME011213_0199614-340ITYDADRLPVEWPLIHAGETVVTRSSAMPVVLALEMADLESIVAYQQEKRAKIPLDCNHVVSNLAGKIGMDESELLKQLPRYAGVAGFGSLKLKGDALYMSDVEYLPIGREVMKAKQFRYFSPTLRGLDGKSPLRISSVALTNNPHLQGVCELSAADTDEDDDEVSAEKVSEAIKKLTEKQEKKMADETTNPAVANAAEEQTKILNLLKEVLGDDVTVENLRAKLAALKTGADSSDELKKKVAALECAEEARNLSAVRQEAMRRGKLTTAMLERDYFKKMSAAEWSDYCSNVADGCAVPLKTLEMGEFHAAPPLPAKPFASVSEAIE
GUT_GENOME111598_0260319-342GPTPGRIQLFPMGAFAARDGRPGTLKGVKVKAWRLDAENAAALIARWRARETPLVVDYEHQTIHAPDNGKPAPAAGWIESLEAGQDGLYATVKWTDTARAFIRADEYRYISPVFSFDPETGAVLEVKSAALTNYPALDGMAAVTARAEDDSPMKKETLEALRYFFGLAADADEAAVLAALKAQGDGQTLTAMLAAKDADTAAAKESAPDPAKFVPAEMLTAAQEKAAELAAKVKELEGDGSLAALTAEIDAALADGRLPKSCEGWAKATAKNNPEAIRGYIASAVPLVALKATQTGGRTPTGAPHTAALTEEETFVCKQLGLSE
GUT_GENOME038109_0008562-270VLVDEAAADAVEELRRKLQEGAQDGRMDEPYIDYDHQDHDAAGWVKRIYWGGDDPKTGGIRAEIEWTPEGKAKVEGKSYRRFSPVFQASDPDENGVCRIIGSGINMGGLVNRAAFQSIAPLLGGKDGNPAAPANQTQTANTTMTDEEIKALQAQLDALKTENEELKKSLTDMQAKAADAAVEKACEEGRISPDLKASWKEMILADPKAE
GUT_GENOME007437_0083613-286DGRPGKNMKWQLDADIAYRLIDRIAARHTDIVIDYDHQSLYTRDNGRPAPAAGWFRSARWDDVKGLIATGVRWTDKATDHIRKNEYRYVSAVFLYDDVGIVREIISVALTNTPALDNLPPLQAALSRYQQEYVMPQDHTETLNKLTEQVAALSAEKAQLSGQIAALTAERDALKNTLADRDAAEAAIKAQAEEQEKTRLIEAALSENRLLPADRDIAQEMSLSMLKKFLENRKAFADLSRQTKDKSANNTPTHGLTEAELAICSRMGCDPEQYV
GUT_GENOME142546_0127722-351PTTIKLMPAGVSRGRDGRPAGCAGWVLDHTNAAALVAAASQRQTRYVIDYEHQTLHAAHNGQPAPASGWFSELEWREDDGLYATGVEWTAKASAMLAAREYRYLSPVFTYDASGRVTALLHVALTNNPALDVLPDLTAALSALIPASPIPSTQETHMDELLEQLRWLLNLPVGATADDVKAQLQKLISQLSGGEGMAAASVDLPALLARQQDRIAALSVSQPDPARFVPVDTMRALQERVAALTAQVSVRNVDELVVAALSDGRLLPAQEGWARELGQNNLAALKGYLDTAPKIAALSATQTQGNPPADSVKPQWDEDTLAACSQLGLSA
GUT_GENOME143726_0143818-359PSGRVRILPAGSFRGLDGRPKECAAWVMNAVCAQRLINQVNNQKTELCFDYEHQTLRAAQNGRPAPAAGWFKTLEWVEGDGLYATDARWTDAASAMICAHEYRYISPWFRYSATTGEVLSLVNVALTNIPALDDLDEVAQAAASRLAALSQTPSEQESSPMDEEQVSNLLANLRWIFNLPETATAEDIKAEIDKVIAAMSGGQGTAAASQGLMPWIDAKDNQLTEQTTAMAALSATAYDPAKYVPIEALNDALQRLAQAERANAGHQVNDVVQAALSDGRLLPSMEDWARTLGQKDMTALQTYLDNATPLAALSQLQTGGTVPSGAATTKQQQDTGVTALDD
GUT_GENOME009735_0003923-315IELIPSEEIFAGFDRRVFKATDKQGIIDRTNSKLKYIVIDENHKTDYTAGTGQSTEAMGWMHDFYIKEDNSVWALVEWTLSGAIKIENKEYKYISPVYEIDKLGNIISILRAAITNNPNLRLAALNNNSNDNKGESMSKEINNALGISENASDSEILTAINNAKKENESLKVELNAERENKKTLTEKNQNLEKALNEVNKELAEFKKIIIEKEALSVVEKAINDGKIAPATKEVYLALCMENGGIEKFNKIMENTPKAKLFEESNIPSKTDNISLNEADENIAKSMGYTKEEM