UHGP-MC 51850


Information


Number of sequences (UHGP-50):
87
Average sequence length:
379±32 aa
Average transmembrane regions:
0.01
Low complexity (%):
1.96
Coiled coils (%):
0
Disordered domains (%):
1.2

Pfam dominant architecture:
PF04107
Pfam % dominant architecture:
7011
Pfam overlap:
0.67
Pfam overlap type:
extended

Downloads

Seeds:
MC51850.fasta
Seeds (0.60 cdhit):
MC51850_cdhit.fasta
MSA:
MC51850_msa.fasta
HMM model:
MC51850.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME072702_020598-364IDAIVSHLKRGFKRSDDFKIGIEIEHFVFDSNDRSAGYEKILDVMKDIMGEDDKPYYLDGHLIGFYNDRYSISLEPASQLEISIAPCASADELVCIYRDFRRILDPVLSRYNLRVECYGYNPYESAQELDIIPKKRYEYMDRYFQKAGTRGRNMMRATASVQVSVDYSDENDAIRKYRLAAILSPVLSLMMDNSPVFEKQPAAMHMVRAYVWDGVDADRCVLMDGCLAPDFSFEKYAEYIMNMPAILVMEGETAVYTDDKKIKDIYAERVMTDEEVMHVLSMAFFDVRLKNYIEIRMADSCNIDKAAWYARIICNIFYNDDNMGKAEEYFAGVGQKDVLEARESLIQHGKDGIIYGK
GUT_GENOME183019_0038724-409VGREAEYTVVDSNGFAVDIEPLLQKMFEQYNDYSIVRKDGVKIGIKTTDHQFLKEVGKGTIEVVSRPCGDLWELRSVHQIAMRRLLSEANKIGYSILGIGLHPRSTPSWDLLTKKPHYRVLYNLIPEVWDWFELMSGDQLHVSIQQNELLTVVNVANAASFIVTALCGNSSIYDSRIAPAVSYREYHLQQILADSSRHGPTPFAPTKKDLIANVSGRRHLFFSDAKQYRVDFDSFQNWTEAFQSYSKEIFHEYLFHSHYSWNSARLRPLTGTIEMRASCQLPPAIDGASSALYLGLIEGTNDLYPLLSKTFKQSKELYLNTCLKGMDSVDSSIIRQVLDVCSQSLRNRGRGEEKLLDPMYAILANKENPGQKMRAIFQSEGLSEVI
GUT_GENOME076977_003257-385YFKQGCKPNLTGSIGLEVEHFLISRNDGRPMPYDGENGIGQLLEFLRKDFPHAYYEQDLLIALESEDVLITLEPGCQLEISLRCMNNLQEMMEIYLRTVSSILAYIHPRGYDLIYSGGLPTIGQEDVVLIDKERYRLMAQYFKNVSTRGLEMMKATAAVHVSVDYTDEKDFIQKYRMANILHPLLAFLSSHTPMYAGKKNEDVLLRDSIWSHTDPARCGILPSLFEENFGFDAYARFLEQVPLIVMNDHGNFIDVKDQTCAQVAETYGYGKTEIAHYLSMLFLDVRLKQFIEIRSADSMPPIYTQAYCALVKGLFYQQENIETYSSLTHSIETILQAKEALRKDRYDTIVYGKPVQNLLKEMIFDAKKGLTEKEKSYLL
GUT_GENOME044254_003271-421MKENNAKRIIMDQLITPYLEKNSKYVGTEIETVFYPVNSLEAPKDVMCEAFEYMVKEHGFSNVITGSDGYVVRIDNGIDSISADYSYVILEFSMGKSKCINEIGERFYKYYKVLTAFFDRKGYSFTGFGCNLFYRPFQDETQYTHDPFYSKIREYVLEHTAHKDISCFYTLMASTQSHLDIRGEKFLKTFNLFNKLEFVRGLLFANSIPNTSFEQKHIKYPDNLLCARDFLWENQGLPNTGVIEKDFESIDELAEHYSKLKVFVRMGENGLSTFEPMTLEEYFEKEDRLTDGLSCFRSFEHVVLNNYHALEVRSDCTQPMADVFSPLAFNLGISEMVDEVMETVNKFLHDNQIELDNAKLRYMAITGQEITEPGKMQAFLGQLYEHARTGLVRRGYGEEIYIECIKERIESGILSPAQNMK
GUT_GENOME220014_014351-419MNARQLVYDTFIAPFYQKDRCCVGVELEFPLLATGDEPVSEDIGLLLLAYLQKHGFSVAETDIYGRGVFITNADGDCLSFDNAYQNFEFAMTKDEDLTAVAARFYTLYDLVQDYLQKIGYTLAGLGTNPHRPLTTSRPVAYPIYMTLRRFLSNFSGGFYHTVTDFPAWLSSVQTHLDVSADRLPQAYTVYAALDFVRALLFSNALPFSNLNGFSKTICFRDYLWEHSGFGSLADNTGPVCGTFQTTDDLINMFMKKSIFLSVSAKGDYQIIPAAPLEEYCNTIGTSAALNGYLSFKNVEITRRGTLEVRSDCAQPVAEAFAPSAFNLGILQNLDVAAQRLDAFFAWLPPELADTPDRNAQLRHRAIYGEGLPVRDDVLRTFLLDLVRLAERGLVQRGLGEEALLKPLFARAETLSCPAL
GUT_GENOME250973_0122334-420VAQKRTGVEFEKLPVKIHDYKAASYYDVAKFLQSYKKNNKQPVYENNSILGLSDKDGLISLEPGSQTELSLIPSDNLCDVKQYLKVYNKETAEIAESFGIFWLGYGIQPVSTFDKINIIPKKRYEYMTKYLPTVAKKPLIMMRETAGIQSSFDYSSQEDAMKKFAFALKLSPIVSAMFANSPVRAGKLTKYKSNRAASWLETDNDRCGLVSEKVFTGEFSFDDYTDILLDVPMIFIERFISGKKTALKTDNITFRQFLKYGWQGFAANKQDWETHLSLYFPDVRFKSYIEIRNQDNQRSSLICSVPALWKGLIYNADAMESAEALLKGFTYSDFEYLRNETPKYALDMEINGRQLKDIAKEIISVSYNSLKTYGKSEERLLEPLMEN
GUT_GENOME085498_0224613-430EKYIVPTKHKTGYMIGIEFEIPILNMSQEKIDFSVIQKVVLALREAFDLNIEKRDDNGDIYCLSNEETGDTISFDYSYCNVEFSMGTVSNLHVAYERFKTYYQFLERELDKYHYRMTGMGVSPYGIYNDNNALPGQRYRMIQHHLESYVEHPRFKAFHSRPNFGSYSCASQVQLDVSEENLIRTINTLNRLEPIKAVLFSNSLLLDPEFNLVSARDRFWDSSMYGYNPHNVGMFEIELTDIEELVEYIKRTSLFNVARDGKYIYFTPIPAEQFFQMESVEGMVFNGDHFIPYVFHPEPSDLEYLRSYKFEDLTFRGTIEYRSACCQPVSECMCVAAFHVGLQEKLNELETLLYQDRSVYKQGYSPSELRKMLIELEFPKELNQQAVQALTLKVLELAKQGLIQRNLNEECYLEPLFER
GUT_GENOME130141_015991-307MEKGLNLCEISSRFYKLLDLVRSFLKEHGHDLCGRGTNPNMEYTSSSPVSFEIYGLIRSFLSQYKGGDYHNYTDFPAYLSSVQTHLDVPIERLPAALTLFARLDFVRAILFANSPAFGEEKELFPKCSCFRDYLWEKSGFGSLADNVGKVDEKYETIDDIIDAIGRRSMFYNGSGLIQPVSVMEYFKNHPASDMQYYLSFKNIEITRRGTLEVRGDCAQPFDKAFAPPAFNLGVLCALDECREITDRFFYNNHIQLSNTELRNNVIYYSLMPASQDETDKFVNSIKEAARNALEKRGLGEEKLLKSV
GUT_GENOME166431_0117520-385QSLVEFFQSGCVTRDNYGIGVEIEHLPVRVGTDEAVGYTDEHGIRDVLHDLAPLYDESREYYEDGHLLGLGRGKIAVSLEPGAQIECSLGVLDSPEELNDVYAQFRRDINPILEKYGIRLINYGYTPRSRAHDLSVIPKRRYDVMTAYLGRLSAYGWNMMRASASTQISLDFSSEADAIAKMRMGTAVGPILAYLFRNTPYFEGAANGLPLLRQHMWEGIGTGRTGVIPGLFDANFDFEKAAINVLATPLMVADTTHTPEAGEGSVYMVSYETAADVYPDRALNVYEMNHIISTQFTDVRLKNFVELRHWDSLPIDRARVLADIVVSLFNRADEMERLKTYLDGLTALDVQAAKYELQTQGRNAKP
GUT_GENOME138351_0110819-352CVRPGVLPIALTIEHFITRLDGTPVDYAQLAQVIYSMQGHDEPLREDNVYLGFRCPSYTITIQAGCQVTISLAPYPTVLQAMAVYEGCFTRLCNALAANNMYARTVGVHPSRRAELLPLVPQSGYMIMDRYFRNTGNQGGVMLRATASTRVTIRYTSEADFVRKFRVACLITPLLALLTDNVPLYQGESNHNYSIHTHIWNNVDPDRCGIYPNVMDSNFGFESYAAHILQQPLVVARHGARIVGVGRKSAFEVYPSFLGHGDIEQILSMFYYDVCLNHNGIELRTADSLAPRYAASYAQLIKTLFSSHAAQEGILRRYAGANSAQIEAAKVGIC
GUT_GENOME170490_0038423-283AVSPEPGAQVGVSAGPVANLANILAAIDSFDAELERVMRLLGRPAHLVVCGYDTVAVRPEDVELVPKERCQIMDSYLPRQGKYAHDMMCCSTSTQASIDYTDEPGAMTLEKVATVLGPVLCFAFDSSPVWRGEPVPRMVRGRIWDEPDPSRCGIIPGSLDRDFTFERCCTWLGTVKPILMTDHEHRTYAVACASFMISPFYNHGLAAGLPFDPFAQTDEGVEAVRHELESKGWDAAVYGVGMAELLERLAGLARDHARSEF
GUT_GENOME066931_00218158-540TDQVGIELEQFVLHGDLTRVSWSEEHGIRSLLQDIAPAWEGKKAYTEDGELIGLSRSVDGFQQNITLEPAGQLEISAGPYSDMNTAKEQLEAFQDEISKTLSDRGQMILPLGYMPVGKAEDMELIPKDRYRFMDRYFRKFGPYGTCMMRGSASTQVSVDFRDEDDCKRKFRLANALGPILALMCCNTMRFEDENCDRIMMRTMMWNGVDRARCNVAPGTFDADWTPRTYAEWLLDVPAIVAPDGNGGWRYDERTFGEIYANTPMTPEDADHAMSMVFPDVRLKGFIEIRVADSMPVEYAAALAAFAKGLFSTDDALAQAENVLDMDHIHSKDITLAKLSMMEKGYDAEIYGGKKTSDIAARLHMIAKFHLNPDERKVLEPLTT
GUT_GENOME090565_003228-399VESMVNFYKSGIKSNSQKLGVELEYTLVYDDNSQVKYFDEFGQKWLMESMLEFYPEKILDEQKNLIGIKNERDSITLEPAGQFELSAGPFEKLEEVFEAFETFQKRVNALAEPHGIKMLAIGYHPKCKAEELVIIPKVRYELMTKYFLKNSPKGIRMMRGTGSTQVSIDYSSEADCIRKLRLAYVLTPLFAILCDNTPMFEGEKRTHHIMRTQVWEDCDPRRCGAMPGIFEDSFSFEKCAELVLNTQAMFEMDGDDGHLTNKTTSEVYGDKEMSNDDCAATCSHLFNDVRLKNFIEIRPADALPTNMELAYVALIKGIFYSEEALSSIEDYFGPQSETSFEEAKRNLEEKGLSGEVYGMPATTACKKLLDVASLGLDKDEKKYLEPLFGVVS
GUT_GENOME018982_019387-374IQKIVTYIMAGANGKQRLGLELEHLIYDGQYHVIPYEQMAACLEQFAQEVGGKPYILDDKLIGVEADGYALSLEPGCQLEISIEPLEDVNQIRHIYEEFRAVADPIFASCGYQLHEGAVLPFVASGEQAATDIELLPKERYRVMDAHFKQSGTCGAYMMRASASTQVSVDFSSEEDALRKARILEKLAPLMMLLTEQRDGMPFSERWQPHLIRSQIWRDVDPVRCGYLQDSLSEDYSLKAYAAHIYHSDCILIRENGELHPTDGKSAAEWYGDRPLEDVDYLLSMYFPHVRIKKYIEYRIADSMPIEDAIDYAALIGAIVYNEEVLGAVEQLFANVHTIQELEAAESAVIADGWNAAVYGRPVLEWLD
GUT_GENOME212188_003577-386IINNLKSFERPHDEFTLGVSFGHLLVNSKDLTAVNYSDENGSKSLLERLIHNGWKGITEDDVLVGAENGAESVVLGLGGQVNWRFKEFSQVQDLERAYLAFIEGLFEELKRRGQILLATGHQPVSSTGDIEIVPTPEYQALAQWAEGQGDLLEALATGAETTIGLQYAHVDNFQKRIQSAALVQPALAAFFDNASWVNGEKNTELLYNLRRLLTADERSYAIPGVLNDPFKYQDYANFVWSAPAVSVRNAQGLVVADDRSVGDVFADREMTDEDLERVYRMIHPSLTVSRHGLTLANIDSVPYPLNMAYVLLVKSLLYNPDHITALEKMIEQYKIDQIEAMQRDILDKGLDASFGDGTVFDLVKDLYFMVSLTVEPLEQH
GUT_GENOME134300_002743-392IGVELEFFICHKSDETFYMPDIISRYLKYNSGFISMQKDSNGICIKAKNELGVTISFEFTYSIIEFSIDPQYDILSLHSIINTAIYDFSLFIDHYELMLISHGIAPFNIQFKYVNNPYYKLIREFHGNDNIAKICSSQSHIDYKDEDLIKIINTYNKCNWINALLFSNSPHIVNSNKTILCYRDYIWKFSSHGRDSNNILISKEFNDIEDLNAFNNSKLIFMVYRESKPILLSQIPFNQYVTLKQVKGLQFDEDCISETWIHPSVDDYKYLRSYQNVAITNRRTIEIRSDCQQPFNRLIYPAVFNFGLKQAVNEVSSYLNNINFNFFQLRDDVVQNGFDTKIVESKKWLTGISINILYIIKEKYRSRGFGEEKYVDVLINQMIEEINPAI
GUT_GENOME080015_0046610-404SIIAYFESGSKDQDRKLGVEVEHFIITEADGVPFTYAPQGDIPGLQDVLEHLLKTYPAIFQNAEGELIGCANEEASVSLEPSAQIEVSIAPFETIAEIGRAYAHFREAIDPYLQAHGAKLVNAGYHPTRKAEELTLIPKERYRLMDAYFAQLGTLGLRMMRASASTQVSIDYVDEADAVRKMRVASALAPILGAIADNVAVYERKVGAYPLVRLAVWRNVDDARCGVVPGVFAAGFGFGAYADWMLRTSPIFVNSSQADGSSVPRAEFDRSAGEIYADEVMTKDEIEHLISMFWPDVRLKRFVEIRPADSLPQEYLEGYAALIKGLFYSEESLRRIEHELGVIKHADGEDAWPLDERDVEAAIKAIREHGYEAVLYEHKPKSLRAWEELLFSLAR
GUT_GENOME244037_008027-374IQKIVDFIKSGEKDEKDFRLGFEIEHVVVNKETLESQSFYGADGIGEILKDLVSLGYEKTDESGMAFSDGNVDISIEPAGQFELALRAKTKVDDLFEVYRTHMDRLIPLFEKRGLLLVTLGYHPKTKIDDIKIIPKARYDFMYKYFEKYGGEYAHNMMKGTCSMQLAVDYKNEEDFKKKYFLANALSPFLYSVFDNALIFEGETYIDRNLRQTIWEYTDRDRTGLYDFAFDSDLSYAKYAEKILDTPMIFMPSDDGEAYVGRKTLDELMTEENADVMVNHGISIVFPDVRVKTYMEIRMPDNIPYPYNMAAAALIKGIFFVEETFEKVYELFQDMTYDEAQKLKNNATRMGIFALYQNVEIYQHVLDI
GUT_GENOME210691_004828-368LIRLFQSGERPSAPMRLGLELEHFALSAVNGRRLYYDDGVRALLEDVAWQFETCTRFLDGGVSGLFGQDFSLTLEPSAQLEVSIRPVSSVAEIRQIYNQFASMLAQPLQKRGVRLVTFGVSPASLASECVLIDKMRYVMMDRYFRSLGKAPVFMMRQTASTQVSIDYQNEQDFVNKLRLASALSPLFYLMTENAPVVEGAPVQAHCPRIWAWRRVDDARTGTLPCVFDDDFGYKRLAQFYAALPPIFAGSPDGHEVYTGSRPAAEVFDPAAPADARHLLSMAFFDVRVKPFIEIRMADSMPVDLACAYAALIKGIFGDEANVHGWLDRLGCQSAQAVEDVKTDVMENGWNASLYGKDMGET
GUT_GENOME018858_0140130-429CDVAGTRKLGLELERFVIDRDANRTVAYVDEPGVHELLSRWVRFFSPAEQVFIDGHLFGYAGTYEVEGEQVGISISLEPGSQLEASVGPSEHVWALLGALETFDAQFVELTRELGVRWELVPSGYNPIVSVPTEVPLIDKERYHLMDAYLSGTGRYARDMMRGTTSAQVSIDLACGHGGRETYQLAVALGPMLSFLCDNAPHWRGLSAADTPRMAHARIWEQVDPARCGIVPGTFDKDFGAATYVDWLCGVRPILFTDADGKTTSTGTATEADILSRRLLTKAELAHMVSMVFPECRLKGFAELRTTDSLPPAQSAALAAFVKGLFYDPAVGDEVSALVLPGISEQDVRDAWSGLKASGWDATVYGRPATELVDNLVTLSRKGLADDRDLALLEPLTSLW
GUT_GENOME096381_0462935-349VGVELEWLVHPHTGTAPASGTAPPHRGVRQDAAVGAAALDAAVAELRGLELSSLISLEPGGQVEFSSRPAESLPACVEAVGADLAAARARLGRHGIRLVGRGLDDRGGTRLLDDPRYRAMERFFDRTGPAGRIMMCSSASVQVCLDAGTPGPGPFGLRRRWFLAHVLGAVLLAAFANSPLREGTWAGWRSGRQAVWAALDGYRNLAPAPVGRGDPRTLWARHALDAPVLCVRSPDGPWTVPADGMTFRQWVRRWSAPPRPGGVRRPPGPGGGRPPRLTDLDYHLTTLFPPVRPRGHLELRMIDAQPGDDGWIVPL
GUT_GENOME158306_009822-428NTKQLVEETFLAPFAKPGREYVGVELEFPLLNLAKAPVDKQVACGLMTHLLEHGFQTSERDMDGNPAFLVNQDGDELSFDNSYNNFEFAMAKGKNLSYIANRFYRLYALVQNFLLPQGHTLCGMGTNPYQPSIERAQVRYPIYLAIGKFLEQYRGGRYHEFPDFPAYLSSVQTHLDVPLDKLPRALTLFARMDFVRALLFSNSLPFPEIKGMEKTICFRDYLWEKSGFGSLLGNVGKVEGIFRTAEDIMAAIMKKSVFLWEDHGAYELIPPVSLEAYFSSGANPEDIRNYLSFQNVEVTQRGTLEIRSDCTQPVQSAFAPPAFHLGVFHKLEEAEAKMEIYFQNQIPDELASRPDCNAVLRNSVIYEGKVPGGNEAAARLLKELVELAEDGLRERGRGEEEYLAPLAERAEMLTCPAKETRRRLQEG
GUT_GENOME096469_0035045-337KTGPPGTVGAELEWLVVDPTHPARAVALPRLQRLLAGPMPGGSLVTFEPAGQVELSSRPAPDVMTCVADLQRDVAHLLAVLASDGLTVVPAATDPHPRRPRQLDRPRYAAMEAYFDALGHDLGRTMMRSTTAVQVSLDAGADEADVAARWRLLHTVGPTLVAAFANSPRLGGRATGAVSTRQRVWRGLDPARTHAPTPNGDPAAAYAEFVLDAPLMFTGEGCRPHPGGTLRDWVSGRLPHLDPPTTADLQRHLTTLFPPVRARGWFEVRYLDALPPQWWPVPVTVLATLLERP
GUT_GENOME065146_008808-365EAIYNRYIVPTRRKRTHLAGLEFELPIVNEKNEPVNFEVIYQVTDRFIDTFSFLNVSRDDNEHIYLAVDDKTGDGLSYDCSYNTLEFSFGKEEDMNVLYKRFCQYYTYIQKELRKEGHMLTGMGINPRYAVNQNVPVVSERYRMLFHHLSSYKKYGNSIPFHSYPNFGLFSYASQIQLDVEEEQVVPMLNTFTKLEPFKALILANSLWGENAEILCSRDNFWRNSLHGLNRHNVDMYNVVFDTTDEIVRYIKSMSLYCVEREGKYINFPPVVLSKYFSSDRIKGEYFDGNRYREITFHPEISDLQSLRSFKFEDLTFRGTVEFRSVCEQPVGEIMASGALHAGLMENIGELSEFLEKD
GUT_GENOME208351_0175910-368KITDYFRAGCKEDYNRRIGVEIEHFVIKEDNTNATYEEIAELLEAVFGNEQCEYEQRSLLGCVTMEYAISLEPAAQLEISIFPRKTVQELEQIYEGFLKRMESYLGLRHMRLETCGYQPYAKAAELPLIPKRRYEYMDAYFKHTETMGIRMMRGSASTQVSIDYLNEQDFVKKFQLAQMLSPVLALLTENTAVIDGIPVKQHIPRTTIWNHVDKKRCGIVPGSMSEEFSFCTYAEYVYHVPLIFAKDHGIIMPTGEWSAAEYYRERLMTEEEIEHMLSMVFPDVRLKNYIEIRMADSLPKEEAFAYVSLIRDIFYGGNITKIRRYLGNPTEEDIAQAKENVISYGEDGMVYGRKAGEIF
GUT_GENOME096381_0427826-402HARAGAARELVGIEVETAALDPHTGAAVPYEGPAGLRALLEHLVRETGAVPVYDRSALTGVTLPEGGTVTLEHGGAVEYSSPPCDGVAELARVADGALRRLAAVARRFGFALVPGGQYPFTAPGDVAWVPHSRMPAMRRHFMSLGPSGADGVQVMSLALSTQTSLDFTGPEDLTRKLRALAAASTPAAALFVNAPLEGGRPCGLLSRRMDHLSRTDPARTGAVPVMLAADVDADRLVDWALDLPMIHRAAGDGGRRPAPPVPFRTLLTRGFGDGTRPGPADWRAHLSQVFTDVRVRETLELRAVDGPPYRALFSVPAFWTGLAYHAPSRDAAWELLGGITPDEHRAAQAHIARRGLAARIAGRPVRELATALLDLSA
GUT_GENOME126402_0125525-417RNVDAIAAFIESGCTADEGALGVELEHIIVREDGSPVSYSGEDGIQSVLEVLRADYPQATYDAEGDLLGVARPGEAVTIEPAAQVELSAGPFHELASAKACFEQFEKKLCGVLAPKHGKLAKAGYHPTARATSLELIPKQRYEYMNRYLGSVSPFGPCMMRGSASTQVSIDYTSADDCVRKYRVASALTPILSLICDNSPIFEGAPRTHRMVRTEIWQHCDPERCGIVPGAVEGSFTLRDYARWVYDTPAILVPDGQGGWREDSRRFSEIYAERPMSRAEIEHALSMVFPDVRLKTYLEIRPADAMPVPYVVAYAALVKGIFYCEESLARVERLFEGIATNDIEQAKNSLQELGYAGEIYGRPAAEIVLFPIGKRRQHRAGRPAAEFADKVIA
GUT_GENOME057818_0014239-431ELYRNGCKPADTFKVGLEYERLPISTVTNKAVDYGADFGVCNFLRAFAKEENFDYITDDYNIIGLKQNHDTITLEPGSQVELSVEPETSIDGIKRKVDYLNSKMAPILDRYNIKLLEYGVSPLSTHKTINLIPKKRYHIMAKYLWGILSDVMMRETAGIQACFDFKDEQDAMRKFLIANKMTPFMTAMFANSPIRGGVETGYKTFRALSWLNTDNDRCGFCGKMDEDFSFEKYVDCVMKSPIIFINREGLPVDIKGKINFEDFIENGWEGFEAGIDDYKLHANLYFPEVRMRNFIEIRNHDCGNSGMAYAVLAIYKGILYNESAMNEIEELLAPFSYRDLAELRYNVPKSALAAHIKGFTAADIAKEILYIAEKALIEDGTGEEKYLEPIKQY
GUT_GENOME119917_003624-392RELLYRRFIAPMESKKNLINRIGMEFAYPLVCLGNKSNKMIVKYLFEQLIKQGFMPLYCDKKELIGVEKKFTARIFLDASYNCLSIALAPIVAIHEADRTLQEVFKLFRTIVEEQALLIDQSIHPLHYSLPANFIADDEWMAKRKFIEKYSPQDFSGRADFFAFITAIKTHLNFMPQEIPYAFTVLSKLDFAELFLFANSPIKTKQGWFRSAKYYYYTRSGLALAGLSGSMDQSFRSIQDVLLDYAQRGLFLRRRENCQFFIPVRLGEYFKRSEYGAIEEDLDCFFSYKNTELEQIGTVKRQVSCVQPFSQTFVPVAFVRGIAAKLGQADELVKRFLRDNQITSSNNQLCEMAARGVLIGSRNSLYEFLGGLWQLAIDSARMRGVGDEA
GUT_GENOME128188_0002724-410AAIVEHLKRGASPASKRIGIELEHICVDADGRALGYSQTGGVRDALEALSERYPEKTVHNGNPLGVGRPGAVITLEPAAQVELSAGPFECLADAQASFEEFERDLDDALKPHGGRALALGYDPKCVAATKELIPKARYAYTNRYLGDISPWGPRMMRGSASTQVSIDYADEADAINKMQVAQMIAPLLALICDNAPYFEGRVRPHTLMRTEIWKHCDPARCGTVPGALDPDFDFTDYAAFLLDFPAIVRLDAQGEAHLDTRTFGEIYAAKPMNDADVAHAMSMTWPDVRLKSYIEIRPADSMPIPYVIAYGALIKGIFYTPENLTKLRCTLGPVGGTAVATAKDELMKHGYSAEVYGHPAAKLCDMLIELGRKGLEPDEQGYIEPLA
GUT_GENOME017995_012432-385NKQKIVEYIRSGAKPSENIGVEIEHFIVNSANESISYYGKDGIGAVLEELARFFDEKIYSEGQLIALSNGRYHLTLEPAAQLEISIEPLQYIDEIEKIYHEFYDLIHPILESRGYRLVTLGYQPHDCVSDLPLIPKKRYELMDRYFSHSGKYGKNMMRGTASAQVSIDYTDERDCILKFRLANALVPILALITDNAPIFEGERYGGRMIRTKIWSDVDNARCGIVPGGTDKDFSLERYADYLLNTPPILIENNGKTIYTDTKTTQEIYREREMTKADIEHILSMFFPDVRLKQYIEIRPADSMPIPYTLAYAAFIKGIFLRLSSVCDELGLDNIHTEDVENAKKKLMADGFHAIIYGRRVTDILNELLNTAEASLPDRDRIYLN
GUT_GENOME033570_01067106-489NINVLTNYIADGSKGCNSNAVGVELEHFVIDKNGDCVPYINGVENIIEQLAQNFPKHVYSEGFLIGLSCDKYNITLEPGAQIEISKKPTENICEIENIYGEFLSVINPILDKYSYRLTTLGYMPKNKAKDISLIPKKRYEYMNKYFKSVGTRGINMMRGTASAQVSIDFADEKDCVQKFKKANIISPILSLICDNAPVFEGKPFLGNTLRTYIWNDVDNDRCGIVPTVLDSDFSFKKYAEYIYNSPAILTVNGDDIEFTADKKICDIYKDTPINESIAEHLLSMFFPDVRLKKYIEIRPADSMPIKYVLAYATLIKGIFMSDFSLDVSLSAITQAKNNIILKGYNAEVYGTDAHTLAMKLCSAAYNALDSSDREYLLPLKQLID
GUT_GENOME244166_0032921-405GYTDRIRWQVGPEIEHFVFDRQSGKRILYPGENGIEGILRAFAQRHTDWTTTWEEGHLLALEKLGSSITLEPGAQLEFSLAPSESVAIIQRRYQAMLDDLYEILDPLGYMLVTIGLDPFNAVEDIPLLPKSRYHMMDAYLCNTGDLARTMMRKSAALQVSVDVGSDRDFCSKYRVLTALSPILYTLFDSAVDNEGKRLTTYNARQEIWRRTDPARTGFPADVFSADFGVDHYADWVLSAPPIFLPKDGKVVATGNRPLHELLDEAKDEEELDRFVRHGMSIVFPDVRAKQVMEIRMMDSIAAPYAFGAMALFKGLLYNMDALRRLEERFTPMQADWVERGKNAGRDNGIQAYYHGDYFAHWGTTLCAMAREGLSEQEAAFLVPLE
GUT_GENOME009245_0054224-408YFACKHPRNFKIGIEYERITIDKNNQSVNYYGDNGIKTLLENFSLKYNWEKILDCNNLIGIEKDKTTITLEPGGQIEISLSPQKSLADIEINLEDLKNKIDIEANNLGISILEYGITPASKIKDIDIIPKRRYLTMSNYLPLNLALVMMKQTSGIQVIFDYESEEDAIKKLALSSKLSPIVTGIFANSPIYEKKDSGYKSYRAYAWCHTDNNRCGIISSKLFNYFSFSDYIDKVFNIPMIFIVRNNRIINIDGKITFKTFYENGYQGFQATKDDFLLQASLFFPDVRLKNYIEIRNHDCQKGKLKYAIPALYKGLFSSYKSLSDASEILKPFNYEDIKIAQETSAKIGLGAKIGNYKISDISKEILKLSYDNLDYIDQKYLEPIL
GUT_GENOME108899_0039617-377ENDYIGVEIEMPIINLNSPYIVNNKVIEGLFSTFLDNEEFKIENYDNEKNIISIKNIKTNDTVSLEYSFNTIELSLGKELLIDRLKEKFDFYYNFIKQYLEKYEYEIYCGGINPNYHYIDKTCLNQDRYKVIERLLTNSSNSDLYNQFCSYCCSIQTHINTSKDSIVNLINMLSKVENNKMEIFSNSYMKETNLKNSRKYLWENSNFGPLNIGTNPNYNNLEDLLNDYSKRNLFFVERNNKYLLLNAKQPLNKYLDKKRVKATDEYGKTCYVVPSYSDLDNFRSYRSVELTKYGTLEIRSDCTQSKENIFKLIAFNVGVCKNHLEILKYLDMNKKITNERLVEFCIKGLKMRNFGEEKYME
GUT_GENOME181939_0086511-401VNYFKSQEKKPEDFKIGVEFEHFIIDKKTLKSINYYDQRGVKDTLEKLEAQGWKGKYEGKNILGLSNETQVITLEPGSQFEFSIKEPKKQIKEIEKEYFKFLNEIIPILEEKDQYLVATGYHPVSKINDFDIIPKERYHYMYQYFQSKGSHAHNMMKGTAALQVALDFSSQEDYIKKYQVVSSLSPVAYALFDNSYYFEGEVWDKHNLRTLIWENMDKDRSGVIKVAFDRDFGYSKYAQYILNTPPILIDNGTKVYYTGDQLAKDVFDPDDYTQEELEHILTMVFPDVRTKKYIEIRMMDSLPYPLNLSVVAFWKGLLYHQQNLDTLYHIIQEVRLENIDRAKKEIIEKGLEGTFKGKPIREIGKELVDLAKKGLDSDEIKYIDPLEEMIK
GUT_GENOME212235_0104320-407NVAAFVAYFQNGCKPKASTLGVEIEHFLVDNTGQPLTYSQPNGVADVLRTLSARYPDLSMHGDDILGVAKPHMNVTIEPAAQLEISAGPFESIADMRFALGEFERDVAAALTPVAGRMVALGYDPMLVAADKELIPKARYDIMNRYLSAISPSGPRMMRGSASTQVSIDYTSEEDAAAKMRVASALAPLLSLMCDNAPVFEGTRAPHRLMRAEIWRYLDPDRCNTVPGCLEAGFSFADYANYILDVPAVVALDAEGEARYDTRTFGDIFADRTMTHADIEHALSMVWPDARLKTYVEIRPADALPIDLACAYEALIKGIFYGPALGALAAAFEGVDAQDVDDAKTAIMERGYEAETFSRNAGGMCDALMEMAYEGLPAGERGFLDPLA
GUT_GENOME189613_016219-397SLLTEYFQNGCKENCSQKLGLEVEHFIVVKETGESVSFYGKHGVAELLERLKPCYSHFYYEGERLLGLYNSEYSLSLEPAAQLEISIAPRREISEIRNIYHSFRRQLEPILETWGYELVMLGYQPVSQIQDLSLIPKKRYDYMDAYFKKTGTMGAHMMRGTASTQISVDFFSERDFVRKMRAAYILMPMIKFLTDCSPIFEGKIQEKRMVRTRIWQRVDPRRCGIVPGLFQTDFGFASYARFLMQMPLIFVVEDERAVSVGEQTAAQLWEAHDLTEKDIEHLLSMAFPDVRLKRYLEIRGADCMPFPYVMGYLALIKGIFYQESALEQILSKKVSEKEILEAERSLMEYGFSGSIYGKRADEYMKDILGIAREFLEYQEKAFLDPLEQM
GUT_GENOME093560_0020917-447LYQRFIEPTKNNENNYIGIEIEIPILNLEKKPVDFQVVHKITEEFKKQYTNFKPQTIDHNNHICSLINKKNNDIISYDCSYNNLEFSFGRETNLFNIYERFCDYYSFFKELFEKHNHTLTGLGINPYRIYNKHIPIPNERYQMLYHHLCSYSRYSKLPKYFHDHPEYGMFSSASQIQLDVNYDNLIKTINIFSKLEPIKAILFSNSVLIGENEDLLCCRDMFWQDSTHGINPHNIGMYETQLKDIDDLMAYIESLNIYCTMRNGLYVNFPTINIMDYFRKDSVEGEYCDKGVYDKITITPKISDIEYLRSFKFLDLTYRGTIEYRSSCTQPIKDVLCVGAFQLGLKNKLYELEEIFENDEVIYHRGYNASELRKLFVKRQLPNFVDEDKLYTLIKQILDLSKEGLIERGYGEEVFLESLYERVKNRTNPAK
GUT_GENOME004983_0113439-438LVRIFAAGGKQHQGIGIELEHLLLHKNTGAPVAYEGPDGVGELLRRLSVYYERGLYDQGNLVGLVRGDQVVSLEPGLQLELSAGPYETVMEVERVYLRFREELDRILEDIGLMTPMLGYTPSALAEEAPHIRNFRYACMQRFLGAQAPEGVEMMHCSSSLQVNIDYTDEADAMRKFRVAQALSPLLALMTDNSPIYRKKPRTGNIVRTCIWGAMNQDRVDTVPGSFAPGFGFDGYADYILSRQAILVPNAPGIRVPHDADVPVDPEGKWVWAGDATFDDIYQRPMTDQEAEHALTMVWPDVRLRGYLEIRPGDAMPFDYVLSFTALIKNLFYNQANLDVLEMLTAGVDEDDINVAKRELERKGYAGTVYGRPMSFWANVLTQLAFSVSRGEERIYFEPLM
GUT_GENOME033115_0032410-392KLIDYFKSGIKEEKDKSMGLELEHFIVKADTMESVDYYEENGIRDILKLLSPYFEKEIYKDGNLIGLVKDKSNITLEPASQLEISIGPFIDTLDILEEYNYFLLKIKPILKKFDYKMIYAGYQPKSKVDDLNLIPKKRYEYMLDYFSKVGLHGKNMMKGSASAQVSVDYFSEEDFKNKFRLANILSPIISFMTDNTNIFEGEHYDKNCVRVEIWNDVDPDRSMIVKGALNKEFGFSEYSEYILNSPAILIEDDNGNSVYTKDKKIKDIYNDRELSKKDIEHIISMFFPDVRLKNYVEIRMADCLPIDYALSYASIIKGIFYSEEVVKLLLDKFEDIKDIDVIKAKEEVVNKGYDAKVYGFDIKDLMLNLIELAKDHLCHEEKD
GUT_GENOME127913_0103121-416NRERFVRYLSAGIKQPGGPTLLGVEVEHFVVFDDGRPVSYERHDSALGIREVLEYLSAFYPERAYGTRGDLIGLSGSDGAVSLEPAAQLEFSCAPCTTALEVQAAYRRFRDAVDPFLQTRGAHIVSRGYHPTRKALDLELIPKQRYRFMDEYFARIGTHGERMMRSSCATQVSVDYYSEQDAVRKIRVASAIAPVLAAIADNSPVFEGAENHTPLRRMQLWREVDNLRCGAIPGVFDAGFGFERYADWIMRTPPIFVTRPAAATPAGPALRAVFEQAADKAYQDAPLEDADIEHVISMFWPDVRLKHFVEVRPADCLPADCAIGYTALVKGIFYSEESLAAIERELGVKGGVWPIHVADLNNAIVSVQAFGMRGKVYGKTLESCEKLLFELAEGAL
GUT_GENOME135504_015266-397LIYNHFFKKFENRKIGNVGCELEFPLVNKSGSDVDKAFARGIFDYFSNRGFKRAEDEGYFFENAHGDVISFDNSYNNFEFSLCYGNNLCEIKKRFDEYFAEAQNYFSRENYELIGRGTNPNKSTISQNRAEFSTYNMVDDFLKTYGKDTKFPDFPAYLSSVQTHLDVDLQTLPTAYTYFAKLDFVRAILFSNSPDFDKKGYRCYRDFLWEQSAFSKCSNITGKVDAEFETVDALVGYFLEKDMFNRKRDGKYEIFAPVNIKEYFENEKYGAKGDDIDCYLSFKNVEITARGTLEVRSDCTQPLNSSFAPCAFNLGILCNLDAAISDVDGFFDENSIAMSNTELRNIVVSGKKLNKICDEDILSDFVYDLVEIAAEALEKRGLGEEKLIEPLY
GUT_GENOME004893_016369-400KTLAGYFAADCEPDGKLTLQLEVEHFLTRSDGQPPAFADVQAALRDLQQQTDAPIITDGEYFGYSGPALTATLGPACQLRISLAPLRDVQDIMDLYNRFYLQLGLALAAHGLRAWTAGSERPGPGGPRCHPTCHAEDLPLVPRTRDEAMDAYLRKKGACSVQMMRATAATQVSIDYQDETDFVRKMRAASLLTPFFALLSDNAPVYQASRNSSYCIRTRLWQDVDRDRCGVTPHLMDADFGYARYAENVLTKPQITALRLGRVRAAGGKIAPELYAGHVSRQETAQILSNFFYDVRLKSRIELRAADSMPPRYIAAYVQLVKSVFGSPAALQNVLRHYAGATTLDITSAKLAVCKDGYNALVYGRPASGELAWLLMQARSRTPSQEERALLD
GUT_GENOME066618_0167125-404QLTAYFEKGSKENQEFNIGLEVEHFILNRETHETMVFDGEHGVGALMEQLSSGYPEKHMEDGAVISLESDDILITLEPGCQLEISTAPFASVERVVGAYHQARREVEEALDVWGYDLVTEGYLPYGKAAEIPLIGKERYRCMDRYFAGTGKYGKNMMRATAACHVSIDYFDEEDFVNKYRLAWLLNPVFALLTENVSRFENEPFSGHLLRDKIWQGVDLKRVEVCPDIFDEDFGFESYARWLMDIPVIVVNNGDTYRYEPEKTIGQCFEAYGGDEALIKHYLSMAFPDIRLKQFIEIRSADSMPENYVKAYCALIKGIFSNGETVTKMLSLLPQEAEAIEAAKQSLRENGYRGMAYGQDVRLMLYWLVNEAAANLEGEER
GUT_GENOME064845_004616-395EQQNLELLEQYFRDGCKWNCLQKLGVELEHFVIDRKNQKNVSYYGSDGIEELLEEWKEYYPGHERENGRLLGLYNNDYALSLEPAAQLEISMAPKESLRCIEKIYRNFSEAVRPSLERRGYELVTRGYRPFGRVDELDLIPKKRYEYMDAYFQESGTTGRNMMRGTAAAQVSIDYCCEQDFISKYRTAYLIMPALKLLSDNTAVLEEKPYPGHLARTWIWDHVDSGRCGILPGIFEEDFGFHTYAEYLWNLELIFLPEDGGYRSTGHQKVRDLWSDRLLTRQDMEHILSMTFLDVRVKHYVEIRGADSMPLPYVLAYTALVKGLTFEKEVREELLARYPIQEEDIRKAERSLAEQGYEGEIYGEPAAAFIGRLLQMAEDHLEPEERKYLQ
GUT_GENOME114249_008681-413MDALNVIYDRFINPIAEKNSRNVGTELEFPLLNMARCPVDKSVAQGFLKRLVSEDFTVDDTDTDGNPAFVVNADGDCISYDNSYNNIEFSMNYGANLTEIRDRFYALYDRARSYFEPFGYVIAGLGCNPYKDYIETSHVSYPVYNMVQEYLTEFDCDKTHNYPDFPAYLSSVQTHLDISAAGAGETADFFAKMDFAAAFLFANSLAFDGSDYLCFRDYLWENSAFGLLNGNTGCHDTLMRNADGVAGSFLQRSMFNRIRNGTYEVFPPVSLENYFDRAGAKADDIRQFLSFRNIEITARGTLEIRSTCTQPVSEAFCAPAFYLGLMNNFECAVGTVDGFFGEYGITEKNSVLRKAAIRGKLPEGVDKNGMIRLLTSLVRIAEKGLENRKLGEEFMLKPLTARAQKLESPALAA
GUT_GENOME096531_0093815-427RIVAYFESGCATKGDRYYDDDGFLDRPGYLGVEVEHFLMRTSDQEPLFYEPHDGIPGVRDVLEHLAAYYPQVTRNAAGDILGLANESGNITIEPAAQLEISIAPLARLSDIAKAYNDFRGYVDAFLAPYGCSVACQGYRPKTKALDMPLIPKQRYAFMDRYFHELGTHGERMMRGSASTQVSIDYFSESDAVRKFRVGTALAPILAFICDNTPVFEGEKNEEPLARMALWRDVDPHRCGVIPGTFSRGFGFESVANYLLHNAPIFVTRAKAAHPDDARVEEVRWVGDERAMDAYGDAPLATEDIEHLISMFWTDIRLKRFVEIRPADCMPDVGVLGYAALVKGIFYSPANLTAIEDALGVQGDFWPLSDSSVEMAIAKIRQHGKNAAVYGHYVYEWVEFLFNMANASLEEEEA
GUT_GENOME199425_0167818-433TKSKSRRFIGIEIEMPVVNLNKAPVEEKVIFEMSKAFCSYFGFDIIGYDADGNANSMQDSVTGDNLSFDCCYSNLELSLGKGENLHQIKKRFDAYYTFINEFFSRHNYTLTGMGINPYYNINHNKPIQNERYRMLYHHLHSYKKYENNGFAFHKFPAFGTFTSASQVQIDIFYDDLIPVINTFGKLEPFKALLFSNSLMSEEPDLLCVRNMLWERSMQGYNPHNIGMFEYELKNIDDLIEYIKSTSIYCTMRDGKYINFEPVPINEYLKSKKISGEYFDGKQYKEIEFEPCLEDLEHLRTFKFEDLTYRGTIEFRSTCCQPISESMSVAAFHIGLLERVQELHGLLNADNVIYSHGYSASELQRMLSKNMLPDFVDKDNLIKVLFRILELSESGLKEREMNEEIFLQPLYERAKNF
GUT_GENOME066077_0114123-395LLKFFENGRSQRGTGGFGVEIEHLPVHNSDDTAVSYYEPNGIEALLKRLAPYYDEEKEYWENGHLVGLGRSGVAVSLEPGGQVETSIGILKKPSDLNTLYSKFRRELDPILDDLDFRLVNYGYQPKSSFADVPVNPKDRYDAMTDYLGRVGQFGPCMMRCSASTQVSIDYVDERDSIEKLRLGTVIGPILAYFFRNTPYFEGETNPWPLLRQRMWDYLDFQRTNVLPGLFDDRYGWEDYAVDVLSTPLMFADLTHTPEAVASGASPKELHRPAFRENAGEVYPDRELNPYEINHIISTHFNDVRLKNFIELRHWDSLPIERAERLTEIVSSLFYVPEHRERLESYFEGISEEEVFEAKANIQAHGREASPYGQ
GUT_GENOME281901_009994-363ITEFIRSGEKKHQTGGIGLEVEHFVLHKKTGLPMPYATIEELFKLLEPDYPQAVYEAGHCVALENEETLITLEPGCQLELSFLYSADLKRIQTAYQKAMKPIEEFVDEKGYEIVYAGGLPTVDVDSFPRIQKARYALMEAYFQNHGTRGKEMMKGTAAVHVSVDYADEKDYVKKMRMANLLHPVFAFLCFDSWYYAGKKNTDLLLRDSIWQGTDPNRCRIVDDLFADDFGYQSYANYVLNAPLILMHTKEDAYLSVQDKTCREVAKEYGWSDQAIAHYLSMVFPNIRTKNFIEIRSADSMPLNETMAYCAFLKGLFYRGETVEKYAGLAHSIDAIHATIASIRKDGWQADVYGYRCDVFC
GUT_GENOME027327_003935-368QFIINYIESGEKPTTEQNLGVECEHLILKEDGRAVGYFDKGGTQDIFNDLVNNQGFEPHIEDGKILSCKKGEFSISTEPGGQIEFSSKQKRCVKTVEAGILDFYDLIIPEINKYGNGLYTVPFQPVSRIEDYEILPKERYYAMFDYFKTHGRLSHLMMKGTTSLQCSIDFHNEKDYSRKYGMATRLTNIFYTLFDNVAFKENRPLDIYAYRAKIWEHTDPQRTGVLERAFRADFSYADYAKKLLDTDAIFTLEDGNFKRFDGTIGEAIGDDFDVKKLEHLLTMVFFDVRSKRFIEIRMIDSLPWPYNIGAVALINNIFYNDDNLDRMWNLLGDMTYKEHLRSREEIYKKGPQAKLMGKKISRWG
GUT_GENOME103710_0377211-397LIKYFENGSKRNCIEKLGVELEHLIVKSETGESVTYQEEKGIAYILERMSGYFPWRYEPEGFLIGIYNEDYSISLEPAGQLEISINPRENITLIYRIYRMFLEQIHPILRECGYEMLTLGYHPKSKVRDMPLIPKTRYEFMDRYFMETGTCGINMMRGTAATQVSIDYCSEQDFVQKIRAAYILMPLLKLLTDNTPIMEGEIYEGYMARSYIWDHVDPARTGIPRGLFDEDFGFARYADFLLDMPPVFVEGQDAPMYTGHKSTAEVWGGERLSRHDIEHILSMNFQDVRLKHYLEIRYADSMPIKCVLAYTVLLKGIFYNKELVKEVNEKFTASEQEILKAQHNLMQDGFDGDVYGYEAAKLLSYLIEAAKNQLMESEQLSLLPLEE
GUT_GENOME110215_003027-399NKQALVAYFIEGAKGDKPLGALGVEVEHFVVTAEGEHSVSYAGSEGEFGVRDVLAHLADAYPQHTYGLEGDLIGLASDEASITLEPAAQLEISIAPYLSIGQILRVYSEFRSKVDPFLAEHGCKLVTSGYHPTARALELTLIPKQRYRFMNNYFMEIGTHGERMMRASASTQVSVDYRDEADAIRKMRIAQALAPIFASVTDNTAVFEAEPAPRLARFALWRNVDNDRCGSVPGLFSEGYGFADYADWILRTCPIFVTRPSADDPAGPNLRAVYGQSAYEAYGDAPMTRADIEHVLSMFWPDVRLKNFVEIRPADSLPAPLMAGYAALVKGVFYSERSLCAIEQAFGAAEGVWLLNDSSANQALEAIQRHGAEAVIYGKTLAEWQELIFEVAD
GUT_GENOME163235_0105915-427LAKRYLASLKDNPEQYIGVELEFPIVNTMGNKTDVLVTKALFKHLQEVEEFIAIKEDEDGNPVQLQHKINKDQIVFELSYNTIEFAFEKTKTIQEVESRFQHYLGIIQPFLRERQHQIEGKGIHPYWRINENKPVKIDRYKMLIQFLQLSGQYPNHRFHHYPDYGTFICGNQVQLDVSRTNYLEIINAMNKVEPAKAYLFANSIFDGEDWDTKISRDIFWEDSMHGFYQENVGVNPKVFINEQDFLEYLAKTALFYVYREDEILYFEPIRVEDYLAKEKIVAYRLDGTRQEIKPQEEDLKNHRSYHFQDLTTRGTIEYRSVCTQPLNKTFAPIAFHVGLHHNIKELDEYLAQNIIFDFNKDNPKELRRQYSKKQLTESELVKIKQYSLDLLKISKRGLIDRGNREEVYLEEII
GUT_GENOME174867_015307-393QLVSVFKSGVKEKHGFSLGVETEHFVTDKKKNRRVYYDDGVENLLEEASAFFEDVSFFGDRVASLGNKNYVVTLEPSAQFEVSIRPCENISEIENIYNDFLSVFTPLLKKHGFSLEMYGYSRNSLADDCKIIPKERYRLMNGYFKSLGNTPQYMMRQTCATQVSIDYFSEKDFVDKLRLASVLSPILYLVSENSPVTECKKTDNHCPRIDAWRKVDDTRTGIIRNIFDDDFSFEKLAEFYSNISPIFIFENGREVFTGKKTAREIFENKTVTDENVNHLLSMAFFDSRVKNFIEIRSADSMPVEFTLAYCSLIKGIFSSENTVKSLLDFANCKTTKEIENAKTQVALFGYEAEVYGKNAKQFAEFVIENAKNNLNESEKKYILLLEN
GUT_GENOME096284_0122130-401GDKDPAGKRIGFELERILIDDQGRTVPFAGERGVGALIAELARSRSESELVYIDGHLLGLDYTFHGELETVGVTVSLEPAAQLEVSAGPAHSVKALYDAICAFDAEVERACAAIGLDASLVPVGYNPVVSSPLDLELIPKERYRDMDAYLSRRGRYARDMMRCTASTQVSLDYEDERDAARIYRMATLLGPLFAFLFDNAPIFRGEPSPGMARSRIWHHVDVDRCGIVPGAIEGLSFEDYILWVSGVKPILFTDAEHVTTSTGDCYTRDIMSNRPLEPAELMHLLSMVFPNVRLKGFVELREMDSLPPRLAAACTSFTGALFYDKCLEQKLAEHLRDYLPHGFAGMDENDAVVARLHLEEQGWDAQVYGMPV
GUT_GENOME137475_008074-381DKIIDYIRSGEKSEKDYAIGFEIENFIVDDDHRAVTYWDEGGIKDILEDLINIGYEAIYLDGVLLGAKKGKTEISLEPGSQFELSIKKAWRIKDLESEYFARMEEIKDVVEKRGYHILQRGYHVKTKIDEIKILPKQRYDLMYKYFSETGTMGHNMMKGTAASQSSIDYKDEEDFRRKMRVYNALSPVFYAIFENANIFEGKPYNENGLRMTIWKNTDKDRCSIIKEAFDDDFSYASYADYIVNRPCIFIHKDGKDIYTGHTKIKDIEGIDEADKDLVEHVLSMFFPDTRAKKYIEIRMMDAVQIEYLLALVALIKATSYQNLDEVYEMIKDVTYEDALRALGSFKEHGIRGKLKEKSFLDIAKKLISLAKSSLGDEA
GUT_GENOME095591_002162-442NFKAANRRKLVEFFQSGCKTAQGFGIEIEHIVLHKNTHLPASYEEENGIAALLERMKPFYEEAHYDGSHLVALSRGNEHITLEPAAQLEISAGPFENLLDAKFAYACFRKTLDPLLDELGLYTPMLGYVPTVPAKTLKLIPKFRYDAMNAFLGAEAYEGVCMMRASSSLQISIDFRDERDAMRKFRVSEKIAPILALMADNSPFYEGKNRHGHMARFALWTRMQQDRCGVVPGSLGADFGFDDYADYILTRRAILVPYAADVAAILEGDNGERVAAGAAGAASAAGTGEEHTTSPVPAYIPEHDDHWLYAGAHTFDELYAHRAMTDDECMHALSMVFPDTRLKNFVEIRPADALPLSHALGYMALIKSIFYSDDILDVLDRSLAPYGEQDVLDAKQSLIEQGYQGRAYGHGAAFWADKLVALALDTMTREERIMFEPLRSL
GUT_GENOME078989_0100113-400DLDLLTEIFRSGCKTDEKTGIEFEKIGVYSDSLKAVRYPDIARFLQTFKNGNWSGVYEGNNIIGLKNETGTISLEPGSQLEISLNPLDTIAEIENKITEYNKITAEIAEKMGIIWLGYGIQPLSTNKNIKIIPKKRYEYMTRYLPTVAKKPLVMMRETSGIQTGIDYKSEQDAMKKLSLALKLSPIISAAFANSPVRSGKLTGYKSYRAFSWLHTDNDRCGLVSSKLFERDFEFSFEDYAKILLNVPMIFIERKGIGAIPVKNLTFSQFLKDGYEGFTAQIEDWKLHLSLYFPDVRLKSYLEIRNHDNQRPELIPAVPAFWKGLLYSENASNEALELLKPFSYIDFEYIRRKTPKCGLSMNIKGYNLAQIAAEALKISHNALKKAGIG
GUT_GENOME284994_0000614-401EHIAGYIASGCKQSQCLGVEFENILVHKADGSPVAYDGPQGVEQILERLSATYPEKTCQDGHVLGLQGGNATITLEPAACLEYSAGPFTTIDQVKDSLEQFRATLDPILDEFGLEVADLGYHPSACAQELSLIPKKRYEAMNRYLGEIGPYGACMMRGSASLQVSLDYTSEEDAIKKLRLVDTLAPLLSLLCDNSPIFEKKRAHQHMVRTLIWKNYDPARCMVIPHTFDDSFGFRDYAEFAYNQAAIIAPHSDGSWSYAGSTTFAQLYRDRLMEDADIEHALSMIFPDGRLKRFLEIRPADALPKPYALAYVALLKGMFYNEDNLDELSAAFARAHVNEERIAKAKESLMSKGYDGCAYDRRPAEWLEMVFEAAQRGLSEEEAVYLAP
GUT_GENOME186025_0161210-424ICNEIIKPAMEKETDYVGVEVELPVIPFDGHADLKETGCRLLQKLIEEEGFHEVLTGTDGCLVRVENEDRDAVSFDYHYGMIEFSMGKSLSLTKIAERFYRLLYFAEDFYGQHGFGLTGMGSNLTNRYPMYAYTNDPFYRMIRAYVSEQSTYHDPGRYFTNMHSTQTHIDLKKDHFFTCYNLFNALDFVRGMLFSNSVPVEGTLPEGMEFPEGILCARDYIWEHCELPNTGTVDQTFGGMEEMAEYMEGLKLFVEIKDGKLTYREPVTLREYFQEEGHTAEALNCYRSFQNVVLNGYHVLEVRGDCTQPLTETFAPCAFNLGIAVCCEEAEQILKKFKEKNQITAANSTLRRMAVCQQEIAEGKEMQALLCSLIDTAEKGLKNRGRGEEQYLAGLKERAHSLASPGKRIRELQKE
GUT_GENOME096502_0022917-397KTKDDLMLGPEFEHIIVDRKTYKSVSYYGDRGVEYIFKRLVDAGFVPEYEGDHILGLDLDDLNIATEPGGQVELSIDKKATVMDIEKSYKRFFSILLPILDEEDYDILAIAYHPVTKIDEIKLLPKSRYDSMFEYFKNHGTMSHNMMKGTAGLQLSIDYVSEDDFKKKFFLANVLTNILYSIFDNGYFFEGKPTHHNIRALIWENTDPARSGIAKNVFENNSYEGYAEYLLKTPAIFAYVDGKLQAVGDRLIGEFLDDKSTKEEIEHLMTMVFPDVRQKGYIEIRIMDAVPYPYNMAVFALIKGIFYNQMNLDALYQLFADVNYEDMERVRKDMYTHGNETIFMERSLKAWSLALIDMAKDGLSDEEGKYLDPLKDFIMTE
GUT_GENOME193126_013768-431LRERYFSNLKQSTELFVGIELEFPIVNLSGTAVDFQVVFSLFSELVEKLPLSIEKSDNNGQAIQLVSDENEDRILFEVGYNTLEFAFDKALTISEVDERFTLYMAVIQPFLSSHNHLLTGMGVNPFWDKNDNRPVASQRYEMLMAYLKLGDSGETQTDKYYQFGAFIQGSQIQLDVTAENMITTINAFNAIEPVKAWLFANSYLWHAQLDTLISRDVFWESSMHGIFPENVGVFPKPFSDQEDFLDYLNKTVLFTTAVSDETYYFEPIQTHDYFNHDEIPAFDLLGGDVVLTPSPHDFKAHRSYQYQDLTTRGTVEFRSSCAQPMADTFSVAAFHLGLMCELPALTDLLSDHIFYEDYGRDYQRLRRRFSTRELDPDALADMLAFSGELLSLASRGLEKRGFGEARYLAPLYQRIKTRENPAQK
GUT_GENOME113174_0101127-401NKRIGLEYERISIDSKTFKQTKFESLFEIIKEFSKINGFELIYDNETIIGAKNNEGTSISLEPGGQFELSLAPKEKLYNVHFAAQNYISQIDYIAEKYGIKFFAIGNQPKTTFNEMEILNKRRYKIMTGYLPTIGRLAPVMMRETAGVQINLDYKNSVDAKRKIKAAILISPFLTGFFANSPFRKNKLTKYKSIRALAWKYTGPDRCNLFYKGIIDYTYKNIYELYANYILDVPMIFISRENNYLELNGKITFREFMKEGYKGLYATMEDYLVHQSLCFPDVRLKNCIEIRNHDSQNLDVAVGIGAIYKGLLYNDDTIDKILDYFAPLTSSDLEDYGFLAAKFGVNFNVEKLNTKAFEIVKKLLYLAQFNLEADE
GUT_GENOME013922_0140916-417VEEMVYEKVIQPFEGREYGYVGAELEYLILPADNETPLADMGSDFLHYLVEQHGFEVSGIGSDGRYMRVTKEGDDVSFDYSYLLLEFSMAKQRSLMTIWEHFLPMFRTARDYYEERGCKIGCFGRNPFHTHRGEYTQDAFYSMIRKYLMDYSQDKDLNKYFTNMASVQTHIDVPMEHLLKTYNLFNELDFVGALLFSNSGKLEDPESEICCMRDEYWEKCGIPDIGVYAHDFASLEELAVAIAQEEIFIRPVENGIEGMQPVSLEEYFGRQGNPQEYFQYFRSFRHVVLNSYHVLEVRSDCTQPLEDCLLPGAFHVGIAMNYEKAAALVKEFKNGQQITEDNAVLRKKAAVGVAIAEEKALYEFLGQLYELAKEGLVKRGYGEEKLLTGLKDRIDRRENPAG
GUT_GENOME047473_007042-407KERIIQKYIEPLKRKKTGNIGIELELPIVHLKQQPVDFSIVHAITNQFVQTFHPILVHTDDNGDIANVEMKNGDIISFDCSYNTLEFSFAPESNMYTIKKRFETYYTWFQDQFKKQDHALTGMGINPHRALNHNVPIPTERYRMLYAYLQENNFPYPNYGFFACAAQVQLDADMDSFLISLQIFHKIEPYLSVLFANSILDDYVLARDWFWKVSMHGINPKNVGDFDVVPTTVDEMVDYIASQSIYCVMRDGIYLHFDPIPFREYVQQKQIKTCDSKGKEVIIQPTLDDIAYLCTFKLDDLTYRGTIEHRSLCMQPISEVFGALAFLVGMETNKEALMDWQPSLSFSRAQVNKKGGVRSLNQDLLIKDCKHLYHVAYEGLCKRGYQEEKLLERIAFRFDTATSPAL
GUT_GENOME244165_0097020-393KKPEEAALGLELEHFVVNRSDGRRIFYDEPDGVAELLKEIAAQEGLTPIWINGHIAGAKNEEMALSIEPGAQFELSLHYQRSVQALEAAYKRAMALVLPVLQRHDYGLLSLGVDPVNGVDQVPIIPKERYRIMNDWLGAHGAHSRKMMRLTCALQVSIDYFSEQDFVKKYRVLTAMVPMLYTLFDSAWTLSGQALPKYNVRQEIWRKTDPARTGLIPGVFDPDFSFARYAEWLLDQALLFRMEEGHEVELGGLTLDEALDQAQPDEVGGLIDQAMGIVFPDIRVKQFAEIRPMDAVPLPYALGAAALFKGLLYQDEALDRLYRRLCGIDLSIVERGKDSGRDNGIQGYYLSDYFAHWGVLLVDLARSGLSAEEG
GUT_GENOME058370_0133821-391KLGVCFGHFLVNRDDLKAVPFEGEQGIESLLEYLTRNGWEGLYEEEALIGVRKDNATVLLQAGGQIEVWIAKTASLKVIDKTYLDFLQSLGSELEARNQLLLSIGYQPVSKAEDIELMPMAKAKLLAEQLQDDPAALKALKAAAKTVVVLDYAHSDDFEKKYRVVATLAPVFAALLDNIPVLDGEEYNQFTAGIALDDDKKTALAKVENVITGHSFKYAQYSNFVANAPAVAVEENQQIVASAKTNEEVYEDQNMTEEQVAGVLDMVMPEVKATTRGLELHFVDALPYPLNMAYVALIKGLLYNADNMNALYEFVLNLNKELLDSLRQTVIEQGMEAKFNEGTVQEVAKDLYFMSTPQLPEEEQHYTQPLD
GUT_GENOME012599_0122311-348FAAGANPGRRIGVELEHFVLKADGSVCAYDEGSRRILEALAPDFEYKNVQDGHLIGVANDDYCISLEPGAQLEVSISPKENIRGICASFESFYKRLAPELAMCGVRLAAFGALDRRRVDKIPLIPKKRYEYMDRYFAETGSCGRYMMRASASCQFSVDYADEADFVKKFRAAYLLTPVFALLAANGGEGGFLKRIEIWDNVDPARTWVPEDLFSPGFGFGSYAEKIMDTPAIFMPQAGYTGDAKIGELAEKYGTDDEALEHFLSMVFPDVRLKKYIEIRVGDAMEYKAAAAYAAFGAAVLYRGADAVLRLLKDAEVGDIAAAKHAVARGGFDAEIYGM
GUT_GENOME251367_0118913-408LIFNELFSNFSNEERGTIGVEIELPIVSKKNIEMKNIQKLFRYLIKEKEFIKDCEDNDKNIISIINKENHDKISLEYSVNTLEFSLKEDCNIYNIKKRLDQYIIIIQNFLDKIDYQLVGRGINPNYKNIDRHCLKESRYLTIEKLLTENNGNNMLFNEFCSYICSTQTHLTPSINELPDVINAISSIEWVKSYLYANSYMEELKCNISRDYLWEISNFEKSNIGTNKHYETIKDIINDYKKRKMHFIQRNGKYYLINSMTIEDYFNKDKVLGKDYNNNLEYFSPLETDIHDFRSYKNVEITRRGTIEIRSDCTQELDNLFETVAFNVGILKNYKLINNLVVKYGFESDYIERRKRINRCKNVNDKELNFIYDVMNLIYLGLKERGYKEEKLLKRLN
GUT_GENOME140361_0073331-333AAVAQYIARGCLADAPLGRVGLEVEAHCHDPQDPCRRPGWDEIAAVLDALPALPGGSRVTVEPGGAVELSGPPADGVLAAIDAMRRDQAVLRPAFAGAGLGLVFLGADPLRPPQRINPGARYRAMERFFATSRSGEAGAAMMTSTASIQVNVDAGPREGWAARVRLAHALGPTMIAMTANSPMLDGDFTGWVSSRQRVWGRMDSARCGPVLGASGDDPGTDWARYALKAPVMLVHTPDAEAVTRWVPFADWVDGRVLLGGRRPTVADLEYHLTTLFPPVRPRQWLEIRYLDSVPDTLWPAVVF
GUT_GENOME208176_007688-399DKIVAYIKSGESQKENFKMGLEMEHFVIDKKELFSYDYFGKKGVGESLKELNHMGFEITNEEEGYILGLRQDDIAVNLEPAGQFELAIDAKKNIKDLDESYKKIMTEIIPIFEEKNQYLETLGYHPKSKIMDLDIIPKDRYRYMQKYFKEFAGKYALNMMRGTASVQSAIDYHDEEDFKKKFFVANALSAFMYTLYDNAYIFEGEVYEKRNLRQEIWEYCDKNRTGVYDFAFDEDMSYEKYAEKILNTDIIFINEDGKDIYKGDTKFEKIMDKDSSDEMIFHALSIVFPDVRAKKYLEIRMPDAVPYPYNISFPALLKGLFYNEENLNKLREDFKDMTYEACQNLKAETKKLGLQAKYEGKTICEWIKYFISLAKDGLDENEKNYLSSIEKL
GUT_GENOME120900_0105241-432NIQALVEFIESGIKPLGGPEAVGIELEHTVVRDDGSPVGYADDHGIAWLLERLRAEYPQATFDEEGDILGVAREGAAVTIEPAAQVELSAGPFTRLADAQRAFEGFEELLARELSPHGMKALTLGYHPTARVDELTLIPKRRYRFMDLYFKERGPYGRRMMRGSASTQVSIDYHSVEDCLRKLRLAGIAGPLLALICDNSPIFEGSPRPHKLVRTKIWKECDPDRCGIVPGSLEPGFTLERYAAYLLDAPAILVPCKKEGCCYTERTFGQIYAETPMTRAQVEHMTSMFFTDVRVKTYVEIRPADALPIPYAISYAALVKGLLANDNGLDALDGLFEGIGEADVAAAKDDLMARGYAAEAYGRPVAALCDRLVEIAAEGLAEDERAFLAPLA
GUT_GENOME176930_0290618-402LVEVIASGEKPRAQWRIGTEHEKFGFRLDDLRPPTFEGERGIEALLNGLVRFGWEPVQENGCTIALLRDGASVTLEPAGQLELSGAALETIHQTCVETGTHLNEVAQVAGELQLGFLGMGFQPKWKRDEMPWMPKGRYKIMRAYMPKVGSLGLDMMTRTCTVQVNLDYATEADMVKKFRVSLALQPIATALFADSPFTEGKPNGYLSYRSHIWTDTDADRTGMLDFVFEDGFGYERYVDYLLDVPMYFSYRDGTYHDASGQSFRHFMQGKLPVLPGALPTLRDWSDHMTTAFPEVRLKKYLEMRGADGGPWSRLCALPAFWVGLLYDDTALDAAWDLVRDFTLAERHALRDGVPKHAMNLPFRNGTVRDLAREAVRISVEGLKRR
GUT_GENOME048050_0032328-394IGIECEHFITERGGKAVPFFGDTGVENILKELSSHYGNRVCSSGHLIGLDDGSVYITLEPAAQIETSIVPCAEIEDIKNLYMSFNERINNILLKYSYTLVTSGYQPFSKAEELTLIPKERYYLMDEYFKSVGTNAMWMMRGSASVQVNIDYFDETDFSEKYRLANLLSPLFYLVTDNADVFEGKKYNGFSARSMIWQNVDGKRCELSAEAFDKGFGFKEYAEWVCSVPPIFIMNGDSCIKTGKKTAEQIFGGREINEGEIKHILSMVFPDIRAKQFIEIRCGDSMPPEFMLSYAALIKGLFYNCDALGYFNNLFKNVKYRDICRVKKDIQKNGFKSDYFGCDIKKLTKKAFSYAEEGLESSEKDFLR
GUT_GENOME243900_008467-400LEAVINYIKSGEKLKEEFTVGMEMEHFIIDKDTLKTVSYFGEDGVGETLRELHEGGCVAYKEGDYILGLENDDLAVSTEPGSQFEVALSSKWNIDELKDIYIKNMRKLVEVFDKKNQAMVTLGYHPVTKIDEIKILPKKRYDFMYKYFNERGDMAHNMMKGTCSLQCSIDYSSEDDFVKKYKICNCLSPVFYTLFDNAYIFEGNPAKSRNMREIIWENTDTDRSGMYPFSFDDDLSYKKYAENILNTPLIFSIDENHEQYYVGGKTFKEVFEDVRDTDMIFHALSIVFPDVRAKRYIEIRMMDEIKYPLNFSAVALIKGLFYDDTNLDKLSELFKNMTYESCMKAKLDAREKGLDATFMNVNMLEFCKMLVDMAKAGLPANEIGYLIPLEEMLE
GUT_GENOME139825_000797-402AEEIIYKRFIRPFENKKAHHIGVEAEFPVVNTKKGDTDVSFVSSVSDYLEKEGFSCVLYGTGGEKLFMENDAGDCLSFDNSYNNFEFSLNHGENLCDLYKRFNRYFDLVQEYLLKKDHMLCGSGTNPNFKNLSVNHAPFKTYNMVQKYLHTFKSQHNYPDFPAFMSSVQTHLDVPLEKLPQAYTFFARTDFLRGILFHNSPDFENKGYRIFRDYLWEKSAFGACPGITGAVDEAFEKTDDIIKFFLEKGMFNRLRGGEYEIFEPVKIKEYFENAAFGAMESDIECYLSFRNVEITARGTLEIRSDCTQPIGKTFSPPAFNTGLLYNMEKAENVLGGFFERNGITMKNSHLRGIVASGGNLTKIAPESEIKKLCGDMLCVSRAGLLSRGFGEEKLIE