UHGP-MC 43431


Information


Number of sequences (UHGP-50):
128
Average sequence length:
286±24 aa
Average transmembrane regions:
0.04
Low complexity (%):
2.43
Coiled coils (%):
0
Disordered domains (%):
3.91

Pfam dominant architecture:
PF00188
Pfam % dominant architecture:
4219
Pfam overlap:
0.41
Pfam overlap type:
extended

Downloads

Seeds:
MC43431.fasta
Seeds (0.60 cdhit):
MC43431_cdhit.fasta
MSA:
MC43431_msa.fasta
HMM model:
MC43431.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME096189_0224540-351DRTKQDILNKWQQFKPMDTGTSYMGPERIYMESPNVAVPYKAGTIKPEYIEDGLRAVNFVRFLSGLPDDVTANPSLAGQQQAAALVNALHQKLSHYPTMPAGMDDSLYTSAKEGARTSNLYGGSPTFYDNVLGYMADSGATNIDRVGHRRWIINPEMKQTMFGMVHNANNVAYASMYSMDKGRPASEVQYDYIAWPSAGYFPEEVFKTNDPWSVSLNPQKYDRTRTDQIQVKLTRVRDGKEWSFDKSDNDKSGKYFNVQTSYYGVPFAVIFRPDGIGDFAPDDAFTVQITGLYSASGSAAQVEFTTTFFKMM
GUT_GENOME154172_00030316-567ASDPVLSAPYAAGKVSDESLGQALAMLNQIRYIAGLDAGVTLNDTYTQMAQTGSMLNAANNQLSHYPSRPSGMSDELYELGYSGTSSSNIAMTSWRSSLLKSLMMWMDDSDSSNIDRLGHRRWILNPTMQQTGFGYAVSNSGAAYSSVYVFDYDYHNTDVTGVAWPATEMPKRYFESSSAWSYSYGSSLNGNVSVKLQCLTTGRERTWNFSSAASDGYFNVDNGGYGQKGCVIFKPDSISLQAGEDYEVTIT
GUT_GENOME121431_0217214-299YETLPDLQQFLPGALTEEALEQALNNVKFIRYLAYLPYDLALSEEATARSQAAALLLAAANELSHTPSQPEGMPLALYETGYAAASSSNIASFNWFTDDVLLTGLEHFMLDEADYNLPTLGHRRWILSPRLQYTGFGLANSASGISYVVMHVMDFSGEDADYGHVAWPSAGAFPAEYMSAGMPWSVSLQPEAYNLEASSPTVTLREQNSGAVFRFALPSSEIEAQYFAISREAYGEGACIIFRPDLAAAGLAGYEQNQVWQVTIEGLVAADGATASLEYTVEVISL
GUT_GENOME135531_00384311-573EEKWDYSTLPLKEGKYSEDYLQLVTEYLNIARIGIGLNQLKLNLDMSDAAQKKAALVMYNNINNLGSGHFPEKPSGVTDEYYKKAQSYMNENLYSGDPQTSIINALNDAYGDPVTCGHRYNLLNPYYLEWGVGSVGSGISMGWQGAHKFSGMGSSNVDLVAWPSNGIFPIDFAYNGIGNWTAYFFNGYRSTSDTYVVIKRLNDNSEYIINKDNLSSTKVLSLHGSLVTFRDDAITYTDGDVFEINIYNLNDGSGNLVNYAYRS
GUT_GENOME238248_0050245-362SIADIQAKYATVTTAETRFDTEPSVTSPYALGELSPSFLDSGLTYFNYIRYLAGLPAVELSEEKNTAAQHGAVVMAANNELAHFPTQPEDMDDAFYQAGYAATSTSNLSMRRTSSAGNLQLDVLQSAIAGQMSDSGSNNLSTLGHRRWLLSPRLLYTGFGCADAEFDGTFYRTYVDIPVFDRSGGAVDYNFVAWPASGYMPVQEFAIGDPWSISLNPTKYATPALEDLTVTLTRVVDGKSCVFTSDTCPETPAEDGSYFTIDTRGYGEGAAIIFRPASDDFGSTQYGLGAYTVTVTGLQDRSGNAETLSYRVVFFDIE
GUT_GENOME165701_007391202-1453YSDAEQARVTQLQQGFAALPKPVLNNQIFATVPSYLPFNTGDTTAAFKQQTYAVFNFFRSIWGLPTTTIEPTISNIDQTGSSFMAGYMAGLQHYVDGSENTVKYDKPADMSDADWQKVNDSLKHSNIASLGTITPLHQTIIDFLADNHNLFGNNAGHRLNMLRPSLVQSGVGVGLTANAKATGATGTSNLTLDVWTADASEDTYTQDGGVMFYPGTSVFPIELLPAGTPISAVSYGNTDLRKVTKVTVKNLT
GUT_GENOME103842_00282243-520TFGNPGQPYAQAPSVSAPYSPGVLNDAFLQDGLNSINYIRAVAGLNPVSMSQEKNQNAQVGAMLIAAGEFSHYPSQPAGMPDDLYQQGYAATSSSNIGKGHYSLSDFNLSCADDSDPSNIAILGHRRWLLDPTLTTVGMGYAERCSLTQVIDDTYVRWNGPDYVTWPSEGVFPMELFHGDLAWSCSLSPDKYMLNRESLSVTVTDASGHSWTGRVSEQTDLSADFVLTSGSGYGSAGAVIFRPAGLEYRIGQPLTVTISGLQLVGGGTDTITYTVKLF
GUT_GENOME139799_0254122-321VTCHAETGINVDYHSQQEIRDYLKQKNVDIYAETTYSTAASDRSPYKAGEVSKSSLNSALNTMNAIRYIAGISPVELEDSYTKKEQAAALVNSANGVLSHYPSKPSGMSSSLYQLGASGASSGNLSWTSWKTGLGYHLVKSWMYDGDSSNIDRMGHRRWILNPSMEKTGFGWVYGSHGTYAAMYAFDNWSADTSYYGVAWPAQNMPVEYFGNNYPWSISMGTTVDASKVKVTLVRQSDQKKWTFSKKSADGYFNVENSNYGKTGCIIFRPENMNYQPGDTFTVKITGLSETVSYQVSFFS
GUT_GENOME116665_00617306-550KYNNAKQEYKESVNTFNQNGSSYYETEPTYKTTPLVAGKIKENKIKGAVGYLNSIRVAAGLSKLEYDSSLTESAQYKATLTSYMTTNGISTSNPHYPTKPSEVDQSFYNIAQRNMSFENLYYGNMITSITHAINDGEGDTIACGHRYNLLRPGAVKIGIGCTNGQGAHKLSGYAPYTVNAVAWPSIGITPLEAYTGGYWTCKFYEDYTVTSDTTVNVKCLNNNREWNFTEKSVTGNNRFYIGNNQ
GUT_GENOME097727_00644347-630EIKRFYQSHPFSLRGDIRYEVQPGFSPYIAGRLSREDVEDGLNALNFVRYVAGLDSNVTIDDTYEIYAQTASVLLARNKQLSHFPEKPSDMPDDFYQIGYLGAGASNLSWTSKSDKKLSYGIINQYMDDGDKDNISIVGHRRWCLAPGLKNTGFGSYQNYRALFVFGTENNNNEVPGYIPWPARVMPYDFFYGPWSVQFDGYTYTLNDDVKVSLRTKSGKVMEFSKEHSDGVFFIDSEGSYGFGPAVIFKPTATIKSDDTVTVTITGVKRNGEPVDFQYTVEFF
GUT_GENOME071328_00685150-440RPNTAPPYSLGQVSAAHRQKALDSVNLMRMIGGLPRVYIHDFYNELTQYGAVVLAANDSLTHTPARPAGMDDTFYENGYAATSQSNIAYGWVSGMDSYFNMPRFTLGYMDDSGANNVPVVGHRRWILNPPMLYTGFGFALNGRNAAYSAMYAFNRNGTTPDYDFISWPASGNFPSEFGNMEMPWHVTLNPAKFDISRMNTGNVEVTVTAPDGRKQTINASNTAGNVGDKNQAYFNIDKEGYGIANAIIFRPGTGVFGDDDLRGIYNVTVSGLYTRNGAPAKVSYNIDFFSA
GUT_GENOME105568_009657-317LVAIVAVVECIAFTAQPVQAAKYSKSEAKQVKYFQRKYKNLDKAQYNRNSIYQQAPNFANPFSPGVLNPAYISTTMDYVNYYRDLVGLPSEANPDDANRSAQIGAASLAAVNASASLQAHGLINYLRPNYISENDWGIAENATLGNINFLDDAHSASAGEIVTDLMREDNNIAGAGNIGHRALILSARATRMGVGAAYGRNSNVLYSVQNGWFADDILRQPVVNTMVYPARRVFPYELVGKKTPWSFSTTKRIMDTPKIYITDITAKRRHRATQVRNFGKAFYGDGYATTITYQPGKTKLINTHKYKVKIG
GUT_GENOME157414_0057751-322VANHTQEEIINHIKDSGALITDDTVYKTDYSAEQPYSAGVLDESTLNSGIAMLNNIRYIAGLNYDVALDDGFNEECQAGALINKINGSISHYPQQPSDMSDEMYQLAYKGCNESNIAWGYSTLNKCIVFGWMSDDDDGNISRVGHRAWCLNPTMGKTGFGVVDNYYNMHSFDGSHTSSVKNISWPAQNMPVEYFEKNMPWSIFTGSSETASDVKVTLTRQSDGKVWNFSQDSADGYFNCTNYIQNGTIVFRPGDITGYNDGDVFTVSVTGVK
GUT_GENOME093124_0147243-321EASSLYADTPSVQGTYAAGALAENTQTSALMYVNFLRALAYLPPVSLEGVYSLRAQHGAVLLAANDRLEHDAPRPQDMPEDFYECAHTGTLSSNIAALNWATPDVLLNAMEYFCRDDGEENLPVMGHRRYLLSPLLGKTGLGLAQSQSGISYAVLYVGDTSADSGDWRQVAWPSEGAFPADLMGPLIPWSITLNGEYYDLEASEIRVFLTEKTAGQAKLDYFALNTDAYGAGPCLIFRPDLTAMGLENYQQNQVWTVRVEGLKTPAGAAESLEYQVEMV
GUT_GENOME171681_0067146-341REEIVARYQALELSLGDEAVAFSEAPSLSAPYRAGQVSPGELGESLALLNFIRYVAGVPDDVSIKEEYASLAQHAAALMAANGLLSHTQTRRWDMPQDYFALGARGAMECNLGRGYGDIGASLLNYLQDTDHVNIFHMGHRRWLLDPAMQATGFGFADGYSATYAFDQAREGDLDFDHIAWPAANTPVELCTPRRGNYAFTVVLGQAYAPAELERITVSLSCGEKVWTLSAADCSLEAFQGQGRYLGVDTQGYGPANCVIFSVGSFTGGERVDIAIRGLVKDGKETELSYSVDFFS
GUT_GENOME044179_0021350-339EHSRDVIRQYINGHPFDMTRKTEYAKAPDGYMPFANFGKLSDTDMIEGRNALNVMRFIAGVGVQDVTLCPYATDYAQAAALVLNHKGYLTHTIYNDMPGIDMWTLRCAQVGAKNSNLASGFNNPAHSVVMGYMYDTNEKNISELGHRRWCLNPPMYLTAFGQVGNFGTMWATDSQLTQASNSGICWPAPNTPAEYFPADTAWSFSMCETVPADNIKVTLTRKSDMHVWTFSSEQADGYFSVNNDTSVPLYGCVIFRPDYVGGYSHGDVFNVKIEGLGKEVSYNVNFFALD
GUT_GENOME121384_0147940-281EGGLFAEEPSVSTPYAAGEVRAEALADALAYLNFLRNLAGLAPVELDPALTNIAQQGAVLSAANGFVSHDPPAPADMDAAFAGAARYAASSCNIARLNWTSEDVLRQGILYFARDDGEANLSTLGHRRWLLNPNMAYTGFGLAMDEAGMSYITMYAHDLQADPGDWQYIAWPSAGAFPAELMSKELAWSIILDPDLYDTDGAWVRLTDLNSGEVYEFPGDGGYFAVDAGGYGAGPCLIFRPE
GUT_GENOME143709_0332241-350TKQEISSRWLQYKPMGVYNEYMKQKDIYEVMPKASVPYAPGKLKPEYIADGVNATNVARYLAGLPDDIQPDWELQPQQQAAALVNAANNMLSHYPVQPPGMEETLYKLGEKGTRASNISAGRSTFYESVIEGYMSDSGTSNIDRVGHRRWILNPAMSKTMFGIAYTSEGYPYSAMYALDKGRTQQVKYEYISWPASGYFPEEIFAPNDPWSISLNVEQYDNSRTNEIEVTLIRERDGKRWVFDQRDTDKEGKYFHVDTNYYGIPYNITFRPDGIERFQDDDRFHVKINGIYDKSGQAAVIEYDTIFFDMV
GUT_GENOME155242_00395153-449EIVAKWRELKINLWRQDAYAKEPNKTDGGRLSDGTLKNALKMINFIRYTAGVAADVKLEDEYVYQAQAASYVNAKNNRLDHYPNRPSGISDALWEAGYTGAGMSDIAMGYAGVMAAIQGWMCDSDSGNIASVGHRCHIIKPDLAAVGFGSTQLSGAPYHALRIDYDRKNGKFTDNYICWPAKNMPVELYDSGYNQNYAFSVGLGTSYDKPDLNKLKVTVTSKKQGKTWKLTKKSRNPNGLYLNVSSNTYSWKISNWIVFNTTTFEEGDRVSVQIEGLTKNGESCDPITYDVNFFSVI
GUT_GENOME130695_0158545-359SARTRKDIVEKWLAYDKADAVKIFDENPSVNTPYRTGKLNAGFLNQAELYFNYLRYAAGLPDVALSDTLTDKAQHGAVLLAANGELTHTPSKPADMSDAFYQLGASSTATSNISMRYIFDSRKILTSSLQGCIDDSNSKENMLVVGHRQWMLNPKLLYTGFGYARNAEGYDFAVTQITDKSAADSDYSFISWPAQGEFPNNIITSGTPWSVTLNPQKFQNPELEKITVKVTRLSDGMVWNLTNQDHTDRPDRQEPYLNVTQRGLGAGNCIVFHLGTESFQTDTYSDNFTVEITGLKTKNGSDASLSYATHIFDFQ
GUT_GENOME235038_01242692-974EVRAYYKAHPFNLKRSVTYDETPGTNPYKAGSLSSTSRADGLNALNLVRYIAGLSEVGINAEYEELAMKAAIINAKNGRLSHYPSQPGDMPDDFYVAAKEGAGSSNLAYASFNTNLAYSVINQWMNDGDSSNIKDVGHRRWCLDPNMEETGFGFYKNYSAMYSFDGSGNTANDVNYVAWPAGNTPKSVMTGPWSVSLGRLFARPDEGNIKITVTNGSQKAVISKGQGTMYISTKGYGDGDCIIFQPGVSYSSGDTVQVTISGLRDEKGDELEISYSVNFFDMT
GUT_GENOME100560_0088644-358TFEQINEYFATHPFDTNMKDEYDIEPDIANKAINDQIKAGAIDIKTDKNKREQLIGKLSDKTINNALNAINCIRYVAGAVEMKIDEEKMMCAQAGAALSDYLKVTTHYLPKDKTLEAGIAEPVFNIAYKGSSMSNIVQGGGVAGKIVNSFMPDPGNDGTGLGHRKLILTTANNTVGFGASRNGQNPNVFQYSMGNFNLNHSGVMWPARKQPVDIFNTNSSQNPWSFVVRPNKIKVKEAELKVTLECNGEIEEILPGQMVGNLRRLFYKNEMIVFRPAKQYKPGDKVKVTIEGIYKNSNEDIKAPTTYDVHFFEIG
GUT_GENOME049996_0057987-391MEELRQLYNSLPTYDKLYKTEPVVEGTDYHIAVLSDEAYDTAKGLINYYRRVAGLGDITLSDDVNESAAYGALALAMNNSGLTHYPSQPKDMSSIDYNKAYAATTSSNLSSASGYSEKRIISVAVSGQIGDSDSSNIDTLGHRRWLLNPGVKTLGIGSANNNYNYYTDIRVFGDGIQSESVNDYDFIAWPSSGANLTDTFPKDTPWSVTLNPKVYYTPTDVSVEVKSLRYGTTWYFDNNTGYGTSSIDNYFNIEKSGYGVANCIIFRPAYQYIDMFKGDYIVTVSNVRRKDTGEYTNIQYKVTFD
GUT_GENOME036523_0109251-334AENAAGLNDTLTFAENPVVSGDYSAGKLSDKTLNSAINMFNQVRYIAGISYDVQLDDTYNSLTQTAALVNYVNGELSHYPSKPADMDEDLYNLGAKGAGESNIAWASWKNAGMNQSLVNGWLEDGDPSNVDRLGHRRWVLNPKMKYTGFGAVTGTKGTYSAMYSFDMKNTKASEYGVAWPAQNMPVEYFGTVFPWSLSMGESVDIESVKVKLTRQNDGKVWNFSNEAADGYFNVNNGGYGQKGCIIFRPKTDRKYAAGDVFDVEITGLGDGKDVSYTVNFFELE
GUT_GENOME011023_00077304-548YDQLPLRAGTVDPRILEASVGYINAIRAGAGLPKLALSEYYSNASQHKAVLTVYLSSQHISNPSPHFPPQPEGVPDEFYQLAQAGNGENLYFASYFGADDIIGSLQKALQEAYGDVIACGHRYNLLDPNWENIGLGVCAGQGVHKLNGYQESDVVLVAWPSKGIMLTDLNYSSFRWTARFYNGRYRVTDSTTVEVTNLNSETTWSFTDETAEEYELYRLLGDNQVTFYNAGITYSQGDVFQITLH
GUT_GENOME246481_0064422-307RAMYAGYVAYYSASPYKSAPSVSAPYDAGALTDSALDGALLYVNFLRRLAYLDDGVSLDPLYTLRAQHAAVLLAANDELAHDAACPEDMPRDFYETAHTGTMSSNIACINWMEDSILLTAVEFFARDDGEANLRNLGHRRWLFNPRMSATGFGLANSASGLTYAVMYAHDDSRAVTGWGSVAWPSAGAFPAELMSPDLAWSIVIDPEMYDVDASDISVRISEKDCGEAALDYLRVDTSGYGAGPCVIFVPDLDGMGIEDYQQNQQWSVRVEGLVDINGRSAEISYT
GUT_GENOME100879_00950175-426GKVSKATLKNGIKALNFFRYVAGIPSNVKIKASYQDLAQAAAYLNYNDKSAWISHFPKKPKGMKSAMYKNGYKGASSSNLAAGNYSIKDAICSWMSDSDSSNIDRIGHRRWCLNPTMKYTGFGIDSDIYAMYSFNQSGKAECYGVHWPADNTPVNMFSAYDAWNISMGLPVESRGLKVKLTRLNDNKVWEFDSSPKNVNDNYLNVDNDGYGQTGCIIFRPKSIDEYKAGDTFKVDITCSEFTLSYNVNFFSL
GUT_GENOME096525_0023441-336QRDEVKRMWNHLLELGENMKPYDVEPSIKAPYTTGKVSQAMLEQALLAANLARSLAGLPADLQLDAGLTDLAQHGAVLLAANNTLTHYPAKPADMADAFYQKGSESTSSSNISSGRTNLSQNVLNGYMPDTGSNADRVGHRRWILNPGLQKVGFGLAGSYGTMQVFDRSRTEAVDYSSIAWPAAEFPQQLFQGNYPWSVTLNPKKYKQPSLQDIQVQVTQVQTQKKWSLNAKNNNLNKDLYLNVELSGYGVPNAIIFRLDQVQNYEGLYQVSITGLHTVDGKDANLTYEVEFYDLS
GUT_GENOME243777_00437364-569VNLSVQNVYTQSADFNQFTHPASLSQPVLEDGLAAVNFYRALYGLSPLTLNMEGCESLAYGAVLSFWASEDGDIPEKPDAMPDSFYHTAQLACQSSTVVRQSALSPYPIAQAIHTLMEKSPPTRETLLSPHLTSVSFGAATAEDGSTCVLLSAQTDDTDDEGVTAYPSGGVFPQELADSLWSLSFDSSLLCGVRGMPTVTIQDLDN
GUT_GENOME281910_01207320-581FYNKNGSHYEIEPQYKNLPLTAGKWSDMALQGSTDYINMARVGMGLTPLKLNEEIADCAQHKAALVMYMNSNGMSGGHFPTQPDGVSDEFYNKAQSYMNENLYHGDVQASIVNALNDAYGDPVSCGHRYNLLDPTYTEWGVGAVGSGISYGWQGVHKFASKGTYNSVELVAWPSNGIFPMDMAYNGIGNWTAQFYKNYKVSDKTEVTIKCLNNGKTYEITNENKNNSDKFLKAVNSGLLAFRDDTISYESGDVFEITLHNVT
GUT_GENOME089647_0132450-348TADQIKQKYKDLGLDQKLPADTWDAEPVLSAPYSPGYLSLASQNKGVAVLNLARFIAGIPSNVTIDSSYAEYAQAAALVNCVNNELSHYPAKPDGMSDEVYNKGYAGAQKCNLASGWSSPADTVELYMNDSDDSNIALVGHRRWCLDPLMGQTGFGRVGNYGAMYSMDQSNADGYGYDMVPWPAATMPLEFFGADQAWSVSLNTADLGDDIKVELIRENDGKVWSFSSAGSDGYFNINKGQYYTDWQTGEEKYAGYGYGDAIIFRPDDLGREYQEGDCFTVKITGSTGTKAYAVQFFGM
GUT_GENOME096390_0077281-350YEERPSITAPYTAGKLKDTYILEGIKAVNLARYLAGLPDDVQPDYTLSKQQQAAALLNAVNGQLSHYPSRPGGMNEDLYNLGANAAKNSNIAYGADTFREAVFDLYMADSGESNIDRVGHRRWILNPRMKKTMFGAVVGQSGTMEVPYSNMYSFDASRAAGEVVYDQIAWPSAGQFPLEVWHDKDPWSVSLNPDIYDNQRTSGIKVKMTRKADGKVWSLDSSDQNKGGKYFKVDTGYYAIPFCIIFRPDSSIKYNPEDQFHIEITGLYSK
GUT_GENOME180544_0043343-333SVYAQAPSVRAPYAAGALDEKLLDDALAYLNYLRAVAGLAPVTRSKLYDARCQHGAALLAALDYADHNAPRPDDMDADFYDTAHTATSSSNLAKFNWMRPSILREGLEYFVRDDGDANLVVLGHRQWLLNPKMAETGFGLANSESGMSYVLMYAHDMGNAGAQWETVFWPAEGAFPVELMHANLAWSVTLNAGEYDLVHSDVRVTISEETRGLSFQFNCTAGTGDGFCAVSDAGYGAGPCVIFRPDFTGTDFTDYEQNQRWTVRVEGLRRQSGETASLVYTTEMVSVTPQK
GUT_GENOME236871_01471139-460SAKHTKEEIAQRYTRLQNAIVSEADFMKTAPSMPAREVTTNCVFAGGQMNDASHQFLTELVNYYRYLIGVPEFSQNSVHSDSLQAAAVVRFYNPTINPAFSGVHSQMPSSFFELGTRDSINVFSQILNAGSYSSMAEVGIRYGLINVGIATDSSTKQNYVSNIQTRQELLSYKVQGMNFGYVSNSLVGLATGKNGSATDYAYMFPSAGYFPNEAIDPTTTSWSFELNPNVFGSQNTSNLKVTVQNITKGTDAYECTFANKKLIAGGGDAYAFEAPTDYVLKKYRDDYQVTITGLQNKSGEAVTIKYTVRFFDVNECYQTSVK
GUT_GENOME270331_0021861-370MAEDADSAMNVEYRSKQDIVNYYNEHPFVRDMETTYSEEPDVFAPYNLGAVSQECIDNVTNVVNLARYAAGLSGMVESDSELNKKAQAGALCDAAIGDLTHTPYRPRDMSDELYNLGKEGASSSNIAHASYNLTLPATIRMYLSDSDSGNRKKVGHRRWILYPALYKTGFGQVENYSATHVIGGQRNYSAHEYGVAWPAQNTPVELLSYSGDGRGVSGYPWSVSFGEKPGDNINVTLIDLKTGNSWHFSNTYGDDGEFYVNNSGYGQRGCVIFFPENLSLDRGDQFEVRINNIGIAGYQDSLTYRVNIFS
GUT_GENOME121397_0096039-337DEIVTYAQEHPPGETYFDISGNQLKDYKISYEQEPSLSQPYAEGALALEEQIAALNTVRLIRYIAGISDSLYLFDPYVEMAQSAALINYANGKASHSPKRPAGMDIGLSAMAMEGAQNSNIDHTAWQNNSLKSSILYGWMYDSDSANIKTLGHRRWILNPSLVMTGFGSVTGEKGTFNTMYIRNEGKNLESVRGICWPALNTPISYFDPNSAWSISLGEEVDESSVVVSLIRIRDYKIWTFSSASADGNFYVNNQNYGQKGCIIFKPKNIGEFLPGDRFSVYVSYNDKNIGYEVNFFDL
GUT_GENOME103729_0161845-322SEIKAMMKKLNPRKLATSYTNPPVNTAPYSLGQVSSATLQNGLNTLNMVRYIAGIPYNVQIKTDYQELAQAAALINYVQGYLSHDPAQLPGMSDAMYEKASQGARSSNIGWGYKTLYDEIVFGWMSDKQSNNIDRVGHRRWCLNPTMQYTGFGIDNDYYAMYAFDGSGQPTAYKGVAWPAQNMPLEFFDQAMPWSISMGTPVANASVTLKRLNDGRTWSFSKSSSNGDFYIENSNYGQKGCIIFRPAGIGNYNSGDLFQVTITGTNLSVQYQVNFFSL
GUT_GENOME080367_02191162-395DLNACQAGRPSEAERQLFINTLNEIRALHHLPKVRYDQAYEDQMMQASMLLAVNEKTSHYPDGTWRCFSDIGYQGASTSNLNFVQAYQPLASYGADTHLINWLIEKNSSSIGHRRHILSPFLSKTVYGEVSNTEAKSVSTILGAAMKVVYPYASQTLSTTMPKGVIAYPYHVYPKKFFSKAEPLSLSILVNPENAYANNTVDFSQAKLVVTRRDNQHIQTISNIQYDNISYGLV
GUT_GENOME046609_0040332-331INGNELETFKGRTKEQLFALWQAAEVKEYDSIYETGKEASFSAPYSGGVIKQEVLDNVQSNLNYYRALTGSPQITERFVNQPELQNASVLQAINLRDRENGVGLTHTLWNYPKPDDMDEDFYDSAVHADHNIISTYGYHSSISGFFSESTFIYTAGHRTALLSPYIGSVQMGLGDTTYGRCRETTSASESFTEKFAAYPAPGYFPKQDHAVLSDWDIYLNLDYFKKADDNVTASITDLDTGEVFEYSYDNGNIRTGSVIFLEAPRKDENRYYAHNYRVEIFGLESISGENVSIEYTVNFY
GUT_GENOME282992_02223326-607EETYNADGTLSQQSQEAALGLINFFRALCGLGEVTISQEDNKYAQAGSDYLADTNFESGNPHNAADGSGNEDAIKGLHSSNISMGSHGYSIYDLLLMQFGDDDDQNGTSLGHRRWLLDPDLKTVGFGAAKGSSWKYFVLTYTAVDNPDRENVAFPNNGFFPIEALISSSTMYGAPFNVSLGTDYKVTSPSGMKVTITKDDGTTTEFNYVSSSSNPDFNDNWWTVNTGGYGSGSAIIINPSYSWLGDYQALAGHSFTVTITGLTDKDGNPATITYTINFFSLQ
GUT_GENOME141220_0115730-347PVYAATTVFSQKDLAQIAAIKKTYSQLDQTVYPYDDLYAEPPRFDYPFSAGKIKPQYVDASVDWMNYFREIAGLTAVPATDYLNKKSQIAAASMAAAQVNPNLDQHGLNNASKPYYVPASTWSAAKDTTSSSNLYFHYGQDSPSDTISALIADNFNLNGSNDTGHRAWMLSSRLSAFGIGVAIGSNGWRFANQSIINPSDYTRTPTKNIVTYPGNGVFPIEELTQRATYNNPIPWSAYFASGESIPTTGLSVTIKNDTTGKVGSGAQVGNYNQGFFGGYDAVITFIPKNVTLTAGNQYTVTIKGLGSQYPNGYTYSFK
GUT_GENOME220434_0126347-305DALFAETPSIEPPYAAGALSDEQITSALDTLNALRALAGLPEVARDAALNNIAQHGAALSAARGVITHAPEQSVHMPDDFYALGAQAAARCNLALFNWHEPRLAARAMVFFMQDDTAGNLADLGHRRWMLSPTMGRAGIGLALDEPGRSYIALYVTDDSADFDYDYIAWPCAGAFPAELMNAATPWSISLNPEKYDLAASKPHVTLIEETSGARWEMASGDSIDAGGAYFTLSFGRFGDGPACIFRPNLSEYPDLENGY
GUT_GENOME091126_0131033-320VTKHTKKEIAEYIKETQADIKSDKIGYKEIPKSEPLNYTNQGSLNDTMINSALLKLNQYRYIAGLNEVVEDKTYSEYAQAVAYVDAYASDKSPSGIGEPLYSYYSEGIKYSNYATGCNGVVPAIEAWMKNSDTRQQMMKSSMSETGFGFIYDATVSSGNSYYATMYTGQGDTTVKNVAWPAQITPIAFFDSDDKWTFATGQEENVDKITVTVTRTRDNKKWTFNKSNGLKVSNSASGLKGYLVFQPNLFDGLPSNEQFQVRITGLVQGNVSYSVEFFDADKYSESTGD
GUT_GENOME170766_01377118-405IYMFTPNVTVPYSSGAASAEHYKYALASVNLMRQMGGLPNVTFKDDYNTYAQYGAVLLAASGQFSHTPSCPAGMDSEFYLKGVTGTSRGNISMGSYPSYTMPKFTTGYMQDNSGSNVSTVGHRRWILNPAMGYTGFGYADNKKGMAYSVMFAFDNSKSGVDYDFIAWPSSGNFPNTIMSAREPWSVTLNPAKFKTDASNLNPSKISVTITAPNGVTKTFTNADHNGSILNNTKKSYYNIDTAGYGVNNCIIFRPGTDLFGSNALSGTYTVVVSGLKETMGTPATLCYT
GUT_GENOME246258_0200651-356YKESGRESVKKLSMEELKQLYKAIPEYEKLYKTEPVLTGSGYRPSVLSDEAYAAAQDWINYYRTVAGLGKITLDSPVNESASYGALVMAMNNGLNHFPRQPENMSDEDYQKARQATASSNISLTYSNTDTDILQHVISGQIEDNGMGNIKTLGHRRWLLYPNAKTFGIGSANNESSYYTDVRVFGWGTQREDVDDYEFIAWPSSGANLSDTFSVSTPWSVTLNPDLYYLPVIDGNGRYKNVSVEVQSLRYGTTWTFNEDTAATGWEDEDFFESSFGDYGVDNCIIFRPAYQNLDTFKGDYIVTIRN
GUT_GENOME260243_00019326-591TDYQFKNLPLAAGKLKNGVLKGTLGYLNAVRAGAGLGELTLSEELCDKAQHKAVLTAYMAYNGISNPNPHYPSKPDGVDDEFYNKASCAGGENLYNNSLLGGDFVIGSINNALQEKYGDTIACGHRYNLLDPYWTTVGFGTCNGQGVHEFSGNDSQRNDEVVCWPSKGIMLADAVHESYRWTINFYGGGYTVNDNTSVEIKNLASGKVWKLDKSSEDFRIVGSSELSFFDNTISYTLGDVVEVTVKNLDKNGQNTSYSYRTVFASA
GUT_GENOME016866_0453641-352RSQQEIVQKWNQWMNGNDTSPLYTKTPSTSAPYSAGVVSDAYLEQGLNAANFYRFITGLQGDLVLDPALNRQAQHGAVVVSTGGYLSHSPKQPGDMPKDFFDTGSKSASSSNLYVTSSAKNNVLTKSIQAYMNDSDASNIDRLGHRRWILSPQLQRIGFGLATRSEAGKTYDQYYSAMQVFDKSRTGGTSFNYSLFPNQGAFPIEAFGSTQAWSVQLNTDVFAKPSLSEVQVEMTRTSDQRTWNFNAKNQNSGFPTGYYNVDPADKQWFNQAYFNVETAGYGYGYAIIFRPDDVQLLKNGDTFNIRITGLQK
GUT_GENOME159675_0151940-345ASLPKLSKEEITQLLRENPNTMPDQVFDVTPSVTAPYSIGQVSSQALQRAVNRLNALRRIAGLPSVTLDASLCENAQYGAVLTAANGGLSHSPSKPADMDESFYKQGYSAASSSNLYAGVTLAFAPDGFMDDSDSSNISRVGHRRWQLNPTLGKVGFGYAQSSSGYGRYVTEKVFDRSGSGCDYDFIAWPASGNFPTNTGGFETNTAWSVTLNPQKYAIPSLSQLTITLVRESDGKSWVLSGNESYNAGAGRYLNVNTGNYGVANCIIFRPDNIDRYEGVYTVTIDGLKSKSGSPAAFSYQVDFFD
GUT_GENOME167367_0013917-331SATTSTVTISEALAATFTKDEIQEVHRIQNQYSRLPKQTFNSGNLYASSPHLTAPFSPGSVTNSYINSQLDYINFYRGLFDLPSISTNKTDNDNAQITASVMAAIKANPFTNQHGLPSETRPNYISDTYWTIAKNVSASSNLNFNVSNQSAGDVITDLLTDTYNLDGSDTGHRAWLLSSRLTTTGIGAAYGENSYRYSVQQVAYSSDGYKAAAKSAVAYPNSGVFPIELLQGNNIAWSLYLSDKTTSGIPKITVTDLDTGEVSQATNVNNFSNKAYGYFKTIITYFPGDIKLISGHEYNVNIDNVYQYSFKLFNQ
GUT_GENOME185313_02014272-557MDAGRYQGESPYTQAPNLTPASFAPGALEEGFVQDGLATINYVRAVAGLPADVRCTDELNANAQEGALLLAASEFSHYPHRPDGMEKGLYDAGYAATSSSNIGYGHRTLSGFVLSCTDDSDAGNLDRVGHRRWLLNPDLKLVGLGYVENMTTTKVFDHSRTPSLTPEMVCWPAQGVFPVELADAGLGWSCTPKPGMYDLGASKGLTVTLTREGDGAVFELDLSMDDPQAGAYCHVDTVGYGYGPAIIFRPEVEGYAHGDVYHVKIDNVKRAGGGTTSIEYTVKFFE
GUT_GENOME046592_02426281-585ADWKPVAYTEREKELLSNSRKYYTESVNSVNSMKSYYEIKPSYDSIKNIEGGKLSKEVAKGAVDYLNAIRVGAGLNPLEYSEQLSEDAQCKSTYTVYLAKNNIKNSSPHNPPKVEGLSDEYYSKCQSGNGENLYSCGIISTSIIDSISNALDDSQGAGQYYNRGHRYNLLNPEWKYIGVGNTLQQGCHKLSGTQSYDVDVVAWPSKGITISESGFSPSGMWTCQFYNNLKPTDDTTITIECLNSNKKWKIDPNNLLENQDYNYKRSENLISYSDNSIAFKIGGVYKITYDHLTDINGNETSYSYR
GUT_GENOME073323_00434419-691FSAKSDTYEERPSLSAERAGKLSEASRENALNALNFYRFVAGIPCDVEYDEQLEYYAQAGTTLLTGVGEMTHDPDRPSGVSQEFYEDGYKGTSSSNLGKGYSNIVDALNRGWMNDGDANNIAGVGHRAWCLNVPMGKTGFGHSGAYTAMWAFDESSEDPANPAYVMWPAQTMPVEYFYGPWSVFLENFDYYEYAEEESVKVTMHSRKKGETYELTPADKDVYGEYLNVTAGSIIFQPGVSFSSGDEVEVTISGLKNIYGEDETISYTVDFFSM
GUT_GENOME218245_00762914-1204TPTKIFDQQPSVTAPYATGKLSQSLLNSGLTLLNYFRYVTKLPSVQLDETLTSNAQHGAVLLAAIDELTHYPSKPADMEDAFYKIGAASTSSSNISWNMGYSAADSLYVSLMGCLHDSGTTNISTLGHRRWLLNPTLLNVGFGYAQAVNGSNYSTTQVFDRSGRAVDYDFISFPASGNFPKNLFKNTVPWSVTLNPQRFEAPNVNQLTVTITRQSDGKVWTLDSSTGAPVNNTQACLGVNTQGYGVSNCIIFTPGANSLTSCDGIFTVEITGLKTRDGKDARLLYEVNFFD
GUT_GENOME108279_0061385-396INYRIGEDTTALLDAKGKADRDALAGSLSQKTLNNALNATNFMRVSAGLQKLYIHTNGRIGGYQWRAQAGAALLAELGAITHNVDGNKAAQAGVSTGVFGWAKAGPGGSNCVAGYGIANKMVNSFMPDIGNDRTGLSHRSYILNPGLSGTGFGAANLSGKNKAGDNRGSAVTMFVTYYGSDPDGLTAVIWPAAKQPIETFRVTSSSGDNPYGSNPWSFFINTNDAVVNASDLKITLTCEGKQTDVLDFKQIPAAGTYGSRLLVMPSNPLKQVIIFRPHVAYTSGDKVTVTIEGIEDKSGLPVSVTYDVHFFQ
GUT_GENOME024713_0181895-356AVTYKKAYKTKSPYYEGALSDATLKKALATLNNVRYIVGLNYNVTLDSTYTAQCQAGAYVDYLIDDLTHDPRKPSGMSDALYKKGYAGCSSSNIAMGSSTLNSCILYSWMEDSDSSNIRYVGHRGWCLNPTMGKTGFGVAGIYMSMYAIDKSNNSSIKNVAWPAQTMPVEYFDKDYAWSLFTGQNETESRVKVTLKRKSDGKTWSFSKGKANGDFYVSNDYQKGTIVFRPKGIAGYRAGDSFTVNVTGVNKPVTYTVTFFSL
GUT_GENOME006956_00625244-499EVFKASSMSDAPDIKNCYAGKLGAKFRNEFLSEINTVRTLHGLPSITYDYAHEDEMMQTALILAVNNILTHYPEANTNCYSDIGAVGAKTSSLEMGVRQNKYDYSPAESITNMVHDKLNLFAGDVGHRLWMLNPFLQKSAYGSVNAPSFMNTRFPYVVGTSYKVVYAFNSTTTAPLGVIAYPYHNYPAKYYMAGSILSISVLTDQKNFFANRNVDYAKATVVVTERSSGAKQKISNIRYENIGVPNHIQFNFDDLK
GUT_GENOME125939_0017567-357NYYREHNINLNQGLSFSTVPNATSPYAVGAMSKTSLEQSLATLNFVRFVAGLDANVTLDDEYTQLAQAAALVDAANGQLTHTPARPEGMTSDMYEKGLAGAQASNLYASSGTGSINAAILSWLDDSDEHNISVLGHRRWVLNPYMGKTGFGAVADSGGKGGFRGWGAMYSIDNSNTRASQKNVAWPAENTPMELFASDSAWSLSTGADLDASQVKVTVTRNSDMKTWNLSQGSADGDFYVNNDNYGQKGCIIFRPKGLSVGVGSGENDFYQVSVTGIPNPVSYKVSFFGLS
GUT_GENOME268865_0106566-357AKAINSGKTQNTSYYSVEPSLESPYEAGMLTDDTLTSMGEMTEFYRWLIGVNPLKTSVVTSDELQAGALIRNFQFAHFPSDSSKPSDMSTELWEKGSNCIHNILAIGYTPQGAITGWMNEGYSTYRGSWGTLGHRYSLISYNVGKLGFGYSGSVAIGDVMDESNPKNDIPFYAFPAPGYMPNSVVYPDSSAWSVQPNRNYVKYDSLGDITVTVTNLRTGESYECTLDNEKLFYDSWSGEFAFVQPDDEGDAYYYTDKYSVEATGLKDSEGNPAIIRYTVNFVDVTPKTGDVN
GUT_GENOME230919_0123051-324VYTQQNIYASKPSLKKKFKVGSLKSSYINEQVAYINYYRDLFGLTAVKTSSQGNKNAQTTAAVMAAINANPFVNQHGLPNEKRPSYISKKNWLLAQDVSNSANLNFNASPQTAGDVMTDLLTDRYNLSGSDTGHRAWLLSTRLSKISVGAAYGINGYRYSVNQVLNVGDSARTASREMVAYPNAGVFPLELLNGQNIAWSLYFSNKVVTSTPKITVTDDDTGKTVTASQVANYSEYGFGNFQTVITYYPSKLQLTAGHKYTVRAGNLATYSFKL
GUT_GENOME156121_0217442-352QRSEEDIKAYLSSHPFDVTKPDTWKITPDYKNNIAGELSDASRQNALNTLNVMRYVAGLNEASYDYDTEKYAQASSTLMIGIGGITHNPERKAGVSDELYTYGAYGAKHSNLGAGYDTISEAILYGWMGDGDPSNIDRVGHRRWAISPGLEKTSFGISVGPGRNNEYTSMYCIYTGTDLWNSFFGSSSNGSDVDFLTWPASNMPAEYMYGPWSISLNSDYFSANENDVKVTLTNLKTNKSYYFDKSDKSTTGEFFNINTGGYGSLHQAVIFDAKIKYNAGDRYLVKVEGLKDNHGNAYDPIEYTVDFFSLY
GUT_GENOME034307_0010550-345INVAYHTQDEIRTYIANNGATINDALKFSENPATTTPYSLGKLSDKTLHSALAMLNQVRYIAGISDQVQLDNSYNQLAQAASLANYLNDTLSHEPEKPDGMSDDMYNMALKGASSSNISYASWSGRSLNDVLISGFMCDSDKYNISRVGHRRWVLNPSMKSTGFGAVSGKNGTYSALYAFDRNNSTAGEYGVAWPAQNMPVEYFGADYAWSVSMGYKVDASKIKVTLTRKNDGKKWEFSQGKSDGVFYVDNDYYGQIGCIIFRPSSVKKYNADDAFQVDITGLEQGDVSYTVHFFQ
GUT_GENOME207662_01442240-524YESKATDTIDYYGLNLINPESKEAFASMPNADLNKFNLGAVSNTAQEDALHIVNTARFASGIKNELTVGKKQAQFAQAASMVNRLNLQVDHFPGLPAGANSDLVKYDNATIGAANSNLSSGFNLLDSVLEYLKDDLGDENQAEVGHRRWVLNPQVSQVGLGQADEFNAMFVNNDNYAGENANTVYAYPGETAISEFYSEGSSLSLMFGENFDLTNAQVEVKDLETGEVSKESHVDESFKGNVKAITFGYGMNYQAGTKLQVKVTGVTKNGQDYPVEYTINYMSIR
GUT_GENOME149621_00515467-740HSMDDIKNKYQKTSPTFNYTNSIYVKNPSWKNPYSAGSLKNGVITDTLNRLNYYRWLVGVNEITVNTAKLDRNQKGAVISKANNEISHYPSQPSDMSNEFYKEAYAGCNAGYTQGDIYSGNVSYGEKIPYQAIEGFVDDLNNITIGSKTGHRQSMLDPKATSISFGQCDIYTTASVYYDPNKDISKEKYYAYPSAGYFPKNEMKVGQYWSIYFVGNVSGTVTITFTYKGKKYSAVGLTFESGSNALSFAMPSDLQNLLGGSNKKMPESEISVEI
GUT_GENOME243568_0026535-400RLWRRAGVTLAAVGLAATGLIAGGPGANFVPGQPAVAQAAPGNSESAARAEFEALWDSTDQVTPFSSGTIISSCTPGTAQPEAVQKMLSTWNYLRSLNGLTPVGLPTNYAPQGPAQAAALTAAAGPVASPTPDSVPGATCLNEDVKLASRSGVIARLDGIVTPATEILRYITEASTTNMNDNLGHRLQMFSPIQANAAIGAVSIGTSGPTATSIQLFDTTYQAAGRPAHPSFWNSANPQPSSIAWPAAGYFPSRILPTGAGENVSRWSYSGQCADLRAAEVHVTGPAGEIPLQVIRRSQPGVDPNLTPWEYGGYDTILFKVPLDQLTIPDFYNVSRYTVSISNIRTAPNCQPVPTSTSYQINLFNS
GUT_GENOME001308_0189798-412EEPNSDVAFSTPSDTTSSDTAYTAGVLSDDALAYSQAYINFMRQQAGLDTITLDSTLNANAAQGALILAKLNKGLDHFPSTPDKITDAQAKAGKYAVSTSSLSESPSYGGHYNFPIKAIGSFMDDSDSSNRKMIGHRRWLLNPGVKQLGIGAAGVPIGEWDCWYTAVRVMDNPWNEELYDSSYGTTSTTTRNDYDFIAWPASGDMLSSAFEPGTPWSITLNPAEYSTPSASSVKVTVTRKSDGTVWNFSSGSTPTGGFFNVDTNWYAVPNAIIFDPGSSNVGSSYEGTYHVDVSGITSKSGSAVTLSYDVNYSSP
GUT_GENOME205584_0065256-342KNAYISSLPSYDTSLGEYKEEPSSTVPFKAGALKDEVVNDTIKQLNYFRWQFGLNSINVNTSYLYRNQCGAVAMAANGTISHAPARPAGMDDSFYQSALEGAGAGIGYSGNCASGFSMVSSIKGYIDDINVTNLGHRYSLLNPRAYSTSFGSCGIFSTLSMYVNDGSMKLSEPFYAWPSAGYFPLEAISAISDWSLTISGEYDITDNFKITLTADNGNSYTLTKDNVYICEGYSSVSFALPTALQQYLSGSTNNFLSGKSVSVKVTGLYNTNLGESAEFNYHTKFFP
GUT_GENOME140412_010883-332KYTKITMLTSVALLTLTLATNNHSVYAADFSQEESQQVVDYQKQYQAIDKSTYSNQNMYDSTPTDTDSGKLNSQYIKTSLDYINYYRKLAGLNAETTTDSANNDAQLAAWALAKVNYPVSKDAHGILNTQKPEGMTDSDWDKAQLASFGNITFTHNYDGSSNPETDVNSPFLDNNNIDGTALTGHRDLLLSARASRIGFGAAIGTNKIKYTVQNGVFADDLLKSNIWADVSFPSKELYPIELLQHENTPWSIYLGNTEVVGTPVITITDKDTGKTYTATNVKNLGRFNWAYGYKTTIDYMPTGVELVLGHEYLVKVGNVYSYSFKLFSQL
GUT_GENOME158523_01324356-645SGSTELEKLSQAEIEELLEQAPLRYEGDVFVQTPSVRAPFSSGTVEKDALQAAADRLNALRRIAGLPAVKLDMELSRSAQYGAVIQAAQGGLNHYPDQLPGMSDDFYKEARSASSTSNLSAGRTLVGSVDGWMDDSDASNIDGVGHRRWQLNPVLGKVGFGFALGEGGYGAYAAEKVMDKSGSGCAYDFVSRPASGNFPEELMDGRTAWSVTLNPELYADANKALVTVTLTREEDGRTWTFQSGSSDGFFSVSNAGYGTGCCIIFRPDGVETYDGTYTVQIQGLRARNGQ
GUT_GENOME244140_00831103-378VTDGILPIMEAWNFMRGLNGLNAVTLDTSGAIAPYTQAAAMVSARNGKLSHYPAVEGFACATDDAVRGARHSNLAQSISQTSAETALWYYIDYSNQSNPTNDQLGHRLFMMDPQLSLSSIGAVAGYTAISVRTGEPYPGLPAEAMHNPQAPSPEWMSWPSAGFFPKQLLTSVGETNSGMDLERWSFSVLNADLSTAKATIIDPTGHPVPLTTIHPGERGITFTPRAIADYSTLLMKFPTIERKPVGQENLVYKVKIEGVKNAPKSTYEYQVALFDP
GUT_GENOME143282_0250745-346LAKSQPQKYVTPPTSTTVSKTENTAALNSVNLVRYLTDLPQVKENKNFSNLAQNAAFLMNKNNQLAHQIRVPQGMSSTSKTYKKGAFAGLASNIGVGYYNIQQSILNGYMIDSGGNKTAIGHRKWLLNPSMRSIGFGKVGSYTDTYVFDNKAAVHQQLPGYYNWSKKDLQEPGGWGLAEKNYSNAKIAWPAQRMPIQLMTQDTPFSVSLGKNYTISDRVKITMKTSKKSVTIDSKNLKNGQHYSVSSSGYGYMNAIVWHAKLGDFKFKDNEVYTIKISGLLKNNKATTISYKVRFFDLEDSF
GUT_GENOME103071_00495304-567EMKEMYQRSVDEWNLASSYYQTEPSYTSLPITAGVVDANVLQSSVEYLNLIRYGAGIDPLVLDQTLCDGAQAKATYTMYLSANGISNPTPHYPPKVDGISDEFYSLCQTGRGENLYNGDVLTSITHALDDTSGDPINCGHRYNLLDPTYTAVGLGSTGSGLFPQGVQKFSGYQENTVDAVCWPSDGITPVEAIYVDQFQWTAKLYRYSVTTDTDVSVTCLNTGDEWQFSTQAGNLHRSVSENFLSWDDDNLSVSAGNVYQVTIS
GUT_GENOME009144_0013450-332EFSALKDIEYTKDYSTKKPYDMGDISFDDRIQALNSVNFCRYLAGLPADITLNDSYNETTQAASLVNASNGILTHYPSQPSEMSDELYKLGSNGARSSNIASGFSNITSSVIDGYVADTDASNIDRVGHRRWVLNPAMKQTGFGFVENYTAMYAFDRTRSDTFTGDYVAWPPKNMPNEIYTQSSYGYAFSVSLNSSYEYPSLENITVDLSSKLLNKSWHLDKTSTDMKTNYLTVNNDGYGMNRCIIFNVGQFPENDTVSVKINGLKRNGVPTSISYTVNFFNL
GUT_GENOME046367_01766298-564DIYVESVELYNSGVSNYYVEQPSYDTLPITAGKINENVLKGAVLFLNCIRVGAGLSELQMDEELCQGAQAKATYTAYLSRNGISNPSPHNPPKVEGISDEFYELCQLGYGENLFWGEALSSIYKALDDSAGDPINAGHRYNLLDPSYTNIGIGASTSNSMSSQGVHKFSGNQKSDVDLISWPSKGVTPVGAFYRDTFNWTTVFYSDYSLTDSSSVNVKHLNTGREWNFSDSDENTNSHYFYRTGSQMSFYDSGLSVSEGDVYLVTLK
GUT_GENOME123961_00237281-570TAEEIKSFYAAHPFSTSYRDAWTVAPNAKNGVAGELTEGTVENALNALNFIRYIAGINADVTVDPEYAEKAQAGTTLLTEVGNLEHTPKKPSSVSQEFYDLGYQGTSSSNLGWGYSNLADAVIRGWMNDGDSSNIDRVGHRRWCINPTMTATGFGHSGSYTAMYSFDRKNTTDVSYVTWPAKNMPTEYFCGPWSVSLNRSELKVPDKTAVKVALTKKDGSSVVLDSSCSSKSGKYLNYNAEGFGMGPAIVFQPSVKYSASDVVTVKIEGIQDKYGNDVPLEYTVTFFSMN
GUT_GENOME096270_02306504-810QPTFSGNPFLEAPKVSAPYKPGKVHPGLIEDGLKMTNFVRYLTGIPENIVLDGQLNELNQYGSVLLARIGYLDHFPKKPADMDEEFYRLGYESTSTSNLNGGVKLTNSYVTPQSPGTMAWSVMSFMEDGGDNNVSRVGHRASILTPELQKIGFGFAISDSNIIFNTMNVIKGLNWKVKHNIPYTTWPAQGNFPVEFINERVNGGHHTPWSIHMERYVANRDEVKIELIRNADQKKWTFDKNDNDYNGEYFNVSGKTIVFKPDFEKVPVFKTGDSYTVRVTGIRYKQGSNEKGPKTSLEYEVNFFSLN
GUT_GENOME269580_0182060-320YKGKTYLNGDSSSWYSVASSTKSPYNAGVLTADTHSAMTGMADFYRWLTGANPLKKESSHSASMQAQALDRNFEFNHFISNSSKPSDMSQEMWDEGYDCTHNILAWGYTPQGAIVGWMNEGYDISSGTWGTIGHRMTLINETTSKLTFGYSGTIAVGSSDEFNNDYQGVMTAFPAPGYMPSELISSGSAAWSVSFNPNVLQVADESKVQVIIRDLDNGEIWTRSAAEDTLQSWSDGMVFAQPKISDGRYKDSYSVEITGLT
GUT_GENOME178361_0050929-337VVNAAEDEKPAMNVEYHSKADIINYYNQHPCDFNMKTQYDEEPSVTQPYNPGKISSATAENANEVLNLIRYTAGLSGMTHVNYDYNEYAQAVALCNKINNMLSHYPSRPEGMSDELYNMAREGGSTSNLAYASWNFTLPDSLKMYMDDGSESNVWCVGHRRWLIYPKLSGVGFGQVYNYSATKVINVAEWDYRAEDKEYGVAWPAQNTPIELLKSYDGYPWSISFGKNVPDTVKVRLTDIGTGKVWNFSKDYSDGEFYTDNGGYGQTGCVIFVPDGLKLSKGSNFKVEISDIGIDGYSDNLSYNVSIFS
GUT_GENOME127497_0108352-335PWTQGPWRIGQVEPEYLDAGLQYVNFVRALAGLEPVTLSEHLSIQAQYGAVLLAANDELTHTPAKPAAMGDSFYRMGRQAAERSNLSLRYGYPWETLLQSAVQGHLDEKGEENRRTLGHRRWLLDPRLGAVGFGLASSASGKQYIVLPVSDRSGTGKAPAAVTWPGSGDFPNQVFSPETPFSVSLDPGQLTLPAEAELTATLTRWRDGAVFSVTGGTLPETLEGNAPYLLVNDPGYGLGTCVSFFFGTEAAGERWLGDYTVSLSGLRTRAGEPYELEYTVRFFD
GUT_GENOME080160_00564123-397LNYFRGLNGAEAVKMDTTSALQDYTQQAALNMAANGRLSHTIDKSSKCFSFKAQQGAAISNLSDSAGQTPAEQILWYFIDPSSLGASTMTKVGSNDRLGHRIALMDPTLSTTSYGNVNGYNAIAVTGNQLVSTPDFNNKQIANDAAYKPEVMTWPSAGYFPYQLLTTNRDDQEDVERWSVSFRGESKSRRPDLSGARVRVTGPNGAEVPTTVLNNTLNGKGPGTWNYYNTLLIKMPKIKDLPAGTDGSRDYKVTVTGVKGATRSSYAYTVKLFDP
GUT_GENOME100768_0019846-344RTKEEVTEKYDIAKKDEYYNRGNNDYYEIIPSLVAPYDGGKLKTEVHQAMTDLTNFYRWLAGVNPYENISNHDQNLQNFAVIETLYFNATGSLNHYPGSSNLWSKPNDMSDEFWQSAFAPNNIIAYGSSPQAAIEQWFEEGYNQRQNAFNTTGHRDMLLSYQTTGMTFAYTDRMAIGRQLSGGTMNLPCTAYPAPGPYPNISLNPEETAWSIELNDQQLSYDNINDITIKVTNLTTNESYECTAKNNKLTTVSYGYGFAFAQPKVNTDTYVDSYKIEILGLKDLNKNDKIVTYQTDLFD
GUT_GENOME000021_0121831-317FSTTEIKQVHHFQQEYADLNKTNYDSSTLYSKTPHLSRKFNAGQITDKYADSQLQYINYYRSLFGLPPVSENKTAQKNAQKTAAVMAAINANPFVNQHGLPTETRPAYVSKSMWKVAQDTSETSNLNFNVSNQSAGDVITDLLTDHYNLSGSDTGHRAWILSTRLSSTGVGAAYGTNGYRYSVQKVLNVDDLFRESSQPVVAYPSTGVFPIELTKGKNIAWSLYLSDKGIKNDPKITITDKDLDQTYTATNVKNYSKSGYGNFKSVITYSPGKTPIVAGHEYEVKIG
GUT_GENOME089306_0173224-310EARDELRSAYRAIESLNDDSLFDSEPSIRAPYAPGTLTDEARRNALDTVNFLRVVANLNPVGENALYDLRCAHGAVLLAANDFVAHDPPQPADMPDEFYASACAGTSESNLVGLNWMRPSILTEGIRYFARDDGETNLSVLGHRRWLLNPEMGETGFGLANSASGKSYAVMYALDQSANCDWSAVFWPAAGAFPVEMMHKELAWSIMLNPEKYDLARSSPIVVLTEENSGMTFTFDCRNETGDGYCRVSEESFGAGACLIFRPDFSNTDFTDYCQNQRWTVRVAGLC
GUT_GENOME164212_00371346-594GQLKQSKKVGITNYINAIRVAAGLPKLKISEDAFLVAQHISTLISYRLTELKLPIKHVPEQPDGLSDEYYKIAIGDGKGYTENLGYSATLSSYSTMMYYINLFLDDSTETPQNFSHRAKILDPEYTYWGFGISPYTFSNEFYGYKASNIFLEAWPAEGVTFLETLTNRQFFWTAQFLDKYKITDTTTVNVKCLNTGETWSFNTEEDTSNRKYKRYINAIQSINNKVVFYDNTTDPKPGYVYQVTIGNIQ
GUT_GENOME031048_01794164-451DFFHSRPFSRTQPDTFDEEPDLESETAGKLSGESVENALNTLNFIRYIAGISADVETSDYYENMAMAGAALMTKTGMEHKPKKPAGVSQEFYQLAYEGTSSSNLGQGYRNLSTAITDGWIDDGDTSNIDRIGHRRWCLDPRMQATGFGHAGSYTAMYSFDGTDNGYEDVPEMVLWPALNMPVEYFTGPWSISFDSSQYPLRSSDQSRIKITMTSEKTGKQYTISGKDTNRAGTYMNVETSNYGYGPALIFTPNVRFSAGDNVTVKITGLRNDSGYDGLQYTVHFFSLS
GUT_GENOME122716_01968104-404SKEEIVQFMQAHPTGDIYFDADGKSTGKTHETAYLTVPETSAPYAAGRLAESEEAAALNTIKTIRYIAGISSNLYISEEYSQLAQASALVNYVNGKLTHTPVKPAGMDQTLAQQGYDGSLHSNISWTAWAENSLKWSILSGWMDDSDASNLSLLGHRRWILNPAMSMTGFGSVTGIKGTYQAMYTYDKDNRDSDYTGVCWPAHNMPTSYFSPSSAWSISTGEELDLSGIVVSMVRFSDGKSWTFSSFSADGDFYVNNAGYGQKGCIIFRPASIEEYKDGDRFFVYIAGLEEPISYEVSFFD
GUT_GENOME150924_0032053-352VFSDRTMQEIADQYSSALYAEQMYDNDDSNTWYADKPSVQYPYDPGMISDAAHSSMIAMTNFYRWMSGLDPVEAARYSESELPLQTGALIRNFCWQHIVIDSFKPNDMDDTLWQKGANCNHNILAQDYTPVGAVTAWISEGYDQSIESWKTVGHRAILMDYKLSKMSYGFCDRIAIGTAEMSKKGMDMPFTAYPAPGYMPANIINPINCAWTVTLNNDRFDFDDIETLEVIITNTRTGQQWVRTYDDETMIYSYGLIAFAQPDDYVDMMYTDSYKVDITGINSIEDDMPVQFTYQTDFFD
GUT_GENOME096545_0116669-339ELTSKSVSYSAQPNVKTPPYAAGSLSNETLNDALKVLNFVRYIAGVPSNVTLNSDYNEWAQAASLVSRLNNKLSHDPSQPAGLPNDLYDLGHTGASSSNLGSGYANPASSIVNGYLRDEDASNIDRVGHRRWVLNPPMGQVGFGYVGSYTAMYAFDRTGSSSCTNVAWPAQNMPVELFQSDDPWSLNVGSEVSESSTTVTLTRQRDGKVWNFSSTSKDGYFHVDNGWYGMVGAIIFRPNGISYNSGDSFTVRITGATSSPIEYQVNFFSLE
GUT_GENOME057180_0023335-317VKYRTADEIAEYMRNHTFNYDASEFDALPSYTQTPYEPGKLSDATLENALNALNTVRFIAGLDEVSLSDNYNDLAQAGTLVNAINDELTHSPSQPSGMSDELYEKCLEGTSSSNIGWGYGSLAENIIYGWMSDADSSNISRVGHRRWCLNPSMKQTGFGNTGKYYAMYAFDGVWNETEYYGVSWPAQIMPAELFGNNDPWSISMGYQVDPSNVNVNLVRKNDGKRWSFSQTAADGYFNVENNNYGKKGCIIFRPDNISYSAGDSFEVTITGLDETISYTVEFC
GUT_GENOME078848_0106148-347EIASLLEECSLELPQKLYEEEPSLTAPYQAGKVTTEALQVATDRLNAMRRIAGLPAVQLDLSLSEEAQYGAVLLATSEFSHTPSKPADMDDGFYQKGYAATSSSNIAMGSDLAWSVDLFMYDSDSSNVAALGHRRWQLNPTMGKVGFGVADVKGGYFSVPFVTEKVFDRSGSGCDYDFIAWPSSGYFPVEENNDRGSTAFFSPLNAWSVTVDTSKYGVPQLDSVTVKLTRESDGKVWNFSAATSDINGNFFNVDNQGYGVSNCIVFRPDGIEKYQGLYTVEISGLPGGTLAYQVNFFDVA
GUT_GENOME103741_0101643-355FKGRTKQEVAAKYQNVKIDEYYDYGTNEDYYKEKPSLVAPYAGGELKEEVHQAMSNMVNYYRWLAGVDKYEQRSVHNDRLQAGAVIQNLYCKAEGKLTHNLATDWSKPDNMDQYFWNIGAYANHNIIAYNYTPQGAIEGWFNEGYNIENKSFDTIGHRKALLSYTTTGADFGYAGDIAYGLIKSQGTTDLPCVAYPAPGAYPSNNIDPKVTAWSVELNLDKLSYNSLNDVTIRVTNLTANKSYDCTKANNKLIESDDQEGQLVFVQPTISTSNYEDSYKVEILGLKDDSGNTSVVEYQIDFFDIDTLVPSTVD
GUT_GENOME082964_0218563-350DYVKAHPAGKNDKLTYKKNPSFSGTYNIGEISAATQSSALNMLKQIRYIAGISDEITVSDEYTKLTQATAYINYANDELSHYPDKPKNIPDNLYDLGVQGACQSNIAWASWNGCSLNYTLIDSWMEDADSYNIDRVGHRRWLLNPKMSATGFGAVDGTKGTYSAVYAFDEANTSATEYGVCWPAQNMPTEYFESSYPWSVSTGDTEDISKIQVVLKNVKTGKTWNFSSMKADGSFYVNNDNYGDRGCIIFRPKNIATFKAGDCYSVNITGTARGTVSYKVRFFDLYQK
GUT_GENOME249955_0167856-344NSGNDTYDPEKKSTYYSKPASIENPYDPGVLTNDTLAAMEGMTNFYRYLAGVESLQEKCTQNESLQYQALDRNFYFDHYISNSAKPEDMSDELWEKGYKCDHNIIAKGYTPSGAIYGWMNEGYNLKTKSWDTLGHRYALIAPQYSNIQFGYCGHVTIGKNCESKNPRQTEPFSAYPQAGYMPSNLVEAQKCAWSVQLNAQKVKISDISNVVVKVTNISTNKSYECTLKDGTARLASSFPSSSSVLQFVQPSDATNGRYKDNYKVEITGLTDVATNEEASISYEIKFFDA
GUT_GENOME176043_0037438-325TPAEIKKYFEENPFELDSRNNFVVQPVFTTPYSIGSLDDASLQNGLNALNFVRFTAGLSDVGLNSGYNELAQAAAFANSINGTLTHYPAKPAGMTDDLYKKAAEGASSSNLSYRLPAKNLAHIITLGYMDDSDAGNISALGHRRWCLNPSMGETGFGSAGYYSAMYAFDNSADNAGITSVCWPARNMPLEYFRSSGNSSPAWSVSVGQRLSADDIKVTVTRKYDSRQWVFSSTSADGDFYVDNGYYGEPGCIIFRPDGISFESGDSFAVKISGTPEPIEYTVSFFSLD
GUT_GENOME251997_00903125-417RTLKEVADEYAKTRYAGATYSNSDSSTWYQEPCSTSAPYAAGVLTQDTHTTMTAMTNFYRWLSGLNSLKNSSTHSDSLQVQALVRNFEFSHWVSDSSKPADMSDEMWNAGAPCRHNILARGYTPQGAITGWMNEGYSLRSQSWDTTGHRYALIEASLSDVQYGFSGGIAIGADVASGNTSDLPFSAFPAPGYMPSRLVSPSSSAWSVRINKNTLKIADSTKVTAVITNLNTGNSYECTKENGKLRVSGVEIDMVQPSDYSGSRYTDSYRVQITGLQDAATGSEAQISYTTRFA
GUT_GENOME237487_02321382-634SSENIYSSIADLESFTTPGSVENLVMEDGLKTINFYRKLYGLSEVQLNDELTSNAQCGAVVSLVTEDLSKPSKPNSMSDEFYHKAMLAFDKNCVKINENLTEIPIVNAVHYLFNCNYYFREKVLDGNITKIGFGVCSDVNGKTAVLVKFEDREEYLNEYNFTPYLCEGLYPISLVQANPELTVTLGNSLFAGDRGNPNITVVNEKTGEEIILDPENAFEFIDNDNKTIIIKNIGFKIEKDTSFKIRIGNVYNK
GUT_GENOME152981_0039748-322YYERIEANSSFDVTYEQEPVTTGPNYNAGKLSQSTLQGAVDMLNFMRYIAGIPYDVQLNDSYNQMAQAGALVNAVNDEVTHYPAKPQNMEDSLYQLGYKGCSSSNLFFRSLDFRDVVEGWVSDEYNASGSNPGHRRWCLNPTMTATGFGEVDFYSAMYAFDNHWAPTAYTGVAWPAQQMPTQFFNTVTPWSISFGRDVDPSTTSVTLTRASDGRQWNFDRSLSEEEFLVDNGGYGQAGCVVFQPSGIADYGDGDVYHVVVKEGASIIADYDVNFF
GUT_GENOME024845_0126043-336EQIKSFLNSHPVDNNAFNDSYVFSYQKTPLLASSYEPGALSKKSLTSPLHLIENIRYIAGIPADLKLSDEYTSMAQSASLVSYANDSISHTPKFPTRMSNKIAKSGIRACGESNLAWDSWQNVSIEYSILNIWMKDDSASNIKALGHRRWILSPEMKRTGFGAVSGSRGTYNTMYVFDFGRKVKTNYQIAWPAQHTPTSYFPEETPWSLSLGKILNEQKIKVTLTRLSDGKVWNFSKKNSDGKFYVNNDAYGQKGCIIFQPSNLHACHSGDVFRVEVTGAGKTIRYNVNFFNEK
GUT_GENOME114602_00141290-543LAQVREIYAKSVEDYNNAGDYFEVHPQDETLPLVPGVINEKKLEGAVGYINAIRVGGGLPALTHSPALSEGCQYKAILTSYLAKNNISNPSPHFPPQVAGISDEYYQKAQMGGAENLYHGHIISSITNALNDAYGDPITCGHRYNLLDPNLQYIGLGSTEVENQLSIGIQGVHKLSGRQASDAEIVGWPSKGIMLNEAGAGTNTMYTAKFVRNYSVTQNTGVIFRCLNTGDTYTFEAGQENTRNHEYHINANGE
GUT_GENOME000021_0024011-333MLVALVSCTFFLAPTQVNAANYSKSEAKQVRYFQNRYKKLSKKVYDENNLYTVEPNFAEPFNPGQLKAAYITTSMGYVNYYRKLCGLPSESNNDKVNRDAQIGASALAAINAKTSLTAHGLLGYTRPYYISEDEWNTAEDATLGNINFLESDTGASAGEIVTDLIKDDNNLDGSGNTGHRALILSARATRMGIGAAYGSSNKKFYSVQNGVFADDILRPAVKDMVAFPSSGVFPYELLDSDTPWSIYFAKRRLTRQPKIYVTDLTTNKKKRAIHVRNYGTDYFGAGYTSAISFDPNISLINTHKYKVEIRGVTSYSFRLFRQK
GUT_GENOME044231_0123549-337AGSNMAQDPSTQAPYALGQVSDAALNTALGRFNALRRLAGLGSVSLDSELDRQSQYGAVLLAATGQLSHHPDQPADMDDEFYGWAHSATSRCNIYQGSGATLPQAVEAFFDDSDAANVPMLGHRRWMLNPTMAKTGLGFAPSRYTYGATPYHFATLWAFDRSAPAQDYDFIAWPASGNFPASLFAGDQAWSVTVNPDKYAVPDYEALTVTLSGGGRSWSFDHTGVYAPSDTGAYFGLNTGNYGVSNAIIFRPDMEGGSYAGTYTVTIQGLKDQNGGDVPLTYQVDFFDP
GUT_GENOME185591_0067795-399LNTLRAFNNIGATKLSPNETIHREAQEGSIAMTTASGISHGLKPETQKGNSRFRPGPNGTWPCSTAGGVRSTLSGLLSWTTWQSATMAYHADNLAADSGVPSLGHRMYMFSPNLVYTAVGASRSSSARSSSVNQVVLSYAHGSRYADMSYETGSITQPDSKVKRPATIEWPAAGYFPYSLVSKDGEWSLSVLSSEAKLLEGAQVEITAPNGTVTRPTARTSGISESSGYQSVAFAAPSLVAKPAANTVATYTVKVTAKGKSWTYPVRLFSSLSGDALSSSADKTAPVFSMPATSKVAYGSKFNPL
GUT_GENOME000271_02663246-532TAPSSVYASVPSVEAMEQGSLTEEFLQDGLNAVNYVRAIAGLSPVSLSQEKNDLAQAGALLLAAGEFSHTPSQPAGMPDDLYKKGYQATSTSNIGQGYQDLWEFTLSCANDNGVSSLGHRRWLLAPGLTTIGMGYVERSATTVVVDGFAGTNGPDLVTWPSEGVFPAGLVDRYTVWSCTPNPDKYDVSGSSLTITVTDNHGGRCVLGESTSASSRLFPGVGLIWLPPMDQDTSGVTVEAGNYGGGPAILFTPSQVSLEPFTVLTVEIAGLKLVGGGTDTITYTVKLF
GUT_GENOME096269_0344254-355WMKPGSTELPIFVDTPSTSLPYAAGSLHGDYVQQGLNAANFYRFISGLDGNLVLDPSLNKQAQHGAVLIAISGQLTHEPKQPAGMPNDFYELGYQSAGTSNLYYAYGQTGNFLVNSIVAYMNDSDKFNIDRVGHRRWILSPQLQKVGFGLATIHNSAKNQTEYYSVMQIFDKSKPHPADYNYSLFPNKGSFPIEAFGADQAWSIQLNPQHFQKPQLSDVQVQVTRLADQKTWQLNHTTQQNGQAYFNVDTNQFGYGYAVIFRPDGIEQLADGDQFQVTVTGLKKSDGTSAEISYQTNFFNVG