UHGP-MC 61606


Information


Number of sequences (UHGP-50):
116
Average sequence length:
334±38 aa
Average transmembrane regions:
0.03
Low complexity (%):
4.49
Coiled coils (%):
0
Disordered domains (%):
2.84

Pfam dominant architecture:
PF01661
Pfam % dominant architecture:
7931
Pfam overlap:
0.32
Pfam overlap type:
extended

Downloads

Seeds:
MC61606.fasta
Seeds (0.60 cdhit):
MC61606_cdhit.fasta
MSA:
MC61606_msa.fasta
HMM model:
MC61606.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME237530_0008220-248RDEDGEALRRCYQTMFSTDSLEHRIHIFLPAWKDQNHALEIAMDETKKFLMDHDDMVIVHVPVPSAISLSKDLTDLLAYVDAQPYVRAVKLDTVFGSVPDSSAFIGDARSLDAFVQERQRTFQQRLFELIDASGMDDVTVYKRANVDRKVFSRIRCNIDYQPKKKTAVAFAIALKLDMETTQDLLSRAGYTLSPSSTFDRIITYCIENGIYDMFEINAALYRYNQPQLG
GUT_GENOME068621_0074628-356MPLQIVRNDITKMQVDAIVNAANETLLGGGGVDGCIHRAAGPELLAECRKLGGCKVGEGKMTGAYRLPCQYVIHTVGPVWNGGSHGEREQLASCYRASLALAKEHECETVAFPLISSGIFGYPKDQALRVAVDTIGEFLLHNDMTVYLVIFDRNAYQISGKLFAEIAEYIDDHYADAHTDSLREDRRRKSVLQSMKPVFYGSVQAPTDTGALSDLLSHLDAGFSETLLQLIDRSGKKDAEIYKKANVDRKLFSKIRNSPGYRPSKSTAIAFAIALELNLDETRDLIARAGYALSPSSKFDVIIEYFIRQKKYDIFEINEALFAFDQSLL
GUT_GENOME085627_001632-325EFHLINADITTLNVDAIVNAANNDLLPGGGVCGVIYSKAGYDELYKSCLEIGHCNTGESVITPGFNLPSKYIIHTVGPVYRDGNHNESKLLESCYLNSLDLAKQNDIHSIAFPLISSGIYGYPKYPKKEAYEIAYKAISNYLTKEDYDLKVYLCILDNSFLISIEERSKLKEYISNNLIGVGHRDRCFFSIKERNLAPRLAAFPTNESFKKEIQDNLEKSFSDYLLKLIDERNLKDSDVYKKAGIDRKLFSKIRRTDYQPSKDTALKLCIALNLNTDDTKDLLGKAGYALSQSIERDLIIQYCITNKIFDLMKINNYLFSFNQK
GUT_GENOME265263_016225-377IVRNDITKMRCDAIVNTANTKPIIGSGCDYAVYKAAGKRRLSEYREKNIGEVPEGDVFLTPGFRLPAHYIIHAVSPHYIDGVHGEEEKLRDCYRKSLSLAWEQQCRSIAFPVISTGSFGYPKEEGIRIAADEIQAFLQKHEMLIYLVVFDAQSARYGKRFDKDLQAYIDENYVGAKHHEEYSIQTMALRRDLDDAICQPGETKQDDAPCQPGEAELDDESFSPYEDDLYKDNLYEDDLYEDDLYEEDSDEDSEEENEFTELHESKLEERMKHLSDSFSEYLLYLIEEKGMTNADVYKRALVDKKTFSKIKNHADYHPQKMTAMCLCMGAKLNLDETRDLLTRAGYALSPCDKTDIIFSYFIENQIYDMIELDI
GUT_GENOME017737_018733-356IQIMRNDITKVTCDAIVNAANTSLLGGGGVDGAIHRAAGKGLLLECIKLGGCRVGQAKITGAYRLPCKYIIHTVGPRWHGGQRGEKELLESCYRESLKLARENHCESVAFPLISSGAYGYPKAQALRVAMDVISEFLADNDMQVYIVVFDREAFQISSKLFDDIQEYIDHTYVEQHTNTNLERMRSARVGVCEDDIWEEEEQILFPDVRKEDTAYVPLSKPTEPAMPYAAAAPSLSEALKEIDESFAQMLLRKIDEKGLTDAQCYKKANVDRKLFSKIRSNTQYKPSKATAIAFAIALELSLEETKDMLMKAGYALSHSNKFDIIIEYFIQNRNYNIFQINEVLFAFDQSLLGV
GUT_GENOME262835_010701-417MPFTIERNDLASVSADAIVVAANEDLQITGGVGEAVAKAAGFTNVQEACNAIGHCPTGSAVATPAFDLPAKVIIHAVGPIWQGGSQNEVALLRDAYDAALSCAAENNAHSIALPLLSAGIYGFPADISLSVAQNAIHDYLDSHDTEIRLVLFDRTALQAGLSSYDRIAEYIDDVYVDKQLDARKKARRTENFSDAFFTAAGTAYGALSAPPSAVSPSAASPSAAILSSLFSSHKETEKEASEDLENACAEDSTEAVCKEALSSNESVNRAEDEREAVYTKNTVDAAFTESLVCETPGKSSSLSELLNSLDASFSTTLLALIDTKGMTDAQVYKRANMSRQLFSKIRSDAFYKPTKKTVLALAIALELDLGTTEDLLRRAGFALSHSSKADIIIEYFIKNNQFDIFEINATLYAFDQP
GUT_GENOME283379_000411-335MPFAMVRDRLTELAVDAVVNPATPSLRRGGGVCAELFEAAGSAKLAAACAAIGCCETGGAVVTKGYGLPAKYIIHTVGPVWRGGSEGEKELLASCYRSCLELAFRRGFSDVALPLVSVGGYGFPEELALEVAVGVIRDFLNGHELRVLLSLPDGAPGPARPGDRFRELDGYLELNMGREPDPSGIGFAAGAPEPSAAPRQYGSSAPLGAFAVAGKGRRSLADLLMNMDESFSRMLLRLIDEKGFTDTEVYKRANMDRKHFSKLRKDGYVPGKPTVLALAVALRLNLDETRDFLGRAGYALSHGSRSDLIIEYYISEGVFDIFEINEALFHFGEKP
GUT_GENOME282422_010771-361MPFSIVRSDIACLDVDAVVNAANEGLIAGGGVCGAIFSAAGHEQLQKACRRIAPCPTGGAMVTRGFDLPARWIIHAVGPRWIDGHHGEEGLLRRAYRSALDQAARLQVASVAFPLISAGIYGYPPAAALDVACEEIEYFLSDRAGTAAEDMHVVLAIYDRRAFAASLDCYDEVALCIQDAQGEESFFACDSAVCAETAEKSCDLGFGPGFDLDLDLHALAAPAAAPRRQARACAPRAIDETEIAELLERLDASFSQTVLSLIDARGLTDVEVYKRANLSRQLFSKIRRDQGYKPTKATAVALAIALELSLDECNDLLERAGFSLSTSSRFDVIVRYFIERECYDIYKLNAMLFAFDQPLLG
GUT_GENOME138794_010201-343MPFTIIRNDIAKVKADAIVNTANPHPQIGAGTDKAVYEAAGAERLLAERKKIGDIPVGKAAVTSAFDLDAKYIIHTVGPFWTDGNLGEQAALRSCYHESLKLAQDLGCESIAFPLISTGVYGFPKDLAIKTATSVIYDFLMENDMMVYLVVFDRKAYDLSGKLFQDIHAYIDENYVSAREDAEYRDSDRRLSNRAEAPQAIRHRRSREKLKETEALPALSSSHNIGEHLKQPDISFREYLLQLINEQDLKNSEVYHGANISRQHFSKIMSNQNYHPSKNTACALAISLHLDLPTTNALLEKAGMVLSGSSRFDLAVQYFILHGMYNIIDDNIILYENGLELLG
GUT_GENOME116298_027323-341LQIIHRDITTMDCDAIVNPTDNCFSGSGGTDYEIHTAAGLGLRYECDRLEPLSFGEVAATGGYKLPCRYIIHTMGPVWTGGQRCESVLLRSCYMNALFKASELGAKSIAFPLISSGTFGFPKDKVLRIAIDAISDYIKSSDKEIDVTVCVHDREAFVLSKEIALKEYIDENTNPRFLYAEHTLRMQPECSTVAHAKEEAFDAYESCCLSCIPDAAPSTEDLSAWIKKQDDTFAVMLLKLIDKKGMTDVQCYKKANVSKGTFWKINNDPKYKPSKATVLAFAIALELSLDETELLLKTVGLSLSHSNVFDLIIEFYITNGNYDIYEINAALFKYDQVCLG
GUT_GENOME053971_012281-301MPFYFKRGDLVATECDAIINASNVNLKMVEGVGRAIFHKAGDKELTNACKQIGHCDVGAAVMTPSFNLTNAKAIIHAVGPIYINGKHGEEKNLRKVYRSVFKIALENNFKTLAFPLLSGEFNYPLRECYEIAEDEIKNFLKDHKDFKIYMILFKNFPENVSEDEQTRLSKFILNNFKTNANVGLKTIDKSNESFVNVIREIQKEKNISDEDLAYKSNFTPEDLEKILTLQNILPTKNICFALSVGLEANKEEFIRIVNSIGFDSIKDDMTGLVTAYYLDKKDYDVFKINRVLFNYDVKPLG
GUT_GENOME226215_031283-378FQIVRNDITKMHVDAIVNTANPMPGYGAGIDSAVYEAAGKEKLLAKRHEIGAIDRGTSVITDGYNLPAKFIIHTVGTAWQGGKAGEEDIIRGCYRSVFDVAVNNDIASLAIPLLASGSYGFPTGIALRIALSEIEAFMSESDMEVYLVVFDEKSVSLSSELYGDIDEYINDNYVEEKNQEEYPDSYGRNERVRRGTEFVAGGFASAASYVPGLFKANKAEKKKRKEAELYEERAEDLEELSSIDMCLSAPQFDEERSLEDVVKNLDKTFMELVFSFADAKGLTDVEVQKKANLDRKAFSKLKCGTTKNPSKATALALAVALELDLDDTKDLLSRAGLALSPCSKQDVIVQYFIEKEAYDIYEINVALFEHGEQLLG
GUT_GENOME040390_011061-298MALAIRLDDITHMHCDAIVTATDTALSGTCGVDQDIHRAAGPELDAACRALAPCPVASAVCTDGYRLSRKIIHAVGPTWHNDPADLQTLAACYRAVFACAREARCASLAMPLISAGRLGFPPDDVLRIAVQEARAFLERCSIDIFIVTHRPSTFNLGVRMFAGSLERIHDKKMRFAAEHGTLSLDEVLGQQHESFRDMLLREIAARGMTNAACYQKAHVSKSVFSNIVKNGTIPKKPNVAALVFALELPLEDALTLFGKAGYYLTNTDPFDTILEYFIEQGNYDLFALNEALYQRGQP
GUT_GENOME041218_0420014-247LIIKFGDLTSAVTDVIVSSDDAYLSMGGGVSASILRAGGDVIARDARKNVPCQMGDVVVTSAGKLEAKYVFHAITIDWSQKDEFTVEKSINSIIKKSLNVLSVLGLKSIAFPAIGTGAARYSLEDVAHFMSMAISEFLSNSDEELEIYIYLMDRYGRRTAIDYIVFFEQFYRRMFTSGVDIAEVNPAETEAHAKQWDMKSMRLNHLNSYLIKLEEQRMNFEMKLIEAIEKNDNE
GUT_GENOME219702_010271-317MPFCITRNDITKVTADAIVNTANPQARIGGGTDSAIYQAAGADALLAERRKLGDIPVGQAAATSAFGLRAKYIIHTVGPVWQGGIHGEEAALRSCYEQSLYLARKLGCESVAFPLISTGVYNFPRDKAMRIATAAIYDFLMEQEMLVELVVFDRSAYELSGRLFPNVQSYIDEQYAADQAAQEYRRRVREERFSRAGSAPTLSGALHCTQSTFQEYLLELIRQRNLKNSDVYHGANLSKQHFSKMLSNRNYKPTKNTVCALALALHLDHAGADALLAKAGLLLADNIRFDLAVRYFLDNQMYNIVEDNIALYENGLE
GUT_GENOME228323_018817-379VQGDITKITDVEAIVKAANNSLLGGGGVDGAIHRAAGSELLAECRTLHGCKTGEAKITKSYNLPCDYVIHTVGPIWNGGRSQEEELLARCYYNSMKIALEHGIRKIAFPSISTGVYAFPMELAAKVAVNTVARFLSEYPDDFDMVEWVLFDSHTESVYENEVDKLYRRKNGRIMQQNAFYRKLEDIRKKINLYMAQKISDNVVEKCARSDEIENFDECEMLSAKEMPPEEDILAAEPRQTEPDESVSIKIPEFLCKDKRSAKLDKFLENKDESFSEMLLRLIDEKGLKDSEVYKQANIDRRLFSKIRGDEEYVPSKKTVISFCLALQLGIDESNRLLATAGYTLSSSSRFDLIIMYLLENKEYNIQFANIVLD
GUT_GENOME222413_009581-311MPIQIIKDPVATLDVDVVVNAANRNLRQGTGLSDALFKAAGAAEMRAACEKLGSCAPGHAVITPGFRLPAQYVVHAVGPIWVNGRQHEGELLAACYRESLSLAAAQKARSIAFPLLSTGPYGYPREKAFRIAMTELAAFLEKREMTVYLSVYDPRTTRPDAETWMALSHYIAHPESRAPEESVGDELPSLDEMLFEPIAPEPVVKPVPKAPFIAQVMALARKKGLSESALSRRANMTLSYLSALRNRPSELPDRDAAMALCVALSLTEEETRALLISAGYFLLPSDRRDAIISFFIGRGADIYALDAALYA
GUT_GENOME236536_006903-371FQIIRNDITKMKVDAIVNAACPTLLGGGGVDGAIHQAAGPELLAECRTLHGCQTGDAKITKGYHLPCRWVIHTVGPVWHGGSHGERELLASCYRKSLALAVEYRLESIAFPLISSGAYGYPKEEALEVAVDTIRAFLKDHDLDVYLVIFQKSDYQLSKGLVEDISSFLNEAWKGKKDAHAREEERLDSVSAELESMYLEDRYPEWGLPPCPWEESRRLKKPGIQGRIERVELQIERVNYNETHLVDFFVDSKREEPFFSELEESFSHMVLRKIDEKGMTDPECYKKANLDRKLFSKIRSHEDYQPSRATALALGIALELPLSELQELLGKAGYTLSHARKGDLIVEYFVKKGIYDIYQINQVLFDFDEP
GUT_GENOME222285_011831-394MPLQIVRNDITKMSVDAIVNAANSSLLGGGGVDGCIHRAAGPELLAECRTLGGCETGSAKITKGYRLPCRYVIHAVGPIWRDGKHQEKALLESCYRMSLQLAKEYACRSVAFPLISSGVYGYPKDQALKVAVDVISSFLLENEMMVYIVIFDRAAYQISGKLFADIAAYIDDRYVNEHTDSREEQRRRRGSVAELYPGKAMTLEERVASWRGEASEDGESMVAEPGPDACIREALEAPEPPMVAEPESDACICESPAEPESPMMAEASLKSCRSLSLEEALGEIDESFSEMLLRKIDESGMTDAQCYKKANIDRKLFSKIRSDKFYKPSKPTVLAFALALKPPFDELQEMLAKAGFTLSHSSKFDIIVEYFVERGNYNVYEINEALFAFDQSLI
GUT_GENOME225144_006731-255MSFRVVLGDITKIKTTAIVNAANCSLLGGGGVDGAIHRAAGPQLLEECRKLHGCRTGEAKLTKGYRLPAKYIIHTPGPIWRGGNDGEAALLRSCYENCLKTAMKNGIRDIAFPSISTGIYAYPLEEAAKIAVNTIYKFPELEVQMVCFDERTKAVYERANAVYRKKYDCKEKNEIVYIEYQDLPMKRAFENEELRLGMMQIAKETEQKHERILHLLQAETTFRQYVHLKGREGLSDRICEKIYAEMEKENERNPV
GUT_GENOME177855_031731-367MPFQIIRNDITKVKADAIVNTANPEPKYAAGTDGAIYKAAGADKLLAERKKIGRIKFGDVAVTPAYNLKAKYIIHTVAPAWNGGNKGEFEVLKSCYSRSLTKAAELGCESIAFPLIATGVYMFPKAEALRIAMDEISNFLMREDVDMTVKLVVFDDKAFRLSRNLFFQVESFINDDEVIAAHKEEYGLPDREFERERARFRREREEVDYYASFSLCENSIPMEPVELETPHGKTRKITGSKPFNESTFDKNLYMSDGKEELSFQQHLLKLIIEKDLDNTTVYKSSNVTKGAFSKILCGDTKKPQKKTVLGLCIGLRLTLEEAEELLASADMAFNPYNKRDKLVIQCITHGQYDIDEVNAMLFVCEQP
GUT_GENOME025825_007761-369MPLTLVRQDITRMRVDAIVDPTDEGLHADGGVSAAIFRAAGALCMEAACREAGHVECGGAVATPGFDLPSRWVIHTAGPRWPEGAVKAGHLGYLRRLLARCYKSSLDLAASLGATSVAFPLISSGAFGCPRDLAIEDAVSAIRSWLDAAPDRHADMDVYLALFDRESTREARVAYPGLEEYIDQRYVDEAERRDHRRRPKRENRWHADATSGGASFSAGDLPSEVFDEPVAASAPAELGAACPGDAAAGDELDDLLASLDASFSETLLSMIDARGLTDAQVYGRANLTRQYFSKLRSGRLNPSKRVVLALAVALELDLEQTQMLLARAGYALAHNSKFDVIVEYFISRRVYDVDRINYELFCHDQQLLG
GUT_GENOME117586_010441-359MPLKIVRNNIINMEADAIVNTANYLPIVGSGVDSTIYEAAGFRQLLDERQKIGEIPYGEVAVTPAFNLKAKYILHAITPFWRGGGNGEIELLRSCYENIMRLALQFNCESVAILLLVTGNNAYPKGLSLRMATEVITSYLDKYDIMVYLVVYDKASFTVAGKLFADIESMLDESDFEADYFDEECFENFIANDYKAKRCLLYECNDKICYSLTCDECVVEKKSLLSTALKDASKKKMSGAELDRELARLLANKEAKTFSEKLFQLIAEKEMEDVDVYKNANLDRKLFSKLRHKNYKPSKNTVLALAFSLELTLEETEELLKYAGLALAPNKRFDLIVKFCLKNNVHNIYDVNAALFKYT
GUT_GENOME138227_025774-326MSFKIIRDDITKVKADAIVNTANPKPRIGTGTDHAVNLSAGPQLMEARNKIGVIQPGEVAVTPAFNLNAKYVIHAVGPVWQGGTNGEEAVLQRAYVNALEAAVARHCESVAFPLMATGTNGFPKEKGLSIAIMTLTSFCLSHDIQVYLVVFDDAVFDLAKRIFDNVQSFVDKDYVDTLYEREYSIQGCSASYHDLSERVSSLPEWNADAEKGTRVKRILNQPDISFHTMLFQLIEATGKKDSDVYSRAEFTRQQFSQKIRSNPSYHPSKNAAIRLAIGLQLDYENTQSFLKKVGYFLSEELPADKIVADCIRRGEYRLPVINV
GUT_GENOME262211_010141-398MPLYFMHRDITTIPADAIVNAANPTLLGGGGVDGAIHRAAGPSLLAECRPLSPCPTGEARITGAGNLPCRYVIHTVGPIWTGGKNGEALLLRRAYESSLTLAHKHGCRTVAFPLLSSGAYGYPKAEALAIAVAAIRNFLLASHDDLTVYLVLFDQDSLQASHENLSELAVYLRTADALSADQEGNESGKNLPPLPANQKYRHGDSCTPKDSAAAFSFSQRTTERAENSLPEQTVPDKDDFDFYDEEADADTDYDEAEYYDKDEGYCEEYCASPTCTEETLRAVLANREETFSEMLLRKIDESGMTDTECYKRANIDRKLFSKIRSRTDYTPSKPTILAFAIALCLPLEETQRLLRTAGFTLSGSNKADLIITYFLSQGIYDIFTINEALFAFGQPQLG
GUT_GENOME158633_011534-306FLVKGDITTMKVDAIVNAANRSLLGGGGVDGAIHRAAGPGLLMECRTLGGCATGEAKITKGYRLPANYVIHTVGPVWQGGNCNERELLIQAYRSSLELAAENGCRTVAFPLISSGVYGYPKDEALRVATDTVEDFLRSCDMTVFLVIFQASDYDIGRALFDDISAEVACCKTERPAPEHGLDGGCTGPCPSGDAFCRMLLQRLAASGMSHDLFCQKANLTKELFASFSCAPCPRPSKPVAMACAVALELPEGDIAALLASAGYGFSDTPVFDRIVRYFISRGVYRIVEINKALFAFGEPQLGG
GUT_GENOME155726_012174-361QIIRNDITKVKADAIVNTANPNVVVGSGTDSAIYHAAGEKELLAERKKIGIMMPGEVAYTPAFNLDAKYIIHTIGPSWIDGNHNEREILHSCYEKSLNLAAELECKSIAFPLIATGVYGFPKDEALQIALAEINKFLLSHDMRVILVVFDRKAFELSGKLVDDIEEYIDEHSADEIWDAEYGGRYGNTRRERLERLESLHSAIMPDTYDEDEPEEDESESRIIYAEAAAPSFPDVSGMSLDEVLDSSEDTFQQRLFKLIDESGMDDVTVYKKANIDRKVFSRIRCKEDYKPKKKTAVAFAIALQLDMPTMLDLLSRAEIAFSPSNKFDLIITYFITNKVYDIYEINAALFKYGQPILG
GUT_GENOME236890_014451-382MAFKILRKKITRSGSEAIVNGVLDFPLFKTEVDYEVFKKAGRKKLFSAFKNLHSFKYGKAVCTSSYKLKEKGIKYIFHTRLPYYFENEKEAVSLLKKCYRSSLNLAKEKGCKSVAIPLLGSKFSDFPEEIDIQIAKDEISRFVSKNEIDVNLVISQKTTSLISAKLFNDVQKFIRKNLKSQKKHFPCDPWFLFRKKDSSIDKLKKKRNSKKDSGDYFTILPETLEIVEKGGKENRVSPVKYGILSSENMSTVGASTPFCKRGNSSGIEVFLNQALENLNFQDTLQKFIASRNLDNSSVYKKALLDRRFFSKIISKKNYIPKKRTVMALGLSLELDLNEYEELLASAGYSFMPSSKFDMIVKYCVINRIYNLIEINLILDSYG
GUT_GENOME128889_007115-334IVRNDITKMETDAIVNAANSRLLMGGGVCGAIFNAAGAEKMQSACDKIGFCGLGEAVITKGFNLKAKYVIHTVGPIYKAGNTVQEQQLYNAYASSLKLAKSKRIKSIAFPLISSGIFGYPKSEAITIARRAIRDFLKDNDMDIYLVVYDRDAFEISKGLFDRIKSYIAETLVLPQSYDRRIDSRSIPIERSVYESEMMFPQMTANNASRRLNDLLKNLDETFSQMLLRLIDERNLKDSEVYKKANIDRKLFSKIRNDKNYRPKKSTVLSFAIALGLSLDETKDILGKAGFALSPSSRADVIVEYFIENENYDIFEINEALFAFDEPLLGV
GUT_GENOME237425_009981-356MPLKIIRQDITKIECDAIVNPSNRHLYPGGGADLSIHEAAGPELLEYCEKLGGCEVGKAKITPAFKLPCKYVIHTAGPNWHRLDNSEEMLVSCYKECLKLAKENNCETVAFPLIASGAYGFPRQSVLKIATNVISDFLFENEMLIYLVVYDKKAFEISEKLFCDVASYIDDTYVEQNAEHQIVEGCYSNYPIGTLRRDELAERRKRVLEERAEKRMVCESRTLAIDMEECYCKESTLEDMLKQMDKGFADTLFYYIDKKGVTDVEAYKLSNVNKKTFSKIKCDPNYKPSKLTAISFAIGLKLDLDETTHLLKTAGYSLSNCNKFDVIIQYFIETGNYKTIFDVNEVLYQFDQQLLG
GUT_GENOME269013_017471-382MPFKVVRNDIIKMKVDAIVNPTNRYLKGSDSIDGIIHRAGGEKLEEECRALNGCKVGEAKITNAYNLLCKYIIHTVGPVWRDGKHQEREKLFDCYKNSIICAKQHDISSIAFPLISSGNFAFPKDIALEVACEALNKLSNDETIIYLVIYDTQALEISKSFLDVEHLIDEKTIIDITSIKPENFENEEKTYSETISDCKIKIEELYEKIYSTSMRLFMLNSIYRDLGTNVDSKRFLDNLYSIEKMCQDSLQKKIENFKTIISEIESINTTQQLICVNKTEISFSDLLFTFIRTKGIKPSHVYKKANISRSSFSKVRGDYHPKKDVAISYSLALELNIEETQKLLSTAGYVLSNSIKEDLIIMECINKGIYDIAKVNIILLKY
GUT_GENOME025538_004203-353FQIIRNDITKVSADAIVNTANPEPVIGGGTDYAIHKAAGPELLEARKQIGNIYTGQSVATPAYQLSAKYVLHTVSPAWIDGKHHEEEDLRKAYDAALILAYELGCKSIAFPLMAAGTYGFPHDLALSVAIRAFTDFLLDHEMQICLVLFNSKAFGIAGSIFDDLKSYIDDNYVSEKNEEEYFIDEYPTVNASSIRPKRAELQRRRRMERERRERESFVGSAPQAPLAAPRPDERYSLADILKQHEKTFSEYLLDLLKERDGKDSEVYKRAEVSKQLFSKILNNPDYQPTKSTVIQLALGLELDLVQTQKLLGKAGYALTRSSKTDLVVQYYIERKIYNVTFINEALYDCGL
GUT_GENOME244224_004284-308YTVKKDITKIKADIIVNAANTELREGGGVCGAIFAAAGAEELRAACDALAPIRTGKAVVTPGFSTKAKYIIHTAGPVYSGKEEDEKLLYSCYIESLKLAEEKGVKNIAFPLISSGIYGYPVKEAYEVAKRAILDFLKNSDLTVYMTLYGTTLPPPSAELRSFLETYPADCAFMGSVILEDASSLSMTEMISRLDEPFSKTLFRMIDERNMTDPEVYKAANIDRRHFSKIRNVKNYTPKKKTILALAVALKLDINETELLLERAGYTLSRSRLFDVILRYFITHKQYDIWKINEALYHYDQVELGG
GUT_GENOME235609_014421-330MPFQTIREDITRVKVDAIVNPTDDAYTHLGGTDKRIHDVAGEELYTECKRHGSLNVGDVCITSSFNMKNCKYIIHTFGPIYVDGKHNEEELLIGCYENVLRCAVDNNIESIAIPLISTGSFNYPKKDALRVATNVITDFIYEHELDVYLLVYDKEAFDISNSLYRDIHDYLEDNGVYDEVRSHRSRVPSKAFGFMSMPVGSSLSEHEVLDEEFSFVDFVEDESFNDCLRRLIKEKDLIEFDVYKGANLTKQAFNGIYNEGKVPKKNNVLALCIGLRLNIDEADDFIEKAGYSFSKGNKHDYIVRYFIEHEIYDVFKINEILIHEDLPYLG
GUT_GENOME026182_013836-330IKKDIVKMKCDAVVNAVDNTMTKGGYTHERIRKAAGRKLDEQCRAVGYCATGDVKITSGFGLPCKFLIHTAVPRRSDKKEDEEKLLTSCYENALTIAAANGCESIAFPLLSEDSSGCPIKSEPDIAAKAIRTFLLSNEMDVYLVFSRKNNRKINLDLYGDVLSYINKNAAFASPDVVFLGDLCAEDYMMQSSRFSNKKLFIKKPSSLEDMLKNADDCFTVTLLKLIDACDMSDVECYKKANVSKQTWYKIMNEKNYKPSKNTAVSFAVALGLTLPETERLLATAGYALSDSSKFDIIIKYFILNEIYDVFTINQTLFDFDQPLLG
GUT_GENOME185631_011381-362MPFQIVHNDITRMRTDAIVNAANGQLMMGGGVCGAIFRAAGAEKLQRECRKLAPCPVGGAVITGGYGLAARYVIHAVGPVWKGGGCGERELLARAYAGALRLAKENGCRSVAFPLISAGIYGYPVGEALQTAEEAIRAFLETEDMEVYLVVFDREAVTLGEALHADIIHYIDRYFDDSGQRLRLETELGYGAERVEEAPGAFKSGAFKPGLSSGAAREAGQKPKEIEIPQFLRPRARPLPDDRLKELLERPAETFSQMLLRLIDEKGYTDVEVYKRANMDRKLFSKIRGNPAYSPKKQTVLALAVALRLSADEAGKLLEKAGYTFSNARKQDIIIRYFLEQGQYDIYAINEALFCFDQPLLG
GUT_GENOME087912_019041-358MPLEIIRADIVTVCADAIVNSANPLPVVGRGVDSRLHAAAGPRLFAARERVGALAVGEAAATEAGVLPARYVIHTVGPLWEGGSGEAEALAACYRNSLNLALSLGCGSVAFPLISTGTYGFPKDLALRVAIRTIGDFLAEHDLDVILVVYDPESYQLSEELFGSVQSFIDEHFEDVEQLPAGDTFAYASPPSATYEGSLSAIPAFEEAPPSFASFAPSSPSSTDHPPRTPGFLNRFRRRQLEDVVNQVDETFSEALLRMIDERGVTDPEVYKRANIDRKLFSKIRSNPAYQPSKATAVALAVALQLSLDETRDLIGRAGYALTHASKGDLIVEYFIDQGVFDIFLINETLFAFDQALI
GUT_GENOME015170_0067127-231CKDETVLRKFYRNCLSMDVNSLAIPLVPFGDSEFSREDGLMIALDELNNNLDVYILCENLFEYDDLDAYLYGFKIEDSRIDYRIETPKELEDRLAHLKDTFSEYLMYLIGQNGLTNTEVYKRAMLSKKVFSKIKNDPMYHPNRNTALRLCVGAKLNMEQTKELLARAGYALSPCDKTDIVFSYFIENKKYDMIELDIQLEKYGEN
GUT_GENOME012869_012584-337HLVHADLTQMAVDAIVNAANSQLLAGGGVCGAIFRAVGSRTRDLQEECLEKAPCPTGKAVLTGGYGLKAKYIIHAVGPIWQGGKCHEKELLASCYQSSLELALENGCHSVAFPLISSGIFGVPKDLALEVARKAITGFLAEQEEELIVYLALFDKNAVELGERLDRELTHYIGTYYKQSPRRKRRVQRVREEAVCYSMEMDATAPMQTFQAPSVGAPPDSLENALGHVQKSFSQQLLALIDAKGLKDTDVYKRANLDRKLFSKIRSNKEYHPTKRTALALAVALQLSLSETEKLLQTAGYSLSPSIKDDVIVTFFLERGKPDLYEINEALFRYG
GUT_GENOME268098_015034-326ANTSLLSGGRVNGAIHRAAGPKLLDECRKLGGCEVGKAKLTKGYKLLARHIIHTVGPVWSGGKNNEPKLLRSCYKECLSIAAENKFDSIAFPLISSGASGYPKDQALKIAVDSCSEFLKTHDMDITIVIFDRESFDIGRKKLKDIQAFIDDNYFDESRKWGYSAAVGTAFSTNEPFRLFRKKASNSKGKARSAHSSFPAIGGTCIEGIEAKLKMIDESFTEMLLRKIDEKGMTDSDCYKKANIDRKLFSKIRSNKNYHPKKVTAVAFAIALELDIDETKELLLKAGFGLSNSSKFDIIVRYFIESGNYNIYEINEALFRFDQA
GUT_GENOME051358_012021-294MPFYIIKGDLVKMNVDAIVNAANYTLKMVEGVGRAIFHAAGDIELTNACKAIGKCAPGNAVSTPSFNLTTTKLIIHAVAPIYQNGKHDEELLLRRTYQNAFKIVKENNFKSVAFPLLGGEFNWPLRDCFNIACEEIKNYIKNNDLDLTVYLVMYKNYPSTVGDVVQEALTKFITTHYKVNEKERVLAKYKIEFPYTWKRFYNGMSDEELMYRANISSVTLNLIKNDPKFKPSKNLTLALAFGLGLKGNDMKAFLRDFNITLNYARLLDLILLFFIENDIYDVYEINNAMFIYDY
GUT_GENOME286453_0207413-367KIVRNDITKVKADAIVNTANPNPICASGTDLAIYEAVGKEQVRAERQKIGKIVRGDVAVIGAYQLQAKYIIHTVDPIWEDGKKHEFEILERCYRKSLQKAVELKCESIAFPLISNGVYGFPEDKALQIAVSVFSEFLTENEMQIILVVFDKKSFQLSGQIIGEIDSYIDANYVRVSHKKEYPIERRGRARSRHIPEEELYEQMLRSEDTEESYTLDEESDADVRLTEPCMLSSDMSLEDQLANMGASFHEKLFELIVAAGIDNKDVWKNANLDRKHFSKIQCDEHYHPKKKTVMALCIALHLDLEQSKDLMARVDWEFSSSSKFDLIVQKAIIDKQYDIMQLNVTLFKYTNEILG
GUT_GENOME062269_0127529-335DRKIEQSAGLTQRLKQTDAPLLIFVWEQSDPCNQGETLAQSYKQALMEAAEENCASVVIPFPELELSSEAQAIILRRLIGTVNEFLMNHEMSVTLAVDRRQTWELSASLLTSIQSYIDERYISDLSIFPTGRVFAKMKNFIAPLEEEQEEETVQDGWELNETILSDMEPETNEQDFQRIGETIKPRSLSQLLAQREASFSQSLLRLIDEKGKTDPQVYKGANIDRRLFSKIRTEKDYRPSKTTALALAIALELNLDEIKDLIGRAGYALTHASKLDIIVEYFVVQEIYDITQINEVLFALDQPLLGG
GUT_GENOME235886_011024-371KIERNDITKVSADAIVNTANPMPTVGAGTDTAIYKAAGYEELIEERRKIGVIPRGESRATPSFKLEKNGVKFIIHTIGVFWQGGSKGETDILRSCYKTSLKTAVDLGCKSIAIPFLATGSYCFPKELALQIAVEEISKFLLENEIDVKLIVYDRKSFKISEKLFSDVEDYLQKNLTDEDESDFFSDALKSCKIDSSEDFEEERGRIGRKKQAQRIKKRESSPIIGAALTSAEIPVIHNLESAREATIAPIDVDAFIQKGQTKNFQDTLQSLIAERNLPNADIWKKADMDRKFFSKIISTKDYVPKKKPVMALGLALELPLEDFERFLAAAGYAFMPSDRFDLIIKYCVMNKIYNIVKVDMILDDHGQE
GUT_GENOME204380_014421-313MPFLMVRGDIADMRTDAVVIPAAGTGQLRKASLVPCRAGKAVMRKRIGSSAPSIIHVVIPAWIDDRYKREQLLKKAYRSALRLAVKNDLRSIAFPLLSSGNTRPAQEDAFRIARSVITAVVQQEELSVYLVLEERDLLAPSGERQAAVAAYLEKHEAVQADEQMDSSGAPQRVKWLMMAEEQAHAPEPAGQKETLDALLAQKRETFSQRLLSLIDEKGYTDAEVYHRANLDRRHFSKIRNDIQYTPKKATVLSLCIALRLSIDECEDLMNRAGYAFSPCSSFDLIIRYFIEHGEYDIFAVNEMLFHYGLPLLG
GUT_GENOME198841_006971-343MPLQIVRNDITKMKTDAIVNAANPTLLGGGGVDGAIHRAAGAKLLQKCRTLGGCKTGEAKLTKGYRLPCKYVIHTVGPIWQGGKQGESDLLYACYQNALQLAVQYHCESIAFPLISSGAYGYPLQQAIAVAISAVRAFLETHDMYIYLVVFQKDAVSMASSLVADVTSYIDDHYVETHYDGNTRRILSEEQVQWQTKALNPLSPAVPYASTFPKKENLTDLETMLMQTDKSFAAYLLDLMDEKGMTEVETYKRANIDRRLFSKIKKSAFKNNAAYQPRKQTVLAFAIALRLNLEETQRLLERAGYTLSNSLKFDLIVKYFIIHKQYDIFIINETLFLFDQMLI
GUT_GENOME282985_005871-344MPLQIIRQDITKMRVDAIVNTTNEEMVGYSGVDLAVHEGAGPLLDEECAKLAPLGLGTAKITKGYNLDAKYIIHTSGPVWQGGLVGESIILKSCYIESLKLAVANGCSSVAFPLISSGTYGYPKDQVLKFAIQVITEFLFDHEMMVYICVLDRTSYEFSRKLFSEISEFIHDNYVDEYKECSFADLRAFEVSTPSEEIVQNDSVDAASKMMAPCKAKASSIAEKSLHEYMKTMDKSFAYKLFDLIDKRGMTDVECYKKANVDKKTFSKIKCKPQTYKPSKQTAVAFAIALKLNLDETQDLLASAGLTLSRSFTFDKIIRYFIQKGVYDIFEINEALFEFDQVLL
GUT_GENOME253970_018175-333LVRNDIVNMRVDAIVNAANSSLAPGGGVCGAIFAAAGRTALKKACRAIGHCDVGSAVITPGFNLPARYVIHAVGPIWQGGTRGEAALLQSCYTRALHLAQENGCESIAFPLISSGIFGYPKPQALRVAINAIEEFLLKYEMQVYLVIFDRAALLISERLYENIQRYIDDRYVELRAFPRQAAEALPHARQRAEQADLPLAAPCATPGKRSLDDLLGHLDESFSRMLLRLIDEKGMTDVEVYKRANLDRKLFSKIRKEGYNPSKQTALALAIALRLNLDETKDLLGRAGYALSHSNKFDIIIEYFIEEGVYDIFEINEALFAFEQRLLGA
GUT_GENOME098616_000111-308MSFNIIEGDITKLDVDAIVNAANSSLLGGGGVDGAIHKAAGEKLLEECRGLGGCKTGDAKLTKGYNLLAKYVIHTVGPIWRSGEYGEEKLLRNCYKNSLKIAKDKNLKSIAFPLISSGVYGYPMAGAIDVAISEIKKFLNKNDMKIILVIFNRESFVIPNTSLRKEISNILNKENKESVNKASIDKASIDKDAINNKPTNNFYEDLIDLTEEKSISLKELYKKANIDEAELEKIESNNEYYPSKNILLSFVIAFKFSLKEMETLLERYEYKISYDKRYDLIIKYFVENERYDIFEINTVLFAYKENLL
GUT_GENOME276716_016031-349MPFFISRDDITTLKTDAIVNAGNRELRMGGGVCGAIFKAAGAEEMERALQGLGPISTGDAVATPGFALPARYVIHTAGPIYHGRKEERDLLASCYRNSLLLAEKMHLSSIAFPLISSGIYGYPVEEAEQVALDAILSFLETHDMTVYLIILKERARKAMVYSRLSYFLQSHEDMEPAFGISIPSKAPAVERDSLCFDVLADEAPAQRTRRRKKGPRKAPVHDLPLPASGPIPDEFLRTEESFSHALLRMIDEKGLTDPEVYKGANLDRRLFSKIRSNENYKPRKNTILALAVSMDLSVEETENLLKKAGYTLSRSLTGDLIVLFYLQRGKPDIYEINEALFHFHQPQLG
GUT_GENOME197694_009391-308MPVKIIRQDITKLKVDAIVNTTNPNLDATGGLDHYIHQLAGKKLDVECRRIGKLKVGQACLTSGYKLCKYIIHTASPVWNIQNKNNEALLKSCYLSSLMLANEYKLKSIAFPLISSGTNQFPKELALQVAMNSIVSFLTDHEMMVYLVVYDRNSYKISSELFDSIEAYIDDYYEDEHMLVCNSIIAPRFSLIDALNQMDESFSTMLLRKIDENGMSDVECYKKANIDRKLFSKIRSNPNYKPSKQTVIAFALALELSLDEMEDMLNKAGFSLSHSNKFDIIVEYFVSQGNYNIFEINEALFAFDQSLI
GUT_GENOME071520_007005-339IIRNDILKMEVDAIVNPTDVDLSGSGSIDLKIHNLGKEKLKLELSSIGKCSPGDAILTNAYNLPCQYIIHTVGPIWNGDAKAFKVLESCYLNCLNLAIDFDMESIALPLIATGSFGCPTDKAIQIAINTINSFLIDYDLDVYLVVYNSEAFGLSKKLVEDVKSYIETDDLEDERIKLIQELLSYYEFNEEQVDDEPIGEIKVSKSIINICESDDAIMYETSCKCIAPDGLDDLIPELDKIEITNEDTFAYKLFKIIDENNFKDSDVYNAAGVSKMVFSNLRKGVIPKKKTIFQLCLALPININEATELLASCGYTFILSDKFEKTIKKIIEAKNT
GUT_GENOME097453_002761-377MPFTIERNDLASVSADAIVVAANEDLHITGGVGEAVAQVAGFTNIQEACNAIGHCPTGSAVATPAFNFPARIIIHAVGPIWQGGTHNEVALLRSAYDDALSLAAENNASSIALPLLSAGIYGFPADISLSVAQNAIHDYLESHDAEIRLVLFNRNALQAGLKVYDYIKEYIDDVYVGKHTDTRQRTRSTEAFNDTFFTAAGAAYGALSASPSATMMPSLSSPHKEAEEETSEHAAYDKNGTDDEALYAEALAAKTLGQPSCLSELLDSLDASFSTTLLALIDAKGMTDVQVYKRANMSRQLFSKIRSDALYKPTKKTVLALAIALELDLDATEDLLRRAGYALSHSSKADVIVEYFIMNNQFDIFEINAALYAFDQP
GUT_GENOME014402_0024144-311LTDDKKKEKFLLQYTYKACLNLAAKSKYENVAILFNCIDTDNFTESESLKIVVQIIVDFLRTHDINIFLIIHINKKFIDFDSKIFIQISKRIEAAVIASGRTEGCQPIPWFNYPVRECRREIVFPKTSEGLDEKCLYESEFCSENIEAVEEMVNGIEDNFVVSLLKLIDSKKMTDVECYKKANVSKQTWYKILNKKDYRPSKNTIIAFAVSLGLNYSETQHLLSTAGFALSNSSKFDIIIQYFINNKIYDIFKINEMLFQFGEPCLGL
GUT_GENOME064691_005661-306MPFRIIRNDITKVTAEIIVNITNSEPEVRCGTDFAIYRAVGKEGEEKLLRERKKIGKISKGDIKVTKAFNLNAKLKYIIHTVGPIWIDGKHEEFRNLEKCYDKSLKEAEKLKCKSIAFPLIATGMSGFPKDQALEIAVSVFRKFLFEHDMEIILVIYDDKSFQLSNQILNKINKYIESNHRINGFDFRKNQNFFKENLNHVTLSFQKKLLELIHISGMNDKDICRMANIDSKIFNRIKQRKTYYPKKKMVMSLCIGLKLNLEQSKDLLMSAKYEFNSSSKFDLIVQNAIENKEYDILKLNVELQYY
GUT_GENOME272664_014321-300MPFQFIKNDIAKVRADALVNAVNPLLKGSADGCAPSEAKLIEGIEGYALPCRYVIHAAAPQWRGGDHGEHKLLASCYRLALMLAEEHGCESAVFSLIFFRGHGSPKEEALEIAVREIGAFLRKSDMMVYLAVSDQTVIQIREPIFAEIEEALESRPIFGMRECLLPSEEARESTAPAKFSKRAIEEALAVGGETFSELLLRKIDERGMTDVECYKKANIDRKHFSKIRSDRSYRPSKNTVLAFAIALELTPEETDEFLARAGFAFSSASRFDIIVEYFINRGIYDIYEINEALFAFGEKQ
GUT_GENOME155261_019531-388MPFQIIRNDITKVKADAIVNTANPNVAVDDGVDSAIYKAAGKKKLLEARKKIGLLMPGEVAITDAFDLDAKYIIHASGPWWTDGSKGEEECLRSCYAKALQLAKDYGCNSIAFPLLATGTYGFPKELGIQIAVDTFTAFLEDNDIEITLAVFGSEDVTISGKLVEDVACFVDDGYVETALAEEYRNDNYPRETGKRPESRVGATLQSFHMPSFLRRENRDKEEDSFDALPLEEEAKEDESPDADMMQSYSAMEAPSVMMPEYLKPVSFEKKSESLEEALKEIYTDSFEKHLQQLINKKELKNSEVYATANISKQYFSKLLKGQVKPSKEKMLALAVGLRLNLDETIDFLRIAGYALSPISQTDKIVEYFIEHEDYNVLKIDIVLFDYG
GUT_GENOME251869_013411-325MPFQIVRNDITKMKVDAVINTANPNAMYGRGTDQAIYEAAGRDKLLAARQKIGTLQVGEAVVTPGFRLPAKYIIHNAGPVWEGGDRGEEEKLELCYENCLRLAAKKKFKSIAFPLISTGTYGFPKKIGLQTAIKVLSRFLLDSDMMIYLVVFDDESVQLSEMIFPDIPDIPDFIQQDRDCSHIPIEYPTQALFRSESLPMVCEETAYRTLDDIVGNRGKNLPETLMQFLIEKDLKNADVYHNANISRKVFSKIITNEKYHPKKKTVLALAIGMRLNLDETKDLLSRAGYALSPGNKFDIIVKYCIEKGEYNIVKINILLYEYGEE
GUT_GENOME070121_008151-341MSFSVTYGDALEYKGDAVLNSLGTNGAVYGRLCKNIIKGINRKDVKTLIDAQKDMPFGKIIETVGGDLQSAHVLHIVTPFKKNDDANCTQLRKAYKSVVDYAINHGYKTIGLPIIGTGANGYSDKEAYEAVLDVLSEISDKEVETEEDIINATVIAYLNPKPLKEQLYEDERRMLLERAGNVQGVYYDSFDLVDKVAPQNSKLKEAYADKHSFLNIIKFVADINPEDMFFPDLWPNPNPYKYPYDFIDDYCSQKNVNYKKALKEYDCRKRQKVRYNQTLSKIDVFRFSVLLNLSKTEILQFMSLCGYGFNPGSRLDMFFMDFINGKYGKFGKQRKLYEMDM
GUT_GENOME246834_013641-359MPFEIVRNDITNMQVDAIVNAANPKAAVGYGVDAGIHKKAGRRLFAARKKLGSIAVGEAAITPGYNLNAKYVIHAVGPVWRGGGHGEEQLLRRCYKSALSLAKQYECESVAFPLLCAGNCGFPKGVALHIAVSEFSEFLTENEMKIYLVVFGNGTLALSEKLFSSVASYIDENYVREKTLEEYGLSDIQSAPQASSMLRRRFQRQAELRGDAKENVDVAGSCAPFAAMPQQTPTVPKPAQSLESVLNNLDAGFSETLLRLIDDTGKKDSDIYKKANIDRKLFSKIRNNADYKPSKSTALAFAIALELDLDETRDLLERAGFALSHSSKFDIIVEYFILNKIYNVFELNEVLFAFDQPLI
GUT_GENOME045470_012241-382MPFSIVRDDISRVHADVLVNAANVRLAPGGGVCGALFSAAGFDEMRAACEAIGGCATGDAVATPAFNLPARWCVHAVGPIWRGGRAHEEELLHRCYRSAFARAVELGARSVAFPLISAGIYGFPVERALAIVREEVAAFLEYHDEIELTLVVFERAVVQMGNALVEQVQEYIDDEYVDSSSFMRRDTGELERELQWTEDASAPHSVEMAEPVALPKYLQEDDAPTVMPSASRPFVASSIRMPGGAMPDAPMPGAPSRAGTTLDAEIAQLVATLDAPFSTTLLALIDVRGMTDAEVYHRANISRQLFSKIRGNESYRPSKQTVVALAIALELDMSATQDLLARAGFTLSKSSKFDVIVRFFIERGIYDLFQLNEVLFAYDLPL
GUT_GENOME126289_007571-363MPFKIIRNDITKMNTEAIVNTANDHAAVGTGCDSAVYEAAGYQELLEYRKKHIGFVEEGGAFITPGFHLQAKYIIHAVSPYYIDGKHGEEEKLRSCYRKSLQLAKENGVKSIAFPLISTGGFGYPKEEGIRIAADEINTFLFGNEMLIYLVVFDRKATALGEKLYPNLESYIDHNYVESVREKEYGKSSEQSIRLWREQRPSSIDRRPNKNNGMQEKQPKLFLGGVDSQPCGFGEDEASFLDFEGLHEGKLEERMSHLSDTFSEYLMYLIQERKMENAEVWKRAIVDKKIFSKIKNNVNYHPNKLTALCLCVGAKLNLDEARDLLARAGYALSPCDKTDIIFSYFIENEIYDMIELDIQLEEH
GUT_GENOME105088_013384-342LMIRNDITKVAADAIVNPANRNLLQGSGTSRAIYQAAGEQELTAACEAIGRCDLGRAVCTPAFGLPAKYIFHAVCPAWHGGGFGEAEQLAGAYHSALELAAEYHCESVAFPLLSSGNYGYPKEQAFRIAVDTITQYVMEHDLTVYLVLYDRGSLAVSRKLFASVEEYIDDHYVAQNDESYGFGRRRRELSERRRLLEEDAAVPMLGAVPASAAAPRTARSLESLMDNLGESFTTRLLRLIDERGLKDSTVYKQSNISRQHFSKIQCNRDYNPKKKTVLAFAVGLHLSEDETIDLLKSAGYAFSDGSKRDWIVRYCLEHKIYNINQVNTLLFEYDQEQLG
GUT_GENOME112054_013601-312MPLHIVSGNITKIKCDAVVNPTNIFLDPTGGVDWDIQIAAGDELYAERAKIGTLSIGHAAITNAFNMNCKKIIHVCSPIWKDGLHGEDILLASCYTESLKLADENHLRYVAFPLIAGGVNGFPDKKAFSIAKVAIQNYINANSDIVVYLIIYDKSDIDINEMLKLDVAKYVASNLLHDKKSARPLDKETVCLSKKMVSFPSDNDLEKGFSDKLIELLNEKEMSNVECYKRANMDKKLFSKIICGSLPKKRNVLALCIALKLNLDEAKTLLNCAGYSLSHCFKLDLVVEYFIINNKYDIFELNEVLFDNDLPL
GUT_GENOME220009_009011-339MPFSIVRNDISLMSAGVIVSVANEHPLARGGVRGAIFKAAGPAKLIAACEAVAPCPAGRTVSTPAFGLSARRIVHAVGPRLIDGAHDEEAPLRSTDASTLAESARLGAQSVSLPLISAGIFGYPSAAALAVACEEVRRFLASDAATVRGTEMRIYLVVFDRAALAASLDRFNEVAAYIDDEFVDARPPRDAGLDLLASAPLPQMPVCRSAAASPHVDKDEVAALLRGIDASFLQALLALIDARGLTDAEVYKRANISRQLFSKIRRDNGYRPTKQTAVALAMALRLELDETESLLARAGLALSPSAKFDVIVTYFIERGCYDIYELNAMFFAFDQPLLG
GUT_GENOME194594_008551-373MPFKIIKNDITKVKADAIVNTVNPNVIIGDGVEYAIYNAAGKDELLKAREKLGYMPPGEVGITPAFKLDAKYIIHVSSPVWAGGWYGECPPESVFTKETILLGDCYMKALYMAAENGCKSIAFPLLATGTYKFPKEIGMAVAVRAFTEFLKKYDMEIFLVVFGEDSSNISGSLFRKVKRFVDYEEICACAESAPRDNLDWINGSTIDDDEETESYQDISEDEKLTESYWDISEDDNDELYYKSSQRSHKYSESRKHPESLEESLKKIHKYSFAGYLQQLINKKGMKNSEVYATANITKQYFSKLINGKVNPSKEKVIALAIGLHLNMDETKDILKVAGYAFSPYSQTDIVVKYFINNKDYNVIKLDILLYDFG
GUT_GENOME030967_01147295-587LRRGGGVCGAIFEKAGREKMEEACRKLSPIQTGEAVATPAFDLPARYVIHTAGPVWKGKEGEKELLYRCYEESLFLAEKLKAKSIAFPLISSGIYGCPKAVAEAVAKKAIFDFLAKRDMKVTLVLFGQKKRTRITEAIRSFLEWNCGAGMKEEASLMLADMAPEPMGAPPVRGPAKGFALQLEDPFQKVLFHYIDAEGKTDAEVYKEANIDRRLFSKIRTRVDYTPKKKTILALAVALHLDVEETEELLESAGYSLSRSREMDMIVRWYMEHGIYDIYEINETLLDFGENQLG
GUT_GENOME197827_016791-340MPLKIVRNDITNMHVEVIVNSANWRPEYGRGTDAAVYKSAGEKELLAERKEIGKMAYGECKETKAYDLEKKGVKYIFHVVPPRLEMDDCVSENEKQLHDCYRNALLLAAEKRITSIAFPVLGSGMYDFPKEKALSIALQEITDFLLNNEDICIYLVVYDLSMFRLACRIDENAYSYIDEAYVKLKNWLSYGKWAEKNVGEQLKKHLKSGKRDNHLWYEQEEEKRYMKKGERQPISAIESDSWELLNELVKKDDFYSVFQKFEGDLKGSIVYGRSHISKQLYAKINPSNKNYIPDKKTIILLCMGLNLNIEDTKILLESAGYCLRKNYVWDQIIEYYFRDE
GUT_GENOME070496_021211-342MPFTIVRQDITKMELDAIVNAANTELKMGGGVCGAIFKAAGAEQLQAACRELAPIKTGEAVITPGFRLPARFIIHTAGPVYSRLQPGKAEKLLRSAYTQSLELALENGCESIAFPLISSGTYGFPKDKALKAATDAIKDFLTGHDMDVYLAVFDKAAFAVSKKLMGAVESYIDDKFVDAYQSRRRLLSVERKALNMPEELDLSESAPLPNLPCAAAPQMASKSLDELIGNLDEPFNTTLLRLIDAKGRTDAEVYKRANIDRKLFSKIRTGKGYMPGKRTILALAIALELSLEETDDLLQRAGYALSHSQKFDVIVEFFIVSRRYDIFEINQVLFQYDQPLLG
GUT_GENOME284018_0201123-270DLPDSAEHVISASVPFYDSGSPQESECRLESAYLETLEQAEESQCRRIAIPMSSALRCGYPAEVSFEAAESAARRFLEDHDLMIYLVRERPAAEKRDAQLEAFLSLNLEPEMMSACALPEGRILLEEDDVDDAVRHLDESFQSALFRIIDAKGLTDAEVYRKANLDRRLFSKIRSQKDYRPGKRTILALCIALKLSREETEDLLEKGGYSLSMSQVSDVIIAYFITHGNYDIYEINEVLFRYDQQLLG
GUT_GENOME199712_0069586-412DITTLKCDAIVNAANSSLLGGGGVDGCIHRAAGPKLLEECRTLGGCQTGDAKITNAYDLPCNYVIHAVGPIWRGGQFHERELLTSCYEKSLALAQENHCETIAFPLISSGIYGYPKAQALKVAIDAISAFLMENDMTVYIVIFDKAAYRISSKLFSDIASYIDDHYVETHINKTSEAMRRQKQFREMDLCESISLPEPSNVPLSKAATLEDALQQMDESFSEMLLRKIDESGMTDAQCYKKANIDRKLFSKIRSDKFYKPSKPTVLAFALALELPLAQMQEMLGKAGFTLSRSSKFDIIVEYFVERRNYNVYEINEALFAFDQSLIG
GUT_GENOME165621_014045-326LIRQDITKIPVDAIVNAANPRLAMGGGVSGAIFRAAGADQLQAAANKVAPVETGHAAITPGFSLPAKYVIHAVGPIYRQANPDQSRRLLRSAYTESLRLAVAKHCLSIAFPLISSGIYGYPKEQALAVATSAITDFLTDHELDVYLTIFDQASFVVSQDLLGPIASYIDQHYVNAQPDTRRRARPDHELAAYPAQTPLSQEPLDQLVNTLDDSFATALFHFIDARHLTDPEVYKRANIDRRLFSKIRSNPDYVPSKRTAIAFAIALQLSLTEANDLLQRAGYTLSTSQKLDVIIMYFIGSKSYNIYRINEVLFSYDLPLLGS
GUT_GENOME018196_013591-394MPFLIVRDDIARISADAIVNAANTRLQAGGGVCGALFAAAGAADMQAACDAVGGCPTGGAVSTPAFALDATWVIHAVGPIWRGGLSGEREALRSCYRSVFAEAERLGVRSVAYPLISAGIYGFPVDEALAIAREETEAFLRVHEDVAVTLVMFDRATMRAAGELFDEIDEYIDDEYVEASPHMRRRAERLAVEMRGGVSELFADEDDEGAGAPGLAPGLEAARGASLAPDLAPELSAPVAASEPAPSAHPAGYSTPVAFGGEAVCAEFSAAAPRELDDLLSNLDASFSETLLALIDERGMTDAEVYHRANLSRQLFSKIRSNRAYRPTKPTAVALAVALELDPRQTQDLLGRAGLALSRSSKFDVIVRFFLERGVHDVFRINEALFAYDQPLLG
GUT_GENOME100963_007035-354MIRNDITKVQADAIVNPANTDLLEGSGTSRAIYEAAGEEKLIKACRKIGHCGFGEAVITPGFGLPAKYIIHAAGPVWMGGRHGEEEVLYQSYWNSMKLASEYHLESIAFPLLSSGNYGYPKDKALKTAVHAISDFLMEEEMLVYLVLFDREALAVGKKLSASVKAYIDDHYVEVKNEAYRDADAFFEYQVRRSVSYVAEAPVNDGARAPAQAMASPFPMGQAAPQVSKRHLDALMHRKNETFSQMLLRLIDERGFKDAYVYKKANVDRRHFSKIRNDTEYAPNKKTVLAFAIALELSMDETKDLLMRAGFAFSCCSKFDVIICYFIENGMYDIFEINEMLFAYGQPVLGE
GUT_GENOME253619_003141-329MPLKIVRDDIVKVNADAIVNTSSPSPVIGRGVDLRINNAAGPELIRYREKFGQLEYGDAVMTPSGKLGAKAVIHAVTPVWRGGGFGERHILADCYRKALAIAEDAGMNSIAFPLLSAGNQGFPNYIALETANSVFEEFLADSEMLICLVLFSDEAMNASGELRDRIEEYIDSNMADEIMMEESACCSVSDIPVEYNRNEKKRSFLCLKRKKAKLEDIDSMLEGSFTEELIRLAGIKNMTDPQVYKAANIDRKAYNKIINAKDYHPGKNTAIALGFALRLNLDQMKDFIGRAGYALTRASRFDIIVEFCILNDIYNIIRVNEILFEYTEK
GUT_GENOME124455_008604-351YVVKRDIVSMKTDAIVNAANPRLLEGGGVCGAIFAAAGADTMRTACAALAPIGTGDAVLTPGFQLPAKWVIHTAGPVYEGGASGEAALLRRSYKECLKLARSHHMKSIAFPLISAEIYGYPREEAFQIAEETINDFLKKYDMEVYLCLYPEAPGDISIPAQLDDFLRTQGILPESPIPPAGANACFELSYRPEETDSAALDAQKESVRYSLQKPETTCEERICADISEDLRHIIKELDAPFSEKLLRLIDAKGMTDPEVYNAAYIDRRLFNKMKSPNYRPRKDTVLALVFGLRLTLEEANELLCAAGYALTPASKTDMICTYFLTHHMYDVAIVNSYLLDYDQKPLGY
GUT_GENOME085152_002501-335MPFILMRNDITKVEVDAVVNAANTSLLGGGGVDGAIHKVAGPKLLEECKRIGGCPVGEARVTHGYNMPCKYIIHTVGPIWRGGGKEKEEQLYSCYKSSLKLAEIKGCKSIAFPLISSGAYGCPKEKALTIAQKAIRDHLKNHDMDVFLVLFDKSAVNVSQKLKFDIQNYIDDKYSEEHYPSEAEEQRARFNISGSILFALRNRSARYAGVAPAANIAAILKYINKPDFSFSQKLFHLIDERGMTDPEVYMRANVSKQVFSKIRRSDYHPKKNTILALAVALKLSLEETNDLLSYAGYTLSRCDKGDLIVSYYIENKIFDVDKINIMLLEFDQTLL
GUT_GENOME158557_011691-347MPFEIVRQNLCEVTADAVVRFVSTGTGVRVLHRGSLPPGFMAKELEFAFTPKAVILIEVSLPQGGDGEFNPALLLTQAYRQALTAAVREHCGSVAILWPKFSLPGISEDDLLSILVQGERELLTEQELWVTLVVEDVRRLRLSDSLINAVSRYIKDHYIQALPREKTGHVLAKVKNFIAPVEAELEENEELTDIPAEETCLFKEQEEAQAPSDKFYAEPKMRSVDPRLFEQLIAQHEESFSESLLRMIDERGMSDPQVYKRANLNRKLFSKIRIQKDYRPSKTTALALAIALELNLDELKVLIGRAGYALTHASKLDIIVEYFIGQGIYDLFQINEVLFAFDQPLLG
GUT_GENOME103454_016886-338VENDITKMDTECIVISADVSLKPTGGMSTEFYNAVKEPKNLIKECERIGFCGGCECVTTHSYGLPCKYVIHAVAPIFADSKSKAEEYLLTCYKNAIYMARRKMFESLAIPLLGTGKKGYSKEMSLRLALQAITEYLSKYDMSIYLVVHNKSSFKPDVKLISSVESYLSKNYTGNGLQGFESVSTYANSFIYDVCTDSAVNYIGAECKTMLNDKEQNIKSESADIVNFPAILKNVLKKYSVKFGKTYRSANIEKSAFERLKSNEDKIPDKEVVLALAVALQMSINTAEQFCAESGYSLTHNKQDTIVRYFLENKIYNIHMVNQMLFYFDKGQLG
GUT_GENOME112544_018721-343MPLFIIHGDITKLKVDAIVNAANSSLLGGGGVDGVIHRAAGNKLIDACRRLNGCEVGDAKITKGFRLPAKYVIHTVGPRWQGGHYGEKELLESCYNKSLSLALRHKCHKIAFSLISSGIYGYPTEDALKVAENTIQDWLEDHDKMTVYLVLYSMEYGFCDEERTEDIKRFILKEQLTESSTYHVTGPHTTEKPFGMLVSPAVGNPSLSKRVLNRLKKNGHFSDKKIYDNLSRKLTGAHITFYDTLCRLIDKKGLNEVECYKKANVSRKVFSRLRQNGQPSKRTVLAFVGTMGLTRDEAEKLMESAGFAFATGDKTDMIILYFIDHQITDLDEINYALADFEEP
GUT_GENOME211809_008025-331LCEGDITKLGFDVDIIVNAANRMLLPGGGVCGAVFSSAGAGLEAECGRLAPCETGNAVITGGYGLCKNIIHAVGPVWQGGEKGEELLLASAYRKALVLAREAGARSIAFPLISSGIYGYPFAAAYDTAKKVILDFVGNEDMTVYMVLMNAPRLLPAIKNADIERFLSRHRMPKENRLRKQGVFSVANIGEFKERQFGVSSARMLSRDKQGEKSLEELISTPAETFSQYLIRLIDRSGMTDVQVYKRANIDRKLFSKIRSDDNYRPSKKTVLSFAVALKLSLDDTRDLLGKAGYALSDSVKSDIVVEYFISRGIYDIWEINEALFDFG
GUT_GENOME139854_014046-335VHSDITQMRVDAIVNPTDSRFSGSGGVDLAVHQAAGPGLAQECRQLKALRPSEIEVTGGYNLPCKYVLHTVGPVWKGGWANEQALLRSCYLNALFRAAELGAESVAFPLISAGTFGFPKDRVLSVAAEAIRNFLSLRDEDMRVFLCVYDRNAYRVSRRGELERFLNSARMEPPGKAMPMPESRSQRIAMPKRVESAACEEAMSCEAPAPQSLEDWLKHQDDTFAVTLLKLIDKKHITEVQCYKRANVSKKTFWKINNDPKYKPSKPTVLAFAIALELSLPETEDLLRTVGFSLSHSSAFDMIIEFYIRKGIYDIYEINAALYQFDQVCLG
GUT_GENOME130027_012431-306MPYKIIRNDITKMHVECIVNTANPYPVIGDGCDSAIYHAAGSSELLAYRNTLGEFEECDAFLTPGFALDCQYILHVVSPYYEGEQTNALIHLCYQNAFQIIEENNIKSVAFPLIATGSYRCPKEIGFRIALEEIQMFTSQHDVMVYLVVFDQESTKISKQIAAHLEEYITDYYVEDALLEEYCYAMPATVKGSLADSLEERLLHTSDAFNEYLLYLISLKGLSYPEVYKNALVDKKLFSKIKNGSHPSKIIAQCLCIGAHLNLDEAKDLLSRAGYAFSPSDKTDIIFQYFIENEIYDMIEIDIQLE
GUT_GENOME213311_010851-404MPLTFLRADITTLSCDVVVNAANTKLLPGGGVCGAIFKAAGFLPLALACRKIGSCAVGDAVMTKGFKLKAKAIIHTVGPVYGKDPANQDRLLGDCYRNSLRLAAENGFESIAFPLISSGIYGYPKDKALNIATNAIKDFLQASETDMQVFLVVYDKASFEISRTLYEDIRSYIDEREVRPPAASRMPEDGAANMPFGAPAPDDFGVDAAPAPDDFPVGAPAPNTFPGNTGKHDYAADENLQHDKTSHSQAKSLPRTKPLSDRQTNASSEAVCEQASMSLLDAPRDRSLADLLDRKSETFSEMLLRLIDEKGMTDVEVYKRANIDRKLFSKIRKKDYVPKKATVFALIIGLRLNMDEARDLLSRAGFAFSESLRFDIIIEYFIEHNRYDIYEINETLFAFDEPLL
GUT_GENOME254219_000271-329MPIYIIGEEILNKKTECVVCPIKVSRLYLYEDICGKIYKKAGQIELSYLFDEVNPMAFAVPVITDGLALSDKIIHIIVSDFTLCQTFKMDMYQSYNDIVKLIVENGFKTIIMPPLCFSYKRLGNKNSYRTCAAFMRYFLDLYHVDCNVFIMVDKRTLNDHITNYVSTYISTSFPISKRHKPPVYPLTTIEEVNAFVKENKVIFHKKILQAVNYNYELKNPKFENLYSLIKNKYESDAHFCFSANISKDEYRKLHEEEGYIPPKTTLLGICIALNLDIDQTNNILQDLLNEKLNNEDPGDSIILTYLEEKNYDIVSINEKLFYADQPQIG
GUT_GENOME248894_017185-367LVRQDITKMKVDAIVNAANTQLAMGGGVCGAIFRAAGVSRMAAACDRFAPIHTGEAVITPGFNLPSRYVIHTAGPIWRDGKHDEERLLRSCYRNSMELAARQGCASIAFPLISSGIYGYPKAEALHVALDEIRRFLGDAADSGHDLDVYLCVFDKTAFEISSSIDKRLRAYIDDYYVAEHEDTLGSRTYELLALQRFSDNKPKADRAVPSGVLDMPDFLGDMGETPVYSAPSAAPSPKSAGHVDDAVLSNLDEPFNIILLRLIDAKGYSDVEVYKRANINRKLFSKIRCGDGYMPSKKTVLALAIALRLNIDETQDILACAGYALSHSVKFDVIVEFFIVHEMFDVFTINEMLFRYDQPLLGQ
GUT_GENOME236868_013853-367FRIVRNDISKVRADAIVMSANPKPICGGSCDLSIYNAAGFDDMLKARRVIGLLETGEVAVTSAFALPANYVFHTVGPVWKGGESGELLALRKCYEKNLEKAKELECQSIAFPLISSGVYLFPKDKALSVAVETIRDFLQTNEMDVALVVYDKRSFELSQNLQNDVRSYIDENYIALKDSYSEKWNLNSRSKLSAFETDCGENSVLQSDDDECSSFSLASFENLEVRRECAFRNICTKPSIDPKEEPLEKVINHAAETFQQKLFSLIQNKKFDDVDVYKKANLDRKHFSKIRSDVHYSPKKKTAIALSIALQLNLDETKDLLSRAGYALSPSNKGDLIVSYFISHQKYDIWEINTVLFKYGQTTLG
GUT_GENOME088421_025051-358MPFEIIRQDITDMKTDAIVNPTNNKLQPSSSGVCSSIFKKAGFQKLQNTCQKIGSIKIGDAVITKGYNLGCKYIIHTVGPVWRDGKNNESTLLYNTYINCLKLAKYKKCKSIAFPLISSGNFGYPKDKALEIATKAIKDFLSENDMLIYLVVFDKKSFKISKNLFDSIKQYIDDNYVDAHKDKRYKQNIDILDDVDSSTVLYSMTKPKSKSKKSKLFSKKSSPIEINEEASPLPFELDDEESLLHALSNTEDTFSESLLKLIDESGLTDAQVYKKANIDRRLFSKIRKNKDYTPSKSTVLSLAIALELDINKTKDLLRKAGFALSHSNKFDIIIEYFISNNTYDIYTINEVLFAFDQN
GUT_GENOME213893_001411-348MPFEIVRNDIVKMTVDAIVNTANPFPEIGTGVDTAIHKAAGHRLLQAREEIGNIVRGDAVITPAFNLNAKYVIHTVGPVWNGGANGEADCLRSCYRKSLELAKRNDCKSIAFPLLATGNYGFPKDQALQIAVSTISDFLLEYDMMVYLVVFSEKAFSLSEKLFKAVESYIDEHYVEERLEEEYIRNYSGEAGRELRFIREHCRLAELPDDVIEKNGSMPSCPSSVAGTALSMEDMLREEERTFSEALLDWLIKKDLNDPDVYKKANMDRKLFSKIRSNADYKPKKSTAIALALALELDLEETKEFIGRAGYALTHSSKFDIIIEYFIRQKNYNVFEINEVLFTYNYPL
GUT_GENOME229268_0068514-315TIIFGNILDSQKEVLVSSDDCYITMGGGISGCILKTGGRAIYEDAQKQIPAKLGNAIVTTAGSLKQKYIFHAITIDNEYAHKRHEEDSQQKLEIQEFIVVHSVKYCLRLLSNLGLSSIAFPAIGAGVARIPYKKVAQSMASAISEFLINTNKHFEIEIYLYDRYGKMENWDFLPFFESFSSAEACSEQVHKFELHKQAVEENNEKLDFSDVEISKTAYTKHGIHKVFISYSRKDEDKTKGICSLLNDMGIPYWIDVDGTYSGENFKEVIVNAIKQSQLVLFVSTKNSNASLNVAKEISLEDK