UHGP-MC 35999


Information


Number of sequences (UHGP-50):
104
Average sequence length:
239±23 aa
Average transmembrane regions:
0.01
Low complexity (%):
6.75
Coiled coils (%):
0
Disordered domains (%):
1.44

Pfam dominant architecture:
PF01555
Pfam % dominant architecture:
192
Pfam overlap:
0.52
Pfam overlap type:
shifted

Downloads

Seeds:
MC35999.fasta
Seeds (0.60 cdhit):
MC35999_cdhit.fasta
MSA:
MC35999_msa.fasta
HMM model:
MC35999.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME247680_00038151-399EEYKEFVDKFKPSTIKTSDDCYTPDNIYNAIRDHVVEKYRLQGREIVRPFYPGGDYEKEEYKEGCVVIDNPPFSMLSEILSFYIKNNIDFFLFGPHLTIFNYDTCNIVITGIKIVYENGAVVNTAFLTNQGGCFIEADADLSAKIKAANDKNREVYPGELSRHVYPDNLLTSTMVEQMAKSGVPFEIKREDCYHIRVLDEQKKEDKGIFGSGYLLSDKAAAMKKEAQKKAEEKAQEKVYKWKLSERELE
GUT_GENOME119777_00128152-365NEDYEAFTSKFEPKLTTDDCYTPDYVYKAVKEWACDAYGIDQDKCVRPFFPGGDFENFDYPEGCTVLDNPPFSIMARIIDFYEERGIKYFLFAPQLTLFSRNRPCNYICAGIVIEYENGAKVNTGFVTSYGDWKVDTAPDLHELVEDAQRKAKNEQKGEPMNSWQLPDNVITAAMMAKLGKRGISLRIPGASCSFVRKLDNQKPGAALYGGGVP
GUT_GENOME034178_0325927-284LFDDYEGFVDKFEPKKTTDDCYTPSEIFDCVLHFLATKFDLNNKEVIRPFFPGGDYESVDYPHGCVVVDNPPFSIISKIARFYIERNIDFFLFAPHMTLFSANIDCTAIVCGADIVYENGAKVKTSFLSNMLGGCRVLGAPDLYKELITINENKKVGLPKYKYPENVLTVSMVQKYVELGIHIEIDKGDVYHCGGLDSQKKHKKGLFGSGFLLSEQATAEKLAAEKLAAEKLAAEKLAAEKDNTITWELSDREKIIIK
GUT_GENOME131345_011905-200KIERPFYPGGDYENYKYTKESVVIDNPPFSVITKIAKFYNEKNIDYFLFAPGLTLFNILMPNDKTNGIVVSNTITFENGAKVKVGFITNLGEYKVRTAPKLCEWLKAANRWANNDKKRYTYPPNVTSAALLQKIADIPLGIKKDEIVFTRYIDDKNQQIYGGGGYISSRIAAKLKAAELKAAEQVQTFHLSAREKE
GUT_GENOME058623_0102327-260FEDYEGFVEKFKPKKTTDDCYTPEPVYEAIKGWVSANLMPLDGVEVVRPFWPGGDYEGHGYPEGCLVLDNPPFSMLAKIRRFYSARGIRYFLFGPTLTFANCARELDDTYIICNARITYANGAVIPTSFITNIPCDLRIWVAGDLSAKIDEADSRNKARAGKNALPVYQYPMEVVSPAILQKIARRGVDFKVRKSECVSISKLDCHRATGKTIFGGGWLISERAAAKRAAAERA
GUT_GENOME095596_011194-222RKPNDEFYTPEPVYNTVLEWVVKEYGIDKSKVLRPFYPGNDYKSFDYTDKIVVDNPPFSQNQPILEYYLEHNIKFFLFAQFQTLFTVNLPVCYIVTKPKIKYENGQKIATSFITNLEPPCIRTAFDLQKELQGYEVKTKQSVFLDKHILTASRIVRSNKAIKLSLDKMQHVRAVNGYKLFGAAFLVDDETANKFPSSCNTEADKFIELSDREKAIVEKL
GUT_GENOME238364_0041443-263VFHDYEGFLKKFTDNPKTTDECWTPKDVYEAVVRYVDEELCPLEGREVLRPFYPGGDYEHADYPEDGVVIDNPPFSIFSKICGFYVARKIPFFLFGPGLTIMSACKHGATAVFVPNQLTFSNGAAVKCNFATNMAGDLLAMTAPRLGKLLRACPSQNVKVKLPKYAYPANLLSSSDMQTIAKRNEPFAVPRTEAEIVRRLDMMPKGKSLFGDHYLLGDGLA
GUT_GENOME018717_0144225-287MDDTEEYGEFLDKFEAKKTTDDCYTPAGVYDAVLSWARAEYGLGDRRVVRPFYPGGDYRAESYPEGCVVVDNPPFSILSEIVGFYLERGVDFFLFSPYLTNMGVGAGDPRVCHVLTDATVTYANGAKVNTAFLTDLEPGCVVRSAPALQRAVCEADDVNARGGEPPVERVEYAYPDEVVTSTMVGYLSKYGVEWHVGPGSVSFTRALDSQAEAGKGIFGSGFLLSEKAAAEKAAAEKAAAEKARNKKDGSPRVVEWVLSDRER
GUT_GENOME225806_013944-216DEYFEWIKKFERKHTTDECLTPPKVYDRVKNYVVEFFSLENCTIERPFYPNGNYKEAAEKYAVHTVVIDNPPFSKMAEIIKFYNDRHIRYFLFAHAKTALGLVKHGASVWFAPANIIFDNGARVSVSFVTNMEACQCIRTVPQLLFLQKTQQKTNNNYPPDLFIFSHFETICRHGLEIIVPCDASMVRTEYNDPQAGRKKIYGAGIQLPTNKN
GUT_GENOME244185_0083122-276KEKLEDYDSFVKKFDLERQKTTDDCYTPPAIYEAVLDWLKSKVDLSGKEIMRPFYPGGDYKKELYHSNCVVVDNPPFSILSEITRFYIDMGVKFFLFAPSLTLTSARIAGRKDVDVTYIVCGVTVTYANGAKVNTSFVSNLFGDVRLWCCHELYRAVKDVNDQLLQEKKVNLPKYVYPNNVTSAALLNKIAKGADLKIMKDDCYHISQLQSQKRLGKSIFGNGFLLSDKAAADKAAAEKAAGESIVWELSENEKR
GUT_GENOME235878_0163913-230SPEYKAFVERFRPKKTTDDCYTPENVYSVVRDWAVARYAIPPDTEIVRPFWPGGDYERFDYPAGCVVIDNPPFSILSKIVRFYDANGVRFFLFAPYLTNLSIGSGCAGVNHVIAPCTVRYANGANVATSFVTNLGDDFVFSAPDLMDAIEKANDANLKAVKKSLPKYVYPDCVVTSASFGYLCRHHTPFSLKRSECAFIRNLDAHGDKSFFGGGLLLS
GUT_GENOME022516_0094514-265VHEDYDGFVEKFKPKHTTDDCITPPPVYDAVLEWLRLQGAIAADTPIVRPFWPGKDYTEEEYPEGCCVVDNPPFSILSSILNFFESRSIPFFVFAPGLTLLGGSKRKVTAVAVNACVTYENGAKVSTSFLTNLPIFAPYSVITAPSLYTAIKTAQEMAKSTKSLPKYAYPHNVVSISALHLIAHHVEIQIPKQESVFIRRMDAQIPLKKCVYGAGFLCSDRVAAELKAAELKAAELKAAIVFELSERERKIV
GUT_GENOME013821_0170825-249KVGKDDYGEFTEKIKAKLTTDDCYTPVEVYEAVLGWLREKVDIEGCNIVRPFWPGGDYEAYDYKEDDVVVDNPPFSILAGILRFYQGRGIRFFLFGPQLTLFSSSSSSSLTYIPCACAVEYANGAKVSTGFISNLFGDVLAMSAPDLRKRIKEAQEKARGEASASMPKYEYPPEVLTSSMLGYLSTHGVEFKVMRDEVSPAPLSALASQKAVNKAIFGKGWLISE
GUT_GENOME022311_0140311-282YDGFVEKFKPKKTTDDCYTPPCVYDAVLDWVKRNADIKDRPVVRPFWPGADYTKTDYPDGCVVIDNPPFSILAQILRYYEGRNIHYFLFSPHLTLFSNAKDGRTFVVAGAQITYDNGAIISTGFTTNLPDFSDYCVIGCPELQQAIKEAQSKNREGKKKEGELPVYSYPDHVVTASKVKNLVDVGLQLLIKRESAVEITGLDAQKPLHKHIFGKAILVSDATAEKLKEKASKIGGLKVEKLKAEKLKAERLKEEKMKKAIRFELSEAEREIV
GUT_GENOME007568_0147826-297SRAEILEDYEGFVNKFKPKKTADDCYTPPEVYDTVRDWVDANICPLAGRRIVRPFYPGGDYQNEEYRLGDIVLDNPPFSILAKIIDFYTANHIDFFLFAPHLTLFTAPRDNVTYIVAYGEIIYENGAIVKTSFVTNLEKKHRILVAGSLCTAIKECCKKKEKEKGKNNLRKLSYPDHLVTPALLGKIANQGVDLRIPMKECAYIRKFDNMGICIFGGGYLISERMMTEKMAAERLAAEKKNKKAAERLESDITKTIQVNLSEREMEIIRNLS
GUT_GENOME089499_0119414-267IREDYSDFVEKFKPKKTTDDCYTPDEVYSVVLAYVMERCPEVRSLRVMRPFKPGGDYEAEDYTGAVVIDNPPFSILSKVKQFYIERGIPFFLFAPSLTLFGGDDRCSRIVTHADVVYANGAKVKTSFVSNLFGTRSVIVDGCLRTRIEQAQEGQRAEKSITLPKYKYPAHVTSSALLGTLCVDGVYLEIDESECERVCVLDSQRPLGKRIFGSGFLLSDSVVERIKVERIKVERTKVERECVEWELSEREREII
GUT_GENOME135737_007194-277ETYEQFVEKFKPKKTTDDCYTPPLIYDGVADWVCTEYGINRDAFCRPFYPGGDYETFDYTGKIVVDNPPFSILSKILRFYIERNIKFFLFAPALTLFSGATEHCTAIPVGVAVTYENGAVVSTSFVTNLDDSDIRVRTAPRLYKILKSCNDASRKEKTKTMPKYEYPKSVATAAEINRLSKAGIDFEIRKSESLRVRALDAQLLQKKAIFGSGYLISDYAAMRLERAERERAERERAERERAERERAERERAERERAERWQLSEREKAIITELN
GUT_GENOME093325_004277-246RKNTYKEYVDKFKAKKTTDDTFTPDLIYNTILDYVTNKYGVDKDKVVRPFWPGGDYENFDYSDGEVVIDNPPFSLMSQIVHFYITNNIRFFLFAPGLTLLNVDKNDFNRFCHVVTDSVIEYENGAKVRTCFVTNLDDNALVVDTKLSDLIKEANQQANPKRQVPVYKYPDNVLRMTDFFHVSKLGKSLSIPQNQALWFEKTESQKKAKTKIFGSGMLVTDKIADLVGKYRRQPSRPNREW
GUT_GENOME105515_00018208-432YQAFVDKFKPKKTTDDCYTPAPVYEAVKTWAIDKYGLTDRPIVRPFYPGGDYQKMTYPDGAVVIDNPPFSIESEIVDFYHAHNIDFFLFGPSLTGMAQMIGRDWLRSVIAHAGVTYENGAVVSTSFVTNLGTAKVTVSPELHDSVETASRAALVERNAPQKMPKYDYPDEVATAARLGYLASHGVSLEIMPDSCTPIRSLESQRAAGKAIYGGGLLLSREGCRRE
GUT_GENOME286259_0109419-274YEGFIGKFKPKKTTDDCYTPPYIYEALIAWLQAKGYITPDTPVVRPFWPEGDFTDLEQYPPGAVVVDNPPFSIMAHILRFYEQHNITFFLFAPSLNFFNYLSDAYDLTAIIIGLPVVYENGANVCTSFISNLHCFDGVKAMTAPDLYQMLDNAQKENRKKVALPVYIYPDNVVTAARLHKIAHGVPFLLLKKEVAYARRLDAQQPLKKGIFGGGLFVSDDKAAELKAAELKAAELKAAELKTAELKNVFRLSERER
GUT_GENOME094490_00875146-361EESLAFEEKFKPKKTTDDCYTPPEIYEVIKDWACSEYNIDPAKVVRPFYPGGDYEKFDYSGGKVVVDNPPFSILTQICEFYVENGVPFFLFCPTLTGFGSRRLAMKTCHLICGCKIIYENGASVNTSFVTSYGGDVVAKTVPDLREIVDAKVDEIRDRETTDLPNYEYPDEVITAAMMGRWSKYGVEFEVRRSDCLPISKLDSQEESGAAIYGGGV
GUT_GENOME246380_0071712-260EDYEGFIEKFKPKKTTDDCYTPQPVYDALIKWIDDNIMPLSGIEIVRPFYPGGDYEHFDYKPGAVVIDNPPFSILQKICRFYNTHGIKYFLFAPTLSLFSSPLPSETCIVAYASIHYENGAKVNTSFRTNMVGDNTRVMLRGDLKKIIEAENPVTTNKVRKIIYPDNVISAATLGRLVSRGIIWNIPASDTYFIRRLDNSGGVYGGGFLLSERAALERAALERAALELAERVERVERVDRAILSDREWQ
GUT_GENOME101984_0153813-270KEIHEDYDGFVEKFKPKKTTDDCMTPPEVYDEVVRWVDERIAPLEGLHVVRPFWPGADYTAVDYGPDDVVVDNPPFSILSRIIDFYLAHGVRFFLFAPALTLFSAPRPGVTYVAAHCEIVYANGAKVRTGFVTNMQGEFRIMVAGDLYERVRSVMDMVLAVERKSKRVMSYPPEVVSSATLGRIAVRGLCLNVPFSECTFIKKLDCCGKTIFGGAYLLSDRAAADRAAADRAAADRAAADRAAAERLLLSDRERELVK
GUT_GENOME240191_02013365-616QKGLFNDYEGFVEKFKPKKTTDDCYTPPAVYDYVLQYVADHCNIDGMTVVRPFYPGGDYESLVYPDNCVVIDNPPFSIIAQIVRFYLKRGIKFFLFAPHLTLFSADLDCTRIVCGAAIVYENGAKVNTSFLSNMFGEAGVIGDPVLYEGIDAICSAPKAELPKYKYPDCVLTVSDVAYIVKNKGEIKIDKREMVHHSVLDIQKKHGKKIYGQGFLISHTAAERVTAERAAVKKEAIVWELSEREMRIVEKLS
GUT_GENOME000931_0323513-251YEEFVEKFKSKKTTDDCYTPPEIYEIVKRYAIERYGLYGRKIVRPFWPGGDYESYDYQDGCIVIDNPPFSIISKICRDYTERGIDFFLFAPHLTNFSIRNGNHIICGNSITYENGAKIATSFMTSFGDCRAETAPKLYGELKEASRAGKNTILPHYQYPNELLTVNDLEKLCKAEIDYGVHEAEYVRILDAQKNIKKSIYGGGLLISHKMADIKRAKLNTEKLKQSPYVFMLSEREQRI
GUT_GENOME283843_0139321-285EYKEFEDKFKPKKTTDDCYTPDNIYETVADYVATRFKVDRNKFVRPFYPGGDYEKYNYMSDSIVVDNPPFSILAQIVKWYQSQGIKFFLFAPGLTIIGLTRHANIICVGYGVTYENGARVNTSFVTNMTDNLIESSSKLYKRLENADKENSQKIKKQLPKYTYPDNILTACRMNTLSQYGVDFAIKRENGYFMRDLDSQRKFKKGIFGYGYLISGKKTAELKAAELKAAELKAAELKAAELKAAELKAAELKAAELKAAEHVWEL
GUT_GENOME013044_00050133-342SEAQLEFEDKFKVKHTTDDCFTPPAVYDAVKGWVLDHYKISKDREIVRPFYPGGDYQSYDYPKGCLVLDNPPFSIASEIVNFYTSKGIDFFLFGQGTTLMQTLDRANMVVVGISIVYENGADIATGFITSLGSSKVTISASLHDAIERAQPSETMAVSAYDWPHNVISGALLGKLPKFGTEMEIRKAYATKKVGINEVGIYGGGGLVSDM
GUT_GENOME026093_00985124-322RDILFCLMDAGDYKAFPYPDGAVAIDNPPFSILASICTFYLEHGIPFFLFAPSLTCFGGRKVFKQMNHIIADCQITYENGARVKTSFVTSYSGGIVAQTAPDLTKAINRENDRLIKANTKELQKYDYPMNVITAAMMQRYARYGVDLKIRADDYIQVGSLDAQKQAGKSIFGSGLLLCERAAAVRWELSDREKAIIQQL
GUT_GENOME176258_016185-268ETYTEFVEKFKRKLTTDDCYTPPKVYEAVRKFVDKKVYPLKGHEIIRPFYPGGDYENYNYPADCVVLDNPPFSILSKIIDFYIKHNIKFWLFAPSLTLFEAIRKSDTITPVIANSKIVFENGANINISFVTNIWPGNPAFIVEPCLHRYLEMVQKKNKPSKKRPKIAYPTHVVSSAILGKLATHGVGLICPRNEAVFIRKTDSGKAIYGGGLLISERMAAERMAAERMAAERMAAEWMAAERMAAEWMAAEQICLSERERAIVD
GUT_GENOME044077_00758136-376KDRDYEEFLDKFKPKHTTDDCFTPSKVYDVVLGYVKDTYGIDAKREIVRPFYPGGDYRSYDYPEGCVVVDNPPYSINSEICRFYLDKGIDFFLFADARTVMNGKERRLKIVLCGIPIVYENGAEVPTCFLTNMGSDDITVSGTLYSKIEAVNPKETKDLNAYTYPDNLITSMTISKLAKYGIDMEIPRAEPVRCIGDYNVFGGGRLISDRMAEKVKAEKVKAEKVKAEKVKIPIRLTEKEM
GUT_GENOME260574_02223147-402DEEYTEFVEKFKPKKTTDDCYTPDNIYNVVANYVSDEYGKDKNTFVRPFYPGGDYENYPYKENDVVVDNPPFSIISKICSFYKKRKIPFFLFAPTLTVMGIRDAQKIICGIALEYANGAQVNTSFVTNIDSVEFKSCPLLYKQLKAEDDINRKNKHTQLPKYEYPIEVVTGTMIAKYSKLGINFSVKSESCYFIQGLESQKKNGASIFGSGYLISERAATERAATERAAAETAAERAATEKWQLSEAEKEIIKNLL
GUT_GENOME219077_00497189-460RSDEYEAFTDKFKAKKTTDDCFTPEIVYDTVKDWAISHYKLGDAQILRPFYPGGDYEHEDYPEGCVVIDNPPFSILSQICRFFDEHGIRYFLFAPALTLFSINAGKSNYVPVSASVTYENGARVNTSFVTNLGGWRVEISGELFSLIDEADKRNRGESRIEPPGYIYPRNVLCVQDFDLAKHGQSLCFSDEDLQFTRALDAQKEKGKAIFGAGFLLSEAAAAKKSKAEEAALEVMSARFAAIFESQQNSRMSTDGKIIWPLSDREKALIKSL
GUT_GENOME199415_006481-249MEKFKPKKTTDDCYTPPMVYEAIKEWACEHYGIDRQKIVRPFYPGGNYETYEYKDGEIVLDNPPFSILSKITDFYLRQNIHFFLFAPELTLFNLMKREKVNAVVADAKIVYENGAVVKTGFVTTLGDYAIEGNAELNRKIKEAMKEIKSNEVRLPKYKYPKEVLMVNDVKKLVNKGISIKIERKGISFTRGLDSQKIKGKSMYGAGFLLSKEEAAKVEAAKVEAAKVEAAKVETTKAEEYTWELSEREK
GUT_GENOME101869_0167723-270KEKFLDYDGFVEKFKSRKTTDDCYTPPELYEIVKVWVDKNIIPLNGLRVLRPFYPGGDYQNEVYLPGDIVIDNPPFSILAQIRRYYSNRGIRYFLFAPSLTLFSSLQSERECFIVSHADITYENKASIKTSFITNCVNDGTRIWVAGDLSRMLTKKNKELQKAAATELPVYSYPDEVLSAAIEGKIAARGISYKVHRDECRPIERLDSQAASGKSIFGKGFLLSEQATRRKKVIKWELSEREKHIIQR
GUT_GENOME121215_00141179-392EEYDQFVAKFAPKLTTDDCYTPPAVYEAVKAWAVKEYGLEEMAVTRPFYPGGDYQNEEYPEGCVVIDNPPFSILSEICRYYRAAGVKFFLFAPALTLFSVGAGEFNYLPLGVDVIYENGANVATSFVTNLGPYRIDTAPDLYRMATQAVAESRNMGPGNPGYIYPDEVVTASTIMKYCAGGADIKIKNVHFVRALDAQRAERKAIYGGGFLISK
GUT_GENOME064275_0200215-275NNYEGFVEKFKPKKTTDDCYTPSYIYDEVIGWLIDNGHIDNTQKIVRPFWPGADYQAADYPDGCVVVDNPPFSILANIKRWFQDKGIRFFLFAPHLTLFEAYSPEHTYIVTNANIMYENGATVSTDFVTNIPSFSGCGIMCASSLRERIIQVQNKQTKKLKNPKYAYPDNLITTSVIASLLKGGKDIVIPHAELSYTRRLDDQIHTKKSIYGSGFLCSNRIAQLIKTEKEYAYITDIENKKAECIEKEKQTIRFSLSEREK
GUT_GENOME285291_0021831-247VFHDYESYVAKFQNTEKTTDDTYTPQDIYEVVLDYVRSIYPMEGKEILRPFYPGGDYEHAEYPEDGVVIDNPPFSMFTKICKFYSENGIPFFIFGPGLTIFSCLKYCSAVVVAPQIEFSNGAKVKCNFATNLIGDTLVTISHELSEAIAACPSQKQKVNPPKFRYPQELLSVSDLQTMAKGNLPFSVNRNEAVIVRNLDNHPKKGGLYGDHLMISEA
GUT_GENOME220965_00910152-359EEYDEFCEKFVPKKTADDCYTPEPVYEVVKEWAVNRYNLSGKKIVRPFYPGGDYENENYDGAVVIDNPPFSIISEICGFYTQNGIDFFLFAPGLTLFSTYAGRAKYVTGTNITYENGAGILTGFVTNLGENKIEIAPELIEKIEEANKIETKQTKKKDYPKELITPAIYKLAKRGQALKIKPEEAYFVRKLENQSETAIYGGGFLVSE
GUT_GENOME243728_0009511-262YDGFVEKFKPKLTTDDCYTPLQIYEIIKTWAVEKYGLQGRTILRPFKPGGDYEAEEYPEGCVVIDNPPFSILRKIKDFYIANHVDFVLFAPTLTLFSSTDNVVNYVQVNAPIRYHNGAVVNTSLVTNLGDNLIETAPDLRRRIVETQKKSPIKANAASRKYEYPEHVISSGLLDKYVKHVQPFVLKRSEAYFVKTLDSQRATKKGIYGGGYLISSNAVKRLRECLQSPLSKGCTSTERPTKWELSGREKQII
GUT_GENOME037508_01886156-393KFKPKRTTDDCYTPPEMYDVIADWAVSEYGIDRKKIVRPFYPGGDYRSFDYSNGKVVLDNPPFSIMSEIVSFYLDEGVPFFLFAPGLTGLSSKKTVMRCCHIFPDAAITYENGATVRTSFITSYDEGTVARTAPELTESIKEVAARLAKEGKTELPGYVYPDNVVTAAMLQKYSKYGVELRIGAHDCMHISELDEQKEQGAVIFGGGLLLSERAAAERAAAKEWSLSEREVALVESLG
GUT_GENOME135530_00152162-419EDYQAFLEKFEAKKTTDDCYTPDNIYDAVRDWVAEKYEIGNAAIVRPFYPGGDYKSEKYPSGCVVIDNPPFSIISEICEWYTSKRINFFLFAPTLTLLGIMRGSANYVACGCGVVYENGASVNTSFVTNMGGNKIVAAADLREILDDENKKNLKKLHRELPKYSYPDEVLTATMLCYMAAHGVSLEISERDAHFIRALDAQKASGKGLFGSGFLLSEKAAAEKAAAEKAAAEKAAAEKVKTDIWELSEREWAIVRGLG
GUT_GENOME236868_0041927-248RDPDEYLDFLKKFDAKHTTDDCYTPVAVYNVVADYVSGRWGIPRKNFVRPFFPGGDFENFHYTAGSVVVDNPPFSIISKIVKLYNSKGVNFFLFAPSLTLFSNGFDRRTCIICNAKVIYDNGAIVNTSFVTNLPCENVIEVLPGLKCEIERAQKIEKKRMRKLNFPQFVTNSARLGKYACNGIEMKFRREDVRSIRKFGGVQIFGGGAMISPQCAAAMEMAN
GUT_GENOME180786_0007634-261DYEGFMAKFEPKRTTDDCYTPPRVYGAVLGWAEREYGLAGRDIVRPFFPGGDYERHPYPDGCVVVDNPPFSILAKIVRFYMGRGIDFLLFAPALTVFGLLRTEGVCAVCAGATVEYENGAQVNTSFVTNMDRWRARSAPELNEAVARAVAETRRERRRSLPRYEYPDEVLTAARLNYYSNHGTDFRVGAGDFERISAMDAQRHYGKSIFGGGVAAERALRGREGSGRE
GUT_GENOME115250_0041122-274AIHADYEGFVEKFKPKLTTDDCFTPPAVYESVSEWLRRNAPVGGRPIVRPFWPGGDYERFDYPPDCVVVDNPPFSIISGIARFYQRRGISFFLFAPHLTLFSTQGIDWTCLPTGAPVTYENGAVVNTSFISNLFGNLRIWGPSELRAAILAADKQATKTAIPKYEYPNNVITVGRVGKIIKRGIDVRIDGRSLRKIAALDAQRRCGKGIYGGGFLCSDAVADDLAAKDLAAKELATKREAIAWKLSEREMKII
GUT_GENOME008801_00703144-395KFKPKLTTDDCYTPEDVYNAVKEWAIRKYNLLGMNVIRPFYPGGDYRNEEYKENDVVIDNPPFSILSEICNFYMERNIAFFLFAPNLTLFSTASGKCNYVIANAGIIYENGACVSTSFITNLGDEKIITAPDLRENVLDANKKETDEKPNYDYPANVLTAARLGKIVTAGAHISIKNAKFIRQLQVQKREGKSIYGAGFLISDIAAEKEAAEKEAAEKEAAEKKAAEKKAAEKEVIIWELSEHEREIIQELN
GUT_GENOME273110_01292164-372GENYNEFVENFKPKKTTDDCYTPPEVYDVVLKWVRDEYSIDENRPIVRPFVPGGDYINYEYPQDCVVVDNPPFSIMAQIVDFYLERKIDFFLFANGLTLFGYGNREVNLVCVDCSVIYENGAKVNTGFVTNMGDYKIITAPKLKEAIEFVNKPKTKEITTYQMPDEVITAARINFANVEFKLTKEQCHFVRTIGKEKTQLYGAGFLISK
GUT_GENOME212983_004225-243ESYEEFVQKFEVKRTTDDCYTPPAIYEALLAYIDRRVMPLAGVRVLRPFYPGGDYEREDYGPGDVVIDNPPFSILSKILRHYQARGVRYWLFAPHLTCANDAASRGTVVVCCADIKYHNNAGIATSFVTNMWPGNPAAVVAGDLAAELTAAQAVARPDKRKARMAWPDVMTSPALLGKLCDRGIVMEIPRSESAYVGKCGGRRIFGCGWLVSHRVAEKLRRAEEDARPRAEALSLTREE
GUT_GENOME028719_0013919-274RREDVINDYDGFVEKFKPKKTTDDCYTPVPVYSALVKYIDENVQCLNGFNIVRPFYPGKDYKAYDYKQNDIVIDNPPFSILSKILDFYIAHDIKFWLFAPHLTLFQYSARICTLVICNTQITYENNANVNTSFVTNLYPKEQLVNVNGELHSIIAEANRQRPKRNLSRYRYPENVVTSAELGTYIAAYGKSFIIKRDEAARINKLDCQQGNKTIFGSGLLISDEAAKAKLAVEAEKVVDSRQKVFELSPREKQIIA
GUT_GENOME027533_012168-227FNDYDDFVEKFKPKKTTDDCYTPPEIFAVVLDYVRERWGVEEADVVRPFWPGGDYEAFEYPEGCVVVDNPPFSILSKIQAFYLERGIGFFLFAPSLTCLSGKRCMEVDHIVCDCSITYENGAVVRTGFVTNLDGENVLEAEPELGRRLNEANDARLKATRKAVAKYEYPDDVITAARAQWLASHGTPYRVRRRDACRISALDEQRERGKGIFGGGLLLSR
GUT_GENOME281351_003978-254YDAFVKKFQPKKTTDDCYTPEAVYGVVLDWVERNAVVEFGDVVRPFYPGGDYEHAYYPDGCAVIDNPPFSVLEKILAFYVTKGIRFFLFAPALTLFSGKTRRRLTLVSAFASVVYENGAVVRTSFVTNMLDPDIALMTAPDLKKGVEAACKKETKKKPKMAWPYNALGVQKVHKIASVPFVIRRNECEIVSSLKDSKIFGGGLLVSDRVAAELRAAELHAAELHAAELRQEEEIIKVELSPENRRII
GUT_GENOME074256_00821153-401EYQEFVDKFKPKSTTDDCFTPEPVYDAVRDWACAEYGIEPGSIVRPFYPGGDYERFEYPDGCTVLDNPPFSILAQIIDFYLERGVRFLLFAPFLTLFGCGNRDVTFIPANCQVTYDNGAVVNTGFITNLDADNRIVSSKALRDAVRDADGAQRRADVFQRPHFELPANAVNAARFRHGSDIRIPKDQCSFTRALDSLKEQGKALFGGGFLISDAAAEAKAAAEAKAAAEAKAAAYMEFELSEREQAIVA
GUT_GENOME261477_0046845-285YKLFVDKFKPKKTTDDCYTPPEVYEVIKQWVLEEVPQIRELRIVRPFYPGGDYKAEDYTGAVGIDNPPFSILAEIKRFYLDRGVPFFLFAPALTLLSGASERLSYIVANCAITYHNGAVVSTSYVSNLFGDTALILAPELARRVKEAQQAAKAPNAQPRYEYPPNVVTGATLNKYITRGVSLRVPREEVHFTRALDSQRKVGKGLYGSGVIVSDRVAKILQSAPKEPPKEPPTCWELSDRE
GUT_GENOME129638_01596149-363YQSFVEKFMPKKTTDDCYTPENIYAVVKDWAVDRYGLSGAEVLRPFYPGGDYKAVTYDENAVVIDNPPFSIISEICDWYMQNGVRFFLFAPALTLFGVGRGQLNYIGCGAGVTFDNGANVSISFVTNMGNCAAMSAPDLRKKIDAANEQNLRSVRRSLPKYSYPLHVLTSAMLQYFSAHDVSFCVKVGELAFIRSLDSQASAGKTVFGSGFLISE
GUT_GENOME187657_0087011-270NYEDFVENFVEKFKSKKTTDDCYTPPLVYEAISDWVAKEYNLDKSTFCRPFYPGGDYENYDYSGKIVVDNPPFSLLAKILDFYTRNKIKCFLFAPTLTIFSNKRSYNYTTILCGISITYENGAVVNTSFITNLDDPALQIRTAPTLYKVVKLADNKTLAAIKKQLPKYSYPDNVITSAKLYPFAKYGIDIKIKKSQCHFIRALESQRAKKKAIFGAAFLISDSVKAELQKAELQKAELQKAEATETNNWELSSEELLIIK
GUT_GENOME180931_00227173-438DEEYQAFLAKFQDAKTTDDCYTPPNIYEAVVSFVVKTYGVKEKDFVRPFYPNGDYQNENYPHDCVVVDNPPFSILAEIISFFEKNKIKYFLFAPTLTLFSSSSSSSALPIAASVRYENGANVNTSFLTNLEPRNIRARSCPELYALMKKANEDNLRKQQKELPRYEYDKHVVTSTMVAQFSRYGIDFVVPRDESERIGGLDSQKKFGKGIFGSGFLISDRVKAEREKAEREKAEREKAEREKAEREKAEREKAERWELSQKELEII
GUT_GENOME242000_0112321-276KNGAKRTNDECYTPPSIYEIVRDWACREYGIDPARIVRPFYPGGDYQAFDYSGGAVVVDNPPFSILAGICDWYLARGVPFFLFAPSLTVFSTKRRLGSMNCVIVDCDIRYENGTLVRTSFLTSFGDGIVAQTAPELTEAINREIRRLEGKETAPRPKYAQPDEIVTAAMLQKYSANGIEWKIRREDCVRIRGLEEQGKKKCPDGRHKREIYGSGLLLSEQKTAEHTAIKRAVAARTAPPVWPLSDREKALVKSLGR
GUT_GENOME235595_01536170-430EEYQAFCAKFELVKTTDDCYTPAIIYDALCAWLECEYGLDRAKFVRPFYPGGDFERYDYPEGCVVVDNPPFSILSRILKFYTDRGIPYFLFGPTLTLFSSSSSSSCKIIAGVIVTYANGAKVNTSFYSGLPQDRHLAAKSSPSLYKAIKEANDANLKKMHKELPKYSYPLSVVTAAMLLPYARLGIPFEMPKDECVFIRKLDAQTGSGIYGSGLLISERLRAEREKAEREKAEREKAEREKAEREKVTVWELSDREREIIK
GUT_GENOME073435_00175152-401YLAFVDKFKLKKTTDDCYTPELVYEAVADWVSEEYGVDCNSMVRPFYPGGDYQQFKYPDGCCVVDNPPFSILAEIADWYIERDIPFFLFSPSLTAISSKNNRCTLCVGAQITYENGADVSTSFVTNLDDCLVRSAPTLYEAVKNANAENLKDKKKELPKYSYPMEVLKATDVNQLSKYHIDLRIFKEDAAYTNALDSQKEAGKVIYGGGLLLSEKAAAEKAAAEKAAAEKAAVQVWELSEREIEIIRSLG
GUT_GENOME221595_016347-273NTYDEFIDKFRVKKTTDDCYTPDNIYEVVADYVADTYKLDKANFVRPFYPGGDYENYDYQDGDIVVDNPPFSIMSKIIRFYTEKNIPFFLFANALTIFSTISSSACMICLGVTLEYENTAKVSTSFITNLESGGVRSDPKLYAALQEANKINASRLHRSLPKYTYPSNVLTANDVIKMSKYGVDFKISADDYICISQLDSQRQKKKTIFGKGLLLSNAAANAAANAAANAAANAAANAAANAAANAAANDITWSLSDRELAIIDALD
GUT_GENOME193172_005537-224YEELVESAKPKLTTDDCYTPENIYNVVRDYVAERYGLDPETFVRPFYPGGDYQAEDYTGKVVVDNPPFSILRKIMDFYNENGIKWFLFTPGMSTVIGTNYREKMHICLGAAVTYENGATVRTSFATNLEPAGIRTDPELLNAINAANEENRQKAKKIKNMTSWNYPPQLLTAAKMNYLANYGIDISFSLDDLHRINKLDNQPANKSVYGGGYLLNRSA
GUT_GENOME088630_0049012-265ELFNDYDGFVAKFAPKLTTDDCYTPPQVYNAVVEWACHEYSFDRSAIVRPFYPGGDYENFYYPDGCVVVDNPPFSILSNIVKFYLAKDIRFFLFCQGTTALQAKGRVCAICVGAQLTYDNGAKVNTSFLTNLEQGIRARTAPKLLKMIVEANKLSKTSNPMPKYEYPNEVVTASALSLLSKRGINYSITSNDCLDIDKLDHQKDYGKAIFGDGLLLSERAAAERAAAERAAAERAAAQHWQLSAREIELQQSLG
GUT_GENOME248157_0087275-310KKRWPSSRRYTSQPVYDAVLGWLRENVDIEGREIVRPFWPGGDYEHYDYPDGCVVVDNPPFSIFTKVCRFFQSRNISFFLFAPHLTLLSPVGMNWTGIVCDARVTYENGACVDTSFASNLFGDIRIMTAPDLLARIKNAAKTKRRLMDLPKYIYPDNVVSAALLGKIAPYVKFEVRASECRRVCKLDNQKGGGVYGGGYLLSDKATARKIEAYEQVAKRAAVIVLELSEREKQIIA
GUT_GENOME233021_019676-220KLNDYDGFVEKFKPKRTTDDCYTPPAVYEAVLGWVMEHFNIDPSRPIVRPFYPGGDYEHYRYPAGCVVIDNPPFSILSEIKRFYLERGISFFLFAPSLTLLSGKIEAMTHILTDSSITYENGAQVNTAFVTNLPSPNVLMTAPDLAQRIKEAEAQRAKAERPVLDFPENVITAARLCKIASYVSFEVPRAEARFVRTIGKERKALFGGGLLISSA
GUT_GENOME206071_0251110-253NRNYDEFVEKFKPKKTSDDCYTPPHIYDVVLQYARERCGIPEDARIIRPFYPGRDYQTDDYSGNCFVVDNPPFLILSEIIRWYNDRGIRYFLFAPHLTAFSSKVEACRIIVHEDVVFENGARINVAFITNLLPGIAVIGSPDLKNRIKNAMSNREKKTLPKYDYPDEVLTVSRVASTLSCGKEFIVYSDEIHHISHLESMGKRSIFGGGYLLSERKALERKALERKAAIEGIKFNLSPEEQNIV
GUT_GENOME025300_0026714-266EVLEDYDGFVEKFKPKKTTDDCYTPQEVYDVIVEWLTEKGAIDANTPIVRPFWPGGDYEHEKYDEGCVVVDNPPFSILSEITKFYSERGIGFFLFAPALTAASSNICEAYTIIVIGGSIKYENEAKVATAFVTNLPFCAQYAMMTAPELGERIKCVQKSGGVKMPSYKYPDNVISTAILNGFANVDLKIRRKEALRIKALDAQRVKKKTVYGGGYLVSDYAAARLKAARLKAARESVVWELSERERTLISRLN
GUT_GENOME031810_002883-248KSNTYEEFVDKFKPKKTTDDCYTPDAIYEAVKDWAIKEMDWGGRTIVRPFWPGGDFENFDYPADCVVIDNPPFSIISKTVRFYEEHNIDYFLFAPHLTCLEIRAAHSHICVGVSVTYDNGAKVNTSFVASKGPLIRSAPDLYRILDEANAANIKGKKAPPRPVYTYPDNVLTSSAVALFSKYGIEYREDIGVFVRAMDVQREVGKGIFGSGYIVPEEAARKAQEAARKAQEAARKAQEAARKAQKE
GUT_GENOME130287_030275-253ETYEEFVDKFKPKKTTDDCYTPPEVYDLIVDHVNRHVMPLEGHRVLRPFYPGGDYERAEYLPGDVVIDNPPFSIFIKIVRFYQRRGVPFFLFAPALTCGSCFSVPGVTAICTHVSVIYENGAKVNTSFVTNMMGGTHKMIVDGPLCRKLEELQKKDKPDKRKSSLSYDPHLITPAIAGKLAKRGIYLELSSDQCIPVGHVGECQYKVFGSGLLLSDRLATERMAAERMAAERMAAERMAAERQATRERI
GUT_GENOME259222_0112615-268RRHEDYGAFVKKFELKKTTDDCYTPAAVYDAVLEWVAEVVDISGREIVRPFWPGGDFENFDYPEGCIVIDNPPFSIISKIARFYMAHGIRFFLFAPHLTLFSAINIEWTCIVCDCTIEYENKAKINTGFISNLFGACRIMTAPVLCRKIKEAQAKIKEERAKYRNIPKYDYPANVVSSALLGKIASHVEFRVMADECVQIKKLDAQQDKAIYGGAFLLNEAKAAEVKAAEVKRSQKAIVWELSDREKKLIKSLG
GUT_GENOME237056_013076-217YKAWCAHFDDRPKTTDDCYTPRAVIDFVEDYFCDYYNIDRKKIIRPFYPGGDYEKEDYTDKVVLDNPPFSLLTKIVQFYQKHGVKFVLFCNGTTGTTKMYGLLTYHIGGTVIYENGARVNTAFVTNMNETEAIVSDRNFSDKLEKLTNKATRQKCKYEIDEIMLATVKKCDWVLLKSEIIEDIHKTRKDAFGIGYKITKEAGIRRNEAEREA
GUT_GENOME017893_003116-250QNYDEFIEKFKPKKTTDDCYTPPPVYEAVLDWARKHLDIGDRPVVRPFYPGGDFEHFDYPDNCVVIDNPPFSIFSKICNWYVERGIPFLLFAPAMSSIRQNVTYIGVSCTITYENGANVNTAFVTNMMGDIICTTAPDLHESVKKANDDNLKQSKKAIRKLSFPACVLRATTLHTMSRAGVDFCIKREQGCVVGQACESKNGEFGNSILLSDTATAKKLAAEKLAAEKLAAEKLAAEKLAAEKLA
GUT_GENOME206936_000626-219TYEEFTAKFEPKRTTDDCYTPPEVYDCVLRWAHREYGFDLAKVSRPFYPGGDYEREEYPQGCTVVDNPPFSILSKIVKHYQERGVGFFLFAPTLTCMGIRNCCKVVAGVGVTYANGASVNTSFVTNLDPAQARSAPDLRSELDAAIERLRREKAKALPKYEYPDEVLTAPMLARYSKYGIDFRVWPQECSFTRALDAQRVQNKAIYGSGYLISE
GUT_GENOME262013_0062912-253EKNKEYQEFVDKFKPKKTTDDCYPPPPVVYEAIKDWAVQDRHWEGRPIVRPFWPGGDYESYDYPDGCVVIDNPPFSCISKIVHWYEDKGVDFFLFAPHLTCLGIRARSTIGINADIIYDNGACVSTSFVCSSGPRLRTAPDLHNIIEAAAKKARKEQKDVKHPPRYKYPPEVITASHLAQLSKLGIDYQEDRVQFVRHLDSQHEKKKAIYGSGYLVPSQPIRQVMELERRTEWALSDREQAI
GUT_GENOME015185_026527-221EYDAFVAKFKARNTSDDTFTPENIYNCVRDWAVRKYRLEGAAILRPFWPGGDYRAADYPEGSVVIDNPPFSILSQIVRFYNDRGVRYFLFCNGLTALNLVRDRRAGVVAAGVSIRYDNGASVATNFVTSLSGDVLLEAAPDLHDALKRVNDANLKATRRRVRRLAHPYGTVTSAGLNYLAIHHVPFKVARRDALFVHRIDCGVTLFGGAFLLSPR
GUT_GENOME096194_001598-264HEDYDGFIEKFKPKKTTDDCYTPPEIYETVKNWACEYYGINPDNIVRPFYPGGNYEKYEYPKGAVVLDNPPFSILSKILDFYIEKGIAFFLFAPTMTMMSSMRNRDVNAIIVDCAIEYENGAKVVTSFLTSFGDFAIEANPALNDAINNTMKKIKKESKKQLSKYHYPSEVVMVNDLKYLAKNGQNFKVKRKDIYFIRQLDSQKEKKKSLFGAGFLLSKKAAAEKTAAEKAAAEKAAEKAYIWELSEREKKIIENLG
GUT_GENOME097426_0133951-277SKTYEEFVEKFKPKKTTDDCYTPSEIYEVIKDWVCKRYNIDPENVIRPFWPGGDYEKDEYPPGCVVVDNPPFSILKNICEFYLERGIPFFLFAPSLTALSGKTTWDRMNHIVCDCTIEYENGATVKTSFITSFEPETVAETSPELTKLVNDTTEKLRQEKTRKLSKYDYPDHIVTAAMMQKMARYGVHFRVRREECQHVRSLDAQRAMKKTIYGAGLLLSDQAAARK
GUT_GENOME261541_02090196-457FTDKFKRKLTTDDCYTPENIYAAIRDWAVEHYGLGDAQILRPFFPGGDYEHADYPDGCVVIDNPPFSILSKICRFYMEHNIQFFLFAPALTLFSIASGACNYLPIGADVTYENGAGINTAFVTNLGGLKIETSPELYAKIKAINDKNRHEAVEELPGYIYPQNVITPATIIKTGIREPLRVCQKDAVFIRALDAQKAMNRAIYGGGFLLSEKAAAEKAAAEKAAAEKAAAEKAAAEKAAAEKAAAHIWELSDREKALVKSLG
GUT_GENOME239682_012469-224DDYGAFVDKFKPKKTTDDCYTPPAVYETIKDWACREFGIDPSKVVRPFYPGGDYERFDYSGGAVVVDNPPFSILSKICTFYRTEQIPFFLFAPYLTIFSSTSRNGAHMIVTNSTIEYANGAQVNTSFVTSFGDDLIRTAPDLANAIDETVKRVRKEQRRHPPKYAYPRELLTVSRLGKIGRQVEFCVKASDVAFTRALDSQKAVKKAIYGGGYLLS
GUT_GENOME020495_0019915-281NPEYDDFVAKFEPKKTTDDCYTPPLVYDAIRDWACEQYGIDPGSIVRPFYQGGDYEAFDYPDGCVVLDNPPFSILSQICDFYLNRGIRFFLFAPSLTAFSGKSVAMRMNHIVCDADIIYENGACVKTAFVTSYGGDIVAQTAPDLGAKIAEAVKKIKAMTTATKPKYIYPDHIVTAAMLQRYSKYGVNFRVRRGDCVLIAALDDQRRAGKTIFGGGLLLSEKAAAEKAAAEKAAAETAAAEKAAAETAAAETWRLSPREKAIIATLG
GUT_GENOME093639_0186619-261TGETDEKYEAFVEKFKPKKTTDDCFTPPEVYEIIKNWVMKQYNIPQTAKIVRPFYPGGDYKNYDYPKDCVVIDNPPFSLISQIVKEYNRRGIKYFLFAPTLTLFSTAAGKTSYIVAGAKITYQNGANVNTSFVTNLEKCKVRIVGEINELIKNTKISKKKPKYEYPKNIISAAILQKMCRYDVDIKIPEKETFFIRKMEKQKSGIYGSAFLISDQAAEKIKEEMTRIENIKQWELSEKEKDII
GUT_GENOME092891_0165410-226SEEYKVFVDKFKPKRTTDDCYTPPNIYETVKEWVFEHYNLDKSTKVIRPFYPGGDYEHAEYPENSIVIDNPPFSILSKIEKFYLAHGIRFFLFAPGTSCFKPYNGLHFVCVGCSVTYQNSANVNTSFVTNLGGHLVETAPDLYRRIKAADTKNVKAQKKQLSKLKFPPQAATSAQLNQLSAKGQYFTLDEKEVFFTRTLDNAKKSVFGNCFLLSEKA
GUT_GENOME175198_010997-259YEQFVEKFKSKKTTDDCMTPSEIYEVVCDYVCNRWDISPTRVVRPFWPGENYETYGYSKDSVVIDNPPFSILSRIIKFYLKENIPFFLFCPSLTALSDGTDCNHIICDLAITYDNGAVVRTSFVTSFGSPNVMESCPELTRLVNNKMDGIMKAKRQAIPTYDYPDELVTAAMVQRYAKYGIHFSIARDECLLVRSLDAQKAVGKAVYGAGLLLSKGKAREKAAAERAVAKQAAAKQAAATTWELSEREKMLVA
GUT_GENOME236889_0112518-204KKNNLFGEYDDFVEKFETKLTTDDCYTPDEVYSAILDWIGENYDLSGKNICRPFYPGGDYQAEVYKPNDVVIDNPPFSILADIVDYYDAHKIKFFLFAPELTAFMRNENVCNIYAGQNITYHNGAKVATSYISNFWDDLGIWVCAELGRRIKEAQEANKKYQALPKYSYPKNVVTSAMLGTLASKGG