UHGP-MC 226


Information


Number of sequences (UHGP-50):
66
Average sequence length:
342±26 aa
Average transmembrane regions:
0.03
Low complexity (%):
2.98
Coiled coils (%):
0
Disordered domains (%):
2.51

Pfam dominant architecture:
PF13809
Pfam % dominant architecture:
606
Pfam overlap:
0.6
Pfam overlap type:
extended

Downloads

Seeds:
MC226.fasta
Seeds (0.60 cdhit):
MC226_cdhit.fasta
MSA:
MC226_msa.fasta
HMM model:
MC226.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME137702_003685-360FFVLGIGGTGMRCIESLIHLCAMGMFDDTEIHLLALDTDKNNGNFSRLKEVKEAYCNAKGSKEADRVALSNTFFSANLKYYEFSPNYEQKSTFRAVFGYDDTHYHNREEADLADLVFSNNVQNFNLRHGYRAQTHLGSMMMYHSIIEAAQSARDNDLKRFLSRLTTAAQNGQPRVFILGSVFGGTGASSIPIIPQAISRAAGIMSNGVANVLSNAYFGSTLLTAYFTFKAPSGSELSNQKVIATSDKFALNSQVAMMFYNDDKTVASTYQKFYMLGTSGLDWDPMQKQSDRITETITGGEQQKNDSHYIELLAACAALSFYRMDESQLRDNKTQNQTDYMYRAISDTGKLDFQDFV
GUT_GENOME139829_014971-359MAKLYIFGIGGTGSRVLRSFTFLLGTGVKVAGFNEVVPIVVDADSTSGDKTDTVQLMENYQKLYQATDHTTAADPKGFFSTLMTSCCKNGFTLSIPGGTAQKFKDYMHFQSLNNTNQNMVKMLFSENNLEADMSVGFKGNPNIGSVVLNQFVGSEDFAEFANSFAEGDAIFIISSIFGGTGASGFPILLKNLRANDTAAVQNASLIAQAPIGAITVLPYFILNNIDPNRNDQIDSGTFIQKTRAALKYYERNLSELDHMYYLGDDMRPSYENHDGGALQKNKPNFIEFVAALSIFDFANQTIGAEPHNANSHTVYHEYGIKNDVESFNFDNLSDQTVSLVRKPMTKMLLAHNYFTQALD
GUT_GENOME152761_0328838-391VMSKIFVFGIGGTGSRILRSLAMMLASGVKFGANEIVPIIIDPDVANADLTRTVSLLNNYTAIREKLQFSNDNRSRFFHTEIERILPNYTLRINDTDDKSFQQFIEYASMSKPNKAMTKMLFSDKNLESSMEVGFKGNPNIGSVVLNQIAHSADFNDFANSFSDGDRIFIISSIFGGTGASGFPLLLKTLREGKHFPNYDLINKATIGAVTILPYFKLKPNDESEIDSSTFISKTKSALAYYENNISKNGSIDALYYLADDVVSTYDNHEGGSAQQNAAHLIEFLGATAIVDFSKSKFDTPANMELGIKETSGAVSFDSFYRKMYEMLYYPLVQFTMTANALTQKLDVYRSSGF
GUT_GENOME158514_010651-320MRKVKIVNFGGSPLRGSICLAMDCASGLLSESDMDLDLYLFDKDVNSKTYTGSRLFGNLYQQLHEDAPGAFPVSLNRKEEAFEEILKRAGISGKDECILSMLSSGNKGMLNPAEQNILDICFTRDEQKKELEGGYYGKANIGSVTDEVLRCYNLYDETGVVTDIQNELDHGNQVDVVIVCSSFGGMGASLGINFGKYLSLRFQSSRSQLKLHAVHIQPYFSFPEPEEDDRWQIRCNEFYAKSADVIGAYAMEEDLICDTEQTEQMYVFDRYYYLGQAVLDQTTDINAPKDRQDNRLHLIDMLVSLAALHAVAGETEPGRQ
GUT_GENOME274452_007321-367MSKLYVFAIGGTGSRVLKALTMLMASGVNVPVDKIVPIIIDPDAAGGNKTDTVNIMDLYRKFHGNIDYNDPNATFFQTSLEKLNFPSGDYVLPLGGQTNQMFNNYMMFSSFSEENQALVNMLFSDENLNSDMTVGFTGNPNIGSVVLNQFAQSPQFTTFANDFNEGDNIFIISSIFGGTGASGFPLLLKNLRHIDPNLHIPSAQAIMDSKIGAISVLPYFGLTPGKKTDPDSDTFMAKTKSALAYYDNNLNNIDALYYIGDPAHQSYKNVAGGANQKNKAHFIELASALALIDFAKNASSLEGNTIYKEYGIKNDTRGITLTDLEGVDYLAIARPMAEFTLFCQYIQNKLASSDDQKWIIQGEPNYK
GUT_GENOME117445_011664-338NLFIIAIGGTGMRCLESFIHLCALGMFDDKEINILTLDTDQANGNKQRVENLINVYNEVRDASNASKDTFFSAKLDLKHFVTPYNSNDGDTYKALRSKGIAPEYKDDNSDLAELFYNPEKIQTFNLEEGYRAQTHLGSMLMYIGIIDAATKVSRDPKKYSQYLELNAFLDKMRTATTTCRVFVLGSVFGGTGASSIPIIPRAILEAANIIFSVGLEKKVLFGSTLLTSYFKFGTADKETKSRQKVIADASFFALNSQAALNFYNNDKTVLNNYKRFYQIGWPFDPEQYGRNDLSIGGPTQENDSHFIELLAAAAAYDFFTVAEDDLNSTESVKYL
GUT_GENOME090364_013452-333KQYLLAVGGTGNKILEGVVWAAAAGVLDPGGELRMLSVDVDASCGNTTRAMQSCAHYEAVRELLDSLPYTHRGFHTPLRLRQWNMDLSRRSQSVRAQTESLRQGRLLARTLFTSAEASLAYSEGFRGHPDLGVLFFSDLVARLEEDAAQGKADELLALLREIGAALEAGEKVKVLLCGSIFGGTGAAGIPVLSRALRERFAARRDLLEMGAVLMLPYYHVPPAETEDGEEITVSSGQFLVKARTALSYYGMEGLIRQGLSDEKGLYDAVFLLGLPETHFVTTGNYSTGSQSQENDAHLLEWLAARCVGRFLATDYREREGANINCYYYQMAS
GUT_GENOME013558_000925-366YFCYCIGGTGARVAEVAAHLCAMKMIKKVDDLEPIEFIIVDKDDSCGGTTQAKETIGNISTLSGLTKTQKLDNEETIGFCNHKLNIASWNFSNALSKVCTGKENPTTGKENPTLDQVLGSDADDKLVMRAFYNKEARETDTEKGFYGKPSLGTSIFEYMLSEADQNNQDNQDNQDDQDNQDNILKPVENFLGRSTNNRAKVFIIGSIFGGTGAAVFSNLAAYMRKHFEKRSRDLLVSGVLLLPYFSFGTRDGDGDGTPLINSGDFGTKSYLALSQYAQNKHLMRRAVKDKDKDKDKDKDKDNGSFDSLYICGQDPLHVVGEYANGGQGQKNHFDLVDLIAADAMVDFFNKNFYDNAGAVTDL
GUT_GENOME110985_007342-346NLYIFAIGGSGSRVLRSLTMLLASGVETSSNIIPIIIDPDMSNGDLDRTVNTLRKYEEIRAELAFDSSYQNKFFSTYISSLNNDSNYLLPLIGTSGISFDKYLGLNAMSQENQALAKMLFSNANLSSTMDVGFKGNPNIGSIVLNQFTQSKDFLNFENKFVQGDKIFIISSIFGGTGASGFPLILKTLRTSKNTAVANAPIGAVSLLPYFNLKANPQSSIQADSFITKAKAALNYYEKNVTGNATLNDMYYLGDELSSNGYKNCDGGDEQNNDAHFIELLGALAIIDFDAKQFNSPRGNTNFHEYGLNTGQISNSILFSDISTKTNNIIKRPLSMMALLNSYLNN
GUT_GENOME131907_003884-340YVFFVGGSGSKMLEATLHAAAAGVLRHPGAAIRAVIVDLDQTGGNLKRAESLTALYGDMRACVSAAQAEAEEETDIGFSADFALYRLTAVEEQAMGGILRSTGMGNDREIARALYTDYELEMSYKKGFYGHPNVGAHFFASELPRLPAEHSFSEVIEEIRRELVKGGQPRILLAGSIFGGTGASAIPSIARYIRNLTDENPNPDARRAVGGGAIIGAMLLLPYFATTKGDGVIEASQFEPKAKAALEYYAREGVGYGENKVFNAIYMVGSQDKVVYDYCTGGQDQNNAAHLVDWLGAQAVSHFLTTNPGAYNPQEQDGHYLFNLDLTAPNAEGKTFL
GUT_GENOME259596_015401-345MGYYVLAVGGTGNKILESLVYLAAVDGLYTQDESGKTAPLPDVTMLSVDVDTACGNTTRAKRAAESYGELQRAFRQYPAEHPGFHTALQLERWSMNLSRRAMSVDKMVENHGRDQLLSRTIFSRTESALEYSEGFRGHPDLGVLFFHDLLNSLDDLRREGQPDEMNRLLDQIHAELDRDEPVKLILCGSIFGGTGASGIPAVARFLRKRFAARSDRFEMGALLMLPYYRVPASLADEDTEIVVKSDDFLDKARTALQYYGMEGMIRDGEQDENGVFDALYLLGLPPEGFVTTCRYSTGSQSQENDAHMLEWLASRCIAKFFRTGFRGGEQHHMDCYYYQLHTPSF
GUT_GENOME233810_02080150-486LCIGGTGERVMKSITMLMASGMDTCGYTVVPIIIDPHLDLDEKKNLQTLIDNYIAIRDASTVDGNGPDGFFGTDMEWIGTLDSQTNNTNKEAGEDRSYAEFLNVGNLNATSINNYFIKTLYSTENLNSKLSVGFRGNPNVGTVVLQEMLTGASWFDLLKTNCGPDDRVFIISSIFGGTGASGYPLLERKIRSTVDSDILRDVFMGAVTVLPYYGLTDPKKNNSKIDSGNFMTKAKAALTYYEHTVKSDLLYYVGETTLQDNHENNEQTQPDEANFIELVAATALFGFLQRSKPKERQYLTRAIENDVSSLNLATAGKGYASLVKCLADFRILKMLLD
GUT_GENOME092065_001657-345FLILLGGTGAKCGEIFLHMCANGYCNEEDLTILYIDSDSHNGNARNFKQLHDCYKECRAAYRIKESTVPNFFRWHVELLEGNPVDKNVEKFRDLAAASGGESDRSAIDLMKALYSEEEMNMKISEGFFAHPNVGAAAFAANMEQVMKDLLERVYAVKTMEGKIKVFILGSIFGGTGAASLPTIARYLRKKLFDESDNKLVREQMKVGGCMVLPYFLFTKKDEDGKLIGAGELSIEADKFATKTRSALEYYKDEEKMNENGIFDELYILGHDGGDIRGKYSTKGSDQRNLPHITELYAAMSAVRFFKNDLQERGHYFAVIPDQKIGWADVCRQNGAGFLN
GUT_GENOME200258_013824-358YFISIGGSGAKVMESLTHLCVAGMFPTDENFYVMAIDPDTGNGNLARSSAAIHCYNIFQKLEVGNDMPIFKNKVELASPFVWNPTEHDKCLDDVMSYQAYKGTPIGDLYEVLYTKKERSILLNEGFRGHPSIGAAVMAKKVSTDSDGNNEEPWKRFETLVNQDAKTGSVAKIFLAGSVFGGTGAAGMPTIAKLLQRMFKDLYEEKKVLIGGALILPYFSFSPDKSCISSNEIYASSENFLTSTQAALKYYAAKNKEENIYHSMYFIGDNVLAPVQKFSVGSSSQENDAHIVDFYGALAAIDFYQSKQLKKCSYITHADDNIIQWSDFPDIIEIDTNSKDIVVNIKERFAQFTRFI
GUT_GENOME150331_001511-329MNNLYVFAVGGSGERVMRSLILTLASGVKAEVNQIIPVFVDNDEKSNALLKCLDLIKYYNSNPKAGGKMGANTVYQNVSNASDNWPSFFKTSISEPIILNKTGSAIGNLQTIIGSVDTDRSIFSDIAEERDLLFTQDDLQMPLNVGFVGNPNIGSVVLNSLSLGDPSFGTISDSITADDGVIVIGSLFGGTGAAGFPLIINTFNAIDAARKPLLGGVAILPYFSIESKDKSTGIIDTTKWDVNSDSFDVKTRAALMYYDEYMRDMDYLYYVGDGSSKDVYEHYVGGEKQENPAHIVELMSALSIIDFSNVKERPTSVVYKSPVWGFNEA
GUT_GENOME250293_016361-360MSKLYVFGIGGTGSRVIKSLTMLLASGVRMKSDAIVPIIIDPDVAAADLTRTVDLMHTYNNIREKLTFTTDAQCGFFRTEVQELLPGFRMDLENTQNQLFKDYIQLNTMDRANYALAKMLFSDDNLNADMEVGFKGNPNIGSVVLNRFAQSPQFLKFASDFQKNDKIFIISSIFGGTGASGFPVLLKNLRTLQTQSTLQTQGGVFPNAKDIQDSVIARVTGAITLLPYFGIEPNGESAIDEATFISKTKAALSYYNQNMSAIDALYYLGDNILSKYENNEGGEFQKNDAHLLELIAALAIVNFDEDFDERSLKTIYREFGVKQSGENLIFPDLCDATYDIIVRPMTRFTLLCKFFNEQLS
GUT_GENOME038687_004811-391MNVFIFAIGGTGARVLRSLTFCLASGIEKIPDGTNIIPLIIDYDKDNGDKKRTIDLLETYTAIRNKAYEGVELNAGDRNFFHPVIRHLSDVATLAGKADVGVKPSYEFTFGLDDQTKAGTFADYIDYTNMMGNTYLTKDLLASLYNDEPQDFPEGSHPFTELNLELSMGFKGNPNIGSIVFENLKEDAEFKRFCNTFDATRDRVFIISSIFGGTGSSGFPRIVDAIHYSGIAGFDNAMVGGCIVMPYFKVNTPQGGAINSNIFNSKQKAALSYYAQPDQNGNSIYDKLTTAYFIGDDETTNLPYSEGRDTQKNDAHIVEFLSALSVLDFICKDRATLEASKYREYGLPKNINAGEKFNFTNFDADDYEDYLQFLFTMGLTFKYYKDYVAND
GUT_GENOME276136_017301-330MNTIYLLCVGGSGLRVLRAFTMLLGAGYDIPGYQIKPFIIDPHLQSDDLNLTTGIISKYLDIHNPASKGFFKVKMSVDDLNSLNIISGDNKPNRSFSDFIGYTKLNSGNPEKDIVDVLYTYNNLNKSMDLGFKGSPNVGCIVFQESINSDWLRKNISNLAPEDKVILVGSLFGGTGASGLPAIAQAIKKMSAGIDLALIALSPYFQLKRPDPNAENKDIDSDTFDMKSLAALNFYQDNHSFIDSFYLVGDNIKQSNNYEYDEIKQGNKAHFVELIAATAIKDFAIGKKEKWNMFYTDTLNPVMTYPDCRKCMDEVLKCLSNFYIMSKFLF
GUT_GENOME015839_014571-342MNNLYVFAIGGSGERVMKSLIMMLATGMPLGAKKLIPVFVDNDVNSNALTSCLDLINYYRANPETDKTDEVGLHNICSKTSGDLGSVPSFAHVDVEKPIILNVAGDHIGNLDKIIGNLDTKKKYEDSINEEKNLLFTEDDLKMPLTVGFVGNPNIGSVVLNSLSLQGDEFTTIYGDAGSSDGVFVIGSLFGGTGAAGFPLIVNKFMSDDNAHNPLLGGVAILPYFNLQSGDGTKGLIDTERFDVNSETFTSKTRAALMYYDEYMRNMHYMYYVGDSSKRSMYPHFVGGVKQENPYNIIEVMAALSVINFSQKDTNSKPTDVVYQMPIWDFDNETLSNVSSIR
GUT_GENOME105652_0015911-313YVMAEGGTGVRALLAAHMYLSSKAYAAGENGRTENWKFIYETMDAGAEEIEQLQKLVRLDSESEFCDPHYSFHFCRLAEKVKEKLARDNTMSLKKIAPAWYRNGLLLTEEQLERDLLGGYYRDLTLGSVVSSAAMQCALGTEEDRNAGFRAIADSVVASNNTYETRVVMVGSGIGGEGRTNLCTHPAMLRKLCVERVMEDLRMEQEQAKTYVEQNLKIAVIMTGSAFRFPAMNGLDQDVAGLVAGTLRNFPEDSAEAVNLFYLLEHDQCPVQATKASDSREQYKHAHAIELVAVAAMEDFFSL
GUT_GENOME205163_014872-343KRVFVFCIGGTGLRVMKSITMLLAAGMDAQGYTVVPVFIDPHIDLEEKRNLQNLIQDYENIYEAITVSDRKHLNPIHGFFGTEIANLAKINGQQNDISEPIVEKRSFGQYLNVGNLNSEDVNNYLVQTMFSQGNLDNNLSVGFKGSPNVGTVVLGEMIKGADWFQAFCNACQKEDRIFIISSIFGGTGASGYPLLEKRIRQSSAYPNVKDAMMGAVSVLPYFSLEDPSITNSDIDYTTFFTKTKSALAYYENSVLSDYLYYVGEQKIRTTYANDEKKQDDKAHFIELVAATALFDFLSKIDKPDKPQALSRAIKEDVSSLSASSLGDAYNDIIKVVADMMLL
GUT_GENOME117139_021027-335MNNLYVFAIGGSGERVMRSLIMLLAAGVKINANSVTPVFVDNDKNSAALTRCKNLISFYNKGLSGGTAGIGTLCQLIPAEQRGSFFQTKINDPILLDIAGNTIGNLEQIIGNLKKEDELQRLVLEERNLLFSQDDLEMPLTVGFVGNPNIGSVVLNTLSFRDPQFATILQNATTGDGVVVVGSLFGGTGAAGFPLIVNSFMSGAKNRPTVGGVAILPYFDFESEDAHDAKERVINTEKYDVNSDSFSTKTRAALMYYDDYMSRQMDYLYYVGDDNRAHYPHYVGGAKQDNPVHIVELLGAMSLIDFAKGTNQSTIVYKEPVWGLNDDQA
GUT_GENOME256509_0133226-389KLYIFGIGGTGSRVIKSLVMLASSGVKINADSIIPIIIDPDFANADVTRTIEQIKTYVSIRERLAFNEATHNNFFGISIQNVAHDYRLTFKDTPKKKFKEFIEFSTLSKENQALASILFSEANLEADMEVGFKGNPNIGSVTLNQFEESQDFINFSANFKPGDRIFIVSSIFGGTGASGFPLLLKNLRGLKPTFPNSDAIQHAPIGAITVLPYFAVKPDEESSINSSTFIGKTKAALQYYEKNVTGDKSSVNVLYYIGDERTQQYENKEGGVGQRNNAHIVELAAAMSIIDFASIPDEDSSLMCEEDDNAKIYATDPYFKEFGIENDAQEVLFSNLTQKSRNILCTPMTQFTLFSKYIQEHLQY
GUT_GENOME110132_003743-323LYVISIGGTGHRVTTALLNLAAAGAVPAGKIKLICVDSDTANGNLNMLTESIKAYDDASCNGNTFPTSLETAQMGNKDIHNPCWSPTVVDNQTLENAMKKQAMDINGQRIFDFLYTNEEQTVSLDRGFYGHTSIGSLLMAEQIRPDGSNFCKEWEMFFNGFDSNNDRIMLIGSTFGGTGASGLPVISTILRQTYPDAKIAALLVMPYFKFNNAAADAGDNGININWAYFIPKTRADLNYYEKQNFDKIFNEIFIIGEDPDNFMNVKYSIGSTQQSNKPHVIELYAASAAIDFLRSACDKYTVKVMGRFENAEDAREMSSLS
GUT_GENOME218838_015064-371FVFAIGGTGARVLRSLTMLLAAGVKGTSTKNEIIPIIIDYDVDNGDTNRTQDILECYQRIHRDAYKVEDEVEGRFFCTPVRKLKEIVKTTEEFNPNSKFEVYLGQENTNITFADHIRMSSLGGALQPTNDLLKALYDNSPEESKTAELNLNLEKGFKGCPNIGCVVTRALENSLEIQQFLNNVTPFDRIFIIGSIFGGTGASGIPMLLDLIREKEDLNTVPVGILAVSPYFKVSRDVKSAIDSDTFCAKTKAALDAYDLGKSVNTLADAIYYVGDEKVEESFPNHEGSMEQKNEAHLVEMVGAMCVIHFMNEDKNEFHMGEQDAAFYEFGMTSTDSPIGYLSFKNETRYQFIDPMVRFVMFNKFCRDY
GUT_GENOME234430_005381-313MNDNYIIAASGTGAMCARAFIYMAAAGCAQDHGVYHVLLIDKDKESDAVTACEDLLRDYDAMRTQLGEKPDTSTFPKIELHHWNFTEEIVDEYCRQTGNTADSLKNLTLNKLLNPRNDPRLAQILRSTYTEDELSADLDKGFYGHPNIGAPLFDYIQDRFLAKTVVYQDRGDVVNTFMSSLHSSLNKGKTHVYLMGSLFGGTGATVIPNVVRALRSMHDPNNPAIDYGKTNLILGGSVIMPYFRLPTCPANSVEALEKVSPIDAKFAGQTQEALSYYFDSGLLDDMMNLTLLGSSQMDITSEIFARGGQQSQH
GUT_GENOME180609_015691-363MPKLFIFAIGGTGERVLRSLTMVLASGAPTFDNYEVYPIIVDYDVDNADKVKTVELLQNYADVHNAAFTHHTVAGGLQGQSGQFFAAKLWNLFDQKNPYIFPFGPGGENTDIVQNEIFADHIGYSDLYGSTLATKRLLTSLYDESGNADTELNLDMVVGFKGNPNIGSVVFHNIGNTDKFRTFISTYDPTNGDKVVIIGSLFGGTGASGIPEIVKAIGTQKPGAKPAAILVLPYFAPMIKQGGAIQASRFNSKTKAALSYYDDSQLKNKIDKVYYVGDLQSSVISYSEGGSTQQNNANMVELIAAMMIEHYVAGRGASQTEFKFAVNADLELKKRLFIDDFDATSKNQVLNHLVELAIGLKFY
GUT_GENOME088644_001661-393MKRLFIFGIGGTGCRVLRSLNFMLAAGVDGFDSETQVFPIIIDYDKENADKDRTLACIENYSKIHDTAFQAHQDQKDQFFMARMRQMKDALAEEGKGGTSTYELNYAPKQDEKQYKDSIGYEQLTGDLYKTKFLLESLYDTSIDPDKAELEIDMTVGFKGNPNIGSVVFHELKKTPEYKDFVQLFNPATDSIMIIGSLFGGTGSSGIPELITAIRNSNEMKLHQARLGAIMVLPYFSLIPKPGSPINSSLFNSKTKAALSYYEDSGLNENVNAIYYVGDSNDTKLDHNIGGKEQLNRANFVEFVAAMSIVHFVTDDPNNVKKGNAKTEYFKYAVEGELYGANGKSCIDLSDLLKDKTVSVIRPLCSLTLAMKYFHDCIEGDKKSLSKVQYFSQ
GUT_GENOME034198_004681-345MNQYLIGAGGTGAMCIRAYLCTLAAMQAFEKDNINYKVYIRMVDMDDQSDAALKCKELYEAYRNLREQSNALPEVIFESWDFTQAVKNAAKEHGADIRNDQSVTLTKLFTPENGASTHTSLLMNTLYTDVELTTTLEKGFYGHPNIGAAVFNCVRDSFLDVHNSVFMEALVQDLNALPQGEHVRLYLFGSLFGGTGASVTPNLVDVLRSLKDPVTNAELGVPEKLSIGAGMMMPYFKTPVDPAKGAKGTLRPSSAKFMQQTKEALLYYDKFGLVDKVTSLLLLGTHELAVTSEIYARGEKQYQHFHLVLLVAAIGAYRFLNGTLLDPVTNKELHGALVWKIAPAG
GUT_GENOME236359_031271-344MAKTFIFAIGGTGSRVLRSLTMLMASGVKGMTEDICPIIIDYDLHNGDKDRAITCMKKYSEIQNLVANGEGAKLEDSNNGYRPYFSTKIGQLNGITNWSWQFNLRKEIESLKYSDYIDYDKVNLYAPISQDFVNCLYDSNINSVYAELNLNMKEGFQGNPNIGSVVFNDLKGCPEFNAFSNSFRAGDRIVVIGSIFGGTGSSGIPKIVSAIRNHSKADIKKASLSVVLVLPYFAVNSSNDKEISIKSNIFNSKTKAALNYYERSGLNAQINSIYYVGDKVQTKVEPHLGNKEQKNNAHIVELIGAMSVAHFITSAQNGNQRLKYKFNAANDIVDGIGIHELIGE
GUT_GENOME014551_012211-230MKRKVFLFGGSGGRMLEALLFMAFAGITPEETLDLVLCDPDGDGMHGAVQLRQELADYQRIWASRTYIGTENPAFQTEINLRTWCDPLPMNAKTLADWTQDEKQDALLCQALFPADTASLDLRQGFHDHPELARVAFAAMLSECENDPDDAMRRTLDEIQSALNAGEEVRIVLAGSLCGGTGAAGIESVAALLQNRFGQNDNFRMGAVLMLPCADGESPARAQAALERIS
GUT_GENOME116153_011763-349TKYFCYCIGGTGARIAEVAAHLYAMNMINAAATDEIEFIIVDKDQACGGTQQARRVIGSVRTLSPGGAFNRSGRLNGNNAREFCKNHINVNADWDFTRALARLRAGNGGGQNAGKISLKDAVSRTNNNDDKVLMNAFYTQQEQTIDTDKGFYGDPSVGSLIFKYMVKNDGNQNDIANPVTGWLEQNPNAGDEARVFIIGSIFGGTGAAIFSNLAAHLRDSANGGKLFISGVLLLPYFSFGDGGQKAKVNSAEFFKKSKVALDEYDRDLNLIRTVENPNGSFDSLYVCGQDECHMTSENYADGGKNQNNHFDLVDLVAARAMVDFFNKDFPVNNDGQIVGLGKIYEYR
GUT_GENOME196003_009544-318YLIGIGGTGARCMEAFIYLNGAGILKDHQDVKLVYVDADVSCGNLVRTQNAMHLYQKVKNLGFGPDTVFTNNIIDAGFWSPVAEGCDTLDDVFQHGALVNKTENQALGSVYEALFSEQERTTDLDKGFRGHPAIGAAVMSEKMDPKSEPWKSLLPQINADKDAKIFLFASVFGGTGAAGFPTIAKLLKEVLHKDNEGNCVARIGGALVLPYFQFPPAGDENAREMQAKVEEFMLNTKSALEYYDKNDILGPVFTSIYMVGDNEYSQVETFSLGSNSQENESNIVELYAALAAVDFFNKEEYDRKTTPMIARAREN
GUT_GENOME135011_009111-379MSKLFLFAVGGTGARVLRSFTMLIASGLQGFDSSTEIVPIIIDHDRQCGDKKRATDTLDTYQNINRALYPNGNTVSYVDHFFMTRVTPLSEVGAAHIPNHVGDSKWCLNFGPDGSVKFAEFIGIPSMVAMPGLTETLGLLHALYDSSDDTAPAAELELNLNVGFKGKPNIGTVVFHELKDTVEFQQFLACCAPNDKVFIVGSLFGGTGSSGIPEIVKAIRNSGAACANTQIGTALMLPYFGLLPNQHPNNDGLNTGAIDSAMFNAKTKAALGSYSIGGGDALNNMVNSIYYIGDEVNATSNEYNEGAVSQKNPAHVAEFIAAQAIVDFWVSGNAGPHEFGTRDDGQTKAFDLQAFHDVTHEQILTSLSVFTLAVRYYRE
GUT_GENOME020358_013244-347LYIFGIGGTGSRVLRSLTMLLAAGVDTNGYDIVPVIIDPDTSNADLTRNVFLLNKYMSIHSRLSFTGQGQNDFFKTSILQSVQNFTILIPNTNNKTFEQFIDLANMGVADQAMAEMLFSNKNLSSFMQVGFKGNPNMGSVVLNQIASSPQFEQLANTFVEGDKIFIISSIFGGTGASGFPLLLNTLRQNKNIPNAGVINHAEIGAISVLPYFSLEQDSNSEIDSTTFFSKARAALKYYQDNIQNINALYYIGDFPKKPYENVEGGSEQKNDAHLIEVLAATAVIDFCKNSFVNQGTSYRELGLNVEGDTLSLRAFPSSGMRKQIAYPLMKLALFAKSLVDDFDY
GUT_GENOME136920_011643-345KTYVFFVGGTGARVLRSLTMLLASGCRMKDGGAIVPVVIDYDVKNGDLKEAKDLLDCYCNISKTAKYEEQEEGFFRQKLDIKEGSYAAIQYENSNKTFGDFLSYDSMANRDGLAPMKKMLESLFDTSDNQVTAELKLDMTVGFKGNPNIGSVVFDDYFRNKDWHYDDIVSSVQPGDRIFVVGSIFGGTGSSGIPQLIKNFEKKGSSSQGNYQGVQNAIKGACLVLPYFGVKNSDESAVDSRIFNSKAKAALSYYDKEINSSLDEIYYVGCKDIGQYENHQGGDEQKNDAHLVELIAALSVLEFANRKFDYDHKDNGTTAYEFGINHGYNEDNMKLHDTSYLDI
GUT_GENOME241027_010172-369RIFVFAIGGTGSRVLTSLIMQLAAGIRPTDQNGKIIKDLSIVPIIVDPHEDNAGLQQATELLDNYRSIHNSIYGKQELNAEGFFSVKIETLKEINAANVSADKFFFKMQRVSDNRFDKFIGLEDMSPENRLFAQLLFSKEELETRMHEGFYGSPNIGCIALNEFKKSKDFDAFRSAYSEGDKLFFIGSIFGGTGAAGLPLFITSIRDLEHIDNEDTGKTLCAKAPIGALVVMPYFSIAKDDESVINDTEFTIKTRSALRYYETNLNNYIHNIYYIADPSGTQDFENDPGNKNNQKGNKSHIVEFAGSLALFDFIESDNTDVTEDHAGRNIAVHTEYKAYGLNDDKAFVSFVQLAKTTNTLCMEPMMKF
GUT_GENOME110891_000464-345KCFCYCIGGTGARIAEVAAHLCAMNLVNDEREITFIIVDKDQNCGGTAQAKEVIGRIEKISQIEGSNTGACLCKSRLTKPDANPWDFTNALANIAGNNDGNASLEDALGQNGTDDGILFDAFYSDSEQRSKTSKGFYGHPTIGALMFKYMMKNANGNDIAQPVINWLSRGANTAKVFIIGSVFGGTGAAVFSNLAAHIRSKARALANADSNANDQQQAENIADKQNQLIISGVLLLPYFSFSQNRNNNIIDETEFYAKSKTALKQYGNDPHLIRRRVDSDNSFDTLYICGQNPLHKVGEYSIGGESQKNHFDLVDLVSAYAMVSFFNEDFMRAGTNQIDAGK
GUT_GENOME236224_023172-287KGCVMAMGGAGRQVLRALGCCALAGTAELQEIDLLLADTDADASALTAWQEDYALLRSLWPEVRQIPAFRTGLRLRVWPGPLPEETASLGALATTAEDRLLLDTLFPADTAEASLQDGFHGRTDAAAVLLAGLTGCGNAGPSGALAEQLQELEDAARDGETVRVALVGSAMGGMGAAGLTALAALVRERLPRAEVAAVVLLPYFRGEEHQLASAKAMLRQWDEDGLCATVYALGLPQSAYLTAEEGHQQARMPEWLAALCALDFLAAGRKGHYGWRVPFDHFGWDA
GUT_GENOME229078_027461-331MKKLFLFAIGGTGERVLRSTTMLLAAGVPAFNGYDIYPIIIDYDADNKDKERTRELIKLYRTINRIAYANHSLSSMPLRDTNNGQFFGCAMKEMNGLNDFVLDFNPGDENLKFRDKIGFDQMDVNLLPTQALIESLYDTSDDPKTELNLDLKVGFKGNPNIGSVIFHKLGDYAEYDAFVSAYNADQGDKVVIIGSLFGGTGASGIPVLAQHIKNDIPNVDMAALMIQPYFAPEKVRDGAINAKLFDSKTKAALNFYERSGIKNSMSAIYHIGDPYPTIIPYCEGGKGQQNNANPVEFISALAIAHYCGGNVPANTFGNEYMYGLSRDIVSG
GUT_GENOME238259_011964-355NLFIISIGGTGMRCLESFVHLCAMGMFDGKEINILTIDTDSSNGNKDRAENLISLYNDIKGSAGKEGGAPNRNTFFSSKLNLYKFSTDYTGNRSSFAKLSKIGDEDVDNRDLADLFLDNDTVQKFDLAHGYRAQTHLGSMLMYNGIVEAARSAYIKGDDAMPQEKQLREFVIKMQDAAAGARVFIFGSVFGGTGASSIPVVPRALNDAVKIFSEGANVLNPEQVKFGASLLTYYFKFPNPDATKKAEQKVIASSSNFSINSQAALEFYRKDPTVKTFYRRLYHVGWPSSMSLDLGGKLLTGGNNQKNPCHVVELMCAMAAFDFFNQEELNSDSNQENMPCYYRTVEADDSGT
GUT_GENOME073832_0165513-372LFVIAIGGTGMRCLEAFVHLCAIGMFDNEEINILTLDTDQSNGNKGRVERLIELYNKVKTDNPKSPGGSPNSNTFFSSKLSLYRFYTDYSKANRSTYSKLSATQGIGQETEEDDHDLADLFLEHDTVQSFNLEHGYRAQTHLGSMLMYHGIVEAACHYVENKAKATEPEQQLADFLTELQKSGTGARVFVFGSVFGGTGASSIPVVPEALKEAGNIISQNTIDFNKVKFGATLLTEYFSFQSPSDAQRKDEKLIASSDYFAINSQAALQFYQNDKTVKERYKRLYHIGWPAKDLNVSDSNNGNTITGGAEQKNACHVVELMSACAAYDFFTLDEQNIQNTTAQYLYRSVELDEAKNLTFR
GUT_GENOME245203_002013-329NHFLVFIGGTGAKCAEAFVQLAACGAMGGPENTYHILTLDVDATNGNRMLAIEALARYRLLREQFPATGQFAGSTLLGPNLVHYDWHVQLPDNFPNAAGVNCLHAMRAQNNAEEETLMRLLYTDEEMGFDFSREGFHAIPALGAPVLQHILEQGLALGGCQAFVDALRAETGVNSRLILVGSIFGGTGACGIPAVTRYLTDKTRDIAMAGNLLTCGVLLLPYYHFAEPGEGDALGVHARKFYNNARGALAFYRDMEQELGYERLYLIGSPLDFNMGAYRPGMESQKNPPTPVEWESALAIAHCIASEPDETRGRQFYKCVAGEGDGE
GUT_GENOME110036_004466-339IIAIGGTGMRCAECVVHLCAMGMFDDTEINLLALDTDYKNGNFKRLKNLIGNYNRITSDRDGVASNTLFSAKINYYTFSPDYSAGTNYDIVAHYSDAQASTHDDDGVKWQESDLVDLFLTPEMRKMDLQHGYRAQTQMGSMLMYHAIKEEAYKNKGKGSDLVRYVENLMTTPGSIVFLFGSVFGGTGASSIPVLPHALQEAAKILSKDGQGDLLSKNMFGTVMLTNYFSFDAPNTTDSVFATSDKFALNSQAALSFYNEDETVRSTYKRLYMIGREKPRNVSRRDAESVTGGDAQRNPEDYIELAAASAAYDFYQNAKAHMDKPVFGDEAEIFY
GUT_GENOME051217_012433-342TYIFCIGGTGARVMKALAMLLTTGVEINTDEIVPIFIDPDASAADLTRTISFLREYKYMNVQLHFSDDMKNRFFKTKISELIPNYRLELSNTRNDRFRKYMGFDSLDIANQALAYMLFSKKNLDSEMEVGFKGNPNIGSVVLNQFAQSDAFNDFTNSFRPGDRIFIVSSIFGGTGASGFPLLLKNLRGINMSLPNAGNIKDAAIGAITVLPYFNLAQSQDGTNDIDSSTFISKAKAALEYYNKNISGNNSLNAFYYLADDIYDMYEYSEGGTTQKNKAHIIELLAALAIVDFCSMDNDALQSSKGKALNPIYKEFGAASGDEDMIFSSFYDDTRDIIQRP
GUT_GENOME165335_044361-330MRKVFVAMFGGTPSRGEICLYLNCAAGIYQDCEIRCYPIDKDTRSANYVDSRTFCEMYNDFRKVMEGGEMAATAVSRVEDLFEDMLEQAGIRAQDESIRYIMAMEKELNDKDRELLDICLTKEEQNRNLEGGYYGKANLGAVTSDVLIYKKVYKEISMVKDVENELAAGNEIDFIILCTSFGGTGASLGVNFGEFLAENFQGKRDNLRIHCIHVQPYFSFPDPESDDQNQINYHKFYAKSATVTIVLGDEKKLIKSGKEQDPVFDSFYYLGQEVLDRVSDVNAAKDRQKNKLHIIDMLVALAVNDILKKSNENNYNLFGYQYSHEGTEFI
GUT_GENOME126258_003811-373MSRLFLFAIGGTGARVVRSLTMMLASGVDGLDSSTEIVPIIIDYDLSNGDKTRATKALEKYSEIHQSLYADTADGKLCADHFFMTTIKPLSQTGVALAPGAAQLKNFEFNFGPAGTSQKFSEYLNESALNAVPGAELTEQLLYALYDNSESNCPNAELELDMAKGFKGNPNIGSVVFHDLRESAEFRQFSATFNDAQDKVFIVSSIFGGTGASGFPEIVNAIHASQLPDMQSPIIGSALVLPYFDLQQHNPEKGDVGAIDAASFNAKTRAALSFYGVPNGINSKVNALYYIGDENHDAYDYNEGEDRQKNAAHVVEFVAASAIIDFLRRPVTFADHNAYEFSIKDEKKGQSIQLPDFENSTHTLFLDNLSAFV
GUT_GENOME008718_005074-369KIFLIGIGGTGMRCLESFIHTCAMGMYDETEVEMLALDTDSGNGNFKRLRNLKNCYCSINGGNYKRNTLFSAKINYYEFSPGYTDNDSFYSLSNYQTAAAHREDSPVADLADLFLDRNVMNMNLKHGYRAQTQMGSMLMYYAILKAAYKASPAGGNSTDKSAFALKSFINNLNAFPKAPVFVFGSVFGGTGASSIPVIPLAFDRASEIMGNKSTNIVGDHPFGTIVLTNYFRFKVDKMTETEVVAKSDNFAINSQAALMFYDGDETVKKTYKRLYLLGRQNSENRDVTEKDEKGTKKTSAGVTGGEKQENPADYIELLSAFAAYHFFNEAEKGDKAFSEDSDNRFFCISHNYGNSKLDFTLFGQDH
GUT_GENOME088579_002483-318CYIVAVGGTGSRVLRSVVHLAAAGVFEDVKGLQNINVMVVDSDSLNGDKINTNDTLSDYSVLSQSNVFRPVIKQYAWNPIDVMNKAIDAPQLMNVAAMSDKIEDLYGFLYTSEERSKVLKGGFYSHTSIGSFYMNQDITDGKGIYINEWKNFFENNNIQPEDKIIIIGSVFGGTGASGLPTLARKIKTAPKTQNCDIAGILVGPYFAAPTPSEGSIIETDNFPIKSKIAIDFYDKQKMSDVFKTMYFIGEDSDKLMMTTHSTDGQTQKNKANVVELYSATAIVDFLNNTAVPTDTENKCVKLAWRGTETSDDVFGY
GUT_GENOME081297_001071-279MKLYLIAVGGMGHRVLESLVWSAATDVLTFPEGIKLLSADVDASSAENKRARQHVADYEAVRTLLDSLPYTHRGFQTPLRLHEWTMDAPSSVEAQTNASPQGRLLARALFTPEEQSQTAEDGFWGNAALGALCFAQRIARMDAEAAQGQPDALAALLADMKHALETGEKVRVLLCGAADGGTGAAGVTLLAQLLRERVAAAEIGALLTLPDETMRNAKAVLREYSRTDFLRQGEAAQTGVLNALYLLGTLAENDTAVQVGTNLSRWLAARCAGHFFTEP
GUT_GENOME139612_016425-345VFVVGIGGTGMRCIESFVHLCAIGMFDDTEVHLLAMDTDKDNGNFRRLALLVDNYMKINGGNTKKETLFSAKINYYQFSPDYDKNDHSSMSRILDNTENDKEVVGSGAKVKVKAKELAGLLIRPEVENMSLAHGYRAQTQMGSLLMYYALLEEAYKSKNSSSGLREFLSNLSNGVAGKQIFVFGSVFGGTGASSLPIIPSAFNAAAKIMFGDNIDVVKSNYYGAVVLTNYFTFDIQSSEKVYATADKFALNSQAALNFYRTDNTVKNVYKRLYLLGREGNSLRKLPSGGTGGADQCNPSDFIELMSAFAAHDFFKLCDSPDANFGKSDNNFVCRGIEDGQT
GUT_GENOME090547_0095724-372MAKLFVFGIGGSGARVLRALTMFLASGVKVGVDKIVPIIIDPDEANADLTRTVTLMNRYRDINNAVKYGSPKDKSRADEREFFFQTNMEVSGSGNYKLQIEDTSNKTFEEFIGFSSMSSANQAITRMLFSDKNLEASMQVGFKGNPNIGSVVLNQLAYSEGFNSFASEFAQGDKIFIVSSIFGGTGASGFPLLLKTLREGSLCPNYALLKSAPIGAVTILPYFKIEQDVESAIDSTTFLSKARSALSYYQRNLGGERGVDAHYFLADEARKTYENHEGGSAQRNNAHLIEMLAATAIVDFSNYQPDEFQQGTKYYELGVKDNPQSSTIRFDSFYDDLHEMIYEPLVRFR
GUT_GENOME220910_013242-353SKIYVFGIGGTGARVIRSLTMLLASGVKLGENIDTVIPVIIDQDRSNGDLTRTIALLKTYKSLHDQLKFGRNSKSEFFKTNIDDLNTSFRMQVADVADKDFQSYIKYLNLDTRNKALVSLLFSKKNLEARMDVGFKGNPNMGSVVLNNFSTEADNDLGSILESFQSGDKIFIISSIFGGTGAAGFPLLLKTLRQAQSSQLPSAALVANAPIGAITVLPYFGVQHDEDSEINMDSFMSKAKAALSYYRDNLNTDVLYYISDKLSKNYDNHEGDSAQRNNAHFVEMVAALSIIDFCKNNVQHDGSKSFKEFGFNEDKPVVSIRTLADSTRALVGKPLISMFIFEKFYKEHLKNT
GUT_GENOME233111_003951-350MSKLYVFGIGGTGSRVLKSLTMLLASGVMCKADTIVPIIIDPDDSGADKTRTIDLMNQYMAIHDKLENNTEGKNKFFTTAIKPVEGMTNFIFPLQNTRNCKFKDFIQMSQLDKESLALVNMLFSEKNLNSDMKFGFKGNPNIGSVVLNQFAGSQEYKAFANGFEAGDRIFIISSIFGGTGASGFPLLLKTIRTDKSSQAWKIISEAKVGAVTVLPYFNVEEDEKSGVDSATFISKTKSALAYYERNISQNGSIDALYYIGDNIKATYENHDGGSAQKNKAHLIELASALAVLNFASTDDAQLVGDTIHYEYGLDTTKTPKEQVLFKDLGKESRDAIQKPLTQFILMNKYL
GUT_GENOME243777_011983-327FIISIGGSGSKCMESLIYLMAAKFLPSTEPYRVFVVDKDANCGNTNRCRNTIGSYHTLRKAVEGQPYAEPSLQECTWCFEDSLNEIVRMKGRTVQNSLCQGGHDQQLLDMIFSRSEQTEDLVNGFYGHPAMGAAVFEAVSLTGSYQNSELFRAIQEAVSSGTQHINIFLMASIFGGTGASMFPNVARSIRDRFCQGGANLDKISIGGALLMPYFRFPNQQEGEQQRINSITYEDFMEKTAVALRYYDGTRGLVKNASVPGDYIFDSIYVMGCAPFTQTCAVNTAGGEKQKHKFHVVDLFAALSAAHFFQNPNGTGEKLQIFVPQL
GUT_GENOME184495_022024-339KVFVIGIGGTGMRCLESFVHLCAMGMFDNHDVRMLALDTDRLNGNFARLQELLAFYNRINKDKVSSDTLFSARINYGTFTPDYDKNNVTFNSISDYSSASSLLIGENVKYRESDLCDLFLDSKVRAMDLAHGYRAQTQLGSMLMYHAIIEEAYKTKRSDYNSELRSFIKELNESSGNQVFIFGSVFGGTGASSIPIIPRAFQKAAEIMFGEQAQVLEKNFYGSVIMTNYFSFDIQKQDAIVATSDKFALNSQAALAFYASDKTVNRTYRRMYILGRENMRNISNEKAKTDTGGASQENPVDYMELIAAFAAYDFFKTCDGGESAFSSEKGKRSNFY
GUT_GENOME113022_005033-352RLFIFAIGGTGARVLRSFTMMLAAGIDGLDSNTTIVPIIIDCDATNGDKTRTIEALKNYRQIHNSLYNDAQVYDHHFFMTKMASLGELCHDNTICPDFQINLGVPTNSGMTFADYINLGGIIANPNARIADPFIQALYDDSPENDKNAELNLPLEMGFKGNPNIGSIVFNKLNDTAEFNAFLHQYNPGHDKVLVIASIFGGTGASGLPKIINALPTNSVSSALVVMPYFDLGTPQDPDDTGAINHQIFKAKSAAALGYYQDTINNKANAIYYVADRKSDSIEYHEGKTKQKNTAHPAEFVAASAIVHFLKSGIDPIGHQAYEYCVTNDREGAEMRIYDFSNPSRTKFIDN
GUT_GENOME233269_006363-371LYIFAIGGTGSRVLKSLVMLSAAGVKPLDENGRPLSGFEIVPIIIDPHKANEDLKRTEKLLSDYRTLRKRLYGDAVDANGFFANKITLLSDILTDSSIQVNRSPVCNLAAVEQSKFREYINYNTMNEPNQALTSLLFADYQLENRMRIGFVGSPNIGSVALNQIKDSDEFKAFANVFNAGDRIFFVSSIFGGTGAAGFPILVKNIRHAENLDISNKDVLRRAPIGALTVLPYFNIEHNEDSRINKADFVVKTQSALHYYNKTLTNDSRYAINALFYVGDQVTSKPYIYDPGENGQRNRAHLVEFVGALSPLRFAGIANDRLCDLNGYPMPTLAYEYGLDKDVSDLDFSAFGTETRRMVYKPLVKFHLLF
GUT_GENOME129344_014781-354MARLFIFGIGGTGARVLRSLTMLMASGVKLGVDEVVPILIDPDAGNADLTRTTSLLNDYANIKQCLTQPNENKFFGTGITNVTTNYYMELKDTSSMKFADYIGLKTMDKASRSMTEMLFSGENLDSDMKVGFKGNPNVGSVVLNQLVRTDAFQAFASLFSAGDKIFIINSIFGGTGASGFPLLLKILRGSTPSDYAKASVIASSVIGAVTVLPYFTIKSAEESKSKINSSTFLSKAKSALAYYEENIYDNKQIDQLYFIGDDMMSEPYENNEGGPDQRNAAHLVEMMAATAVVDFSNVDGGTRPDRPSSKELGLKVNDDGDDPEVITFDNFYEGLRDMLFGPMVEFAMMANVFH
GUT_GENOME096549_005303-402RLFLFLVGGTGSRVMRPLIMQFAAGIHPVDDNGNPISLEVIPIIVDPHKANEDLKRTDNLLRWYKQLRKSLYGESADVAKGFFSVKISSLSDILPNGSNLSDTFLFNMGAVESKKFQDFIAYSTLDTGNQALCSMMFSDDQLNTKMDIGFVGSPNIGSVALNEFKDSEEFRQFSNVFQKNDRIFVVSSIFGGTGAAGYPIIVKNIRNAGNNAQINNRGNLRDARIGALTVLPYFNVQQDENSPISHADFISKTKSALFYYHDNLTGLKRNGVDIPMSKINACYYLGDEIPSNPYFNDPGGNGQRNDAHVVEFVGALAVLDFLSMPDEQLQTIDGNAINPVFKEYGLASDKTTLSLKDFGVSTRALINKQFIKFHLAYMYITNQLKADIGRGYTEDKPEIK