UHGP-MC 50923


Information


Number of sequences (UHGP-50):
75
Average sequence length:
267±26 aa
Average transmembrane regions:
0
Low complexity (%):
6.9
Coiled coils (%):
0
Disordered domains (%):
0.43

Pfam dominant architecture:
PF04389
Pfam % dominant architecture:
4400
Pfam overlap:
0.47
Pfam overlap type:
extended

Downloads

Seeds:
MC50923.fasta
Seeds (0.60 cdhit):
MC50923_cdhit.fasta
MSA:
MC50923_msa.fasta
HMM model:
MC50923.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME096130_011665-298SQKILSDFQIRKSWGQKRAFRAWLRRELEGAGWTVTEERGRFSGTNVIAGDPEKAEVLFTAHYDTQAVLPFPNFITPRNMGVYLLYQLLIVVVMFFVVAVVSFCSALIGYMAELPTLAAMLPGWLVCAFCIWWVFFGRANRHTANDNTSGVVTLLESALTLPPELRDKACFVFFDNEERGMLGSAAFARKHKRVKAEKLVLNFDCVSDGSSIQFYPTSALKKDETALVRIEGAFLPQSGKTVEVVRGFGFYPSDNATFRRAAGVCALKKGKRLGWYMDRIHTDRDTVFQAENIS
GUT_GENOME006551_000338-303EIYCAVIGKRCRPGQLQRFLSRVQAQAQTQNQPYDCVEMKEKGVLSRCLFLNDPLKKDCILMAFYDTPMTCLVPHHPYYPYNGMKNQKTEQLNLLLQLAVTLVCVLALMILTFQKAAWERQGYGILFWILLLLLVGAASFISRGFANRNNYSWDAALTILDEISQRHPECGVMLLDHGVDQASLALLLKHYPALIDKEIILIEPLGSGDTVQFHAASHWKQQVQSLPQFPGWNLNLNKLSEVNSIHQRFKNMLCVGSGKIDSDGDLTVVGCRTSQDREVDFNRLTWLTDALCSLLE
GUT_GENOME159128_0035011-283MNEFAARIERDFPLRRKEKEKVEFRTWLVHTFKELGYTPKLESGESALAAGGSVTNVVVGDIDKAKIVLAAHYDTGVRELLPPLICPTRPLTFLLYQALYPLLLVAGSFLLSFAVTFPLNLPSMMLPLFLLLLAVALFYPKYGPDEKNNRNANTSGVVTLLEVAKTLTPRYRGEVCFLFLDGGTQTSKGAKRLRRAHPCLKEKPVFVVDCVGEGDELLVLPSKATRWDGDLLDAINENFVNGEKKTCFLKTDGQIIFPSDNAAFPHSTVVCAC
GUT_GENOME231234_0331118-272YIKRFRRKEKEEFRVFLAEELSRSGYKVRNDISKKRQFKSVNVVTDNIEDAEYIVAAHYDTPGAPGVILNAQHFLEKFIGGKQALILISLFILCLLPIMYREVAIYYFVLVLGLIEFMMFLFIPNESNMVDNTSGVIGVLALARIFNGNTKVGFVFFDNEEKLMLGSKKFSKKMCKEFPDFKNKVIINLDCISSSSEESLWNINSKINIDKHNNLLETISQKLVEDRVLLNKVTINSDQLSFKKNSTIAIGKYKE
GUT_GENOME007500_004727-256EDLTFYQKRKTLAEKRAFRDYIQKQARDLGLRSNIYPENKSCKNVIVGDLKNAKYVVCAHYDTAPKLLPLIERNYTYCHKFLEVLSLILGVGLLVLFYFLNILWLGICCIAISVFLFLRISGVLATSRKFSYNDNTSGIITLLNLMDTNKDFNQRVAYVFLDNKEKGLIGSKALSKMMQNQKMIVQKEDKKFIFIDCVGLGKDFTISWYRETKFVDKLKDLFLEKCDDKYTITLKEDSKVDVNDYMSFKR
GUT_GENOME000017_0174916-303QLKNEYGIRFRRKQKRKFLEYAQTKFKDMGYMTHLQEKSRINKHVVGNQNLVIGDIENANVIMCAHYDTPPRAIHKIEIIHGIRKSIWSKIKKYILILVFILILTSIFILILSIMLYLLNCLSLIDFIPITAYIVACLMTLGVSNKKNLNDNTSGVMTIFKIADELKELNNSKVAFVLFDNEEWGLIGSKIFANQNENINNKLFINFDCVGVGDNILILCDNYSIEEAKKIGENKQCSKEKNILVQEADLSKADSDERRFKNRIRFAAFHRNEDGELYIENIHTKKDN
GUT_GENOME244165_010547-227RDWYFRYSKVFRTRLSQKSKARFLSALLLDLHQLGAAARILQYGSGKTPIQNVYVGDVTQADVVVATYYDTAPYFWGPYFYFDRKKQARQTTRTLLGLSILWLFLGGAITGLLMYFHFFAHWQFLSWKSLAALAIYGPFFALLASFTRGSRFQKNTIRNTSSLLCLLSWIEKNRWGKGVAFAFYDQGAFGDQGLRRVQEEIGPATKLLVLEAIGARAPLFC
GUT_GENOME026761_010706-291RTIFAHFEVRKTKKQKQAFRDWAKDYAAACGYEMAVERGSFGARNLVFGDPPAAGVIYTAHYDTCARLPFPNCITPKCIPLFFLYQLAICLLLVGLPALAELLLLRAGAPFAAAHVVFLLLLTLLTALMLVGPANPHTANDNTSGVITVLEIMRALPAGQRGDVAFILFDLEEAGLLGSSSYASKHKRALRGLLVNFDCVSDGSHFLFIARRGSRADLPALARAFLPDESHTVEFASRGVIYPSDQASFRRGIGVAALRRTRRGLLYLSRIHTPRDIVFESGNIDF
GUT_GENOME233279_017119-283LDRWQVRKTKQQKTDFIGFMKKNFPDMKIEEGGRLIKSRNLVIGDVQKADFILTAHYDTCAALPFPNVLSPTNMAFTLFYNILICLPFFAVMAIICRLLSFMIDDFLIRYWISFAVMLVLLFAVFMGGKPNKHTANDNTSGVITLLELMNRLSPGQKERIAFVFFDNEENGMLGSAFFAKKYKGCLADKLVINFDCVSDGDNILVAMNNEAKTRFDPIVRDAFVPENGKAVYVSSNAFYPSDQMMFPVSIAVAALKRKKLIGLYLDRIHTKRDTV
GUT_GENOME096022_019179-304KYGLTIGRRFTRKEKNFFCNEIGKDFQALGYSVRGAIGKKKRTKGMNLMIGNVGKAKTIFVAHYDTLNHDFGNPIRYFPLDGNASFSSSFLPMNTPVILSMVLGLILLLGLGRKINFQDNLAMSVLILGILIALVVVSFMMTFRIGNKVNLNRNTSGVVTAYLIAQQLPEKLRDKVAFVLTDGGNGTHVGDYMLRDALPNTIKDRNVIILDCVGKGPRLGIGYFDASKANAEKLEAIVKAKDPEAKLHTSLVDEDHVKYTSLSFYEKGMIVCRGKNLNGSLIVENTATNHDDEIER
GUT_GENOME264363_0071713-286VRKSKQQKEAFRRDVALWLQNLGYEVHTEKGSFGSRNLVIGNPQSSKFLVTAHYDTCARLPFPNLITPCNFWLFLLYQIFVCAVLLFPAAAIGAAAGWLLHSSGMGHYLFSISLLAELLLMLIGPANPSNANDNTSGVVTLLTLAGSLPPENRKDVCFVLFDLEEAGLIGSSSYQSKHKKETARQLVLNLDCVGEGDDLCFFPTAKVKKDKGQLAQLQRLEGTYGAKALTVPAAGFACYPSDQMNFPRGVGICALRRSKAGLYLSRIHTPRDTI
GUT_GENOME255430_0041811-291DYPVRRKAQEKENMRTYLMGQLRALGYDAKLNDCGKAVNVIAGDPERASILYAAHYDTPLREPLPAILCPTRPVTYMLYQALTPVLALVLCFAVSLGVTFALSLPNLTLPLFLVLLIGALAYLKYGPSEKNNVNANTSGVATLLHTAEQLTPRYRNDVCFLFLDGGSDNMRGAKGFRKRYPSAKEKPVLCLDCVGSGDELLILPGKGARWNGELLDAINSSFENSERKTCYDKVDGLVHFPGDQRAFKQGVAVCAVRRVPGFGRFICPTGKDNRIDDENLE
GUT_GENOME083450_004596-283EEVGIHFSRRRSNRDKLRFMNRLVSQLKEGKLPLQMMQEKKSRGGMSNHLVVGEPKKSRWIVMASYDTGSKMLNPRYEFTPLNSKHNFREEMKNIAGYSVLTLIVAALAVFWTRNFPAYSMLWKILTVLADVLVLGLVWRWMKKPDNKMNMNRNSGSVAVLYDCAEKSRTGCFVFADKSVMSNEGFYELASKFGRQPVLILESIASGEELFLAFRKGQEKKADEIVRALECRVHLLPLDEEAYKNTPLEGFEKGFMLTRGSLRKGNVCVKNVRNGKDF
GUT_GENOME127701_005161-274MEIRDAVYTQYLVRNTYAEKRSFRFYFSKVLQSYGYEPIVQSARTMGVESNNLIGGSLRHCKTVFLLSYDTPKTRFVPQFRLPGRPGLQFFLNLLPMLLLAGIAALLVVFLRLPVLAAELLFLGSAFALYYLVNNKNNYNASSPLLVLNACLERMPQNPGIAVVLLDNSDLFSLGRRAFFKEYGSLLAEKNMIYLDFSGRGDTVFVEYTKKAAPMAQLLESESQEPAPVFWERKRPADLALLRFSCVGKSPIGYSIGAYRSESDTYADSGAIEA
GUT_GENOME005514_0131414-292RFNRRQKEKFLKHLDEIFEEQGYPSLRNEKRDFRGLTRNAIYAFEKSTKVYIAVPYDTPEHLFWYKSEYFPLDGGRSMNKNMIATYIPALIVYAIILIVVMFVAPLITNLAMQMFINLGVLVSTLFLLFLMFRGIGNKYNANRNSSSVIAAVDFMKKLDRDQKRRIGFIFTDRNQKNCQGAKLLSSYFAQQNKNPEIIWLNCVGEGDTIGIGYRMHGKRLASSLNAGKKSKASLLCEMNGDKCMQTVMAYFDKGVMICAGRRDDKGSIVVNGTQSAKDR
GUT_GENOME079933_010037-289EEVLSRFPVRKTRAQKEAFRAFASERFASRGWRVAEETGGIARSVNLIVGDVARAKIVFAAHYDTCAALPVPNFIAPSNWAATLLYQVALGVCLCVPAALVSAAVALLLRLPLVAGLAGCLCCAALVALMLCGPANRSNANDNTSGVCVLAETILSLSPDTPVAFVFFDHEEWGLLGSSAFFKAHRAQMQDKPLLNFDCVSDGSHFLIARKKGWRRNAALDARLQEALRGLPEGKTARTAAAWTVIYPSDQAHFPCGAAVVALRRHPLLGHYLSRIHTRRDTR
GUT_GENOME033667_0211750-306KYTKEYGVRFSRRQKRKASLVLFEDMKDAGYEGTMISGRKVFSKAENYLFGNIKTMKTVIVIPFDTPQHCFWRRVSYYPLNGNKSANKSVLPMFAPVILLYLMVFVMLWIADHFVTSPQLAFGISLFIYFLIFFLLYLMTIGIANRNNYNRYSLSVATAIELAQKLGKDEKRKVGFLFTDKNKTRYYGAQLAVDKFIQENKNPDFICIDCIGEGSVIQIGYNPNMRKMAVKLKQKENIDLVKLSETMRMQNAMGVFS
GUT_GENOME237438_00207196-446IVFCAHHDAAPLYGIKKDDRKKLFLSLHLPLIHYSIIGLVTIAGLIADIFMGELFSVNLPSPVVIVLLVIQLLTLFVYFWLWKLISDKYSPGIGDNLVSSCVLTQLARYFSWKKRSGEGFASTRLIFASFDGEECGLKGSARWFSHHREQLVNPVVINLDSLCSSSDLTFLTKDVNGLVQLSAPLAAELSRIADRMGYPSKVGAMPLFSGATDAASAAREGFEAVTVTAVPLDGASSVIHTEDDTLEHVER
GUT_GENOME091497_0021414-240RLTGEQKNTFLAFARGIFSSFGLSTALYTSRSRGIRLTGAVAGSADSAEYVIAAYYDTPRRRLVATPRIISRRVSTGLISAMPAILLFFISLLFWRAGAPFAICECAFLVCLFAIYYLAPNTTNANFTSSGLLALFSLARSVQRGVSYALLDGADLPADGISSLYKSYASRLDGKTVILLGNVGVGRRICVGYPKGDEALRTLAEDIAGADGFVYVHKKTYLRVDCA
GUT_GENOME121384_010839-301VRDHQARKSRAQKAAFRELVRAEMRKEGVFFREERTKGLLENVNLVYGNIATAEYVFGAHYDTCAELPFPNLAAPLNIPLSILMQLLLMLMLALPAALGAFLTAWLGAPVDVAAAVGVLLGVLASALVIAGKPNRHTMNDNTSGVVALLTLLSRLPKERRGRVAVVLFDNEEVGLIGSTLFRKRHARAMADRPLVNLDCVGDGEMLLLAPNKGYRADEELCARTQAAFPAGSLAPVFARRAFYPSDQWGFPKALAVATLKRRRFFGAVIDRIHTRRDTILKEENIEYIVEGLL
GUT_GENOME261708_0100011-294NFQTRKTKVQKAQFIKFLVSELDKNGINANIEESGSLIKSRNIVVGNVSKAKVIFTAHYDTAPKLPFPNFITPKNLLFYIFYQILILLPLFLMCFLVSFISMILFENVLIAYFICIFLAFGFTYLIMFGKENKSTANDNTSGIITLCEIMLNMTEDELAITAFVFFDNEELGLIGSSAFAKKHREEIKNKIVINFDCVSDGETMMFAVNGAARKNYADIIASAYTTYDDYKVKPFITKALTTFYPSDQMNFPVNIGVAALKRKRFLGYYLDRIHTKHDTIFREE
GUT_GENOME081675_0108625-292FGTRFSRKQKNKFKDEIAKEFSELGYEMKEIEGRKWLNKAKDYFFGNIKNAKTVIVVPYDTPERKIWNKVFYFPFDGEKTASKTMAATFVPIVALYAIILIFVFFGGKMTTNPIMTAMISLIMFLLLLFLVYFMTHGIRNTKNFTRNSASIIAALDIAEALSKDERKKVGFLFTDKNKTRYLGADIAGKEFKDAGKNPNIICLDCIGKGSTLQIGYRPQNRKMAQEVVKCDPNKKNIEVVKIDEGMQFQSAMAHFDKAIVISCGEVDN
GUT_GENOME096448_0280212-275FGKRFTRKQKDRFIGFVTKIMKELGYKVRTVTEKRKFGGNSVHMLIGNVEKADVVFVSSYDTASRILFPNYRYYPLDRQKNFKNEKRNSLLQYGIAGTILLICFLIAFFSGGVLNGQTHLWRFAALAAAVFGAFRVASGIPNKFNFNRNTSSLLLVSKLASTVKNRKKAAFVLADFSCNYYEGYRELQEFFGKELQSKKVVVLDCVGTGAPIYFAERKGRPSNDMERLKQIPTGLDVRFTELTEEQADDSVLYFFPDGVYVFSA
GUT_GENOME032999_0047912-283FAKRNTAKRKERFLAWLAHSAQDLGFEVHIDEKKKGKLASRNLLVGNFTKCENIVVTGYDTPSKVMMPGYTYYPFNFKKDHKQEMIALAIQMASSIFVIFLFYLLVRGFNDYNIWMKVAAVIVGVAMIIASFFLGKGMANKYNFNKNTSALCVAITLMQEAAAAKDNRTAFIFCDNVSASFYGYAQIAENMEKKLKGKNMIIMDCIASDGDVFIAHKKADKQLAEGLQKYLPKSRLLERPTDVPGCMQWFNNLMYVTSGTADEQHEVKIKNT
GUT_GENOME096500_0196113-292AKRYTAKQKSIFLSQVYQYFLKLGYKISYQNNSNKVNPVTNMVIGELDTASVVVVCAYDTPSRVVIPNYLYYPFNTKKNLREENINLFFQFILMGICFISIYFLLIFFKTYSLLFKIISSLICGILVIIAYKLMKGSANRVNFNRCSSSVALIAKLAQELKENKNISFVLLDQNVNSYEGLKLLKKELKNNKKLILYLDSLAFGTNLVCAHNEKMNDIADLLVNHLKDLNIINKTYKQERCRETMLHFFDNMLVLTSGEIIRGELAVKNSRSGKDYQLDI
GUT_GENOME103173_009525-291EEIDAQFPIRKTTAQKRAFEKWVMAKMREMGYRPYVDGMNRKDKLRHRNIVAGDPQNAELLISAHYDTAATIGVPDLRIPRNFPVYILAQGAVLLGMLLISLLIGTAVGLATKSGDWLILTFFFAYLALMLLMMFGAANKHNVNDNTSGIAALLETMQRLSPEAREKTAFILFDHQETGSRGAKSYGAQHVEVQTMKLLVDLNCVGDGDTFVISAPKMAKDKTEYAAVRESLEANAMASGVSTQFFGRAGVQGAGDYRRFVCGVGVSAYHHSAGVGLITGRIHTSRD
GUT_GENOME129765_0185223-242TKKSKQRFLSALVSDIYSMRTDVTVTAYDTLAYRSKNIYVGDIEKAEKVICTYYDTPVHALGSYFMFDWKDQRKKTIYSILLSFILLFSLGWWGMMIYNKNPHHVFDLLSVQTGITVLAFGSYFFLLGKAARGWPSRQTFIRNTSSILTMLEMIRTIDDPSVAYAFVDEGCYGEKGLDSVRSGMKKEAILFYLDSVGADTPLQFSGNYFSNKEQWLKQVD
GUT_GENOME236864_005217-297ITELFPVRKSTAQKKAFRQWVMAEISGMGYAVRVEENDRGRQQNVIVGDPEHAEVTFTAHYDTPSTILLADLQIPRNYAVYLLWQIILLGGMLLLALLVGAALGLLTQNGDVMTLAFFGTFVGLMVLQLNGFANRHNVNDNTAGVAALLETMARIPEENRAKAAFILFDNMEKGRRGSKAYAREHLEMQHTHFVVNADSIGVGDVFVAAAPPLATQLPQYALLEKLLSDAPEREVRFCSSVTTRFNSDFRSFKCGVGIMACRRISGIGLYLGKLHTSRDVEADQGNIEHLA
GUT_GENOME243755_0119810-312TQTYGCRFYSWQKKKFHAALDEDFKALGMEGSVLMKKRFSVFPRYDYVYGNLKQAKNILVIPYDTPERCLWYKNTYYVNNGTKNQNAGLVQTFVPIIVFYLLFVVLLYVLPPYFPTAGAQAAISLISLLLMALILVALTKGFPNKKNFNRNTSGILAALEIADAIGVDGRRKTAFVFTDANKGVFFGDMILKDELSFMNRNPNVIQLNCVGDGEELSIGYTQGNRKSANELVKCCKGDVKHEMCELKEEERQNSSMAHYKKGLLVTSGTMNEKGYLVTMKTASGKDCVLDDALIDTVTEMLKR
GUT_GENOME142215_014777-316RYTEDQKSKFINFIISKFREEGINLKEVSQNGNRNVVSDNIDSAKIILTAHYDTPRRFSIGDQIIHKNMNMFRIIFIVALILIADILIVPLINLILKMLEKNIIVPIIAIIFISIGIDYLFKYLVSLYYKRNKRKDKLNPNQQKYAYKNCKKANENNYNDNTSGILCLIEVAKKLNIINEEKKIDSVGFVFFDNEEKGLKGSRSFVKSYHDLESEIIINIDCIATEGDYDTISYIIGNNKNKEKTKVLVDIIKSFKNKSDEIKVKRVIKIFLKNYIVRVCYLFLCNSVTSSDHLSFTNYNALSISRKKFN
GUT_GENOME172969_0159015-273RYTQKQKTSFMNYIGTRAQELGDKAVADGLDQSGASACRNIYLGSLNHADFILTAPYDTPARTLFSGVGYYPLSPRKNMTWDSLSLVLSALLGTALVVIYVLFVLRRFLSIGGTPMWLALAGMLAVSAAAFKLAAGTANRRNFSRNSASVVAALEYRKRYGKKVAVALLDRSCCSYAGYRQLAERLGGLAQSKKIVALDCIASGPEVHVHCGRLVGELLEKAPWDDMNQLHILDGEPLERTILKLFPKGLLITGGSWKT
GUT_GENOME235928_0300415-304RLTFKQRSKFINYIGGELLKNNIECSADKYSNRIDKEMSLVVGNLKTAKKIIVAGYDTPRKILFSKFIYHPLDWEKNRKQEKKNIIIVLIISIVATIICGLGIYYLKYVTKNIRLILYALIVIVMILVVKLNKGYANFYNFNKNTGGITVLFKLAQYVNVSSRKTAYVFLDNTAEGYKGYERLKELLDEFNNKNPQLIILDCIAEGENIYVAFNEKKELNNSTLIKSLSNYFDLKILKLNDEEYNKSPLSLYSNGIMIFRGKESNGEYIVENTRTKEDSQCDIDKLNNIY
GUT_GENOME104705_0009012-279DLGCRFYAKQKAAVRESLVEEFEELGYSSQVQSKRNLILKAHNLIFGNLKSTRYVIMVPYDTPGRIFWPKLHYYPLDGTKSSAKATIPQYGPVIIFYFLFLGLLYVLPALALNAYAYLTLSGVSVVIFIMIFILLFRGVPNKHNANRNTSGMLAAIELARSLTKEQRRQVAFVFTDRNEGRHFGSKLVADALKEANKNPMIILLNCIGKGEKLVLGATNGNKKTAAEIMKKYKGEQKPSVSVLTSDMQVQSPMGYFTKGISIARGDID
GUT_GENOME056098_010383-253EISQEIFRRFQVRKSRAQKAEFIAFMREQFPELRVEEGGRLPRCRNLIIGDVDRAEILLSAHYDTCARLPFPNFICPQNMPLYILYSLLICIPFFLISALINFGLFFISDSYLLHMAVSLVVIFGLIYLVFFGGKANPNTANDNTSGVLTLCELYAALSPEQRAKTAFIFFDQEETGMFGSALFKKLHPQAARSKLLINFDCVSDGEHMLIVKSRAAKKLYGEAFAVAFTDGMGKTAHHGGSGTFYPSDQM
GUT_GENOME095707_0087317-291VRKTAYQKQEFRTWAMGELKRSGWKAREETYGKFNGSVNVISGDPEKAAVFLCAHYDTASRMLFPNFVSPTNVAAHICYHLAAALLLVGAALVISFAVSYPLQQPGLMLPLFLVLVVGALWVSAYGPANKSNANGNSSGVLALLAAAKMISHDKRICLVLLDNNEKNMLGASAFKKRHPNAADTALFLNLDCVGDGEHLLVMPSKLSRWDGGLLSALEAAFQDEVGVHPHVLAKGLQYYPSDHRKFRFHAAFCTCRHLAGLGYYIPHLRTSRDTV
GUT_GENOME237061_0135213-292MRRTKEQKEGFQKHVIESLQAKGVTAKIEKTRDGKNQNIVVGDPLTAKAVFTAHYDTPGRALFPNIMIPRNQALFWTYQFVPILALLAMMFVVGLIVCVAWILITGDFDERVTFIAMLLGYYVGFYLMFFAFSNKNNYNDNTSGVATLLSIIDKLSEEQLRETAFIFFDNEEKGKKGSKGYFNDHKEEMKDKLVINFDCVGNGKNIVFIAKPEAEKLASYSALRQSFGSENGFETAFYPKKGSQSNSDYLNFPCGVGCMACKKSKKGLLYTPYIHTKKDI
GUT_GENOME021812_0083412-257LSQRYTLKQKECFLEEVIKKCKDNGYKTSFISKHGSYYQLANLVIGDLDKAKTIVCASYDTPRKSLIPVNYYPFAAKNNVSMGKKNVIVDAVIMIIMFLLSYVILLNFNEASTSIKSLKIIMLAVLIIFTFFVLKGTANKVNFSRNSAAVAMVMELMENNQNPSVAYVLLDRNTVTYEGLKALKEKLQDTSKEIILLDALSYGKDTCIAYNKNVDISDFEHDTYIFKEYQNPTNALQYFKNMIQIS
GUT_GENOME001314_021303-277QELFEKYTKTYGKRFTRKQKHKFAKALHEDMTQAGYEKTMIKGSKLIIMKAEDYFYGNRKRMKTVIVVPYDTPERKFWHKVFYFPLDGTKTMNKTLIATYVPIILLYVFILAGLYIGGNFLSAPTMANLMSFLMFVLVIFLIYLMVHGIGNRHNVNRNSSAVAAAVALAKSLDKDERQRIAFLFTDKNKGSFLGARCADEDFVKDGKNPNVICLDCIGKGDTTMIGYNPSNRKLAMEIAKCYPEKGKQIETIKLDESMRVQTAMSYFRKAVVIAS
GUT_GENOME095719_017856-263KNWILEYGIIARKRFYRKEKIRFIKRIEQDFHKMGYKTSILSPQGSKGHAIDLLIGDVKTAKNVIISYYDTPVKTCGIHAYKPFDLRHRNIWYICIVAIPLMIITIICFLFSTTLLKIEWLKGIFHLKDLWAILIYITGIIFIIHYAKGIPNGRNLNRNSSGVITLLKIANDCDEKQRSTTAFVLSDFGCINHLGIQMIKAYIETYEHTNFILLDCVGKPYTNAVNYTSSFSTNVLKLDKHIVRIPISEQNNAAQIFP
GUT_GENOME277285_0126711-281KMNTNRWMQEVNARFPVRKSKAQKERFRQYVLQKAQEMGYAARVEENKAICTNRNIVVGDVDKAKVLVTAHYDTPATVGLPNVMLPMNRPMFYLVQALIALVMVVFIFVPTGIVKKLTGSIFCTEATLIGLYCLMMYLLLAGVPNPHNVNDNTSGMCGVLALMESFAAEKPEEIAFVLFDNEEKGLLGASGLAKAHKQVAKKTLVLNMDCIGVGEAMLMLVPKAAREKYPALGETARKSSGIPVVLGDMEKCNFSSDQKHFKLGVGICACR
GUT_GENOME020145_0147020-280RFRKKEKIRYINNIEKEFKSFQYETSILSPVGGKGKSIDLLIGDVKKAKTLIIANYDTPIKSFGLYKYKPFNVNFQRNTYFVTVFITMLILCLVCFAYTLQFVRIQWLEGIFKASNILHITIYLLFIYGIYKCSKGIPNRTNVNRNSSGVIALLVLAKHFKDISKKDVAFILTDYGCINHNGDMMVKEYLKHIDKKNVILLDSIGKGESLAVNYSPSMQQIVTQLPSHIEKYLRDMKEYSMANIYKNCLIFSRGIKEKDFF
GUT_GENOME246218_0164519-305RHPIRRGAKRKEAFLREAAEIFGALGFPAGISADRFIINNKNLAAGDIEKAQYLFAAHYDTPFRMFVPINVIYPKSIVLSLLYQLFMLALLFAAAVALAIGINALIPLGKYLFMLPLWIWVVSVALSCFCFPNKHNVNDNTSGVIAIMRLAHALPASIRERAAFILFDNEELGLFGSAAFANAHGKKIRGKTVINMDCVGVGDDIVFLANREAAKDEALMGALNTPPAGNRVKRVIIGKGKGWFYPSDQANFKKSIAVAAFKRGPFGLYLNDIHMDCDRYLDSNNIE
GUT_GENOME067683_009471-193MLTQPMDVLEQFPVRKSKKRKQEFRQEVSRLLESYGYTCHEERGSMGAVNLVAGDPERAKYLITAHYDTCARMLVPNFITPCNAAIYLLYQLVLVLGIVGGSALLGIGAGILTGSPGVGELVYLVCLFGILYLMMAGPANPTNANDNTSGVVTLLEIAERCRRTGGSRCASCCLIWRSWGLWARLPIKRRIRR
GUT_GENOME024450_007132-293DLKDTMMLYGVTLGSRKTQRKKNLCIETIKDTCEKLEVPFSIVQSKAGMFTVSSVVMGNLSEADYVVMAGLDTPSQYFTGLNQYPFHPEKSSRAENIKNVLRLLSVIVLACMAYFPLESIVSQGVKLWSILTLVVLLAGMVMALFPKANPYNFSKSSSVAVLVKLMEECKDNAHIAYVLCDHTSSGYIGYKAVKDKISAHSKVILLGPMASGSKTVVAYKEIDNQLGSFLSDCISSSVLQRKYTEEESERNCLSLFAHCIYIGCGDIENKEFVVSNTACKKDVEVDMERLQR
GUT_GENOME048012_000987-293DVLTAFPVRKSKQQKQEFRDAAQSYLSQFGYSASVEKGSFGCRNVVIGDPETAQYLITAHYDTCARLPIPNLITPCNFWAFLGYQLVVMLLMLLVPAIPGALAGLLVGSFDVGYYVWFLCVWAVIALVMIGPANKHNANDNTSGVVTLLEIARSLPESQRGKVCFVLFDLEEAGLIGSAAYRKAHKKASDSQLVLNLDCVGDGDHIRFFPTKKLKKDRKRLTSLYKACGYFGKKDVLVHEKGFSANPSDQANFPYGVGICALRKKGKTLYLSRIHTPKDTILEETNV
GUT_GENOME043642_0122915-254RYTRKQKEVFLAEVCRQCRQRGWKTEFQTRHSRILHVCNLVIGDLAHAKTVIACAYDTPSHAHLPIRYYPLNPRKNTQAEGRNLALEWGLGAICFLMAAGAVYALRSLSPAMKVAAVLAASLAALFGVSLLHSRGNRVNFNRNSASVALILRLIQEWDPKSSTALVLVDQCCNSYEGLKLLKEACPSRAEVILLDCLAQGEKVVCAHRKGVDVADLLEEDWINKEYEDPDNGLRFFENGM
GUT_GENOME079795_006594-237QDNMKERLLVLSQRMARRKSPKQKKRTLYQLSEMFSALGYPVTLMGKKGIHGQQLLGGDPTHAKVIVAADFNLREARLIPARQQLLNQSANRRNQLDNLTVSLVLTVLLASMGVLLCIAGTKQEGRWYLFVLAAGVFLLAFWLGRRQTAGNAGKNAAMFALLECARQYRKEPAAFCFLEETGTELGLRQLRENFPQARLVYIHQLGREKALCAMLKGKQTRLREALAENWPSLM
GUT_GENOME009672_0127912-263FGKRYTVRQKVRFINYIQNREQKKGIQVQVKEGRGCGNRVCRNVYVGNMKGADTVIAVPYDTPPQRILSVWNYYPARADKNLRTDTAVMILHILAAIILTIVYFLILRNLRQMGTLGMIVLLFPMACLDVKLIFGWGNKNNVSRNSASVVTALEFLDRYPKKAAVALLDQSCCGFWGYKCMKEDLQEKAAVKNVIVLDCISSGDSLFLHYNEGERLQIQDREITLKPIMQDKKDQSVLGLFERGVLLTGGED
GUT_GENOME243819_0094612-297FAKRNTPVRKKNFLKWMMENCKQMKVKHKVDEKTEKRMTSTNLICGDLKHADVVFVAGYDTPSKVYIPGYQYYPFDEKKDHKMEMLSLVIQLLMSFIVLIALFFLIRGWSGYNIWLKIGCVIGSAVLLVAAYLFGKGAANKYNFNKNNAALSVAMTLLTQSVDKKYENVAFVFTDFSSASFMGYRQLSQMREISNKNIVILDCIASEGELFAAYNNGGKECAMSLKQIDGDIQTLDYTDAAGVFGLFRKLTYVTSGKKVDDRQVAIAHTRSRKDSKADFDRIEKVY
GUT_GENOME173064_005796-238QMNDWIVRYGFLLKKRNTDKQKDKFIQTFLSDVLNIRDDINVIEIEEKKRKYHNIYIGDVSKADKIITTYFDTPIVSFGDYSFTDTEKNKRNTLTRIAFESVASLCIGLGIFFFLMRVFEGTLLTTVLTVFALAFFYVFNGIVKGRPSSKTQVRNTSSIIEVLSLLEKYKKNKRVAFAVVDGGCTNGIGFVALRNSVKAQSKIYELDSVGANTTLTVTEDSNQKSFKIHPDLD
GUT_GENOME210470_003583-294LQDLMMLYGVTLGKRRTAKQKYLFAQQLNESLPPLGWPVRVQQREGRFSKIENLIAGDLANAKVVIAVPFDTPAKALRPQKYYPFHPDKNVQQRGRDLALQCVLELLCFGAACLMFWSGRGTGAMLPRTLAALLLAGVGVWLLLPHASPCNFSRCSGAVATAVKLAEDLSGEENIAFAFCDHAVDNYDGFRLLAQEMPTAATVLLLDNLTSGPAVALAHGEYRKEAAEQLCSLLPEDKVYDRNYPEEQRQRNLLALFPRGMMLSCGRVEKGEFVVENTCCAQDHVLDLPRME
GUT_GENOME184197_019353-250QTEAFRDWLFRYRYVYRARSTDKSKQRFLKALIADIIPFRKDLQVIEYDHTKKNASRNLYVGDLTKAKRIICTYYDTPPEHFGDYHFFDRQEQGRKTNQFILTASAVMILLGLLGTWLYIHFASGRFPLLSWQTALFALGVGVYFLLLNRVSHGAGFQQNLTRNTSSILALLSLISQNSQTATAFAFLDEGSYGERGLEVLRDSVRPNAKIYYLDSIGADAPIRAIGKQFNEGQLQQLAIEHSSEAMG
GUT_GENOME244005_0191815-247GKRYTRRQKLKFLLGFTQDLEELGWRTEAKETTEKNLKNVNLYIGDLGKARVIAEAYYDTPPVSLTPGAYRFFSVSCRRNERMYAVIVPMLLILAAGALFFWKASASLFDEAGRLSRAGVLNVAVLFVLLALMYKCRKGIPGKRNVIRNTSSLLALYRLAEQERRNKRLAFVLTDDGCGAGLGARVLKGRKRRDQIVVHLSCVGAREQLYLLYDSGTQNEAGLPGLLKLCAEN
GUT_GENOME260354_015253-218GRRCMRAEKHTFLSLVEKRTGGVRERAQAFGLLSENLVIGDLVSAETVVMAAYDTPRLRAMGCAYYPASRGLMAVQWLVRTAFSAGLAVICCAALRLPVWAAETAALAVLFLTCMAFPNRRNMNCNSSGVAALLDLWNRHQDEKTAFVLLDNDDLLHLGRRAFLKAHRRELAHKTVIELYCVGRGDVFVVSCGNDSPPIDSSEHSVVTVRRGRQPF
GUT_GENOME103851_0176915-276RKSYNDKSKFIEYLKTIFNKFEIKFNIDNHGKIIKSRNITVGDISDADIILTAHYDTCAVLPFPNFITPKNIFIYILYSLFLAFLMVVTASACAFISNLLFHKAFISRFVYMFVVFLDIYLLIFGKSNKHTANDNTSGVITLLSIMHGIKTEDRNKVAFVFFDNEEVGLVGSSQFKKKYRDLDDKLIINFDCVSDGDNIMFVCSKKAKFDKIFEVLKENLNIEKEHYKKNVLIESTKNTFYPSDQMLFKKSIGVAALKKKRI
GUT_GENOME227828_007734-288VSKTILTNWQLRHTRKQKKAFAEYAVKIFSDLGYYTRVETKKGFTPSNNIIIGDLSNAKTIVTAHYDTPSRRFIPFFVTLDSTFQTFLTQFSTMIAVFAVCMYLAKYTNFFIGFVLFDLLLWLLVFGPAKKNNANNNTSGLMALYEVASRLPRDRKEEVAFVLFDNKELAHAGSNAFVKQHEVEFFYNKLVINLDCVGVGDAIIFLASKHSIPFAKKAATFAPKDCGKEFSARVNRNFVYISDSNSFPVAFNVSAFTMDGGPILKDLYNEKDTVLDESNIDALAS
GUT_GENOME201177_0176719-314MNEINRKVLKHYQIRKTRKQKEEFRKFLVDELKQYGYCPIIQNKGINNNIIIGNMDTAKIFCTAHYDTQAVLPFPNLIVPNNLFGLILSQLVIIICVLLLIFFMNIVLIKLFVWTGLFKNIELNIASTISWLIIMIWMLFGKANKHTANDNTSGVITIIESAIKLPKDLRDEVCFVLFDNEELGLLGSTAFAKEYKEWIKNKLVINFDCVSDGNDIFFFPTKQIKNDYSKHESIINAFASSNYKRVHINKGFGFYPSDNFNFKYAYGVCSLKKGWFYYLDRIHTSRDIIFNEDNIE
GUT_GENOME026761_0020919-273RRSYEEKTEFLNGLAMDFGELGMPVSLLEDKPSRTRDLIAGDIEKAGVCVISHYDTPQNLMSRFFHLFPFDRRQTALYARRAQSARLYFAAIAGFLPVAGGIVCYLRALPVWSVVLCGLLAAVGLAYLTLAIRGGGNRCNFNENDMAAVVALRAAAAMDEATRSRMAFVFLDNHYSGFKGEKLLIGQYRTPLAGKHGVVLEAVACGQKLFVAGSVPQREKMASLSCCPDKTEIYERPVENIGPALASCVKLSAGR
GUT_GENOME256557_021195-294QDITARFPVRKRDEEKEAFRKWAVAQGKTLGYSARVEELARGRHCNVVFGSPEHAQVIYTAHYDTPARLPIPNLMTPRNIPMFALYQLGIILVLLLAAAIAFVGAQLLMRNASLSLIIALVVYYALLLLMIAGPANPNNVNDNTSGVAAVMETMARMPKEQREKAAFILFDNEEKGRLGSRAFAAANPKIKKQTLLINMDCVGVGEHILVIGKNYARAKTEYAMLEQSFTPRDGLQPHCYGVTGSVCNSDHQAFRCGVVIVACRRKKGMGFYTTDIHTRRDTQADQKNLD
GUT_GENOME078823_032036-239VKDVILRYGVLLGKRNTERQKTDFLRATQKQLEQAGFPVVITCVSASLMRRESVNMYNLYAGDFKKADVVFITYYDTPLRQFFPKEQKAFDANWSRGNFLLHTLLFLCCMIATAAFLYLTVIPSLQKHGFVSLWGAVLVLVCLFGFYGIRHMRGGIAAGNTMVRNSSSLIALFALACDLSEQEKQHVAFAMIDEGTRSEYGLRMLQEYIGKKRIQRVYLDSIGNVGKLQGFADR
GUT_GENOME097742_00373140-380RRRIIFGGHADAAYEMRYFLPGFRRLLWPLTAGATLGMLALILLQSFLLARRLGAALPSSSGLWSGLAVSLFAPFFAGILLFINWGVVTDGANDNLSGCFTAMAVIKELSERHTRFAHTEIGCLITGAEEAGLRGAQAFARAHRSELAGIETVFIALDTLKDPAQLMVYPRGINGLQANSPEAVGLLQRAAAACGYPLPKAPPYPGATDAEAFSRLGLAACSLCGVDHHPQPDYHTRYDTS
GUT_GENOME158602_002005-301LEYLTKHLPVRFSKRQKQNAAAWISDEMEKQGYLCETIRHKKYFTSVRNIVAGNLKQAKTIIVVPYDTPSRVFWHKFLYYPLDGNLSARKSLFPMYVPIFLLYFLMLVLMYGIPYFMPTPEAVSAAYIAGIVLLVLIVALLAKGIPTRKNAIYHAGVAAALELAASLEKEQKRNVAFVFTDQNSGKFYGAYYVRLRLDELKKSAQVITLNGVGAGGELVIGYTKGQKKNAQELIKAAGKHHRVASKSMDSEMLVQSAMQYFPKGIVVASGVYDEKHNLYIDRLRRNHDDQVDEKQLE
GUT_GENOME218103_014352-274SALHPISREILASWQVRKSRAQKEVFRAFVTEELAKAGYQAKTEEARMVVKTRNLVIGAPDTARVIFTAHYDTCAVLPVPNFITPLCLPVYLLYQLFLTALILGLAALVAWVPMLLGAPPVAYALTLLVCCYALLGLILAGPACRHTANDNTSGVIAVLETALALPEELRASTAFVLFDLEEAGLFGSAAFARAHPGVKKNTLVVNLDCVSDGSELLLVLPKGLPAEAEEAIRASYVDQKDKTFLFVPGKKALYPSDQANFRRGVACAAMRKT
GUT_GENOME237835_002837-287EILDKYQVRNNKKQKTAFAGYVERLATEWGYSFKTEKGMFGARNLIVGDPSKAKVVYTAHYDTAPVLPFPNFITPRSIPIYLLYNLGIVAVFVAVALLIGGAIGFISAALDFGASFAFYGGYIAYFALLFCMLAGPANKHTANDNTSGVTLLIDIMRELDVDSRDDVAFVFFDLEEMGLFGSSSFAIKHKAEMKDRLLLNFDCVSDGENVLFALCKGAVDYSTKIEEAFAEKDGFKVQIASKGVIYPSDQANFKCGVGVATLRSTKGGLLYMNRIHTKRDV
GUT_GENOME113641_0131814-291VRLRKRQKKAFEQLITAECKRRNIPCHVEQSPIVKSRNIVAGNLEKAKVIFTAHYDTQAELPFPNLIFVGSMWKFLAAQLFMGLMMCIAIFLPAAVLSAAAVRITDSTFAGLMVFEFLIWFTLFMMFFGKANKNTANDNTSGVCVLSELLFSKEFDGEKAAVVFFDNEELGLLGSSYFKKLHGKSIRDKLIVNFDCVSDGDFITLVLGKRVREDGETVRIIDKSFTADLGKTIVKGGSNKYIYPSDQLHFKKSVAVAAMKKGPLGLYLNRIHTRRDTV
GUT_GENOME118218_019769-307LARWQVRKTKKQKRAFEAFLLRALREAGYAVEALAPVRHMAGNADARAEECGALLKNRNLIVGNPDTAKVIFTAHYDTCAVLPVPNYITPTNLLVWIFYQLLLVLGMFLCATVLAALIWLLPLSEAALFGASTLMFVAVLCFMCVWMIAGKANKHTANDNTSGVVALLEAALAMPEERRKEVAFVWFDNEESGLFGSSAFAAKHREAARNTLLVNFDCVSDGDPFLVVLPHRMKEEPLADILRASFMPRGVKQALFPTTRKAFYPSDQLHFKRGVGVAALKRGKLGLYLDRIHTREDTM
GUT_GENOME001204_010868-298RYTKEYGCRYTRRQKNKFLEVLHKEMAECGYESTDIQGKRLFSRANNVLYGNVKQMKQVIVVPYDTPEKRFWPTVRYYPLNGSKTISKSMVPLYVPAIVLFVVLFVGVYLVQPKISDAMAGLLVSSSMFLIAILLIYNLLHGFSNRKNYNRNSAAIACAVEVAHSLSKEERAKTGFLFTDKNKQYFLGAEASANYFHEQHKQPELIVLDCIGKGSCMQIGFNPQNRKLAAELAKHDPSKTTIECVKLNETMRSQHMMAYFKKAVMVACGELEGEDLCVLGTHTGKDTELDE
GUT_GENOME142060_0239711-290VTKDFGIRHNYNEKTNFLEFIDKELKFLGYETEIVQGKNVKQCRNLCTVEGNADIIFTAHYDTPGTMPKFLGFIFKLFGHARQIIGSIIYIVIILLLIRFIKNALNISISYMVVFILIIFPMLIKNKKNYNDNSSGVITLLNIAYELKNNENLKGKADKIKIVFLDNEESGLLGSNLLSKYWQEKDEYFKEKKIINFDCVGVGDIPIVYYSKELDYELADFLQNILSCCKKDSKKFMCRYYPLSDDYSFKKNPAISIIFSNKSIIPGGYYIPNVHCSKDN
GUT_GENOME255826_0042615-262RKTRRQKDAFIDYLSQKVKELGFEMIVEEGGYFKSRNLVAGDVDKAKVILGAHYDTCAQLPFPNFLTPKNIPVYLLFNLILLAVIFLLEFILSFILYSLTHDALIVSLGSLILCIALIYMMLAGKPNKHTVNDNTSGVITLLEIMGELDEKMREQVAFVFFDHEEIGLFGSMAFVKKHKKALEDKLLINFDCVSDGDHLMFILNKKAQAYASFFQEAFKCENKEIIVTKASNTLYPSDQLNFKCSAGV
GUT_GENOME236864_0052210-296KFPIRKSGKQKEAFRDWFIAWARAQGYSAQATTPRGIFHSTNLVVGDPETAKVVFTAHYDTPAVMPLPNFITPCNVPVYFLYQLLLVPIMLLPAALIGPLVGHVARVLTDNFELARQMGALTALVAVYASLGVMMFGPANKHNVNDNTSGTAAVMELMTRLPEAERAKAAFLLFDNEEKGMLGSSAYAKSHPQIKQDSLIINMDCVGDGENMLFFANKRTRALDSFPLLEEAMQGVQGRTYVMNRMEKCVYPSDQRSFRHGIAVCACNRAKGVGYYCDKIHTKRDTV
GUT_GENOME142577_0129216-243KRRTKKEKIRFLRSLSSEARHLGYAENEVSTKVAKINHKENYNLYIGNIEEADTVLTTYFDTPPRSVLGLHYKVGDARNNYKWAMISQILPFVVYTIFELLVIALLVFPMLRINNVWQQTTAAILAVLLIGVSLKLKNGLSQSSNLSNSTASIILLLSIMRQMTTAQRKRVAFVFTDNGMVNNLGLKVAANFISYHAKKEPQIIYFDSIGDARNIRIFHDDKETFSGK
GUT_GENOME261082_0127719-296YGIRNSNKDKQRCYSYIESKFAKEFGLDVKFDKLSVGISKIGLCIAGNLDKADKILIAPLDTPRKSYYKQYSYYPFNEKKSDRNHMLAALINGSIGVLLTAAVCWLVYLLFKQFWLSCICAFVSFLLYLYSLRNIFNFSSSAPLALFTYIAAHRRKDEQIAYVFLDHSAQNYLSLKLFLVKYKKLCERADAVIYFNNLAHGQHLVSAYKELDKNKSQFIEACEAEGLKIEDEGVLHCFESSSKLMIMTAAERDQSNEYYVKDIRSRRDKTVDVKRLKR
GUT_GENOME058076_003352-293LKDRMIAFAGIFGKRYSKKQKIRFLRYIQSSAQEKGVKMYLDEGEASGKSGCNVYLGPTQRAKTILAVPYDTPSRLWWPGFHYYPMDPERNARQESLVMALNFALAAVLLAAYWFAVFRRCFSAGGPIGGLALFGLLLVALLAVNLVLGSANAKNYSRNSASLALALEIFDGLQDGSVAVALLDNACAGLDGYARLAEYLGGRARQTNVVILDCVMSGSELHGHCTKERIEALAARNGDIQWHEAASEGNAVLSLFPRGMVITGGSWIGDATVVKGTRSGRDDELNMERMER
GUT_GENOME150015_01842102-301AWICEYILYMRLTDFCYKKKEGENVLAVRKASEETHQRIIICTHIDAAYEMPFFLYMKAWMIYLLIALADGGLIFFLISSALNAFGLIGELATIELCIAMIFTACSLIIFLFFVNWKMVSPGANDNLTGCFVALSLLKELSENDKRMKYTDVCCLITDGEESGLRGSFAYAESHKQELLETNSIVIAADTFNDKKELMVY