UHGP-MC 16556


Information


Number of sequences (UHGP-50):
84
Average sequence length:
287±41 aa
Average transmembrane regions:
0.04
Low complexity (%):
1.4
Coiled coils (%):
0
Disordered domains (%):
3.8

Pfam dominant architecture:
PF01807
Pfam % dominant architecture:
238
Pfam overlap:
0.02
Pfam overlap type:
shifted

Downloads

Seeds:
MC16556.fasta
Seeds (0.60 cdhit):
MC16556_cdhit.fasta
MSA:
MC16556_msa.fasta
HMM model:
MC16556.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME281802_0066815-327SSRHQCPSCGHKGEFALYVDTETGQPLHPSVGRCNRDQKCGYHYTPAEYFKDNPDKKPKDDWKPTYVPVQQKPASKIEYLPVRYIPTLKQIEKNNLFRFFCSKFGGSRSREVFEKYKVGTSKHWRNADGLSAVFPQIDANGNLRQMKVIAYNPETGKRLHKEHPAEKMISKGYVLDVQQDKVWFAGKSLLCNFDANLVQCFFGEHLLKTGNDVSIVESEKTAMIASLYLPNSIWLATGGKNGCRWTSSDVCKVLTGKNVFLYPDLKCLQEWTEKAEILKSHGIKVHVSRYLEEHATDEDKGKGLDIGDYLLQE
GUT_GENOME234951_0169322-309CPVCGRKRKFSLYIYTETDEPIDPDHRSGLCSVCGYHYPPREYFADHGRTTEKFTPTQTASASRQWVEVYDTISRKKVEASHRHDGTLYHYLISIFKKEAVDAAWDRYNMGITKDDRTIYWQVDRMGEVRTGEFIKYLPNGHRDHATAERWAHNAYKDFTLEQCLFGLHFTKNNSTPILLVEAPKTALICSICYPSHIWVAVGSRDQFKVSKLWSIRFHRVLAFPDVDALQKWQDEAQSLNNKGYHLEVLDFRELYEATEDEVKELGSNGDLADLLIMRSKPPYHIPQ
GUT_GENOME095416_0024816-303GGKIICPNCGRKSFVPYIFNDTSDIIDPTCGRCDHEQSCGYHLTPAEFFKDTPDVKHTTNHEAYSKPQPSIINFCIFDEWKVRQTADEAMQCDFAQGLLKFFEADKVAQAINRYHVQSIGFNRNTAFPCISVNGSVTDVMCLGYGSDLHRNPDVCYHYYGDKNQKAQLDVLYPNKYTYNPCYFGEHLLSESTEKNIGLVESQKSAIICSMMFPQMTWLATCGCGNFNVTKSHTLKNRRIFVYPDKGSSDKWGKIVETMKRYSYNVHLRPIMEKLQSYGANSDIADVIL
GUT_GENOME038551_00167150-474NYSLQKYKGTATRHTCPNCGDRHSFAYYVDDSGVPLHPSVGRCNHESSCGYHYTPKEYFHDRPECRTAKSFSFDKNKTERKTGQASRQTVTGYISPQYVERSQSVHSNFFRFISSLLGSYYGSKAKEVLRRLLEEYRLGATRDGAVIFWQIDRENKVHTGKVMQYNPENGHRIKGGEALAVDWIHSKLKRQRVLPEEWQLSQCLFGEHLLNVYPDKVAVLVESEKSAVIGSAIFPDYVWLAAGGKSQMREEKLRVLAGRTVLLFPDADAYAEWKQRAECMSFCKVIVSDLIEKNATPEQKAAHIDIADWIVFQIRESKIMCTASH
GUT_GENOME038390_0267837-361MSTHRFILEPYKGISTRHTCPNCHRQRCFSKYIDTEKQIQFPKYVGRCDHEQKCGYHFTPRDYFEQNPSEKDKLAENSFRNYAPIKEIQPIATSYIDLDIVNQSLRGYPTNKLFQFLSAQFGETETLKLMKKYKVGTSKYWDGATVFWQTDNQNKVRTGKIMLYNSETGKRIKEPYNHVTWVHSVLHKGDYNLKQCFFGEHLLSEDKSRPVALVESEKTALVASYYLPQFLWIASGGKNGCFNANSLSVLAGRTVVLFPDLGATDYWQSKIGLMKRYGIEVQMFDYLEAHANEKERKEGYDIADYLLKVKPDEAILQQMIKRNPV
GUT_GENOME218838_0163817-315RHCCPACHKKRCFSRYIDTERQISFPEDVGRCDHEQSCGYHLTPKEYFERNPQAKPLHGDFATPSAWRAKPTEQRKPTSFIAAETVSQTLHGYEKNNLYQFLRSKFGPEDALRLMKDYRVGTSKHWPGACVFWQTDVNGDTRTGKVMLYDAESGKRVKHPFNHVTWVHSLLKLPDYNLRQCFFGEHLLPMNRGKPVAIVESEKTAIVASYYLPEYVWLATGGKHGCFNADTLSVLKGHQVILYPDIGATDQWRQKLGLLRSLGIEASIFNFLEDVATDDERTAGLDIADYLLQIEPDQA
GUT_GENOME175862_0162314-293KYICPACERKTFVLYIDNYTGSPLHSAVGKCDRADNCGYHYPPKEYFTDNQIPFDSKKRSVSRPKPPPKPQPSFIDRELLRKSLWSYEQNQFAQWLVGIVGEKQASEAIGRYFVGTLLNGGTCFWQIDLQGRIRTGKIIMYDKDGHRRKDVTPPVQWIHSALKPNFTLSQCLFGEHLLHDATKKVAIVESEKTAIIASIYLPDMIWLACGGCEGLNADRCSVLKGRNVVLFPDSGQYEKWSAKAKELSRICTASVSSLIEQQATEQERKAGFDIADYLVR
GUT_GENOME149574_0056218-318RHTCPQCGRPHCFTRYVDPGGNMLDETVGRCDHESSCGYHRTPRQYFRDHPELSRTRQHRPTTAPRPSQHRHTAHSPLCTIPQDILRRSVRKDIPSTFTAFLRTIFPEDTVAALTDTYRLGVTRSRDVIYFQIDTAGRIRTGKIMKYDPATGRRIKDPGTPFRIGWVHSLMKRTGQLPPGWQLTQCLFGEHLLSEPDSRDRTIALVESEKTAVIASAIMPKYTWLATGGKSQLSPEKLSVLRGRKVIAFPDVDSYRQWTEKLTAIDGLTITVSDVLEKNATDRDRRDHIDIADWLIRWRQD
GUT_GENOME049445_0036013-310SSRYTCPKCGRSHCFTRYIDTWTGQYLADDVGMCDHKNSCGYHKPPREYFKEHGNGYHFEYRQSNPLNRSTESPKEKETDFIPMRIIEDFEKLDKKKNTLKEFLSTLVSPFDLQRGFSAYHVGTTKNGETVFPQIDMQGRCRTAKIMAYDENGHRIKDKMDRIDWLHARIMKKKRLKPSEWNLKQCLFGEHLLSLRPDDVVCLVESEKTAIICALVYPECLWLACGGKQNLKPEMCQALEGRNVLLFPDADAVADWEERSKLLSFCRKTKMIRWYKHEAEDSKRDIADAILEQHPKEA
GUT_GENOME110568_0011019-190SCLVAYLISTVGVKATDRLIPLYHIGACKTDGAAALWRLDADGKPLNGSAVLFDVGTGQVITAESITHRLMPPYEWWFTAPSFFGGHLVHDASTIAIVQDEITALLGAVAEPSFTWLAVGYGQNVTGDMLVQLAGHDVVMFADSLNAEAWHGLRLPNKRMAVSDAFVCADVN
GUT_GENOME276270_019764-344RYTLEKYRGMASRYECPACHRKGVFVRYIDTETGRYLSTDVGRCNREDHCGYHYTPKQYFKDHPEQSPRYRHQRERSNYSERSNRVTPRENAVLADTTHKEGIDTIPKKYLIHSMGYHSNFVRFLLSLFEPNVMDSPTLARLMSEYYLGCTGNGSVIYWQIDFQNRIRTGKIMQYDPDTGKRLKNESGAVDWVHAKLKHSGVLSEGWFLSQCLFGEHLLRKRPNDTVCLVEGEKSAVIGSGLMPEYIWLATGGKGNLKAELCACLRGRRVILFPDLGAFDVWKAKGEAIASQVGFTLCMFDTLERIASTEEIAAGWDVADYFIAQIQAKLSTEAQQPERLI
GUT_GENOME159493_026108-290GRLTGVCPQCRRHTFKYYIDTVTGAKAGDYLGRCNREVKCGYHRRPTGNRPLTAAEATASLFDRPADDALSTIPREVYERLPPVNTDDNLCHYLSRRYGIGMVQQACRYFGVSHARFAGGSSAFWLIDRMGYIRSAKVMVYDRATGHRVKSTNRMPVSYAHALMRLPDFNYRACYFGEAAACDRLHADAKLILVESEKTALVVNLELMRCGRSADYVCVATGGASMLRVDPAMMESQYYRGTFLRGRRLVVLPDADMVATWRLYADSLIPYVAELHFIDVREP
GUT_GENOME051052_0118834-196SSLTDYLDSVLGDSREMQNRIDNLVMRYRLGAVDKTKNEHFGTILPRINKKGMIVGGSVIYYNTDGGTIIEKDSLVAHLYSWYAFDYYVDPDVFFGEHQYRGGQVVIVQEEKTALLGSLAFPDIDWLAVGVGNNLTGGMMKKLIGRQVVLYPDDFSCDFWREH
GUT_GENOME017731_0179917-305GNRKKSICPNCGHRTFVRYIDVETGKPLGEDIGRCDQEINCGYHKKPRDYFSEHIHTSQRRYWFPKKEYRELSDTSIFNTIAPAKVEQTLMLGAFNNFSIWLREHFGFTEAMQQMIRYRIGTSNHWPNATIFWQIDQQQKAHTGKIMLYDYHTGHRVKDPYNHIAWVHKSENAMNFHLKQCLFGLHLLRPDTKIVAIVEAEKTAVVASIFFPDVLFLATGGLQNLNTERCAPLKGHRVILFPDLGAEDKWREKAAKIPALKGCIISTWLANHASAVEREKGLDLADYLE
GUT_GENOME199241_0080332-333PRIKVTCPACGRQKKLVRYVDTQTGRFLADHVGKCDRIFKCGYHYTPSDYFRDHPWLREASSWQSSTSLHQRAPQRVEPIRKPSFIDTGLMNETVKHCQSSTLFRFLSSLWGEDETMRLFRLYNVGASSRMNDAAVFWQVDIKGRIRTGKIMRYGNDGHRIKEDGLVNFMWAHRMKLMVDNPDDFNLVQCFFGEHLLSACPDSKVMIVESEKSAIIASHYYPHYLWLASGGCCGCLNQTASQVLKGREVWLVPDLDAEERWRAKLTMLRTITPTVGIVTAISNMATPEQRAAKLDIADFLLD
GUT_GENOME243598_0078323-197EISKSDIGLRHSSLTDYLDKVVGYDANMLREIDVLISRFHLGAVKKSINNHLGTVLPRIDQAGRIVGGNVMYFDSNTGVILKLEPITSHLYQWYCFDYFINREVFFGEHLLSGKPIAVVMEEKTALLGLLAEPSVDWLAVGVDYLLTDKMMNRLSGKRVILFPDGLGYNYWNEHF
GUT_GENOME100343_0195616-314CPACGRRTFKPYVNNLTGQPLAPDRAGRCNRQYKCGYHLSPREYFINHPMQLPPGWSPASASASATALPRPRQPRNGAPDYISPEVARRSMRRYDINPLWQFLCRHYPAEIVERAAAIGLGTARQWGGAAVFWQIDRMGRVRTGKVMGYDAASGRRVKEPRPQVTWAHTLLGLRGFNAGQCFFGEVALGRPGVRTVLLVESEKTALVLTCELLLRGKEHVAVLATGGADGALCDASRLSDPTYKYSALRGKRLVLLPDADMWQRWRALRLGAVTGSRRVVPPGWLSLAGSQDIADVLIA
GUT_GENOME237057_002536-319LERYRGRGSRYACPQCGRKYSFTRYVDTEKNAYVADNVGKCNRLDKCGYHYTPRQYFEDHPWLSERPKSLYNTPPHPRPQPRPQPRPQPRSEPSQDSANVAFGVMPRMLVERSLALDCGYKRWLRSVAGDEVAKRLISDYKIGGCINETTGLDAAVFWQEDINGDFRTGKIMNFDPTTGKRLKGEAGVVNWAHALLRKEGALPEGWELRQCLYGEALLARRPRDVVALAEGPKTAHIGSLLMPDMVWLATDSMMGLSHERLQPLAGRRVVLFPDEGRGFREWSERIVPIAHSVGFDYIVSDFMERIAPNTGGDI
GUT_GENOME153546_020166-332FHLQKYRPGSKTACPECGRKSCFTRYIDEAGEISFPDNVGICDHINSCGYHYTPKEYFRDNPAVKERLNEQEKNFRTPTAARPAAIPVPEQKPQISFLPSDWVEQSMRRYDINPLHRYFIRVMGKKDTDRLFRLYRVGTSRMWNGAAVFWQTDRDGNVRAGKIMGYDAETGHRIKEPFNQVSWVHSVRKVPDFRMKQCLFGEHLLSDTSAAMSAKPVAIVESEKTALVAALFIPDFIWLATGGMHGCFNSETMQVLGGREVVLFPDLKATEKWRQRLPMLESFCRRATCSDMLERIATGAQREAGLDIADFLLMEDTPQMILQQMTA
GUT_GENOME237421_0128795-349TITPNKNIMDRIRINNTVFLDPIVVEKSLSTANSFCCSLVTNNIFTWEQMLHASETYRLGATNSGGTIFWQINREEQICDGKVMHYGSDCHRDKDKSPIWVSWIMKHRQHKLSLDARTEKCLFGLHLLSVDADADDVEIAIVESEKTAVICSELFTLKSMKEKYSISNIDIKPNILWMATGGISNISTKLLMPLKGYKVTLFPDTDTDGSAYRHWTDMAKKISREMDQKINVSNILEKYASAEQKENKIDIADWI
GUT_GENOME067546_0050548-354CLDCGRKSFNLYVDSLGEVAFPEDIGWCDHSGCNSQHDHSPARYFQEHPEKRPKGYEDNRTFTIQSLPLKPIEPKPLIFMPQELMMHSLKGCESNALFNYLCMKVGEEKVRNAFQQYKVDTACLWNGSTIFWQIDIDGNVRTGKIIKYGPNGHRVKTSERSLINWTHSLWREKPENFKLAQCFFGEHLLPRYPDAHIMLVESEKTALVGFMYFPDFLWLASGGNNGCLNLEASKVLSGRKVILVPDLNMEKNWNDKAEMLTKIGVDVSVFDMGQLNPSELDRQAGLDIADFILRARPSDKESMYREF
GUT_GENOME116058_0158425-274IISPKESLPGAAQAKKRHNTMKNNHKPIARFIHRDRLRNPRLGDTHYYERTTLFTFLSRYFATDAICRVFSRYGLLADLEYEQDGLYGIAFPFQDGKTRQLLSVQYDPATGKQTGEQPIGQRLMKGYDLQPAPALFGMHLLQEYPDDPVGILRDSRDALIMTLADNKRLWLATGNSNRNGYSLTDPAIMETLRGRSVTLYPASGLYDASLPIAHRMQEQDIDTTVSNAAEQLTTDMPSDARLTIADFVLR
GUT_GENOME281273_0177264-388CPNCGKPLCFVKYVDAEGEIIFPDNVGKCDHENSCGYHYTPKEYFKHHSVRLQSLSSRHCSCKHGTALDLAVYFKDNPDVLTRDQSVGESFQAALYKFIEKKPVLTVPSYIPLSYVDRSLSHYEINPLYIYLCNVFGEEETVRLFQLYRVGTSAKWGGSAVFWQIDINGSVRTGKIMCYNPENGHLIKEPQAFVSWAHSELHLPDFHLKQCLFGEHLLKGASSSPVMLVESEKTAIIMSHFISDYIWLATGGKNGCFNREAILALQGRDVTLIPDLSASEQWKAKSSLLLGICKKVSVSDILERIATKEQRNQGLDIADFFLFSP
GUT_GENOME128934_0078815-318SRWTCPQCGRKHCFAPYVDKDKRPAGEEYGRCDHESSCGYVKYPPSETDWRESYAEYRNRTTRKAKPIQAVPKICASQSVEPVDTLPRDIVLKTVRTKPVSDFLLFLSTLFETDTILHLVWEYLLGVTRDGDVIFYQVDIKGRIRGGKIMKYNRETGHRIKDPAAKNPINWVHIPMLRKGLLPEGWTMTQCLFGEHLLAQYPDKVVCLVEAEKTAVICAAMMPEFIWLATGGKSGFNDRVEVLDGRKVIAFPDVDAYDQWCEKAAERPYLDITVSDYLQQNATEEELSSGADIADVLIRWQKEN
GUT_GENOME072686_0427416-319KNRYTCPCCGHKREFTRYIDLMGTFIFPEYVGKCNRANNCAYHYPPAIFFRDNPEALKDLMNTGNGSGNHPVPLVPRLMQPIQASCIEKRIMQISCANKWYNQNNLFLFLRNKIGKETITSLFREYNVGTSKKWQGAATVFWQVTLNGEIRTAKIMLYNKTNGHRVKEGYSRIVWAHTELGYTEFHLKQCFFGEHLLNLYPNKTVGIVESEKSAIVSAAYMQDFVWIACGGKNGCLNSRYHILRERRICLFPDSKAYDTWYLFYMKLKAENYSISISNLLERKTSDEQWNAGIDIADILLQQSL
GUT_GENOME112321_0261314-314MASRFTCPACGAAHRFVRYIDTLTGHFLNEKVGRCERTESCGYHYTPRQYFDDLRRLGITPPPVSAVPSPPEDKPVTFSTIPSGLLTGSALASQPEQTALLRFLTGRLHLSADQLRQLIATYRLQSATYGRIVFPQIDAAGNCRTAKIMAYNPATGHRIKERPGSFNWLHRILISQGKLSRDFRLRQCLFGEHLLPAHPLKPVGIVESEKTALICSLTYPKLLWLATGSKYNFTPDRFLPLSGRIINAIPDAGSYRFWTLKAKAVIRLVRCQFHVSDCIERAATLDEHQRGIDIADLILDN
GUT_GENOME028442_0015916-314SRHTCPACGGKRCFTLYVDESGEPLSEIVGRCDHESSCGYHYKPKDYFRDHPEAGGEDWREVSPELRRHLEKRLVRCKPLCVIPFEMVTRSVRPAIASSFTRFLLKLFDAEMVAGLVRDYHLGVTKGGDVIFFQIDVHGLCRTGKIMKYDAETGHRIKDDDVKGKIGWVHSVMKYRKALHANWELSQCLFGEHLLARFPDKVVALVESEKTAVICAALMPDYVWLATGGKSQFNERLRVLSGRQVVAFPDVDGYDEWVKKAAGVPYLDIKVSTILRDNATAEERDAHIDIADWLIRWKL
GUT_GENOME011447_0151947-303KFICPQCGKRTFVRYWDNLLGTYLADERAGKCDRSIKCAYHLPPRALGYRNGNLPSDRLPHLQSNPNPQAPHTLEMDTAPFFRHHRENDFYRFLLTLPVDRERLEEVCRLYDLGSTRSGGVIYWQKDLDGKVRTGKIMHYDPLSGKRDKSRPPQWVHRKVPTPASWTLRQCLFGLQLLKEDKERKVAIVESEKTAILFSLFDRSRLWTACGGKMNLREEMIADIARNEIELYPDLDAVAFWQSRLPRLQRLTGKILR
GUT_GENOME109296_0190613-325RYTCPHCGRKRCFTLYVDEAGDPLHESVGRCDHESSCGYHYTPKQYFTDYPDLKPGKDWRDGHPIFVTQPKPKPLCFIPDHVVNRSVRFNHDSDPITFLKTILDPMVVEGLIEQYHIGVTRGRDTIFFQIDKLGRCRTGKVMKYDPDTGHRIKDENTPGRITWVHSLMKQSGQLPRDWELTQCLFGEHLLLENPGMPVALVESEKTAVICAGLMPRYLWLATGGKSQLSRNRLSVLAGRKVIAFPDIDAFADWSRKLAALAEPDPSYPARPGISITVSPILQQNATAEDLENHIDIADWLIRYRHSRPDRESP
GUT_GENOME149442_0156895-359HFHVTDKIPVKTERVQQEPLKCPSKPYYLNLIKVLGERERDSSDFCQFLHTIFSDEQVKEVVSRYHLGVTWDGHVIFWEIDRRGFIRAGKIMRYDTATGHRLRNGEKDLPDWVHSRLRLSWQRSHAEDIAPIFCQIGRKNFIQPDKWKLSQVLFGEHLLHSAPKDIPVGIVESEKTAVIMSLVKPSMVWLSCGGYSSLRTCLCGSHDILLDRRVVLFADKSKGIDFCAGWKKVVSEFGDLCLSVTDMMEHISLPAGSDIADIFIA
GUT_GENOME183349_0169445-244NNNLNYYLLDLFDWWTVKYLQDIFFIGSFSDGAITYPYIDDNGKLRSLKRQQYDRETGKRSKTENSYLLHKALSDNDNFNYKGCLFGMHQIRAKYSKGKTVCIVEGEKSAVIASGQYPQYIWMATGARDWFNEHSLEPIKDRNIILFPDTSKDGKTLEKWQQIADKYKAKGYNISVSTMLEDKCTQEQKEQGYDIGDLLH
GUT_GENOME155832_0308016-314RYSLQKYTGRNSRHTCPACGRPHCFTLYVDAMGNPLAGDVGRCEHVNSCGYEKTPKMYFEENPQERGRHNTVPSYTQPKKEAQPDYIPLSLIRKSEGTASNLVGYLAKYFKAGDLKTAVAQYHLGCTKKGETIFPQIDRFGMCRTGKVMQYGADGHRVKGNFDAVDWLHARYMKKHGKAAHEFHLKQCLFGEHLLPKRPDDIVCITESEKAAVIASMVFPEQVWVSCGGKHGLNPERCKPLAGRRVLVFPDADAVEEWSEKIKALGFCRSVRLSDWAKDEPEGSKRDIADLILEEKARS
GUT_GENOME056009_0096612-282VCPGCGRRDFKPYIDSATGLPVDEAACGRCNREVKCGYHLSPREYFSATGRSLELLKRDAMMANRRRAPTQAVDFVAPELARRTMRFYTINPLHRFMLTKFEAGIVDEAFSIYSVGTAARFGGSTIFWYRDLNGRYRSGKVMAYDPLTGHRRKDIQGFNWVHALLRMRDVEFHYAGCLFGTHLVRGYRRVMVVESEKTALALWMAMREAGVHGILPVATGGKSNIKPDLDRPHSRWKPFKGKELILAPDADSFASWSAAGLESVGSGVFYI
GUT_GENOME163648_0229218-337SNRFVCPQCGRKKCFTRYVDADTGEYLDENCGKCDHTASCGYHYPPREFFRDHPDRMHGKAYQPEYINGKPLVGMGRRWQEDKKWQRDASKCQTAERPMASKPPQTEFFPLSWAEEGTDRSSTFRNWFERLPFVPELIHQVLLEYFVGGTSYDIVVRGINYGKAVIFWQIDEQLQVHDAKLMAYQKDGHRVSGWGNSMRSICEKARVGPQLSETDKVLFGLHLLPYYPQKTVCIVESEKTALVCACQYPQHLWMATGGCGNLQASKLQPLKGRKLMVFPDSGEYGKWACCMKESGIDNYQIVDFMEQYETNTDIADVILG
GUT_GENOME088730_0158713-330KQSLLVCPNCGRREFSPYIDTETGEILDETCGRCNRESNCSYHLTPSQYFQQHPEARPKDEDWRTAPDWLSKRQTTQKPILSKPRPSGPICELPRETVERTIRMEPPSNLVKFLDTILDPLITEGVVFLYGLGVTKAGETIFYQKDRQGRYRGGKVIQYDPQTGHRVKHADFPVYWIHQSFQRKGLIPQDWKMTQCIFGEHFLDQYPDQIVCLVEAEKTAVICAGFMPEYTGLATGGKTQLNERLDVLRGREVIAFPDIDATDAWTEYFNSRTDLNVTVSMLLEENATEQDREDQIDIADLLIRWYGKHPEAVPPVPS
GUT_GENOME134550_0297921-336VCPKCGKKRYTRFVDNQTGQYLPYEFGRCERINSCGYMESPKNNINQINNNKMTKEIEQKEVVSEIASERIESKSSMLSSVNDFMNGALLFFLYLIFGEKAVEVFNLYEVMVAKHYYKDGKFGTAFLQMDREGKIRQVKVMAYNPKTGKRLKGNDEFLIYNRRTHHYEKNKPETPASMYIGKMLMYDKEFINKQCLFGVHLLNKFPDKPVALVESEKTALICAIQMPEYVWLATGGQFGCKWTSPEVYNDLRSREVILFPDLKATEDWKIRAEDLAMDGINISVYEGLEEQATAEERDRGLDIADYILMAEKEVKP
GUT_GENOME285291_0019785-348VAWLGKKYGIEVEGGERFQPKPSLPRKPAVQLPMLTLDLNMVKARLDTSGDRLCDWIRSLPWNDEQRARIERVLRSYGVGHARKGRGYTIFWQIDDMGNVRTGKFMLYRPDGHRDKDTPYNFTWAHSVLEKAGKIDLSKADMHTTLFGMHLLNFHPTAAVNIVESEKTALLASIMWGHPEKWIWMASGGLSMLNASKLKPVIDQGRQIYIYPDKDGVTAWKQQAMVIGYKGLHVDTRYLDGYWRDKDGQKADIGDIIVSSLSEP
GUT_GENOME213254_0066160-388CPACHDKHSFTWYLNGDTGEVIHPSVGRCNHESGCGYHYTPKQYYQDNPQLSEFSNTIKKGEVTARVKQNHPKFTQRVEEKEPGRIPKQYIIQSLGYNSNFVAFLCSIFDRYTLESPTIRRVMADYYLGYTKNGSVIYWQIDERNRIRTGKVMQYDPVTGKRVKNANGAIDWVHAKLKRDKVLPDDFNLVQCLFGEHLLKRYPDKVVALVESEKSALIAAGVYPEYLWVATGGRSQLSIDKLRVLMGRTVIMFPDTDTDGKTYSLWADKAKELATIGCAVTISNLLETVATDEDRINGLDIADYLIRQLKASIPEPNNIPATESAKAPI
GUT_GENOME264035_0275415-324QRNRYTCPSCGKHREFTRYIDTQGAISFPGYVGKCNRINNCGYHYTPAMYFDEHPECKEYNKSFPVVKDKKPVVCHPKLPPVHRHTDTSFIPDEIMQQTMKCYEQNNLFLYLANHLGNESALRLMKTYHVGTARKWKNATVFWQTDISGKIRTGKIMQYNAETGKRIKEPYAHVSWVHTELDIPEFHLQQCYFGEHLLYNSRKPVAIVESEKTAIIASFYIPEYIWLATGGKNGCFNEQNFDVLKGRNVVLFPDIGMTQEWKEKCMQMRKRNIRTEISEYLEENANDVERVNGYDIADYLIKVKSGEAVL
GUT_GENOME022480_0159516-298VCPACHRRTFKPYIDLQSGQPLDAKSCGRCNREVNCRYHLTPAEYLANTPNRPTLFLSAAAPAAIDPPSFLSERPPLQDGTELIRDTLFRWMFFLFGNDPGVVRVWKAYRAYHDRGLGGSAGFPLIDRFGNFRSTKLMRYDPNGHRCKNALNDAKNVAWKHTGLPGYCFRACFFGEHLVRAFPKATVNIVESEKTALLCAVEYGYGDRQIWVASGGCSGLRGSAEDLRDPHFRLSFLRGRNVRLIPDNDSVERWKECTGELRKFCRSVTVEPVAGLTGSEDLG
GUT_GENOME286021_0017525-313CPQCHRKTFVPYLDGNGNIIDKTVGKCDRADKCTYHLPPRVFFKERGDDIVELVKRKRPTYQPTPAPAPTFFNDKDFITTHGDIFKRSVEATQNHANNLITFLNGVFGVELVSKMVADYYIGTSKHWENSTVFWQIDRFGRVHGGKIMQYNPNNGKRVKKPSNRITWVHSAMKLSGFNLSQCLFGEHMLKRHPNMAVAIVESEKTATIASGIFADCITLACGGCGNLTAKICQPLRGRDVVLFPDNGKFTEWRDKGQQMRNVFASLRIADIMEREATSEGDDIGDLILE
GUT_GENOME096499_0133021-344CPRCNHARTFTFYIDPETGKPIHRTVGRCDREIKCGYHYTPREYYRDNLPLCGDLFDGNQSKGNGEQISGNGKQISRNGKFSDKTVVTGDLFGVNSNQSGMNNNQSETIDYIKILDEKIDCISHRYVICSKSSDSNFVSFLYDHFPGERIEEAIDRYALGATRNRRVIFWQIDNDDNVRTGKIMQYNPLTGKRVRNERGGINWVHNILKKRSYVFQNYNLCQCYFGEHLLKLYPNKPVAIVEAEKTAVIGSIHTPDFIWLAAGNLNGLNVPKSRVLRDRNVMLFPDAGCYEKWAEKIKRISREVNCTMEVSDVVEKHATEEQLI
GUT_GENOME273590_02206331-632NRYACPQCGRKRCFARYINEQGHITFPDNVGRCDHEQCCGYHYSPSDYFKDYPDANSNDDWRYKTPIKECRKKKPLPTFIDSKLVEQTLHGYSVNPLYRYISTVFGKEETERLFALYKVGTSKKWGGSTIFWQIDVNGNVRTGKIMKYDDKTGHRIKNPHSLMTWVHPELKLPDFTLRQCFFGEHLLTDKTTTKTVAIVESKKTAIIATHFMSDFVWLATGGMNGCFNKDAVEVLSGREVVLVPDLGATDKWKSKLPLLQSICKQVLVSNILEDNATNEQKTNGLDIADFLLMTETPQMALQ
GUT_GENOME026268_0128311-316AGPASRLTCPACGRKHCFSPYVDSNNQIIGEKYGRCNHESSCGYVKYPPSEREWRQSWSEYQSRRKPPQKRVITCPQPKPKPEGSICTIPMDIVLKTVRTNPLSDFLYFLCKIFDVETIMRLIEEYAIGVTRSGYTVFYQIDMQGRCRTGKVMKYDRETGHRIKDSNVPGAITWVHTLLKKQGVLPQEWTLTQCLFGCHLLKKYPDKPVCLVEAEKTAIICAGMMPQFVWVATGGKTQLGDKVEVLAGRKIVAFPDVDGYDAWIEKIRERPYLNIQVFDYLVHKATDEDRAMGADIADIIIRWALN
GUT_GENOME000224_0115014-335GKAVCSYCGKKSFVNYIYSDGAPVADGVCGKCDRADNCGVHYPPREYFKDNPTFKPARFSQFRPQRKRPVITPPSYIDSSLMLDTMRGYEMNSFAKFLHSTFDEVAGADIVQANIERYAVGTSSRYGGSAIFWQVDQFGRIRSGQIIGYDATSGKRNHKQQNWVHSVMQENYPDYKLEQCYFGSHLINSADKVVAEIHQEWDAIPNMQKCEVEPIIYLFESPKAAVIMSIALMWGGCRMTEVPMATCSCGNLNPSLDSRKNPYNKIQVLKNRKVVLFPDNGKFEDWKAKGEQLKGFCKEVWISTAMERNLHPHAIDCEIEDG
GUT_GENOME156757_0074475-389LEKYSGRSSRHECPLCGDPHSFAYYLDGNTGEPIAKTVGRCNHESSCGYHYTPKQFFIDNPVEKELFVAPVRQKPIQKPQQEVGYIPFQYVERSASYNSSFIRFLCGLFDRYTLESPTIERLMNDYALGATKDGCIIYWQIDINGKVRTGKIMKYNQETGHRVKDAGGINWVHSVLKKKKLLPDNFNLVQCLFGEHLLKMYPEKPVALVESEKSALIASGVYPEYIWLATGGKSQLSIEKMKVLHGRTVLAFPDVDGFDYWKDKAKELESIELNIQVSDILEKNATDQDRTNKIDLADWLIRDLMTDCRPAPNTL
GUT_GENOME157435_0089316-322TCPACERKRCFTFYVNEDGESLNPAVGRCDHESACGYHYTPAQYFADHPEASDNWRQNFMSASIKIPKRKPAPKPLCYIPIDLVTRSVNPNYHSDFTRFLVSILGASTATRLITEYRLGVTKARDVIFFQIDIHGRCRTGKVMKYDAVTGHRIKDENIGGRVNWIHSIMKRKGGLPQNWELTQCLFGEHLLPNCPGRTIALVESEKTAVICAALMPEYVWLATGGKSQLGDKLNVLHQRTIVAFPDVDGYELWKQKAGELSSIRITVSDYLESTATPEEREAHIDIADRLIAQLRDGTLVPPADCPT
GUT_GENOME231073_0214123-308KLTCPQCGKDKCFTPYVDVTTGQIVGEQFGVCDHKNRCGYFKYPTGNELKDNDLFVDSNKVLRRYRPPVDPDIANCIPVSKMFDTLNPFETSDLQDYLSNIFGSYHTNKAFNLYKVGMMRFGDWGKCCVFWQLDKNWTIRTGKIMDYGPDGKRVKIPMDHVCWVHVMNGQDYLLRQCLFGEFLVNFYPKDAPVYIVESEKTAVICNIVYPDRLFMACGGIHMLKREMIETLGRRRIVLYPDKGSAFNEWKKKVDRDMRGMNIEISDFLESKPNINEGMDIADYFII
GUT_GENOME153310_0056813-332SDRLTCPSCGHRREFSPYIDTETGEILHPSVGRCNREKSCGYHYKPSEYFKDHPEREGLHGILRTSSSLFERRPEPLADYLPLSLIGGEDTRRKDNNLYRFFVSRFGSSVSDRIFDLYRVRTSKHFRNAGGLSAAFPQIDRRGNLRQVKIMAYNPETGKRLHKQDTAEKWSEKRRGYFTDTEQDKIYFAGKYLSGQESPNLQQCFFGEHLIRPGSLVGIVESEKTAMIAAVYQPGMCWIATGGKNGVRLTETGITSALSGIKAATLFPDLGCFEEWAKKKEIIASCGITCTVSDLLERRATREESEAGLDIADFLLRETP
GUT_GENOME147942_0015025-324KLTCPACGKSRCLTPYIDVATGQVVGNEFGRCDHERTCGYDKRPTGKDVGDKDLWISGNKCIRAYRPPVNPDVVNYIPFSEFERTVVPDDRNTVFRFLSSLWGKERVSDVFRRYHVGTMDLWGWKGCCIFWQIDKDFVCRTGKIMDFYIKTDSQGNEIDVKRVKEKDGDNERPHVMFYHSLHARDFLFRQCLFGEHLLSQYPDKVVNLVESEKTAIICAVNKPDELFVATGGLQNLRPEVIDVLKDRKTVAFPDKGQAFDTWSKKIDGMMMKSRIKVSDYLQSVENVGDGDDVADLIINN
GUT_GENOME239207_020364-265YRFTLQKYKRGSKISCPQCGKKQCFVRYVDTKGEVSFPDYVGRCDHEQSCKYHYTPSDYFKDNPTLIEKGSNYSFEHSKPQSRSLPPISFIEKELMERTLTNYTMNPLYIYLADILGKDETKRLFYLYRIGTSKKWGGSTVYWQIDRQGNVRTGKIMLYDSTTGHRTKEPRSYVSWVHTELNLADYNLKQCLFGEHLLSGNPAKPIAIVESEKSALIAIHYMPEFIWLATGGMHGCFKADAVGVLKGRMVMLCPDYEQKLFT
GUT_GENOME129163_0054539-345LQPYKGRTTRYECPSCHRSRCFTRYIDTEGKISFPDTVGRCDHENNCGYHYTPKQYFHDNPEAKRLLFDNGQNAKVPIVPHIEKPKADPYFFDPYLMTRTERDYSHNHLALFLAKLFGMNRIADQMKLFHAGTTKSGAVAYWQVDIEGRIRDCKVMVYDAETGHRSKDPQQHVNWLHSLMKIDKACIQQCFFGEHLISMVENKDKPIAIVESEKTAIIASMFMPQFIWIATGGKDGMFSRADYNVLKGHKVILFPDLGMLDNWHQKSIALIRHGIDANVYDYLERNATDEDKVAGLDIADFLLRQMP
GUT_GENOME180609_00709196-432PSSIPPDEVLRTLGGYEGNRLAVWLRDTFAPVLPPSEVERVLLDYAVGTVQAWGGSPVYWQIDAAGLVRTGKAMAYGTDGKRVKEPRPLMQWAHSRHEREAEARGEEFIRQQCWYGAHRLQEPGQTAWLCEGEKGAIITALALLACDEGLYRQIVPVACGGFNPTPDRLGDPWHGLQAAKGRSVAIFPDSGKQGDWAAKAEALNGYAADVRVSRWCEPGAVPYEVHEGNGFDDVILR
GUT_GENOME108285_0147675-325YTIATFYEMVKGAGVDIKRNGTRHTFKRQPKITSKPRKNEQACYLPFSLVERSRSNANGLFGYLAKFFSSGELEIIADKYLIGSTKDGKTIYWQVDGEGHCHTGKIMAYDSNGHRIKSECGDRVDWVHSRLQRSGRLPKDWQLSQCLFGEHLLRLRPSDPVCVVEGEKTAVIMAAFFPKQVWVATGGKGLFSPARCSATLAGREVYVFADVDATEEWTKVASALRKTCKSVDVSDWYQLEGVQPKWDIADW
GUT_GENOME153913_0185118-312CPQCGHRTLKLYEDRATGWALDAAVGRCNREVKCKYHVTPAQYFAAGGHAPAVPAGWVAPPLPPPDEFVTIGQRRSPLDTLRRNALFRYMASMFGEDLVKRVWDEYQVMNSGWRGGAVGFSYVDHLGRCRSIKLMRYLPDGHRCKLGGMGFNVTWAHSLALPGRANFRFRACLFGAHLLLGGHHGATVYIVESEKSALMLACYLSKQFFEGAVVCLASGGASGIATASEPISDRFSRCFPLAGRRVVLLPDADMVERWTAYAGALTDHVRSIAVADVRRAPFFLTGSDDIADFIE
GUT_GENOME098685_0251013-327RSSDRHICPSCGHKGEFSFYIDRETGQPLHPSVGKCNREQKCGYHYRPSEFFKDNPDARPKDDWKPDYVMPMPQQEAKITYLPSSFLIVDNQRSRNNLFRFIAGKFGIDRATSVFDAYNVGTSKHWRNNDGVATSFPQIDHKGRLCQIKVMAYNPTTGKRLKKQDYAEYWNFARKEYVGDNRPQDKIWFAGKTLLNNYDAHLRQTFFGCHLIPNASRIGVVESEKTSLICSILMPEITWIATGGCQGCKWTEPAVFQPLIGKRVVLYPDSGMLAKWEDKASILRNGGVNVSVSRICEGLADNTDVADVLLEKKMP
GUT_GENOME226379_0262822-271RRRGKRWTLPARINLESHSRKDKLVFYMNKSGSITVTEQGGDSVNLFDFLVSYLPGCSSASDAFRILSSPDGCRMSLKDFYEREYDSGRQESRFVDMKYVDRLSDAGHWKGNNLYEYLSGIFGVDSVNDVFSRYKVGCLGRESAVFWYSDKDGNVCHDNRIRYGVNGHRKKETHAFRKFTTGEGFTYRGYFKPFLGEYCSDAITCMVESEKTALIASMTFGNGFVWIACGGMNQLGNKLPKNVILFPDFD
GUT_GENOME028668_020028-324LEKYDRSRRNRYTCPHCKRPREFTRYVDTLGLIKFPEYVGRCNRTNHCGYHYPPSNYFRDNPDMLKLLFDNDGASLPTQSIVRTANITDSEQKPSYIDPDIMLKSCSVTWYPYNNLYGYLGLILGWNVTMRVFLEYHVGTSRKWNGSSTVFWQVDIEGNVRSGKIMLYDRTSGHRVKDGFSRISWAHTELNIPDFHLSQCFFGEHLLRIYPDRTVAIVESEKTAIVASAYMGDLLWLASGGKHGCLQARLPILKNRRIILFPDSKAFNEWNLLCIRMREQDFDISISSLLEERATDIQWNDGIDIADILLMKTLPEA
GUT_GENOME278116_0011025-231YVVSSWSMENDFFRFLREKCRVSTAELERLLCRYLIGSTRDGGIIYWQLDFNGNVRTGKVMYYDANSGHRIKTGMAVDWIHSRLKRKGLLPEDFAITQCLFGEHLLHSDDIGKAVALVESEKTALLGSLVFPDYVWVAVGGKANFKPERMVALCNRTVIIFPDVDAYDEWKEKSRRFFLPKRVIVCDLLERQSSATERGAKIDTPVR
GUT_GENOME047929_013519-330SRLSCPSCGRRHCFAPYVDDNDNIYGEEYGRCDYESSCGYVKYPPSDFKEEWKPEFNGRKRVEAHFNRRYARSSARPQLEVSLHEDICTIPMEIVTKTVRLSPASDFLAFLSTLFDKDTVRNVVSEYFLGVTKSRDAIFYQIDIEGRCRTGKVMEYDRLTGHRIKSAEAKTHITWVHSLLKQRSVLPASWELTQCLFGEHLLKKHPDRQVILVEAEKTAVIGYACLPQFVWVAVGGKSQLGDKVEVLHGQTVIAFPDVDSYGNWTEKVRERPHLNIQVSDYVNRYAEANGLDSGADVADVLIHWLRNGGSPMKLQQQPSPAE
GUT_GENOME149102_0227412-325RRGANFIHICPNCGKREFKRYIDNTTMEYIAEDVGKCNRLIKCGFHKPPKVHFQEHPEEKTKRNELYRKKRQPKSPININEKDYDIIDEKYVKISMWNKRYINTFTIWLFKLIGNNPHYGIGVIKDVVQKYDLGGSHRINGAVVFWQRDYNNQIRTGKIMLYNQRTGKRIKDENRPNMINWVHYYLKKEHRLKADFKLKQCFYGEHLLKKYPNAIVAVFESEKTAIVASIIFPDLVCIASGGLGGLNLQKCKVLAHRHVIFFPDLGCYQQWKAKVEDISKHIFFARYSINDVLENNATEEERAGGLDLCDYIIN
GUT_GENOME207127_024257-334LRKYAGKSSRLTCPNCNRPWCFTPYVDDEDQILDPTVGRCDHESSCAYHKTPADWFREHPEARPREEDWRQAPDWLKREQSRGVPGTRHAIRPDEAGTRSQTSICELPAEIVAKTLRKVPKSALQEFLETIFTADVIERLRTDYRLGVTKDRSVVFYQIDIQGSIRTGKIMQYNPADGHRVKNVGVPVDWAHARLKKSGVLPESWNVSQCLFGEHLLPQRPDAIVCLVESEKTALIGSGFCPQYIWVATGGKTQLGPKLSVLQGRKVLVFPDIDATKEWREKLSSIPGLNFTFSTILEDEATAEDRAAQIDIADWLLRFYCHPERSEG
GUT_GENOME117057_0197819-304CPWCGQRKLTYYVDDTGEYTQRFHDANVGRCDRENNCEYHYTPAEFFKNHPADLTRRCTPIRKVAPPKQPPSYISSDIMTATLSGYEHDNLFRFLARQFGQDEAQRLYTLYNVGHSDQWPGATVFWQIDSAGRVRAGKVMHYNEDGHRIKDPQRAHVSWVHSLLKLPDFHLCQCFFGEHLLLTNPKAKVVIVESEKTAIICAHFFPSFVWIATGGKAGCFNVEASQVLRGRDVMLMPDLGAEDEWNKKAEMLRPICKSVKVLTTLTDRATDEQRARGLDIADFLLE
GUT_GENOME226469_0180726-345GNGRKYHTCPHCGAEKFTLYIDVDTGLPLADYVGRCERINSCRYHYPPREYFKNLTIKNKKIMTEKQALKETTSVVKEERVNRRYSKIVDYVKNALFCFLVTIFGKQKVLAAFQKYDVRISMLFRKDNKYGAAFIMKAKDGLIRQVKEMAYDPNTGKRVKEEQSVFKYDYKTNSYQTDNDGLRILYAGKSLMKDYDFENKLCFFGEHLLAGEVDKPIAIVESEKTAIICDICMPEYIWLATGGKNGCKWTTDEVYQVLKDANQPIILFPDLNATEDWTEKAEILLAAGLDVSIYDLEEQEGITEEDKAKGLDIGDYLIRF
GUT_GENOME239000_00859102-348YLKDAREGEMLIAKARNELVEKPPAYVDISGQYVDNSLDKHQESDFILFLKGLMNDVDKVEKAITAYRIGVTKLHHTIFWYIDVEGKVCYGKIMAYKPDGHRDHEVVPQSIPKMLEMQGVLSQNYQIKQVLFGEHLLNEPRYKDSLVGIVESEKTAVVCSMCVDNVLWLATGSLYNLQEERLQSVKERHVVLYPDTDKKSNPFTRWRRAAESLNAKGWHIQVSEYLEKVTTQEQKQHKVDLADLLIA
GUT_GENOME045831_0144210-187SSFFEYLCGFVWFDQERLETLMKRYPIGATEQGEPIFWHINSEHKITNGRIITMDRETGKVYDVSSYYQDGRPTCLFGEHLLNSFPTQTVALVTDEMTAAIMSSFPTPYVWLATGKEKATPTDLLPLVGKSVVVFPDKGDYSKWQETLQAVPNLQFYISDVMEKVQGDCHTIAQMVLS
GUT_GENOME218838_00445112-337RPLQPPQLPRVYVCPDDIAKAASKAPLSALFNFLCRVFRPDEVSRIFNLYRVGATREFGCNPGMMGTAFPYIDQSGRCVDVKLMAYDTNGHRRKNGYSANWVLAKKKLNDYRATWPLFGEHLLSLNPSAPVAVVESEKTALIASTALPGYVWVATGSKQNLNAERCRALKGRAVYLFPDTDGVEEWQRRGVEMAKAGFRVHNCADVVTENAKNPGDDIADIILQTV
GUT_GENOME180609_0222253-362RYSLDRRTAGHSPRKYRCPQCGQRTFTPYIDRETGQPLDAAVCGKCDRLLNCCYHLPPREYFKLHPDAASRRPLGDLYAAEPPRHSCVSRTLLSSTLGGYEGNTLAQWLHCVFDTYIGAEGVDRVLRDYLVGTTPLFGGSPVFWQVTPAGEVRTGKVIGYRSNGRRVRCPKPLMHWMHQGLKDFRLRQCWFGSHRVQHPSQMLLVMESEKGALMTAMALLTLGEECYRSAIPLATGGCGGLAPTLQRLTDPDDAHAVLAGHPVVLLPDEGKYREWLEKSQPLRRVCPSVIVSDMMEKPGELAYEPNPGDG
GUT_GENOME088730_0079618-312RFRCPNCGRPHCFTPYVDENDVPVDVERYGRCDHEQSCGYNLYPPYEPNRQGGKVSALPKSGKRPVRRRKEPRQGLCLLPMEIVRKTLLFTPKNGFIAFLSDVFGEETAKSLVEKYRIGTTKDGYAVFYQFDIRDRCRAGKIIPYDPRTGHRIKDGSVPAAMWVHSRLKALHQLPEDWTLSQCLFGEHLLPLCPELPIALVEAEKTAVICSAVFPEFLWLATGGLGQFNDRVKVLLGRQVIAFPDLGACDRWRKKAKDYPLLDITVSDYLEKNATPQQREMGADLADLVVEERLL