UHGP-MC 33168


Information


Number of sequences (UHGP-50):
97
Average sequence length:
310±26 aa
Average transmembrane regions:
0.04
Low complexity (%):
5.11
Coiled coils (%):
0
Disordered domains (%):
4.16

Pfam dominant architecture:
PF02733
Pfam % dominant architecture:
7629
Pfam overlap:
0.93
Pfam overlap type:
equivalent

Downloads

Seeds:
MC33168.fasta
Seeds (0.60 cdhit):
MC33168_cdhit.fasta
MSA:
MC33168_msa.fasta
HMM model:
MC33168.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME104679_0133099-343PTADAIYEAAKAVDTGAGVLFMVKNYTGDRGQFGMACEMCEMDDINAKIVIADDDVAVKDSTFTAGRRGVAGAILGYKIVCAAAEEGKSLDELVELADRVNANVRSMGFGLTSCTTPAKGAPTFEIGDDEMEIGVGLHGEPGRQRTKLMPADEIVGLMTQNCLDDLPFKAGDEVAVMVNGLGGTPLMEQYIAYNKVAQMLKDAGIKVFKSYVNEYCTSIDMAGCSVSLLRLDDEMKKYLDYPVEI
GUT_GENOME122014_0012444-369QNAVDEMVDGFLELYPNEYKRIFIDRGNCNGIMRQKCRNPVSVVVGGGSGNEPWCIGYVGEGLADAAVLGPIYTAPSCRAVQAVTRNVPNKNGVVYICTNHAGDVLNFELASELAELEGITTRTVVVTDDVASAPREEKSQRRGTAGVLLVVKAAGGAASLGMNLEDVARIAAKANDNTYTFCACTSSAYDPGTGKPLLELKEGMVEYGVGFSGESGIFRKPFIDANETADTVFQYLLKESQPLPMDEVVVLVNGFGHTGHMELSLIAKRILQNLRNSEIKVHHILAEKVYTPQDSSGFSVSLMSVDEELKRCYDYPARSPWIRGF
GUT_GENOME189556_005853-324QIINQPENVVEDMLSVLPDIYPKLKYKKETGIVYRTNLTKDCVPLLSGGGSGHEPAHFGYVGQGMLSCAVEGPIFVPPSAEHILEGIKTVYNAKTGLLVIIKNFEEDVTNFGSAIRQAREMGMNVEYVISHDDISIESSSFTRRQRGVAGTVLMHKILGAAAQAKVDLDGLVYMGNQISECIATLGVASRAATLPGKTEPMFELDDHDISYGIGIHGEQGYRTIGFESSEKLAVELVNKLNLKFHWHTHDPYIVLVNNLGGCTREEELIFTHDILQLLDLEGLNIIDVKIGTYMTSLNMAGLSVTLFKLHDPSWLAYYQAPT
GUT_GENOME095963_031272-327KKFINDVSSIVDDTIEGFISIYGDIVKKVPDSRVIARKNIKQQRKTGIVIGNGAGHEPACIGFVGEGMLDGNAYGDIFSAPGPMHLLSAIKEADTGSGVLVLISNHAGDVMNSEMAVDMAKEDGIAVDSVLLYDDILSAPASQFRQRRGTAGTVFTYKIAGACAEEGKGLLEVKEIAERTRDRTRTITAALSPGISPLSKEPMFQIGEDEIQMGIGVHGEAAACVMKYESAQKTAVFMTEKILEDMEWSEGERMAVFVNGCGRTTYMELMIFYNEVRKFLKSKGIKLLRPLTGNFVTTQEMAGIALSVCRMDPEMETYWNRKTDTA
GUT_GENOME069045_0390511-317NLVNDVIEGTIIASPWNNLARLESDPAIRVVVRRDLDKNNVAVISGGGSGHEPAHAGFVGKGMLTAAVCGDLFASPSVDAVLTAIQAVTGDAGCLLIVKNYTGDRLNFGLAAEKARRMGYNVEMLIVGDDISLPDNKHPRGIAGTILVHKVAGYFAERGHNLATVLREAQYAAGHTFSLGLALASCHLPQDAETAPRHHADQAELGMGIHGEPGASVIATQNSAEIVTLMAEKLSAALPETGRLAVMINNLGGVSIAEMAILTRELAHTPLQQRIDWLIGPASLVTALDMKGFSLTAIVLEESIEKA
GUT_GENOME142395_0090522-347KKIINNVDYAVEDMLEGMIKAYPNKIRKLDAGNIIVRKDAPIKDKVALVSGGGSGHEPAHGGYVGKGMLDAAVAGAVFTSPTPDQVFEAVKAVDGGRGVLLVIKNYSGDVMNFEMAKDMAEMEGIQVEAVVVNDDVAVEDSTYTTGRRGIAGTIFIHKIAGAKAEKGASLEEVKSVAEKVISNVKSMGVALTPCTVPAAAKPSFTLEENEMEIGIGIHGEPGTHREELKSAGEITEHLVNKILDDIKIDKGEEVAIMVNGLGSTPLMELFIVNKKVHQMLEDKGIKVYETFVGEYMTSLEMAGCSVTLLKLDEELKELLDANADTP
GUT_GENOME177165_014451-327MTYLVNNEEEFAKEAVEGFAAAYESYVQPVHGGVVRATTSPDKEVSIVVGGGSGHYPAFAGWVGPGMAHGSVCGNIFSSPSASQAKSVCKAASNGGGTLILFGNYAGDVLHFGNAAQQLSAEGDDVRIFAITDDIASGKPENHLERRGIAGDLFIVKIAGAAAAAGMNLDQVEEVARHANDRTRTLGVAFTGCTFPGAKEPLFEVEPGTFGLGLGIHGEQGISSHEMMSCDEIAEMLVSKLLEEEPERSEDYKGRVGVLVNGLGATKYEEMFVFYRAVKKLLEGHNLEIIAPVVDEQVTSLDMAGLSLTLCFLDEKLEQLWLAPADT
GUT_GENOME096544_012453-329RLVNDPRRFAADALHGFVAAHADRVMATPGGVVRASAGDQGQVAIVLGGGSGHYPAFAGWVGEGMAHGAVCGNVFASPSASQVYSVVRAADNGGGVLLGFGNYAGDVLHFGQAAERLRAEGIDVRVLPVTDDVASAPADMTGLRRGIAGDLLVFKVAGAAAADGLDLDAVEQAARRANERTRSMGAAFSGCTLPGATAPLFTVPSDVTALGLGIHGEPGIRDVPLGTADATADLLVDAVLAEEPPRAEGGYHGRVALLLNGLGATKHEELFVVFGRVVERLAQAGLTVVAPEVGEHVTSLDMAGLSLSLMFLDDELERWWTAPADTP
GUT_GENOME231031_0124520-306EMAYPERWKKLIGEKNSAALIRRRPAFGRPVVIISGGAANGPLFPGYVGEGLADAAVVGGAYSAPNAYAIYETGKYIDSGHGVLLLYNNFAGDYLNNDMAAELLAMDGIPVESVWTTDDIASALGEEKSARSGRTGIALMIKLAGSCLSEGYTLHQAAELLRKANERTATLSMRVDFSDRKVYYGEGFSGEPGYMTAPCTDMSSLARQATEMICDDLKPKRGERLTLLVNRMRMTSYSDSFMMTRRVLEVLSREYNVIKTRTANYSNIIDIYGFDITVMKTDAEMEH
GUT_GENOME117419_008543-331KIMNTAQSFVYDMCHGLAAAHPELEFVEKYKVVKKKDINEAKVSLISGGGSGHEPAHAGFVGKGMLDAAVCGDVFASPSQVQVYNAIKQCATDKGVLLIVKNYSGDCMNFNNAMSDAQEDGIKVDAVYVNDDIAVKDSLYTVGRRGVAGTVLVHKCAGAAAEQGKSLEEVKAVANKVIDNVRSFGFAFTSCTVPAAGHPTFEIGDDEMEFGVGIHGEPGRFREKIDYSTGTFCDDLSRRILTDLEEDLKLKKGEEIVLLINGFGGTPLQELYILNNSVTKALNQDGITIHKTIVGNFMTSIDMAGASISVLRVDSELKSLVDYPVNTPA
GUT_GENOME220819_006566-328FLNKPENIADEVLEGLAGAYCDKIALAGNRIVVRAVAKPTDKVAVITMGGGGHEPALSGFVGEGMLDASVVGDVFAAPNAVSVLTALRKFKDYAGILLVVLNHAGDVMSANMALKMAAREGINVKSILVSDDISAGLDTPKADRRGLAGAIAVIKVAGAAAEAGLGLDEVLAVAEKFNSRIATLAVAMTGCTHPQTGQPISELADGEMEIGMGQHGEGGGGRGKILSADETAAIMFGRLAEAVGLESGERVFIIVNGVGASTPMELAIVFRAAKKCAEERGAQVVGGIVDELLTVQEMAGFQMILCVADGETLKYLKARANTP
GUT_GENOME000010_013776-330NHPDNMIKEMLEGYLSIYPGLFDKVDAPNTNGLMVHDHGDKVSIVVGGGAGNEPWIMGYVGRGLATGASLGNVYTAPPSRTILNVTKAVPHDRGVVYICTNHAGDVLNFELVSELAELEGITAKCVFVTDDITSEPIERKDERRGVAGIALVIKVAGAACDAGLPLDEVVRIATKAKQETYTFSVTTSPGYMPGSGRAMFEMPEGEIEYGMGFNGEPGVLHTKLTCADEIAETMMKYLCDDMKLTERDEIAVMINGFGFTSPLELCILGRRVRSLLAEKNARVHDIFIDELFRPQGTGGLSISIMRLDDELKKYYDMPAYSPFFK
GUT_GENOME096558_034751-324MRKIINDADHVVPEMMEGFVGAYGRYFRKHPEVNAILSRQRRKDKVALVIGGGSGHEPMFGGFVGKGLADAAACGNIFASPDPNTIYEAAKAVDNGNGVLFVYGCYAGDNLNFDMGEEFLRDDGIRTAHVRVQDDVASAPKERMEDRRGIAGDVFVVKIAGAACDAGLSLEEATRVTEKARDNTRTVGVATAPAQLPGVDKPIFELGEDEIEYGMGLHGERGVLRTCWQPADVLAEKMYAQIMEDMDLQAGDEICVLVNGLGSTTITELAIVYRKINELLDKDGIRVYDADLNNYCTSQEMGGFSVTFFQLDEELKSYYDSPCY
GUT_GENOME176891_001143-331KIINSPENFVDEVIDGILLAHGDKLRAADPDRRAIVRTDAGTGGKVGIVTGGGSGHLPLFLGYVGQGLASGVAVGNVFSSPSPEQIHAATVASDDGAGVLYLYGNYGGDVYTFDLAADMCEGDGIRTRTVLGTDDLLSAPPERAETRRGVAGLFFAYKTAGAAAERGDNLDQVAEIAQRTCDRTRTMGVGLSPTILPAAGEPTFTLDEGEMEIGIGIHGEPGQHRGPLETADEITDRFTTELLRELPVGEGDELAVLVNGLGATPLEELYVIYRRLHATITERGARLTHVYIGEYATSLEMAGASVSLCHLDDELSELLSAPADSPFFE
GUT_GENOME096553_0090225-354VEEAVGGLVTAHPYADWHNDGFIGLKRDTDAEAQVAVISGGGSGHEPMHAGFIGRGMLDAACPGLLFTSPNAVQITAATNWADRGLGVVHVVKNYTGDVMNFTVAAQSTDAQVRQVLVADDVATDLGNDHSPGRRGTGATVIVEKVAGAAAARGDELGCVAEVAQRAADRSRSMAVALQPGHSPTHDRATFDLQDGQIEIGVGIHGEPGMERRDLDSAGQPSARALVAELLDGILGSLDATGATGATGDSTDGAEVLLFVNGLGATSELELDLVFGEAVQQLTERGLRVRRAMRGSLVTAMNMAGISLTVTVLDEELLSLLDAPTDAPAW
GUT_GENOME098297_0086026-346NKPEDFARETVEGIVLAHPESLKMLNDSYHCLVRADAKKPGKVAIATGGGSGHLPVFLGYVGYGLADGVTVGNVFASPSAEAMYEVDKAIDNGAGVLHLYGRYGGDIMNFGMAADLAAMDDIEIAEVLVTDDVASAPKEKADKRRGVAGLFFAYKTAGAAAEKMFSLAEVTRIAQKTVDRMRTMGVALTPCIVPEAGKATFSIGEDEMEIGMGIHGEPGIHRTKLQSVDCVVEDILKKILEDLPYQSGDEVAVLVNGLGATPKEELYLAYRKTHQILMEQGINIHRKYIGEYATSLEMAGMSITLLKLDEELKELIDAPAY
GUT_GENOME135133_009789-311RNIVQDALQGFVKLHADLLFLEKENLVINKQLANAERVTLVAFGACGNELGFSRYVGEGLVDVAVVGDLLSAPGPQSCLDALKMADKGYGVVLLNLNQLGDNLSAEIAAKEAEKLGIKVSRLIVCDEVVLGEGPDVRRGLGGCLPVCKIIGAASLNGKSLEEITRLAQSFLDNMATVTIAFNDKEMKLNSGIRGENDGEIIASTTSDALAIEVTSMLIQDLQLQAGEVVYTMINTSGNINHIEAAIFGRDVLNALEQHDFVLASIVSGALFEMPETSVQVTLARMEQEVKSYLAAPCNSVAYK
GUT_GENOME036824_02306463-785DNKKTLVQDMLEGYVKAFPRRVRLLGNHILLRAKEKDLSKTAVIIGNGSGHEPAMIDLVGEGLFDANVCGKIFAAPSPMEMMDALKELSKNGHKEILILVSSHAGDILNAKMTLMLAKAEGIKADMVVLWDDISSAPKGMEQERRGTAGLFFAYKVVCSAAEDGMKMHDLIVLAEKTRDNTRTLSVAIKSGTHPETGLPTFELPEDEIEVGMGVHGEAGTGRMKLPTAKDLTEYMMEQILADKPFISGDKVGVLVNNAGSMTRMELMIIYRHVVRIMEEKGIEIVRNWVGTYVTTQEMAGFSIAVCKMDDEILKYYDYKVDSP
GUT_GENOME141728_000474-317IMNDVQHIVQDMLHGFYFEHNDKVNYDETNNIIYVKDIEKLKQNVAIISGGGSGHEPADIGYVGKGMLTAAVNGSIFTPPTVEQIVAATRLMPKDKSILFIIKNFKDDVENFLAAEQIAKEEGRKIDHIIVNDDVSIEDDASFNKRRRGVAGTVFIQKILGASALEGHSLEELTTIGRSVTENLHTLGVALSPANDPVQGKASFTLNDDEVFYGVGIHGEKGYRKEALSSSEILAIELMNKLKSIYRWRKGDNFAILINGLGATPLMEQYIFANDIRRLCELEGLQITFVKVGTLLTSLDMKGVSLSLLKVEDL
GUT_GENOME207949_012804-317KLINKKETFLTDMLEGLLIAHPELDLIANTVIVKKAKKEHGVAIVSGGGSGHEPAHAGFVAEGMLDAAVCGEVFTSPTPDKILEAIKAVDTGDGVLLVVKNYAGDVMNFEMAQEQLAEMEGINVQTVIVRDDIAVTNEVQRRGVAGTVFVHKLAGYLAEKGYSLTEIKSRVEALLPEIKSIGMAIEPPLVPTTGKYGFDIEDDKMEIGIGIHGEKGIHREEVKDIDHIVGTLLDELYKEVTANDVILMVNGMGGTPLSELNIVTKYIQQNLAARTVNVAKWFVGDYMTSLDMQGFSITIVPNKPEYLEAFLAPT
GUT_GENOME231654_026613-330KILNKPENYVDETLAGLCLAHNDIYRQPQPRLITRANSSDKAKVGIVTGGGSGHLPVFTGYVGEGLLDAAAVGDVFASPSADLMADAIRAADNGQGVLLLYGNYGGDIMNFDMASETVDFEDNIRCTTVLATDDIASATPEEAHKRRGVAGMIYGFKMAGAKAQEGATLDEVTRIAQKTMENTRTIGCALTSCTLPAVGHPTFEIADDEMEMGMGIHGEPGIWRDKLRSADDIAEEMFQRLQTELSLKKGDKVSVLVNSLGATPLEELYILYNKVVQLIDNTGVTIIHPLVGRYATSMEMTGASLTFCKLDDELEALLNAPAHCAFWR
GUT_GENOME091123_0002110-306SALDEALNGIMRACPGKYTRLPSEYGYGLYRNNLPENQVRIIVNGGGGYGPMWSGFAEEGLADAIVHGNFDSAPNAYVLYEMAKFIDCGKGVLFLTNHYMGDYLNNDMAVELLAHDGIKASMCCISDDILSCEGEPAENRGGLHGIGQVCKICAGAARDGYDLEQLSTLAQKANSRLRSIALNVRDNRVYLGEGFSGEPAVKEEPFESVDQMFSDAVEILMSELNQWKESPVYLSVNKHCDVGFTESMVLLEAAARQLEKREIRICGCGAGTYFDVFPGKGCMLSLISCDEEMEKYI
GUT_GENOME237048_0105724-324HLKKVEGIYGYVDSSFDDSYIPVISGGGAGHEPSDWGFVGGNMLTASITGKLFEPPMPEEIIKVAKAVTKKKRAFFIIKNFEKDVRVFKQAIEYLRNIGWSIEYGIVNDDISVDASSLRKYRRGLAGTILIEKIIGTLAKKGKSLVDLKMVCEQLLPETKTIGLAFSDSVLYKKSCHTLSLGKDDIYYGIGIHGEAGYRKEHWQKSELMAREIVNKMRLNFRWRKKENVALLVNNIGKISPMEELTFTADITELLELEGLKIPFIKTGSFMTSHDMSGISVSILRLQDKWLEYLKQPSTCY
GUT_GENOME188425_0165230-295ETRYGYALHRRNLEPRVQVILDGGGGFGRMWPAMVCDGLADAVVHGELETAPNAYTLYEVAKKIDQDKGVLFLCNHFMGDYLNNDMAVELLNYEGYQAAACYVKDDILSALKEKQEERGGLIGILFLAKLLTGAAKSGYALNKLAELAEKGKRHIGSVSICLNESTREIQYGKGFSGEPPKACKPFVNVENMVSEALDLLLAELPGRGNHVHVILNRLRLMSYTESAVVLNALGRQLRLRGYTCGKFAMGNYFDVFSQNGCIISLM
GUT_GENOME096547_0038819-338QALAGFVASHPDAVWNPAGYISRRTPVVDVNGRPAVAVISGGGSGHEPMHAGFLGAGMLAAAVPGLMFTSPNAVQVTEATRAADAGRGVVHVVKNYTGDVINFRVARQGLPEVETREVIVADDVATTSDSGPGRRGTGATILIEKIAGASAYRGDDVDEVTRLARVAAENSRSMAAALTPGHLPTSGRYTFDLGEGEMEVGVGIHGEAGVERTGVTSAHETVARLAGAVVDDLGLGEGDRVLCLVNGLGATTNLELDLVFDEVLRHLADRRISVARSIVGSYVTSVNMAGVSVTLTRLDDADGDTLVELMDAPTAAPAWP
GUT_GENOME064922_0257611-333VVDEMLAGYIASYPDRFCKLDGYHVLLNKNEKDKVSIVIGAGGGNEPWPIGYVGEGLADACSLGNVFAAPTAKSILNAIRYVPNEKGVLCIATNHAGDVLNFELVAGLAEMEGIHTRQIYVSDDITSAEKTQREERRGIAGVSLIVKIAAAASEAGLSLEEVYEIASMANKNIYTVSVTTSPAYILETGQPAYELPDGEMEYGMGFNGEKGIERTALSTADEVMERMVQMLWEDMNLEPGEEIAVFLNPYKATTVLESYTLMRKCLELLEEKGIKVYDSYVDSLFPTQGAGGFSLTFLRMDEAYRRYYDQPADSPLFKKGKVV
GUT_GENOME079006_0254611-324VVEESLQGLLYANGDLYEKVPDACGIISKERNGKTAIVSGGGSGHEPMLSGLVGKGMLTGAAVGNVFASPDPYTIKETIKAADQGKGVLCLIWNYAGDTMNFDVAEDLASMEGVKVSHVIVNDDIASAPKEDKKGRRGVAGILFVAKAAGAAAELGYSLEEVKEAAERANEHVRTIGIAITPCSIPGNPPNFELGEEEYEYGMGIHGEPGISRESLVSADELTEKMIGDLKEEFGDLKGREAVLLVNGMAATTDLELYIINKKACELLKEAGVILYDNIVGKYCTSLEMAGASVSLFILDEELKRLYDQPACTR
GUT_GENOME258560_005363-336MKKFINQPERFIDEMLYGLYAAHPSYYTCVNEDAHCLVSRHKAAGKVGIATGGGSGHLPLFLGYVGKGMLDGCSVGDVFQSPSSEQMLGVTKAIDSGAGVLYIFGNYNGDIFNFQMAAEMAEMEEDIKVHLSIAGEDVASGPRAKEQEKNIRRGVAGIFFVYKCAGAAAAEMLPLDEVYRIAEKTKLLVATMGVALSPCTVPRIGHPSFTIAGDEMEIGMGIHGEAGLRRGKICSADDIVKEIVPPVMQDLQLRHDDEVCVLINGLGATPLEEQYLITGLVHRLLKDAGIRVHRTYVGEYATSLEMTGLSISLLKVDEELKRLMDAAADTPMFK
GUT_GENOME059413_012904-329ILNSKETLIEDMLAGFIAAEGKRVARSSENSQVLFRKEKKAGKVGIVSGGGSGHEPLFFGLLGKNMVDAVAIGNFFAAPTPGTVLEAIKQADQGAGVICLFGNYAGDVLNFDMGVELAEMEDIKAYNLPIADDVASAGKEEQRERRGIAGDLFVIKEVAAAAAQGCSLEECLRVGESANQRIFSIGVGISPGTNPVNGQKNFELADDEIEFGLGIHGEQGIERMKMLDAKSLVQRMVDKLLIELMLFEEKEVIVCINGLGATTHLELYTVNHEVDKLLRGKNFNVYDTVVGSICTTQEMAGFSITVQVLTEELKEMYQMDAYSPCY
GUT_GENOME141220_021994-322LINAPEQVVEQMLAGLLLAHPTLHQIGRQLAIANGNLKRKVWLVSGGGSGHEPLDVGYVGDGLLDAAILGPVFTPPSVESIVATMKVLGPKRPYLFIVKNFASDMRHFTLAREQLVAEGYQIGLVSVADDQSVDPDTLTARHRGVAGTVLMIKLLGAAADAGMALADLQTLATEVNHQLVTLGVALSSTEAPGQKKPAFDLGAAEIAYGIGIHGEPGYRSEPMVSSELLARELVNKLTQINDWPEKAPMAVMINGLGGLPLMDQFVFTHDAVELIRLHGFDMQFVKTGTLLTSYSMQGVSLTLLGLTEQWRHWLGKPVG
GUT_GENOME134618_0040335-315DTTSGILLAKDVPPAQFAVLVSGGAANGPLFAGYCGSALADAVVAGGAFAAPNAYALYEAGTYLAGGNRGILLLYNNFAGDYLNNDLAQELLELDQIPVESVRATDDLASAVGEARSSRNGRCGIALLIRLAAVCRNMGMGLHETAALLRRTNERLNTLSVHLDCTTGTADFGAGFSGEPGLFTRAGLSREELAAQAMQQLLEDLAPVPGEQLLLLVNRLRLTAYPDGYAMAGAAETWLRRSYPVFRLCVGAYSNIADRYGWDFSLLRVPADQIPMWQQPV
GUT_GENOME186351_0011915-300HPALAFISGGGGGHEPLDSGFVAPGMLHACVTGPRFTSPSAPDITQLMEKVATELGVNTIVALVKNYTGDVLNFELAAELAAASGLECIMIRVCDDCATREVSQGARGIAGTVLYEKILGAAAWLGQTSEELASFAKQISESIVSYGVAFHAPSDEQGIPVYDIPAGQAELGVGIHGERGVDRSTFSQRREVIEHCVDEVFSQLAPGDDPLIVLVNDLGQADPNDIESALLSVNERAENLGMSIARTKVGRFVTSADMDGLSITLMRANPDILSLYDAPTLAPSWS
GUT_GENOME140237_0005011-331LIADELQGFAKIYPRQVKLADSVTAVVRAVPKSAGKVKLVMGNGGGHEPAVIGWVGEGMFDCDVVGDLFSAPSGEKMFQAIEEIDDKSPVLLCIQNHAGDVINGNIAWKKAVKHGIDIRKVLFYDDIASAPREENEERRGIAGMLFYAKIVGAMAEAGASIEECIEMFERVRDNTRTYAAAYSSCTHPGTGLKMFEYLENNDLMEMGMGVHGEGGGDNRIPMPKAGRLAELMADALIQDKPYRPGESLLILVNGTGAATAMELNILYNELEKSFRSKGFEIAGCRVGNFLTTQELAGVSVSICSVDDTMKELWERPCDCPL
GUT_GENOME090048_0089018-327GLALANGGAYGMEAGSQSVSIKQKKKDRVAIIVSACAGYEAMLGAVLGENLADALVLGPDLDPYSIMRTAISVNYGKGVLFICSNEDQERESFLSAGGLLDEAGIDCEVVYAWDNISPGGTNDVSLRKSSAGVFFCTKIAGAAAASGLSLKEIYRTVREARNYTNSLSVCVGKGRTLATANSLFELPDEAELDIGMAGNAIERNLNLRSAEEVVNRMAYHLILASGIRIGDEVCTYVSGFGGTSLSDLAQINFYLKKKLEEKGIKIHDMVINPASSKFNTLGATISIMQVDDRIKKYYDMPCDSLYFKKN
GUT_GENOME194757_0000125-327VRIAEPSQHLRILVRADWDREHHGRDQVALLSGGGGGHEPAHAGFIGEGMLTGAVVGGLFASPSVDAVLAAIREVCGAAGCLLIVKNYTGDRLNFGLAAEQARQEGLAVSMVIVGDDIALPDSPQPRGIAGTLIVHKLAGWLAAQQMPLETLSERVDAALPRIASLGLALSHAARPGEAKAPCAPELGLGIHNESGVRRVDPESASEAVETLLAPLTEALAARGYAPPYVALLNNLGGAAEQEMQVLAASLLVNRRGLALSGMIGPATLMTSLDMQGFSITLVAADDDLSQALDAPTDAPAWP
GUT_GENOME056044_03681133-407LESSASDNAPSYMHNVGKTVNTAGDIFAAPNGKLVFDAMKLADKGHGVLLLTLNYAGDQLAGKQAMKLAQKAGLNVRQVVTGEEIQFDPNGEDNKRGLAGAVALYHIAAAAAREGKSLDEVAEIAERYAKNMASITVKSTDATHPQNGMSFGDLGETDLMEIGAGQHGEGGGVRVPMKSSKETVATVAEALCRNLELKAGDRAFVMINGCGATTMMEMLVLFKDTVEFLKERGVEVAASMVGEILTVQEAGGFQMNIAKWDDEFIRLWNTPCHTP
GUT_GENOME075922_021311-325MKRLINEEEKIVDDMISGYIKICRGAVKKVPGVNAVIKTHAKDKVSVIVGGGSGLDPWPIGYVGDGLADGAAVGNIFTAPPARSILEATRCLPHEKGVIYVVTNHAGDVLNFELVSELAGLEGIKTRQIYIADDIASASRSEREERRGIGGVAILTKIAGAVTEAGNTLDETERILRKANENIGTFSVTTAPSCSPVTGKECFTLDEGQMEFGMGFNGETGISREMITGANGIAEKLMKELLNDLGVKAQDKVAVWINGYSLTSQIELSIIAGTCCDILEERKIELHDIVVERLYVTPGAGGLSISIIKLDEEMTRYYNRRAEAP
GUT_GENOME176717_0129114-324SYISATVSSHPELEKHPTLPLVYQKNRNPHQVPILAGGGSGHEPAHIGYVGEGMLTAAIYGQLFTPPTRTEILESIRFLNNGHGVFIIVKNFEADIKEFSSAINTARKEGIKVGYSLAHDDISIEPHNRFQIRGRGLAGTILLHKILGFAAQNGANIAQLTDLGHKLAPEIATIGFARKSASLPQTTLPLFSLEEGNISYGIGIHGEEGYRIVPFQSSEILANEIISKLRLHYHWKNGDQFILLVNNLGTSTNLEMGIFINDLLQLLEIEGVTITFIKSGTFMTSLDMAGISVTLCPVKNKQWLEALNAPT
GUT_GENOME168820_026914-326IFNRPSDFAKEMVAGFVSAHASLVRQVPGGVVRNTQSKAGSVAIVVGGGSGHYPAFAGLVGQGLAHGAAMGNLFASPSAQQICSVARAANNGGGVLLMFGNYAGDVLHFGLACERLRAEGIPCETIAITDDISSAPLAEKEKRRGVAGDLVVFKAAAAAAERGDSLAEVLAISRRANEQTRSLGLAFSGCTFPGAEHPLFTIPEGKMGFGMGIHGEPGISELDIVPSAELARMLTTTLLKERPQTIAQTKGARLGVIINGLGSVKYEELFVFWHDVQQLLQAADVVVADVQIGEFVTSFNMAGMSMTFVWFDDELEKLWLAPA
GUT_GENOME283042_00534235-557IINEPADVVKEEAEGFLAAFGDQFAAVPGVNGLVKKEIPAGKTALVIGGGSGHEPMFGFFVGDNLADAAANGNVFASPDPVTITKTAQAAERGAGVLFVYGNYAGDNLNFDKAAENLESLGIPVRTVRIWDDVASAPKERITDRRGIGGDVLALKIAGAACAELNLDEAYRVTAKARDNLWSIGVGLEGATIPGQKEPIFTLPPDEMEYGLGIHGEPGIRRIRMESADEIAENLVAALLNESGIQEGDTVCTYVNGLGSTTLMELMILNRRLRQLLDEKGIHVHDMDVNSLVTTMEMAGASISMMKMDEELLHYYDQPCSSPY
GUT_GENOME065146_0095814-334IVEEMVEGYVGAHKQYVKMCDLDEAQGRVVLANDAGTKDKVGVIIGGGSGHEPLFIGYVGEDFADAVVIGNINTSPSPDPCYAAAKACDNGKGCIYLYGNYAGDVMNFDMGAEKADEEDDIRVETVLVTDDVVSSENIPDRRGIAGDFFVFKVAGAKAATGADLDEVVAAAQKANDNTRSMGVAMSSATLPAKGGTIFDMEDGDMEIGMGIHGEPGIRRGKIDTADNVIDEIMEPILKDLPYVEGDEVYVLVNSLGATPLIDLHVCYRRVAQILEEKGIKVYKALVGPFACSMDMAGMSVTLMKLDDELKELMDAPCDTPY
GUT_GENOME000962_0006911-329IMQETVEGYVLANKNRLKLTPGTHNITRKDPKEPGKVKVVIGNGGGHEPGTMGQVGYGAYDLVSLGEVFAAQSGKKFFEAIEDIDDGSPILLTIANHAGDVMNGNMAYRLAKEKGMDIEKVIHYDDVSSAPKGYEEERRGTSGFFFPVKAAGAVAEEGGTLEECVKAFHKTRDNTRTISIALSGCTHPQTGMQMMSIPDNEVSIGAGAHGEGGSYSGNFTNSYEMLKIAADYLIEDLPYKEGDEVLLLVNGMGSTTMMELSIVYREDLCNYLGTHGISVYSGVAANMVTTQESSGVSISFCKVDDDIKKWWDAPCSTPV
GUT_GENOME177329_0158640-321NPAPERTVALVSGGGAGHEPMHAGFVGRGGLDAAVPGEVFTSPHSRQIFEASRAVAKSGGVLHIVKNYTGDCINFNIAAERLSAEGIESAQVLVNDDLGTDSGAVGRRGIAGTTLVEKVLGAAADQGLGLAELKELGDALVAATRSLSVARRAHTSPGADHLAFDVEKDELEYGVGIHGESAKETIAQPTLEELTQRMISELRDALTKKFEASAGALLFVNGLGGLAPLELLHIQTAAHEALADAGVNVRVAFSGNYTTALDMAGFSITLTALEEAWLEYVC
GUT_GENOME030091_012906-330NAPDQLVEECLKGYVAAHKEIIELEGDHLVVRKRKKGSGRVKFVLGNGAGHEPAVIGWVGPGMLDANLVGEVFTAPSADKLTEALAYLNDGSPILLAVQNHAGDVLNANLAYAKARKMGIDVHKVLFYDDIASAPKGMEEERRGMAGMLFYVKIVGAFLEEGGTVHEACELFERVRDNTRTYSVAFTQCTHPVTGMDIVALPDNEIELGMGVHGEGSGANRIPMPTSAELAKKVCDVLLEDKPYQHGDEVLVFLNGLGSTTTMELSVFYHDLLNYLGTKGIKVYDGFCDSCLTTQELGGVSVSLCSVDGQMKKLWDRPCECGIWC
GUT_GENOME067317_0301712-329VTDMIDGYVGANSALLERLDGTKSILVRNAGERVMVLAGGGAGCEPLYIGCAGIKMADAVVSGNIFAAPAATALLKTIKQMYHEKGILMVTGNYVGDVLNYELAAELCSYEGIEARTIFVRDDILHMPKERAMDRRGIGGILPVIKTAAGAAAEGLDLDEVERIARKAERSLGTISVTFGPGYRPETGESMYEMQSGYIEFGMGFNGEPGIRRVKMPSADQLAEIILKDIIEDMSLREGMEVALMVNGKGATSNMELYILTRSLTDCLENKKIRTFNTETGNFFTAPGMQGVSVTVMRLDEELKHYYHQDSYTPMYAY
GUT_GENOME194786_00769122-433NITAELLEGYALAYPNQVKLAAENIVVRANPKGNDKVAIVTLGGSGHEPALSGFVGEGMLDCSVVGDVFAAPGAQRLFQALQLMDREAGILLVVLNHSGDVMSANMACQLAARKGIKVKKLLTHDDISAGIGANVDDRRGLAGCVPLYKILGAAAEEGKSLDELIEIGERYNKNVATLAVAMRSCTHPQNGGTITNLPDGIMEIGMGQHGEGGGGQKPLVSADDTAAEMVDLLCQQLQPKAGDKMLLIINGVGATTHMELNIIFRKAYKELEARGLQVVVSRIQEILTVQEQAGFQMIMAILDEDHIDYLNN
GUT_GENOME143421_010893-322QLINSRGQIRQQLLAGLTYTYNDTLTWQEKTGIVTKKTISKDKVVLISGGGCGHEPAHVGYIGENMLDCAVMGAIFEPPASSEILQAIEQTYNGQGTLLIIKNFEKDLASFLEAERLAKAKGLTVSHVIVDDDCSIESGTFEKRRRGVAGTVFVHKILGAAAAEGNSLDELKTLGEQVIPLIKTLGVAFSPASPIGVIPQQYELAENEMYFGIGIHGEPGYRIETMQSSERIAVELVNKLKQQYERKELRKAAVLVNGLGGIPLLELSVFMNDVQQLLDIEDIDVVYKRMGNFLTAYNTNGLSLTLLTIKEDKWLDYLRI
GUT_GENOME142490_022024-329LINDPRYVVEEMVEGYVKAHPNHIKQLPENDRALVTARETREGKVGVLIGGGSGHEPGFMGYVGDGMADGVAVGNIFASPSPDPILEVTKAIDKGAGVVYLYGNYAGDVMNFGMAAEMADLEEDIQVGTALASDDVASAPKEEKEKRRGIAGEFFIYKAAGAAADFGYNLDDVVRVAKKANENTRSMGVGLSPCSLPQTGEPSFEIGDDEMEIGLGHHGEPGIEKGPIESADKVADRLVHDILADIEINSGDKVAVLVNGLGSTPRMELYIVYRRVEQILREKGIEIYRSYVGDYITSLEMGGCSVTLMKLDEELERAIDHPVDCP
GUT_GENOME170490_007886-328NKPEDVERQVVDGYVKSWPALIRKVGDDVVVRTRPKQGKVALVSGGGMGHEPAHLGYVGEGMLDAAVGGAIYTSPAVDRISAGIEAVAPGASGVLLVVKNYTGDVINFKLAEQAAREEGVSCDHVVVNDDVAVGSPTEREQRRGVAGTVFVHKCAGAAAEAGKPLAEVKRVAEKVVDNVRTMGVAIAPCTVPAAGMPGFTLADDEMEMGVGIHGEPGIRREKMEPVDAMVDEILNHVLADIDYAGHEVAVLVNGAGGTPDYELALVANHVHDALAARTIPVWHTYMGNYLTSLEMRGFSVSLLRLDDELRSLLAAPANTPAWQ
GUT_GENOME246025_010186-332NDPNMVVEDMLKGFAKCHRDIIHVEEENPRVIISNSFNKQKKVGIVTGGGSGHKPAFIGYCGEHMVDAVAVGEIFSSPTAKSFYGAIRAVDQGMGVAVLYGNYAGDNMNVKMAMELAEDDDIEVRTVVANDDVASAPKEDIAKRRGVAGEIFLWKIGGAKAAQGADLDAVIHAAQKAIDNTRSVGVGLGPCTIPANGKPNFQIEPGTMEFGIGHHGEPGVRVERLGTAEQMAQEMVDMVIRDAPFVSGDEVAVLVSGLGATPVMEQYIFYDEVEKLLRVAGASSSSVSLHGIRVYRSYVGNYFTSLEMNGVTLTMMKLDEELKECLD
GUT_GENOME201192_004133-242MKKFMNAPETVTDEELVGLGLAYPDILNVDGHLVISKDLAAADRVTIVTYGGSGHEPAQAGFVGNGMLDVQAVGDIFAAPNGQLVFDAMKLADKGHGVLLLTLNYAGDQLAGKQAMKLAKKAGLNVRQVVTGEEIQYDPNGEDNKRGLAGAVALYHVAAAAAREGKTLDEAAALVTQQVQSLISSFKITLRTQDGRSWDITGDSLNMQYNVADQLDQLWAIGHTGSSSVRYEQVKALEEE
GUT_GENOME203915_003939-328NDPEKVVEDELNGYLLVHGDQVRRSERNARVLVNRQRNPHRKVALISGGGSGHEPAFLGYLGKGYLDAVAVGEVFASPPADAFLDAALEANIDGEVIALIGNYAGDVMNMKMAAAKAGRQNVRIHILTSKDDISSAPKEEIDRRYGMAAGFFTWRFAGYLADQGLDAEEIVARVENLSNSIRSIVVGLSSLEIPSAGKENFVVPEGKMEFGIGHHGEAAIDTPELMSANMIGKKMTEALLNDYALERSTSFVAMISGLGATPPMEQYIMADAVNQAFKTLNHSLSKVYVGNFITSLNMNGISLTLVPAESQLVSSLETPY
GUT_GENOME185251_0237319-351KLINAVDNVVTDALRGMAAAHPHELDIDLDQHIVYRRRPKEQGKVAIISGGGSGHEPLHGGFVGTGMLDAACAGEIFTSPVPDQVVAATSAVDRGAGVLHIVKNYTGDVMNFEMAAEIVEAESGIQVASVVTNDDVAVEDSLYTAGRRGVGVTVMVEKIAGAAAEQGRPLDEVAGIAARVNAAGRSMGMALTSCTVPANGKPSFDLPEGEMEIGIGIHGEPGRHREKIAPAAEIAERLVTPILAELTALTELESAGADEGVIAMVNGMGATPLLELYLMYGEIARLLQGAGITVARNLVGNYITSLDMAGCSLTLVRADAELLSLWDAPVNTP
GUT_GENOME153026_0166914-331IILEEMEGVVLENPSLYEYLPQYRCIAVKERKQDKVVVLANNGGGAEPMFAGLAGEGMADAVCTGHLFSAPSAYHIYESAKYIYAGKGVLLLTGNFTGDFLNNDLAAELLEIEGYQARCVYVRDDIGAAGKDHKERRGGIGGMLWVLKMASAAASMGLDLDEVENIARIAADHIYTLPIVFDTGYLPVTGTPMFQMPVRDHIEFGMGFNGEPGFLSMKMPKAGELAEKILTLLLDEFEAGEQDHVCLMVNSLGSLGFLDLHVLEGKIIKGLRSRDISVYDAISGWYSPIQGMGGITVTLLHMVPELKPYYDYPAYTPF
GUT_GENOME252794_0187630-291TQSVHAEPCKRDKVSVVICGRVGYEAALGFMLGENLADAIMIEPELDPYAVMRTAISVNYGDRGVLLICGSSEEEKSVVMAAGGLLEESGFNNEVVYVEDCIAPGGTEQMNAGYFHCVKIAGAAAASGMTLKETYYTVRKARNLVKSISVRVALTSGGRGRDDVGTLFGEGTMEKIAFQLRQASRARTGDTVCTYISGAGGVSLSELGRVNYQLYRSLTERGIAVHDALINPVPVGAANADIAISMMLLPDELKKYYDMPCS
GUT_GENOME195585_008634-331FINDPENLTSELLEGLALANKDIIHLEDGNLVVNNKLKDADRVTIVTLGGTGHEPAISGFVGEGMVDISVAGNVFAAPGPQACIEAIKMADKGHGVLFVVLNHAGDMLTGNLTMKQVKKLGLNVIKVVTQEDIANAPRSNADDRRGLVGCVPLYKIAGAAAAAGKSLEEVAAVAQKFADNMATIAVAAKGATHPATDMVIAQIEEGKMEIGMGQHGEGGGGLTEMKTADETAAIMIDALLRDLDIKKGEKLLVIVNGVGATTLMEQLIVFRKCCHYLAEKGIEVVANIVGEVLTVQEMAGFQLFIARMDDELLKYWNAPCRTPYYKNK
GUT_GENOME146046_0346929-367GLAKAHPSLTLHQDPVYVTRADAPVAGKVALLSGGGSGHEPMHCGYIGQGMLSGACPGEIFTSPTPDKIFECAMQVDGGEGVLLIIKNYTGDILNFETATELLHDSGVKVTTVVIDDDVAVKDSLYTAGRRGVANTVLIEKLVGAAAERGDSLDACAELGRKLNNQGHSIGIALGACTVPAAGKPSFTLADNEMEFGVGIHGEPGIDRRSFSSLDQTVDEMFDTLLGNGSYHRTLRFWDYQQGSWQEEQQTKQPLQSGDRVIALVNNLGATPLSELYGVYNRLTTRCQQAGLTIERNLIGAYCTSLDMTGFSITLLKVDDETLALWDAPVHTPALNWGK
GUT_GENOME096507_0010014-332IVDETLDGLVAISNGLLTRDPGSRVIRSTKKEQGKVGLLVGGGSGHEPMYGAFVGPGLANASVSGDIFAAPAPQHVQEAIEAADAGAGVLIVYGNYAGDVMNFDMGAEMAEDDSDIESKTVLVRDDVATDDYEGRRGIGGAFYVVKAAGAACAKGLSLEEAAAVTERVQYNTRTIGVAVRAGVLPHTGKRTFELGDDEIEIGLGMHGEVGVERSKMMSADELTEKMVNMITEDLPYESGDRVAVLINNLGATTQMELLVVYRKVAELLEQKGITVGRTDIGAHFTSQDMAGLSLTPLKLTDEIEELIAAPCESVPYTQG
GUT_GENOME156682_0123125-324HLSQLKQLPVIYHNQHDSSVVPIISGGGSGHEPAHFGYVGDGMLAAAISGPIFVPPCAEHILETIRFLHQDKGVFVIIKNFDADIKEFSQAIYQARKEGIPVKYIISHDDISVEKSNFQIRHRGVAGTILLHKILGQAAKEGANLDELEKLALSLSTSIATLGVATKSATLPSKQLPIFDLPQGQISYGIGIHGEPGYRMVPFESSELLAVELVNKIKMKFKWQEGQEFILLINNLGGTSKLEELVFTDDILQLLDIEGLKLPFVKTGHLITSLDMAGLSVTLCRVSDSFWLRALETPTN
GUT_GENOME171522_0240512-333DNIRSEIMEGLVYAGRGRIHALPEYCAVYRTMTQTGQTVIVSGGGSGHEPTFAGFVGEGGIDACALGEVFTSPSPDQIIEASRAVHQGNGILFLYGNYSGDGMNFDIAAEILAEEGIECRTVRATDDIASAPPEKMSDRRGVGGLAFLYKLAGAAAQFERYTLPALEALAQKANHNTRTIGVALSGCALPQSDAFNFTLADNEIELGIGIHGEPGLCRMPMPHSDELVEMMIARLCDDLPFGHGDNVCVSVNNLGALSNTELMVITRKVGMALAERGIHTHDVQVGHFCTSLEMSGFSLSLLRLDDEMQRLYDHPINTLGWR
GUT_GENOME014919_0122310-322REDMLKGYVAAYPDYVVEVPGGVARATKMPEGKITVINGGGSGHYPAFCGIIGDGFMDATVVGNVFTSPSTEDVLGVAHSVDNGNGIFIVGGNYAGDKMNFNMARDRLIEEGVDCRTFYITDDVAAAKPEEKEKRRGNVGTFMVFKAAGAAAAAGVSFDELVRITKKANERTRTMSMGFRGCTLPGEAAPLFHVPEGKMEVGQGIHGEPGVGEDDLKPASEVAKILVDRVLAEAPADDSKRIAVILDGLGSTKYEELFVVWGTVRELLEEKGYTLVEPLVGEYVTSLDMEGIALSVEYLDDELEGYWSAPCDT
GUT_GENOME120040_010515-327KIINAQENIVRETISGYIRSQQDRLHQVPGTQVVVRNELDQGKVGIVMGYGAGHEPDAIGYMGKNYMDAQAIGGLFAAPGPDPIYEALKEANTGAGSLILISNCAGDILNAKLALEMCEEDDIPAKGILLGGTNVADPYTGDRKDRRMGVALFETKMISGYAGLGHTIDEVIEFGNEVIDNVRAITIGIRPGTSPSTGDVMYELPDDELTLGAGGHGEAGADNIPMCSSRELASQVLEMILNDKPFVPGDELSFIVNGTGGTTMMELLIFYNDVWNILEEKGYKVFKPIVGTLSTIQESSGIVLDVCKMANEDMKKTWTMPTD
GUT_GENOME127701_0031064-380NDPAAFVDESLRGIIAAHGDQLKFAGDDTRAVVRKEAPVEGKVAIVTGGGYGHLPTFLGFVGQGFCDGVAVGNVFTSPSSDAILNAARAVDGGKGILFLFGNYMGDTMNFEMASEMLQFEDIPCQIIKGSDDVASGPRSDWENRRGVAGIFFAYKIAGAMADTGASLEEVCEVTRRACERIATMGVAFSSCQLPGATAPIFEIGDDEMEIGMGIHGEPGVERGKMRTSAEIAQVLVEKVTEDLSLGAGSEVALLVNGLGATSREELYILYNDVKKIMDKKGIAIKRVFVEEFATSMEMQGASLTAFALDEQLAELLN
GUT_GENOME169130_001302-316QIINEPQNALNQAIQGIQRAYPNLQWVPGTLGCYDRNFRDDQVPLIAGGGSGHDPAHWGYVGTGMLSAAVMGQVFQPPTPQEIIKVTKQVTKNHEAFFIIKNFPADVAAFTTAEAQLTAEGWQVGHCIVADDISVDNASLKQRRRGVAGTVLIHKILGAAVARGTSIAELTDLSASLNDNIHTIGVAASGAQIPGQTATSFSLQPDEIYYGVGIHGEPGYRREPFQSSERLAQELVSKLRLNFRWHAGDHYAVMVNSLGGTTPLELMVFNNDVHELLDLDAVTVDFNKVGTFLTSNGMHGLSLTLLKLAHPSWLP