UHGP-MC 30302


Information


Number of sequences (UHGP-50):
202
Average sequence length:
109±20 aa
Average transmembrane regions:
0.03
Low complexity (%):
2.09
Coiled coils (%):
0
Disordered domains (%):
3.08

Pfam dominant architecture:
PF03932
Pfam % dominant architecture:
8267
Pfam overlap:
0.45
Pfam overlap type:
reduced

Downloads

Seeds:
MC30302.fasta
Seeds (0.60 cdhit):
MC30302_cdhit.fasta
MSA:
MC30302_msa.fasta
HMM model:
MC30302.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME008216_00804109-210ERTLQMSELIHSFGKEAVFHRAFDVTKDPFQAMEVLLSCKIDRLLTSGQRAKAMQGSELIAQLQDRFGDRIEILAGSGVNAQNAGELLAHTGIRQVHSSCKG
GUT_GENOME000576_0286196-209DGRVDEGFLGRIIKHKGDLSLTFHRAIDSSRDILEAAEVLADFPEVDRILTSGGHATALEGQAVIAQLIEQNPDLIVLPGSGITVERAEELLKATQATELHVGSAVLVDGQVDA
GUT_GENOME021812_0128197-208FLNPNRTIDVDNTKRMVELIHSYQKEAVFHKAFDEVLDYDKAIEDLISCNVDRVLTSGTAPTVLEGLDVITHLEEKYGNQIEILPGCGIKEDNIVEILEKSKVSQFHMTAKS
GUT_GENOME142587_03062151-238VTFHMAFDQIPYDLQPKAIDWLCAHHVDRILTHGGKGGTKIGDNVDNLKRLLDYAGERIIIMPGGGVTSANVEELHQKLGFKEIHGTK
GUT_GENOME236260_00581109-210RRMVAAAGNMSVTFHRAFDVCRNPRQALEQVIGLGCDRLLTSGQQPKALAGAELLANLVELAGGRIIVMPGSGINPGNIAEIERITKASEFHSTARASVPDP
GUT_GENOME142688_01225108-202KRLIDRAGDMKITFHMAFDSIEDKKSALDRLVELKIDRVLTKGGLQSAMDNLDTLKELVDYSNDRIAILAGGGVTKENYMDIVNKTGVKEVHGTK
GUT_GENOME082318_0264197-246LDPEGDLDMERMMILREMAGDGTMTLHRAFDLCRDPHQALREAESIGIDTILTSGQKNHCMDGRNLLKELIRESADRVHILVGSGVTPEVINCLADEIGARYFHMSGKKIIDSGMKYRKEQVNMGMPGMSEYEIYRTDEEQVRRARKIMN
GUT_GENOME000319_0229797-209DGAIDERTLGEVIKHKGEMTLTFHRAFDASRDVFEAMDVLNGYPEVDILLTSGGAKTVLEGIETLIALKEQAEMTILPGAGITVETLPILRERLDVAMVHIGNGVRTNGQLDE
GUT_GENOME184190_02036116-214KELLNQAENLLVTFHMAFDELSVENQSKAIDWLVDHGVERILTHGGSAQNTIIENIPHLKELISYADGRIIILPGAGIRSDNVDQVFEQLNLSEAHGTR
GUT_GENOME238784_01812100-237LTPEGRLDRRSMERLVNAAGGRGLTLHRAFDVSRDAAEALADAAGLGIDTVLTSGQAADCWTGRECLGKLLASGTPLEMMAGGGVNAGVIRKLRALYPLRTFHMSGKVTLPSAMTFRREGVPMGLPGLDEFSIWQTSK
GUT_GENOME007050_00255141-244EVTFHRAFDLCANRAETMEMLIAMGVKRVLSSGGKADALAGAQNLKELNDQAQGRISLIAGAGISRFNIAQIAKITGIKEYHFSAKVLEQSMMSYKNSDVFMGL
GUT_GENOME066255_00982125-208FHRAFDCVTDINEAMQTLIEIGVDRVLTSGLKSTAIAGAETIQYLQKEYGEQIEILAGSGLNAQNVQDFIRKTDVKQIHSSCKG
GUT_GENOME232330_00307120-209LEVIMHMAFDHIAQESQQNVMQWLSDHHVKRILTHGGMLETPITDLLSELQTLVNTSPPDLTILPGGGVTVANAQRIANTLGVTEVHGSK
GUT_GENOME102034_02108111-227EAINIAHSMGKTFTFHRAFDIAADPFKILEQLISLGCDCILTSGQAPTAEKGIPLLRELVARAAGHLEIMPGAGVRPANASDIVRQTGCRYIHSSSRRPGATSSDAETVNTIVNSIK
GUT_GENOME243572_00111106-206MKDFIQLVHEYHRQAVCHRAFDLTRAGEKSLESLIRLSCDRVLSSGRAKTAEEGIKNLARWQQRYGEAIEILAGGGISASNAAKILKETGLHQIHASCKSA
GUT_GENOME277318_0062299-238DGDVDIDVMRRFMAVADGRPVTFHRAFDLCRDPRKALDDLIALGCRRVLTSGQAASALEGAALIRRLVEDADGKIIILAGGGVTAENCRDIVRLTGVEELHASAKHTVGSAMRFRRADVSMGTPDTDEYSRATTDASRVA
GUT_GENOME235119_03308122-225VERMEELIKLAGNKKKTLHRAFDVCVNPMKALEEAMEIGFDTILTSGQKETAWEGRTILKELQEKSKGRIEILAASGISAGAIKKLIPYTGIMSYHMSGKVVLN
GUT_GENOME257383_00020113-246LGGAALKYTLHRAFDLSRDPFEALDTALALGFDTLLSSGQKAKATEGIELLRAMVKYVGNRMAVMAGSGVSAKNMEELARAGIRNFHFSAKQDKSCPMIFRRPGVPMGLPLADEYTRSYADRETVAEAKAVLKS
GUT_GENOME244376_01037152-271TRRLLERSQGVSFTFHRAFDQSQNPLKALETIQSLGCARILTSGTRETAESGIPIIRQLVERAAGAITILPGGGVTASNIKRILNETGAQEIHGSASIRLPDGRQETDADIVRDFLKAIH
GUT_GENOME096431_0419086-211GAAGIVFGALTEKQTVDMEILTMVAERAKGLSITFHRAIDQADSVEVYRTLCQSPLKLDRILTSGGQSTVSEGMEPLKKMMRESRKSAIHPIIMPGSGLSATNIESIHDQLEASEYHFGSAVRMDG
GUT_GENOME284015_0084297-247LTPEGDVDLKRMKRLREAAGPMDMTFHRAFDMARDAGQALRDIESLGGISTILTSGQAPDVEKGIGNLKAIIAGADGRLDIMPGGGVNSGNLEHLEEELHAPVYHMSGKMAVSSRMIYRNPAVSMGEDSSSEYRIWRTDEEEVRKARAILG
GUT_GENOME148034_00851118-207MQVVYHMAFDDIKPECQQQALRWLANYGVVRVLTHGGDLAQPITETVSHLKDTIAQAPAGLTILPGGGVTFENASWLAEELHVSELHGSK
GUT_GENOME207298_00708178-273LRLLIDTADGAPVTFNKAIDATRDLVEAYGDLGGLGVDYVLTSGGSPTALEGAEVLRGLVSTPGGPRVIAAGHVRPANTAEVIERTGVREIHMRCS
GUT_GENOME232072_00336112-228FLKACKGKPFTFHRAFDHCKNSMRALDELIALGCTRVLTSGQMPSAEAGIPNLKAYVQHVGNRLIILPGAGVNPSNAHFILTETGATEIHGSGAITLEDGRIETQTETIKGILNDIN
GUT_GENOME282598_02028109-215KEAVARAQGMTVTFHRAFDMCADLNAAAKLLHSYHVDYILTSGGEASAHKGMAKLRELNAEPNHPIYIAAAGVNKTNIAEIANTTKIEQFHFSAKDVVKSQMVFTNP
GUT_GENOME262121_01174125-220NKALIEACGELPVTLHRAFDRTKSLEESLQVAIALGFDRILTSGGETSAPKGVEHLAQLVQQAGERIIIMAGAGITPDNATDLIQRCKVTEIHGTF
GUT_GENOME096435_02703106-229QIIQAAPNIDITFHRAFDEVEDQVNAYYTLTQYKQHVKRILTSGGKENCQAGQNNLESLVNLANQLQGPEIMPGGGLTPSNLEPIHKQVQAGQYHFGRSLRIKGDFKHSFDPLNFKKVVNVLNS
GUT_GENOME005495_00708105-204EELEELSVLAGDMDMALHRAFDVCRDPFKALEEAVALGLKTVLTSGQKNSAWEGRELLKQLVEKSGGRIEILHGAGITPEILEKLALYTEGCAFHLSGKI
GUT_GENOME208935_00698289-385QKIKKINPNAEIVFHRAIDVVPDVFKALDDLINLGVTRVLTSGQRESAIKGAKAVKKMIEYASDRIDILPGGGINTQNVGEFINETGTHSVHASARS
GUT_GENOME037499_00563142-228MAFDELAPSDALAAIDALADLGVDRVLTHGGPAGTSIEDNFDALRACFDRAADRLLVLPGGGITYENAPAVAEALGAPELHGTRVVP
GUT_GENOME244335_00643290-380APLTFHRGFDLVPDQFEGLEALVRLGYDRFLTTGGSPSIADPARLAALVDAAGERIGILASGGLRSANVSDIVEASGAKEVHMRAPGPDGG
GUT_GENOME238206_0012411-100AAPMQVTFHRAFDECTEPFAALEQIIDCGCHRILTSGCAISAEIGIPLLRQLVKKAADRIIILAGAGITPQNTAQIITQTNVTEIHGSCK
GUT_GENOME182579_01753106-213IQELVEHCGRMDITFHRAIDELADAVEGIKILSGYPQIKTVLTSGGKGNITENIPVIKEMIKNSGHLNVMAGGGMNFENIRQMIEKTGASQYHFGTAVRYNKSFSGDI
GUT_GENOME147454_02605121-209ITFHRAFDQCRNAEQALEDIIYLGCERILTSGLAPSAPAGESVLKSLVEQAQGRIAIMAGAGVNADNVRDLVKNTNVQEVHLSGKTTRP
GUT_GENOME212436_02151106-202KMLIDLAKPMEVVFHKAIDLTKDPVKEVKKLAELGIDRILTSGGENTALEGIEVLKSMIEESKGTSLKIVVAGKVSNDNIDFLSEHLNTDEFHGSTI
GUT_GENOME242246_01389117-207GLQVTFHMAFDEIPKDKQLAAVTWLAEQGVTRILTHGGPLDQPVETFYPALKELLAAADPRMTFIIGGGITAQNKNQVLSTLGCVELHGTK
GUT_GENOME000812_0073997-216LQEDGWLDEESLELFLETAEGLQVTFHMAFDAIPKARQKEAIDWLAAHGVDRILTHGGPSDSSLEENLPYLKELVDYAAGRIIILPGGGVTAENVEEVITTLGVKEATAQRSFHFKKKRS
GUT_GENOME253165_02068102-236GDIDVPAMEQMIAAAEGMDITFHRAFDLCRDPMDALETIIGLGCHRLLTSGQAPSAEAGTALLKQLVAKAGERLTIMPGGGINAGNIAGIAASTGAREFHTSASSVFQSRMEFHRNNVTMGKANTDEYARKETDP
GUT_GENOME009103_00778114-228VREIVEMAHASGLSVTFHRAFDHVVDFSNALDVLVSTGVDKVLTAGGRDGAEKGADTLFKLISRAGDRIIVMPGGGVRSSNISYLKKTTGAGEYHSSCRCTSSGGHNGASVEEIR
GUT_GENOME102095_01752118-252NNGAINVTFHRAFDLSRNPATSLEDVIRLGCNCLLTSGMAQSAVKGIPMLKQLSTQANGRITIMAGAGVNPANAADIIAATGVTAVHSTARKPIPSKMRFRRLSVSMGTPGTNEYAPLSTSPDVVAALLKLVVSH
GUT_GENOME188382_01275392-496VTRDGQIDVNAAARLRDIAEDAPLTFHRGFDQVVDQDRGLDVLMELGYDRVLTTGGDPAVAQPDALARLVERGGDDIIILVSGGLRAHNVAEVVAASGAREVHMR
GUT_GENOME208090_01495120-209EVTMHMAFDQIPKADQPSAIQWLKNHHGVTRLLTRAGTPETALDLRLKRYSELVQLADCQLDILAGGGISVANRDQFLAIPGLEQVHGTR
GUT_GENOME014975_01703241-342DNSIDYTTLKELVELAKPMSITFHKAIDYVENPLDDIPKLINLGIDRILTSGKKNTAIEGSELLNNMITLGKNNIKIVVAGKVTKDNIKDIQKLIPNSEYHG
GUT_GENOME279180_0095796-245DGTLDMDLLDEQVVRARRLRPGCSLTLHRCFDLTADPLKALEQARLLGFDRVLTSGCAPTAEKGLEMLARLRERGRELGIAIMAGGGVTPDNAARIAACADELHASARKPLESRMTFRRPDVSMGAPGADEYAYPETDPGTVHSIKQAIS
GUT_GENOME206261_0067799-245FLREDGTPDSARIRVFTEMTRSYGRESVFHRAVDVVPDWRKALDVLMEQGVTRVLTSGQAPTALAGAACIRHMREYAAGRIEILPGAGIRLHNLQEVLRLTGCRQVHMSAHSSFEDCSAQNTRGIAFSGNVLPPESQVKRVDTALVR
GUT_GENOME095596_0050395-204FLTADYHVDIARTAEMITRIHQAGLKAIFHRAFDNTIDPKQAIETLIQLRADRLLTSGQAGTAIEGADLLAKLQAEYGQEIELVAGAGVKAKNALQLLQKTKVNYIHSSC
GUT_GENOME199241_00835115-208AGGLDVTFHRAFDMCRDPESALEALIKLGCHNVLTSGFKPNALDGIENLRQLVIQADGRINIIAGCGVTPRNAEEIIGRTGVGAIHATCSKVVE
GUT_GENOME083008_00304154-304LRADGHVDMDAMNALMKAAEGMEVTFHRAFDMCCNPELALEDIIKLGCRRILTSGQAPKATMATRKLRALVEQAAGRIVVMPGCGVNEENARKILEETGATEIHASASSIHESKMNYRNPNVCMGNDGTDEFSIYESDERKISNILKALND
GUT_GENOME103369_00215121-214EAVFHRAFDECPNKEQAIQELIACGIDRILTAGGQGNADEHLDDLKRWQDTYGEQIQLQVCGSIRSHNAMSIMKQIGFDQVHSACRSFHQDASD
GUT_GENOME096559_04534112-207ADGLPVTFHRAIDEVRDQQEALRTLSKYPQVTHVLTSGGKPSALLAQDTLRELNQLAAELGTLTIIAGAGIGLEALPQFLQSTGLSEIHIGSGVRM
GUT_GENOME255618_01931128-231EAVCHRAIDVTPDPIAAAELIVSLGFNRILTSGGRQRCFEGIDTINEMWSRFGNQIEILACGSIRPNNLAEIISKTNVNQFHMALMKTICDPSMTGQTIHFNGT
GUT_GENOME255826_00113122-213EAVFHKAFDECEDLDKALQTLISCGVDRILSGGGKCTIEEGSQIIGQLYQKYKGKIELLVGGGITPSNIKDIAANAQSGQLHMTAKQTYYDE
GUT_GENOME008426_02184100-237DGKVDKKRCSELLDIWGEGKAVFHRALDVSRNLFESVEDIISIGGFERILTSGGEANVMSGIINLKEMIKRYNDKITVMPGGGINIDNIKYIKETTQAKEYHMAANKNFDSSMEYRNENVFMGGALRLPEFSVKITDE
GUT_GENOME113641_01598120-200FHKAFDCSKDELHSAHAALKKAGIDRILTSGRAKTALEGVSVLKELSSMGGIEIMAGGSVRSHNIKELFEKSGCNSFHTSA
GUT_GENOME098535_00264132-231RMLSDIIHEAGATAVFHRAFDLTPQPFDAIEALIEIVHADRVLTSGQRPTALAGAPLLAQFQKNYAAHIEILPGSGIKPENVAQLIRETGVTQVHASCKG
GUT_GENOME077415_00487101-204MDVCRRLVEAAAGMSVTFHRAFDLCRDSFKALDDIMALGCDRLLTSGCASSALAGSGLIADLRIRAAGRLTLLPGGGVNPGNAAEIMRLTGCRELHASARSSLA
GUT_GENOME096541_0197099-232LQADGSIDARLCHALVSAAGRLGVAFHRAFDATADLAQALEEVIALRCVRVLTSGGHADAWAGRDVLAGLVAQAGERIAVMPGAGITAVNAVALAAATGAAQLHASCKRARSSAMHHLNPTLAGLAPDWYETDE
GUT_GENOME044630_00982101-247PDGSFDLEFMEECMKAAGNCKVTCHRAFDMCRDPFEALEALINLGVDTLLTSGQEPSALAGANCLEALAAKAEGRIHILAGAGVSDAVIKPLYEKGIRHFHLSAKIELESAMKYRNTRVSMGLPGMSEYMIWQTDDKKVRSARMVLD
GUT_GENOME175193_01137107-207RMERLIAAAHGLPVTLHRAFDVCRDPFAALRAAKALGVSSILTSGQAAAAPQGTDLLAELVRQAGPANILVGGGVNSGNLAALAAATGAKHFHMSGKRPLE
GUT_GENOME243649_00837102-203IEKMKELVYLAFPKKVVLHRAFDYSTDGEEKIEELIKMGVNRILTSGKRPKATQGLDLIKDLQEKYGDKIEIMAGCGVNHENIKEIYEKTHIENFHLSARNV
GUT_GENOME095246_02194115-210LSMLAKHCEGLGMTLHRSFDFVADQAASLEIAIDLGFERILTSGGALTAPEGAEQIAVLVRQAGDRIAILAGAGVRASNVAELVRKTGVREVHGSF
GUT_GENOME158533_02368105-201YERCARFVELARGKQTVFHRAIDITRDPFNAVERLARLGVTRILTSGFSAGSYDGRAQIGALQKQFGDRIEIMAGGGITEQNIAALIKESGVRHVHF
GUT_GENOME134988_01381124-242LSTTFHRAIDCADGIYDALEDIASLGYDRVLTSGGCPTAWEGRETIARMQAMVANHTGSSRKPLIIMPGSGINPTNIRDLALQTGVSELHLSATKRHKSGMRVLSGIAQEETVVHSDEE
GUT_GENOME155739_00586102-198FEKMKKIAECRGDLKVTFHKAIDETPDYAEAVRSLSSQGLIDRVLTSGQQETALAGLERIRELQDISNLEVVAAGRITKENLAEVAGQLKVTSFHGK
GUT_GENOME171657_03705107-203LERLMRNCDGLGVTLHRAFDLVPDLGEAVETAVGLGFERILTSGRMPTSAQGIGDLETTFAFAAGRIRIMPGSGVNIGNADALLQRIPFREIHSSCA
GUT_GENOME284516_00107105-229DKTAQMIELVHSYHKEAVFHRAFDVCPDAYEAIETLIRTNEELGLYDEFSLSFNPKTKKDYLLKIIQFKKFLEGKDLLKDLQEKYGDKIELLPGSGVNATNAIELMEYTGINQVHSSCKSYLSDP
GUT_GENOME096032_00240108-199LIEAAKPMKITFHKAIDEMENPLEAIPKLIELGCNRILTSGKQERALEGVPLLNKMIEKANGKIIIVAAGKVTSENIKECSEKIHTDEFHGK
GUT_GENOME160593_01278126-217EAIFHRAFDQCVDPVKAMQVCIDCGIDRVLTSGQQAKAIDGLACLKNLQAQFGQAIQILVGSGVNAKNAEYIMQETGITNLHSSCKGVKSDP
GUT_GENOME199317_00589134-286PDGTLDLTRMKQLIQAAGDMCVTLHRAFDLCADPFAAMEQAKELGIHTILTSGQQNNCCQGKELLRQLVSHEEGRISIMAGGGINPQTLKELLPYIGAHAWHMSGKALFQSPMTYRKKEISMGAASLDEFQILRTSASQIRQVKEILTHHKEA
GUT_GENOME142392_01721106-200KKAKELAGILNITYHMAFDEISDKKTAIDELVELGIDRILTKGGNNSALENLDNLKELIVYAGDRITIIPGAGINSKNRDMVISYTGAKEVHGTK
GUT_GENOME039390_01073100-222DGDIDVPLMRRLIEAAKPLSVTCHRAFDVCRDPFTALEQLIELGCDRILTSGQQSDAVKGIPLIAELVKRADGRIIIMPGCGVNAGNIRKIAEETGTSEFHFSGRSSVDSGMIYRNSKVSMGG
GUT_GENOME194897_01415121-211DLVLHMAFDELSEDQQLELIPWLVEHDVTRILTHGGPLGTDTDILDNSARLKKLISATDDKIEILPGGGITVENYERIATKLGVTQVHGTK
GUT_GENOME096365_00482106-206RNKELVEIAHQYGMSTTFHRAFDRCSDLNKSLEDIIELGCDRILTSGGMKTAPEGKDILRQLIEQANNRIIIMPGGGITENNIADLAKATGLKEFHGSFRS
GUT_GENOME138800_00634106-215ELTSELARLAHEDAPDAPGHVDVTFHMAFDALPADEQLAAIDWLADQGVERILTHGGAAGTPIADNLGRLREFVERAAGRITILPGGGITWENAESVALELGVSEVHGTK
GUT_GENOME153918_0125595-227LNPDGTVNEQQCRRFREAAGNLELEFHRAFDMTADPEAALETIIALGFDRVLTSGCAATAQKGIPMLRRLQSLAAGRIKIMAGSGVNAANAREILDETGVDALHASASTIIPSNMIFRNDSAKMGDPGADEYS
GUT_GENOME096202_03848114-216GLEVVFHRAFDAVPDQIAAFRQLSGYKQITRILTSGGKPSALDAVDRMKELVELSRDSHITILAGSGLNAESLESFVKDTGVTEVHFGGAVRGPKGVEDDIAA
GUT_GENOME278672_0096494-243LTPDGEVDETRTAQLVREAEGMEVTFHRAFDMTRDPRQALEAVIRTGCRRVLTSGGRNTAQEGIETLRALAAQAAGRIEVMAGSGVGPSNARLLAATGVDALHFSARRERESGMRFRNPQVSMGGCAGVPEYTLPDADESVVRQILAELG
GUT_GENOME018737_00158103-207ITDDGQIDVEACKALKDAAGDAPLTFHRAFDETRNYEESLDTLMSLGFARVLTTGGRDATAQAPVLKHLASRAGNDLTIVASGGLRSHNVAAFLKVAHVRAVHMR
GUT_GENOME234138_00116108-207RLVARARGKGSERRLSVTFHRAFDECAEPFKALEQIIGLGCDRLLTSGQKPSAYEGMELIAELVKRADGRIIIMPGAGITPANLDIIENETGAVEFHGTR
GUT_GENOME256503_00707122-205FHRAFDAAAGDPLEMAEELAELGVTRLLSSGRAADALTGAPLLRQVAERFAGRVQLLPGGGVRESNAAEIVRLTGCTQLHFSCH
GUT_GENOME121415_00530113-225KLLEAAGGLPCTFHRAFDVAPRRAALLSSIIEAGFVRLLTSGGAATALEGVEELHELVLLAKDRIKIQCGSGVMADNILSLAQKTGARCFHGSFRRPLTDGSLATSVEEVARA
GUT_GENOME143709_00698125-211GLPMTFHRAVDETRDIYEALDVLLGYPQITSVLTSGGQPSALQATDVIAEMVRRAEGSSLAILAGSGLTVESVGGFIQETGVRHVHF
GUT_GENOME102247_0127776-248MVSEMEALKAAGTSGFVIGCLTPEGELDEIAMAPLLEAAKGFGLTLHRCIDVSRNLGETYQKAAGLGFDTVLTSGGAGKCLDGMAEIGKLLTLQEETGGLQVLIGAGVNARVIETFREAFPQAGAFHMSGKKEVESGMVFRRKGVPMGAPGLDEWHIPVTDQDEIRQAKEALY
GUT_GENOME239375_00756136-242EVTFHRAFDRCRNPMEAIEQIAAMGCVRILTSGQQPTAEEGIPLLRQLVEKAKELSVKYGHPFNILCGSGVTEVSAPIILQATGATEIHGSLRTGRSTDVEKVKKVM
GUT_GENOME029692_00738189-294QTRAMIELAHRYGREFVFHRAFDCVADFDTAISQLIDWKADRVLTSGGEAKAEHGIETLAHLQRKYGGHIEILAGSGVNASNAQLLLEQTGVHQLHSSCKDWLTDP
GUT_GENOME032999_00900100-196LNYIQQVIANKGHLQMTLNRSFDRTADLEEALLAVETLPIDRMLTSGHEQSVITGASTYRMIQQRMAGKMIVMAGAGLNLKNVDAFVKENHPAEIHF
GUT_GENOME238419_00249124-217MKFTFHRAIDACGNPLDAMKTLIDLGFDKVLTSGCKPTAYDGVDKIREMQTLFGERINIMAGGGIDEKNVERIITATGVKNVHASLTSYTADSH
GUT_GENOME036517_00488110-209RRLINAAGGLSVTFHRAFDMTSDPVRALEEIIALGCDRILTSGCAPTAEEGTEMLRRLNETAAGRIIILGGCGVSPSNAATIIASTGLTELHASARSSVE
GUT_GENOME096533_01865114-209GLPVTFHRAFDEAVDLAAAYEVLAKYPQITDLLTSGGAAIAPEGAELLAGLIRRSASGGPAVLCGSGLNADNLPDFLACTGAERIHLGSAVREGGD
GUT_GENOME200288_04791116-210AQGLGVTFHRAIDVSTSLTEALKAINTLGNVERVLTSGGKQTAPEAILELKQLQQLGQTLNIAVMAGSGITIERIQKLVSQTGITEIHMGSGVRY
GUT_GENOME244968_00665101-250LTSDGDIDEYACAMILETIHKVSESGRKVTTTFHRAFDLCREPFDALEKIISLGFDRILTSGQAASAEEGKELLKNLHARADDRIIIMPGAGVTPQNILGIIKDTGVSEIHASAKQHVSSRMRFRREGIGMGTGDEDEYSRYTTSSSIVK
GUT_GENOME082500_0097599-248LSPDGSLCTEQMKRFREHARDMSVTLHRAFDMCRDPFAALEEAISLDIQTILTSGQAPDCLHGVDLLNKLHQAADGRIHLLAGAGVSAKTVPALLEKTPLTQFHMSGKTIRNSEMVYRNPEVFMGIPGMSEYEIWQTDPDAVAAVRTMLD
GUT_GENOME128204_0085097-284LQADGSLNEAQMRGMIDATEGRCGITLHRAFDVCVDPIACYEKAADLGIDTILSSGQEGDAPAGSALLRKLVQMSEENRYASGEARYSETSSALEDAAAKHSPKTDRILNGTPTILVGGGVKADNIAEIAGMTGAHAFHMSGKKILSSGMRYRNEHVHMGLDGISEFELYRTDEEEIRRAVEILKSLR
GUT_GENOME051100_01548235-388DGEVDMEANRRLVELAKGGDELKPMSVTFHRAFDRAANPQKALEDIISLGCDRILTSGQQPKAVEGVALLSELKQAAAGRIILLAGCGVNEDNIRTIFDATGIHEYHFSARVNVPSKMKQLNTNVYMGAEGADESNSPVTSAERVKQTIANLLG
GUT_GENOME000020_00637111-207LKQLADSVPVVCHMAFDEIHQHGQIKALHQLINLKFTRLLTHGGPSNTDIFDNLNHLGKLVVNSGGNIEIMPGGGLNKDNLHELLEAFPFQEVHGTK
GUT_GENOME096560_00567144-240QQIEAISTDIRITFHRAIDYSNDLQTSYQTLATYPNLVERVLTSGGAPDCIQGKDQILRLIQLADQLDGPVIMPGSGLQLSNLLSFHDTVQAKEYHI
GUT_GENOME191312_00061122-210GKPVTLHRCFDLCKDPFDALRTAEELGIARILTSGQANTAAAGREQLAALQREAKTVRLMAGAGVSAENIPTLYRETGILSYHMSGKET
GUT_GENOME190924_0190987-245GADAVVIGALKPDGTLDVEAMKCMIDAAGDMSITLHRAFDVCRDPFEALETAKELGVNTILTSGQRKSAKEGVEILKQLALKADGKVDIMAGAGIDADAIRYLAPKTGITTFHMSGKMTLDSRMVYRKEGINMGLPSMSEFEIWRTDEKKVEEAREVLA
GUT_GENOME225818_01435107-223RLKELIDLAGDMHLTLHRAFDVCRNPFNALEDAINLGFHTILTSGQKATAYEGRTLLKELIQKADGRIDIMPGAGVSSKNLELILQETGAKSVHLSAKTLRPSESLYKNPEVYMGLP
GUT_GENOME278443_00237116-209NKKAVFHKAIDVCENYTLSIEKLIECHVDRVLTSAQKRTVLEAKEEIKYIIDTYGEKIEILPGGSINIDNVSEIVNYTHTNQIHSSCKDFHQDH
GUT_GENOME096462_03595123-217FHRAFDCVKDPYQSMEVLIDLGVKRVLTSGLKNTAMEGIDLLKKLQDQYGDKIEILAGGGVSATNSSILMNKTGIMQYHSSCKSWRKDATTISNV
GUT_GENOME238303_00054104-206IDTVKRLIAAGKGLRITFHRAFDCCNAPFNALEEIIALGCDRILTSGQKHNVTEGKEMLRQLVEKANGRIIIMPGSGVNIDNIAELEAYTKAKEFHSTAQNSE
GUT_GENOME279790_00610112-212MDACRRMATRAKGMNLTFHRAFDLCADADKALMQLMDLGCNRVLTSGQSPTALAGVEKLRHLNELAGEKITILAGSGVNPGNAKEILDKSGVHELHASARS
GUT_GENOME085398_00737101-201FEFSKRFCEKAGDMDVVFHRAFEVVKDKMSAARKLSEIGVRRILTKGGNSLVDGQDVIRELLTLEKPEIISGGVRENTIELIKDLGLKFVHVSSGREVVDT
GUT_GENOME001951_0056896-234LTSEGDVDVYAMKRLVKAAGSMKITLHRAFDMCRDPFDALNKAIGTGCSSILTSGQASAALKGVSLLAELNKAAQGKTEIMAGGGINSGNIRKIHSLTGITVFHASCSRIEKSLMDHINPNVNMGLPQFSEYEKILTDE
GUT_GENOME218279_01548112-248TRAAGGCGITLHRAFDVCKDPFAALQKAAELGVDTILTSGQEAGCMQGRELIRQLVRAIQDSRSDGRETAALPGGVTILVGAGVVSSNIKEIVMTTGAHAFHMSGKKLMDSGMRYRNERVHMGIEGLSEFELYRTDE
GUT_GENOME244375_00473117-215RRLIRLSRTLRLSVTFHRAIDVAKDPLQAFRDILPLRVDRVLSSGGAASAYEGRKTLARMQEITRAEHGPVLMPGAGVTAANVGDILRATGASEIHGSR
GUT_GENOME183568_02213177-261FHRAFDVVTDWKAAMNVLCRLGVSRILTSGRHATALQGTATLKEMRQFAAGRIQILPGAGIHAMNAEKIIRSTGCNQIHAGLRTV
GUT_GENOME128581_00978118-217RQLVESAGKYNLKVTFHRAADRCISAPEQIVEDIIETGAYRILTSGQKPTAEEGKDNIRRMAKAAAGRIKIMCGSGVTEKNAALLKECGVDAEHATAKSF
GUT_GENOME062839_00578113-202AKPLPCTFHRAFDRAKDLEKSLEKVIDCGFTTILTSGQKPNVSEGKENLKKLVDLSSGRIEILVGGGLRSSNIEEIRELTKAGYFHSSAI
GUT_GENOME181220_01245565-670TEAVGRLMAEAQGLRVTFHRAFDECAEPFRALEDIISLGCDRLLTAGHASNVNDGERTLKELNIKAGDRIIILAGSGVRPGNIARLEASTGCKEFHSSSHGPDGRT
GUT_GENOME096555_01098116-218SAGKEAVFHRAFDVTPNPKAALEMLVDCGVDRILTSGQKPSAHAGATLIAQLQKQAAGRIQILAGAGINAQNVCELINKTGISQVHASCRDYEKDPTTARNEV
GUT_GENOME231374_0115298-214DGRVPEGKLRQLVEAAGPLGVTFHRAIDLSSDWRLDLETIVAAGCERILSSGHAPTALAGLETLQAMHQALAGRASLMPGAGVNADNVRTIIEFTGVNEVHMSGMGWRGGMVPHGVN
GUT_GENOME236865_01059102-240LTPEAMVDKKRTRLLVEAAEGMQVCFHRAIDMTQNIMQSMEDIIACGCHRVLTSGGFATAAEGIEVIKQMQQSFGKSIDIMVGSAVSSSNVKRFRDIGIRSFHLSGKADCESRMSFRKAGISMGATDPAREYVLSQTDP
GUT_GENOME166775_00904108-209MKKLITTAHAGNLEVVMHMAFDELTASKQKEALEWLSKNKVARILTHGGSLDRSITDCLENLKKLNKQAKGKIEILPGGGITDKNVNSVIEKVGVTQAHGTK
GUT_GENOME094927_00615116-202GMKVTLHRAFDVCCSPMEALKQCIDLGVDTILTSGQKASALEGKELLARLAEEGEGRIEIMAGAGIGPGAIRELASSTKVRAYHMSG
GUT_GENOME041453_00276135-249PVTLALHRAFDVCRDPFAALETACQLGFDTILTSGQAASAAAGASLIAALIHRAAGRIEILAGAGITPDNLPALVKATGAPAYHLSGKKELDSRMQFRREGVPMGLPGFSEFVVW
GUT_GENOME118662_00303120-208EVVFHMAFDELAHTDQLEAIEWLSDQGVARILTRAGSPGDPLEQLFAHYHRLLDAAKGKIQILPGGGVTLDNRQLFLEQLGVCQLHGTR
GUT_GENOME237444_01611110-210MAEKARSLGLETTFHRAFDSCREPLEALETIISLGYDRILTSGCRPTAMEGAPLLAELVLRAAGRIIIMPGSGVSPENIDRLRELTGATEFHGTRLCVSKD
GUT_GENOME237421_0123195-204DSNLDVDFAKTKEIVNLAEGMEVTFHRAFDICNRPLQNLEKVIECGCTRILTSGCKATAHQGMDMLKNLVAQANGRIKIMAGSGINAGNAIEIINTTGVNEVHASCKHVV
GUT_GENOME243458_0053598-235LSPSGTLDRLALESLCEAAKDAELTFHRGIDVLPEPWQYVPQIREMGFKRLLTSGGASSAVEGTEVLSLMQQASGDTLTIVAAGGVRAEHIPFIRTKGGLQAFHGGFRGVRDRAESSSARRAPDGSYLDAWWTESYLD
GUT_GENOME096443_00600117-207GLDITFHMAFDHLKPEYQNPAITWLANHGVKRILTHGGPSGTSIIENLPRLKELIQFASNKLIILPGGGITADNLELLVRELPLTEVHGTK
GUT_GENOME046070_01890103-203MEACLPMMEAAKGLPVTFHRAFDMVADKRQGLEDLISLGCARVLTSAGAPVVPQGMEGLKSLIDQAADRIIVMPGGGVTAKNAADIVRTLHNREMHGTFRA
GUT_GENOME159608_01297118-206LSITFHRAFDDCKDQSEALEHLIALGYDRILTSGGADNVCDGLDSLAALVRQADGRIIILPGGGVNASNAAMVTNRCGAIEIHGSCRKD
GUT_GENOME208002_0087499-206AGALTPDGRIDEAAARVFREAAGECPLTFHRAFDEVGDLMAAARWLAGAGWDRVLTTGGDPARADVAQLAALQREVGDRLVILASGGLRSHNVAEVVRATGVTEVHMR
GUT_GENOME135384_00611141-241KRTKEMIELIHEYGREAVFHRAFDCIDNQDSAAEKLIRLGADRILTSGGAVNVWDGRKQLKHLQNQYGKDITILAGSGVKDTNVRALIEYTGITQVHSSCG
GUT_GENOME185250_01346108-207EKMIELCDKFGAESVFHRAFDCSKEPDYNIQRLIGLGCTRILTSGLGENAIRGSKLLKIFQEKYGKHIEILAGAGIDSKNVVRLLDKTSVNQIHGTFKEY
GUT_GENOME275846_0160296-214LTPDNRIDADICGTLIERAKEAGLSVTFHRAFDCVADPLEALETVISMGADRLLTSGLARTAMEGADMLHRLNLHARGRISLIAAAGVSSANAAQVLRLSSCHEIHASARKPVRSLAEC
GUT_GENOME020248_00816364-503LTPDGELDLAQMRRMMACAGQMEVTLHRAFDMTRDPFRALEDAVSLGCRTILSSGQAANAALGAPLLAKLNGQAAGRIDLMAGCGVKRTNIAGIAAQTGITTFHTTGRKGSVDSGMRYRKEGVSMGLPSLSEYELWLTDE
GUT_GENOME006551_03268107-213ERTRQMVQAAHGKEAIFHKAYDSTKDLEASLKTLIRCGITRVLTSGGAVYPNILEGCKELGRLQDLYGDQIQILPGGGVREYNARQVLKLAHTGQIHLTAKNTLIDE
GUT_GENOME096287_0122898-207LTPAGAIDTAALERLVEAADGVEVTFHRAFDVLDDVAAGVGVLAAFGVTRVLTSGGAQRAGDGLARLRETVEAADGRVQVMAGGGVRVEDLVRILGTGVDAVHLSASRVV
GUT_GENOME183871_00370119-211AGERPLTFHRAFDTVQNMDRSLDQLICAGFDRVLTTGGDASVVNVENIRHLTDRAAGEMIILVSGGLRSHNVASVIHRTGATEVHMRAPAQQP
GUT_GENOME151010_02391122-212NEGRKATFHRAFDCTRDPYEAIETLIDLGFDRLLTSGQEATAEEGIHLLADLQKKYGNQIEIIAGCGVNENNAQKIIDKTGIKQLHSTCRG
GUT_GENOME284343_0017997-224LTPDGDIDTRQLRRWVRAAGGRDVTFHRAFDLCRCPEEALEAIIDSGCSRLLTSGQAATAEAGIPQLRRLVEQSVGRLSVMPGCGVGAGNAARILAATGAREIHASARRPVGSLMKFRHSGVSMGVPT
GUT_GENOME000556_0123798-203LKEDGTIDIELCKKFMDIIGEKEAVFHRAFDVVKDPFEALDTLVDLGVKRILTKGQKNTIEDGAELLRDLIRYSKEKIEILPGGVRPHNVKWMIDDMGFNQLHVAS
GUT_GENOME155724_0400087-245GADGAVIGILRPDGSLDQERMRLLMEAAAGMKVTLHRAFDMCRDPFAALETAVELGIDTVLTSGQKNSCMEGEELLAELVKKSRDRICILAAGGVDEAAVAELSAKAGITRFHMSGKVIRNSGMLYRTDGVHMGLPGLSEYEVLLTDARKVRAAKQALM
GUT_GENOME242960_0020294-214INEDGSLDIELLREVVDLAKPYPIALHRAFDYSKDGAQVIDQLIYMGIIRILTSGKMPTASEGIELLSHIQEKYGDKIEIVAGSGVDASNIREIYENTKILNFHMSGRVKKENKITYKSDL
GUT_GENOME070980_01130117-214RRFIDAAQGRNATFHRAFDVCRDPFKALEDVISLGFDRILTSGQSSDALRGGDMIRRLHDKAAGRIRIMAGAGVKPENAAEILALSRCDDLHASARSL
GUT_GENOME097991_00847165-290LTPEGDVDREAMQKLLKVSEGLSVTFHRAFDYVRNPESVLEELVEMGVDRVLTSGQQPTALQGADLLRKLVKRAANRIIVMPGCGVNETNIAELATRTGAEEFHFSARENRQSRMLLRNPALSMGG
GUT_GENOME196218_00359106-208FEAMENLAQAAFGMQLELHMAFDSLDFEEQKKAIDWAVSSGFDRILTHGGPLEVPIGETVPHLKELISYADHRIEILPGGGITFENCEQIAEELGVKSVHGTK
GUT_GENOME096279_02523105-200RMKKIMAAAGPLAVTFHRAFDLCADPRQAWKTLGELGVKRILTSGQQSSAEKGISLITELIAAGDTPIIMAGAGVRAANLPLFLQAGVKEVHSSAG
GUT_GENOME157365_00886118-272LTPEGNVDMERSRKLIEAVKPLPVTFHRAFDMTCDPYKALDDLIALGVDRILTSGQEATVVEGLDLIEELIAKADDRIIIMPGCGISERNFEKIKGRLKAKEYHVFLPCEEQSRMSFHPGHIYMGGLLRQSEFMVSHTSCDRVSNIMGMVGMAKV
GUT_GENOME237422_00028105-210MDACRAMVAVAKHHGMSVTFHRAIDRCCNILAALEDVISVGADRVLSSGGKNTSFEGKEILAAMNEAAQGRISIMPGGGVNAGNIKEILSVCGAREIHFSGSETVQ
GUT_GENOME183349_01921108-203ELISLAREYDLSVTFHRAFDCCCNLQKALEDVIGLGCDRILTSGGCSTAYQGMENIKQLVLLSDERIKIMAGAGITSNNIEKLIRKTGIKEIHGTF
GUT_GENOME147522_02456114-220ASGMGITFHRAIDHCSAPFDALEFLMTNKVERVLTSGLAKRAEDGINTLKEMVAFTQGRLSIMPGAGVSPHNAQSIVSATGVSEIHLSGKTTRNSLMGFRNENATMG
GUT_GENOME022512_01456105-202ERCAELVELARPLACTFHRAFDMARDQAKALEDIISLGFDRILTSGGEPSALEGAENLQKIMEQAGDRIIIMPGAGIKDTNIERINRLLHAKEYHMSR