UHGP-MC 1454


Information


Number of sequences (UHGP-50):
154
Average sequence length:
123±9 aa
Average transmembrane regions:
0.02
Low complexity (%):
9.14
Coiled coils (%):
0
Disordered domains (%):
2.34

Pfam dominant architecture:
PF03928
Pfam % dominant architecture:
9351
Pfam overlap:
0.92
Pfam overlap type:
equivalent

Downloads

Seeds:
MC1454.fasta
Seeds (0.60 cdhit):
MC1454_cdhit.fasta
MSA:
MC1454_msa.fasta
HMM model:
MC1454.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME151500_0168871-185AIEEAIAQQVNEAISIMDSHGNAVCYYRMEGAMLIASELAPTKAYTAVAMKRETSALQGAIQPGRPLYQLGTMVQCPVTAIGGGFPIVKNGVFLGGLGISGGTTAQDMAVGHGAL
GUT_GENOME177161_0199026-154ASLTLTDAALLAHCAIDAANNANVPIVFSLVDTHGLQRYFVSQDNALLISHTLATQKAWTAVALRMPTHRLAADVLPGASLYGLSHTQGLCCFGGGLPCWSAGVLLGGIGISGGSVEEDIAIATQTLAR
GUT_GENOME123231_0062720-128LAAEAEGDGQAPLCIAVCDAVGDLLHYTRMAGAKRRGTPIALAKAYTAAVLETPTSALHARLVREHLSLADFCDGPQHRLTSLAGGVPLCLEGRTVGGVGVSGRKPAED
GUT_GENOME193693_032139-157TTLVERCFEEEEKFQFTRFSQTDAKKLGDLLHKCSLRYDKPVAIEIRMNHLVVYRFFPDGTNKNNELWLAAKANTVDMLEISSLHLFAEIQAGGDKLAERRMDEEKFTIYGGGFPLTIRNCGVVGSICVSGLEHTKDHQVIIDALNEYW
GUT_GENOME229325_00254169-296ETLNLEVANQVIEMVIKKAKEKGIKVVVALVKEGGRPISMQAMDEAFVISYELAIKKAFSAVALQMGTHEIAELVKQGADFEGLVSMLDEKIITLGGGYPIKVNGKVIGAIGVSGGSASQDIELAQYG
GUT_GENOME142592_0232114-139QSLALKVLEAVTNKADELGIRINAAIVDDGGNLKAFIRMDEAALLSSGIAQNKAYTAAAFGKSTEEWYPMIKDEPSLLTGIVHTEKLVVFGGGIPLIFNGKIVGGVGVSGGTADEDVQCATAGVQV
GUT_GENOME221386_0060334-167TLDISLDFAEKLMKHVEEEAKRMGMRVVTAVSDSHANPVAVRCMDEAYVGSFDVALNKTYTAVAFKMSTEELGTLSQPGGPLYGIQHTNQGRIVIFGGGIPLRIGNRLIGAFGVSGGSAQQDTYLAHYAADAFL
GUT_GENOME019450_01791101-229LSDDLCDRLCAAAREKSQELGVEISFAIADPSGLPRVFRRFSDALVLSTTLVPGKAYTSAVTQCTTKDVAKYAAEGQPLMAIQTNDPRITLVSGGYPLFVNGKIVGAIGVGGGTEAQDCEIAEYVVSIF
GUT_GENOME096541_0026137-134RQGWAIAVAVVDAAGELVAFEKHDAAIGITPAVAIAKARTAALLQAPSKAFEDFVNGGRPSFLSTPGATPLEGGVPLVDAGRVVGAVGISGAHGGNDS
GUT_GENOME243195_0020328-154EKNVSMAMAIAIIQGTIEQCTKDGYKVSVTVVDKAGNVAAQVRGDGTNPHTMEFSRLKAYTSRTRGQTSLEFMKLTEKPETAYLRQIPGVVGVGGAAPIKVGNEVIGAVGASGAPGGEKDEVCVLAG
GUT_GENOME218038_01770214-344EHPFTLDTLRAMALRAQSHAEKIGVPCVFAAVDEGGNLVMLQRMPDSLLISIKVAQAKAYSALSCKMTTEELGRLSQPGQPLYAIDSTEPGKVVLFGGGFPFVVGGKIVGGIGVSGGTPEQDMEIARYAMA
GUT_GENOME014890_00239199-327MEQRVLSLETAKKIIEKVEEESKNRGKKAVICVCNEQGNPIAVHVMDGAFLISFDVAVKKAYTAVALKMPTLKLNDLVKSGQTFYGLQNLDKVMTIGGGVPLYRNGILVGGLGVSGGTGEEDDSLARFG
GUT_GENOME096273_01572511-636SFAQPTVTYDLAARAVALTIEEGRKAGVRTCATVVDPALQMVAYGRTDGMTPHSVETSRRKAQTAASTRRPSALIVPELAQKLEHGTGGLLTSIAGGVPIAFGGVVVGGLGVAGGKPAEDAVIADA
GUT_GENOME143124_035188-138ECREFMRAAQEKALEMGVPVSIAVVSAEGHLIALERMDDAGFITPDTARAKAYTVAAFRSMSPRFQDGLLIQQWFKERNPQMLINASVFTGGQIVVSGGCAPVFKGDEMVGAYGISGATSDQDEVIGRYAR
GUT_GENOME096283_00250194-325LRLSLELACEIARAGEEKAIALGVPVVISIVDPAGHAILLHRMADSLLASLELSLNKAYTSVALKMASHELVPVIQPGTELYGIQMTNQKIVTFGGGFPLYNDEQIIGGIGVSGGTVEEDMLIAQAAIEVFR
GUT_GENOME134080_01431186-308LNLKSADALIDKVIEKAKETGVLAVCAVCDAGGNLIALKRDDDAFIASVKIAQDKAYTAVSLKMPTYKLENLTKPGESLYGIQHQDNRIVVFGGGVPLYQNGRIVGGFGVSGGTLEQDTFLGD
GUT_GENOME143092_0390828-149ILSADDAQRIIARAETQAQSLKAQVCIAVLDRAGQLLAFKRMDNAPPGCIDSSMQKGRAAALYRTSTDKYMARANGTEPAIATLPNMVPLGGGATVTVNDEVTGAIGVSGTANPAEIAIADA
GUT_GENOME147130_0120933-161PVLTLSVAQQVLTAAQDKAQTSGWPCVIAVVDSAGSPIALVRMDNAAVPAGVELAPGKARTAALFRRPSGVLEDAVNGSRPAAITAQGFVLMRGGLPIVFNGHTVGAIGVSADTPQHDEEIAQAGLAAF
GUT_GENOME096541_0025142-172WQTAQALAAEAVRVCAARGFSVTATVVDTSGEQQAVIKGDQAPLQSLSVSYRKAYTAYSYGMAFDKDSTSALVAAKLTGPADGSLATVPQVIFIAGGVTLRTADRTVLGGIGVSGASGGDKDEACAQAAVD
GUT_GENOME141318_0176620-164FITTLSAKTPEPIQQYNLSLKLAEHLADHTITACHAQQKNIAVAVLDRGGNILVLKRHESVGPHNTLAAQRKAYTALSTKTRTSVLNRNAQNNPEASNLNTLNELLLLGGGIPVTYKNEVIGAVGVAGGGGAAQDEACAEQGVQK
GUT_GENOME213497_0021950-181TENITLDMANKLIERVIKRAAEIGVSAVAAVVNSGARPVSVQCSDGSYIASFDIALGKAYTSVSLKMRTSQLKELAQPSAPLYGIQHTNGGKIVIFGGGIPLKFNDKIIGGFGVSGGSEEQDTYLGEYAAEV
GUT_GENOME141426_0261133-132AAVAVVDATGNLRAFERTDDAPFLTVDVAIDKAWSSASFGYPTHVWNDYVSNDPKVAPLAYRPRMVAVGGGYPILEDGKLIGGIGISGGNYQQDQDACVE
GUT_GENOME224712_0218421-148GINLDLAEQITKAACEKAKEVGVPISAAVVDAGGNLIVHLRMDGALLISVDAAFSKAYTSASLESPTGSLHERTVPEGELYGLHTMFGGRYCVFGGGLPLFQAGRLVGAVGISGGTVEEDTLIAEAAV
GUT_GENOME080367_0239811-134IAQQLLDIGISKAKELQSPSNIAIADAYGYLLAHVRMDQAQLPSIEHSINKAYTSALFQKPTQELKEPSEPNGELYGLNNTLNQRVIVFAGGVPLFLDEHVVGAVGVSGGTAEQDQSIAQAIAD
GUT_GENOME232291_0015415-120LLAQEQHQAVCISIVDNAGLLRSFLRMDGAVAGAIDVSIKKARTAALFGSDSASLGQEARPGGKIYSLESTNGGLISFGGGVVLRDDQGNILGAVGVAGATVDADQ
GUT_GENOME222837_00980193-321LNLETAKKIAEAAEKKGKEIGVPIVVSVLDLGGNLLLLHRMEDSLLASIDISINKAYTALALKMSTNEVSNIVREDSDLYGIQWTNKGRIVPFGGGYPIKIDGKIVGALGISGGTVEEDIKIASYALRI
GUT_GENOME075946_01111203-327QEIFRACEAQAVKMSLPVSMALVEKNGKLVSFYQMPGALLVSESMAQKKAYSAVAMKMHTHELAKLVQPDGALYQLETLTDGQVVTFGGGFVIKDVSGNVIGGLGISGGAVEEDMAIGQKGLEKL
GUT_GENOME095752_0536736-149RFLQETESDARTRGLAVAVVGPEGELIAFGAHARCPPLPRQLAQRKAWSALRFRRPTATLAQEVSAGALRLAVFNDPRLLAMPGGEPVIVDGLAIGGVGISGLPPELDAGLAAR
GUT_GENOME096069_01082187-309QRARELINEGRRKAQEIGVPMAMAVCDSYGFLIAFERMEGVLQVSLGLAPKKASTAIKLKMSTEALSRLVQPGKDLYGLQNDPELVVFGGGLLLKDGAEIVGAVGVSGGSVEQDVSVAEAMVR
GUT_GENOME207678_03297202-323RAKKIVAAAEQKAITLGIPMVIAVVDCGGNLVLQERMDDSLLASISLALDKAYTALSLKMPTDEVAKAVQPGMPIYGLEGNLNGRFVVFGGGFPLVEQGVVVGAIGVSGGTVEEDMAVAKAG
GUT_GENOME202893_0004433-162ITQAQADTVIKGALAKAKEQGVPMNIAVVDAGGNLKAFTRMDGAFLGSIDISIGKARTARLFNMPTSALGAASQPGKELYGIEVTNNGLVIFGGGELLKNKDGVIVGAVGVSGGSVAEDTNVAKAGVAAF
GUT_GENOME103718_021107-133LAESKTVLDAAEEKAEELGVPLNLAVTNSEGNLLGFRRMDGAKLVASNIARNKAYTAAAVKKPTHELKEGAEPGGDVYGLHTTDEGRVVVFGGGFPVEHDGVVVGAVGASGGDVSEDMEASQAGLEA
GUT_GENOME037207_0008623-143DALRWAQIVVARVQADQLKPVALRVVLDDQIILQYLMPGRKDAGWVARKVYTVRQTHHSSLYTFLHREEAPYRDWQTDDRYAICGGGFPLVIDGELRGAFAISGLVHDQDHALLVASLQQL
GUT_GENOME100276_027988-133ETAKVLIEAAEAEAESMGLRMVITVANPEGNLVAQHRMDDAWLASVNISRNKAYTAAALQMPTHELAEGSQPGESLWGLQTTDENRLVVFGGGYPLEVDGELVGTVGVSGGEVSEDMAVASAAVER
GUT_GENOME081289_00885186-308ALAKEIALAVERRAEELGKRVVIAVLDSGANLMLLHSMDDAYIASLQIAQDKAYTAVSLKMPTHVALSESRGGTLDGLSATDNNRLMLLGGGEPLIINGGVAGGLGVSGGTAEEDIAFARFGA
GUT_GENOME231320_0168739-159LVTEALSICHGMNRQGVAAVVDRGGNLVALMRDDNVGPHNTQAAWRKAFTALSTKTPTKQLAVKAQSSPDSVNLNTVNELLLLGGGVPVKYRGQVIGAVGMAGTGGAESDDQCGADAIKKV
GUT_GENOME152359_00328197-327KKKKLTLEEAKKIVLAGEKKAKELNLDFVLTVVNNEGNLILEEKMDNALLASVEIAKKKAYTAAALKIETSVLAELVQPGGSLYGLQADSKYVVFGGGCLLKRDGEIVGAVGVSGGTVEEDMTVAKACVEA
GUT_GENOME212392_0157620-146TKLSLKTAKEIIERAESKAEELNIAVTITILDDGGNLIAQHRMDNSLLISIDASFSKAYTSVAMKIPTEEVYNIVQPGSEFYGLENISPGRICVFGGGLPIINNGKIIGAIGVSGGTSQQDVLIAKS
GUT_GENOME146009_0325136-156HLLLDKAEQIITGHKIGGVVVVVDAFGQTIAFDRLDGATLANSELAPKKAHAAAAFGAPTASFQKKIAEGNVGVLGNPVVVPLPGGQPVKVNGVVIGAIGVSTPDGNVDEEAATGALAALN
GUT_GENOME011229_00479195-321LTCRRATAICEAVLQRAREQGLRLIAAVCDTGGNLVSLQRDDDAIFASIDIAVNKAFTSASLKMTTEEVGRLAQPGAPLYGIQQTNGGRIVIFGGGVPLLKDGRVVGALGVSGGTLEQDTAMGNFGA
GUT_GENOME096857_0008542-171IESVVDQIVKACETEANRIGVPVVVAIVDASGSLVYQRRMEQSLGVSIDLAPNKAYTAIAFRCSTAKLGERIQLGSSLYGIETMVQRPIVLFGGGEPLMCGDYLIGGLGISGGTVDEDMLIVKAGLRAFQ
GUT_GENOME155719_0087147-174MTLGTAKMLIEQGEKEAEAIGVPMVISVVDEGGNLTALHRMDGSLLASLCVSQSKAFTALALRAPTGEAAKTILPGQPLYGLQNTHPGQFCLFGGGFPLMSCGCCIGAVGVSGGTTEQDTAVGERIVQ
GUT_GENOME145822_0570264-177KKATEINVAVVFSVVDRGGNTLLIQRMDEAFVSSCDISLNKAWSACSLKQGTHEITPAVQPGQSLYGLQLTNQQRIIIFGGGLPVIFNEQVIGAVGVSGGTVEQDQLLAQCALD
GUT_GENOME061749_009508-129SQQMASAIIAAGQEEAQKNNWSVSIAVADDGGHLLALSRMDDCAPIAAYISQEKARTAALGRRETKGYEEMVNNGRTAFVTAPLLTSLEGGVPVVVDGQIIGAVGVSGLTGAQDAQVAKAAG
GUT_GENOME095287_031389-135AKTIVDTALATARGHALKPLAVVVYDARGSLKCVQAEDGTSLRRVEIASGKANGALALGLGSRAIAARAEAQPQFVAAVSHLVGPAALVPVAGGVLIRDGDRLVGAVGVSGDTSDNDEICAMAGIQA
GUT_GENOME141054_0210612-133KQMIEYLVNRALEDGRDPLTIAIYDDARQLVSLTMMDGSDKVNKDLACKKAYTSSLMRKTTLKWQEFIQKQGVGLDVFADPQMTYIPGGAPIINSDGDTLGAVGISGRTMKEDQELADAAVE
GUT_GENOME009001_00951297-426KDVTLQQAKLLSEGVRRFAAEKGMKIVVAVANSQGRPVSVEVMDGAFLVSYEVAAKKAYTSVAVKMSTAKLQEEVEKGNSLYGLQTVDQLIYLGGGEPLVLNGQIVGGVGVSGGTAAEDTDLGAFAARLF
GUT_GENOME179523_012487-132LQHAQRCIAAAIAEAQRQRLAVCVAVVDAHANLLAFVRMDDSVPGAIDLAQRKARSAALFRLPSGELGRLAGPGQPLWSIEQSNGGLACFAGGMPLGDAGGSCLGGVGVSGASAAQDQSIAEAAVA
GUT_GENOME104639_00341180-306MSLPNIKTFMQALEDEAARRGLQLVIAVCNKEGRPIAVHVMDDAFVASYDIAVNKAFTAVSLKMSTKELAKLCVPGGPLYGLQHTNQGKIVIFGGGVTLRDREGNILGGLGVSGSTAEIDTMMGDVG
GUT_GENOME095246_0421856-162RKNGWTMVITVLEPNGQPVLSEKMDGTQYGSTEVALGKAQTAANFRKPSSYFQDAVKAGTLNSIFTGAMALEGGELLLVDSKIVGAIGVSGGTAVQDGQVARIGVAA
GUT_GENOME001531_0057119-126LVKNEYGDNPFSVAVVDKDGFTVLYQKEDGAKLLTIALTPAKAYTAVRMGQPTADFLARLQREHLEICYFADEKFVAMPGGVPIKNPAGKVIGAVGIGGLKEDGEVAM
GUT_GENOME243308_012356-137KLELKEARHMVAAAIRKAEEIGVLESICIVDDGGYPLLLERMDGARITGPQIAWNKAFTAAGHKRSTHLFNTPPNGPALPGNEAFGIQWSFDGKFAVFVGGYPIVVNGEVIGGVGLSGGNGEQDTACGVAAL
GUT_GENOME122590_02533309-424LNGLEAEARGGAPVCLAIVNCGGGLAALLTMDGTPERAVSIAQGKAYTALRMESSTKDFHERLLRERITIADFCDPAFTTLEGGIPLFDGNGKCVGGLGISGRKPAEDGELAERLG
GUT_GENOME103718_019556-132LDVAKELIAAAEGKAEEVGVPMCIAVMDDGANLVGFHRMDDALIASIDISQNKAYSAVSLKLDTETIQEVSQPGESLYGLGNTNDGRIITFGGGFPLEDDDGNVIGGIGVSGGSAKEDMEVAQAGVD
GUT_GENOME237519_0048518-129NDLASDGAPVCLALYGNDGALMALLRMPDVPARIEHMAIGKAYTCAKMGCSTAELHERLCREHITLADFMDPRMTSMQGGVPLKDKEGNTLLGIGVSGRTAQEDEKLALRIC
GUT_GENOME064423_02308234-349EDAFLLGCNLVEKVKKENLKNIRIRIVLNQDIVFQYLMNGKKGDQWLNRKQNTVELFKLPTYQIWQENERSHCYQQYTNDERYVICGGAYPIIVGKGMIGSVIVSGLAHNEDHQII
GUT_GENOME056415_0131723-144GLKFGLELIDYIQSHQLKSVGVRIVYKGLVIFQYLMDGKSEDIWLKRKERTVIESGHSSYYTFLHQDQYSTWIDNDNYTLGGGGFPITENQQVVGAICISGLKHDEDHQLIVEILRKLLKED
GUT_GENOME143607_002337-132LTLADAEFLMEEAQKYAIEHNFKVSIAIVDETSSLLLMKRLDGASPLTTHLCLEKAKCSSMSGRPSKFYEELLQNGRLGFLSMPSAQGMLEGGKAIIYEGQILGAIGVSGVQSFEDAEIAQHAIDV
GUT_GENOME145483_0560335-154AMSTAAHDEARKLGVDISFSVVDEKGWPVFFQRYDNAMLLTTVLVPKKAYTSAITRMPSGELEKLTAPGAPLYGINTADPKLILIAGGYPLTYKGKLVGALGVGGGSWEQDEKMGQAALK
GUT_GENOME096466_02300196-328LPEDRDLDRAKRIAVAAEAEATRLGVPVVVSVVDAAGHPVLLHRMAGSLLASIDIAQNKAWSSAAFRKTTAELGELAADGGPLPGLADGNAGRVVLFGGGVPLFDGDELVGGLGVSGGTVEEDCIIVQQALQR
GUT_GENOME071336_0263813-133KIIHYLMERVEKANGKPLAIAISDGAGVLVSLTLMEGMAVRGGGFATHKAYTAAKFQKSTWEMVKHLDKQGVPMSAFCDPNLSPLKGGAPIKNEEGDVIGAVGISGWTGEEDQMLADEAAA
GUT_GENOME243108_0331064-171ETQKRNWNAFCIAIVNPSGELVYFEKQDNCQYASIGVSQHKARTTTRYRRPTLTFENLIGKGAYFTYLTTLDDVIASRGGNLLIVDGKIVGAIGVSGGTGSQDNVISL
GUT_GENOME231519_0302716-133ALDLADSCYAKRPLSVAVCDASGELLSFARMDNAKLLTIELTQRKAHTSSRLGCTTQAFLQRLQKEQLDISYFADPHFTALPGGVPLLYDGKCLGAVAVGGLSAQEDHDAALAVAENL
GUT_GENOME143413_0399821-164AAGLNTERNISLALASDLASRTLAVCQADGYNVSVTVVDRAGIIKTVLRGDNAGPHTVKASEQKAFTALSTKTPSGQVMENSQKTPAANNLKDIPGFLLLGGGVPVKAGDEVVGAVGVAGAPGGHLDAQCATKALEQISAQLKA
GUT_GENOME188428_00568197-325DLKALKKIADACIKQAEKLGIAVTICLVDGEGHLIFSYRMPNSLLVSIDLAKRKAYTAVAMKTATNELGAATNPGGDLYQLESVTNGEIVTFGGGFPIYDRENHLVGGIGVSGGSVEEDQLIAQAGLAC
GUT_GENOME216096_0074814-130DFLLGEAAKQNNRPLAIVIVDPTGNPISMTLMDGYHSRGCRFASNKAYTCAMMGMRGEDFHNLVANSGNSLSAFCDPRMTTMIGSAPLRDASGELIGAIAVSGWKSEEDQELADAAA
GUT_GENOME243044_040264-126LTLDVARKISDAALAKCRELKLKPMAIAILDARGALKTFVAEDNTSLLRGEVAHGKAYGALALGMGSRAIFKRANEQPYFVDAINTMARGALVPVPGGVLINDKDGNLLGAIGISGDTSDNDE
GUT_GENOME066397_0020660-181LNLAKEISYAIEQMAEQMGLAVVVSVVDAGANLLLLHTMDGAYIASSCAAQEKAYTAAALKMPTHQALAMSRGGDLDGLTNGNGILLLAGGYPLLAGEKQIGAVGVSGGSIAQDQMLASFGA
GUT_GENOME231806_0265833-138GWPMVIAVCDPGGHLVCLQRMDDAALGSVEVAQRKASTAALFKRSTRGFEETLASGGATLRLLSMHNAILMDGGVPLLHEGRVIGAIGVSGMHPSQDGEVAEAGAR
GUT_GENOME096221_0341711-131DVTKILDAAEKEARAHQWNVTIAVVDDGGHLMGLRRMDGCATISAYIAPEKARTAALGRRESKIYEDIINNGRISFLSAPHLQGMLEGGVPVMVEGACAGAVGVSGVKSTEDVVIAKAGIA
GUT_GENOME225418_0162423-152FTRQNMQRVFDVAEAKANELKVGVTMCVADEAGNVRMLYHMPNANMVSRTLAPKKAWSAIAMKEPTMQIGPDIQPGGPLYEMSGNLDGRLVSFSGGIPLVWHDKIVGAVGVSGGLVEEDQLICETAVNAF
GUT_GENOME004026_00468226-346KLSQTALDYAQSIGVPIVVSIVDAKGVLMYFHRMSDALLISNDIAQAKAYTAVALKAATHEVHQSAQPDGDLFNIESMVNRKICTFGGGYPIIIDGEIVGGIGISGGTVAEDMDIASNALQ
GUT_GENOME124453_01690765-880ELHQLIRQAIEHARQLQVPVVVSIVDAHGTETVTWRMPDALLVSSELAPKKAWTAVAMKTATHELATTVQPGAALYGLESHLQGKVVTFGGGYPLWRDGQLIAGIGISGGSVEQDM
GUT_GENOME178919_0000610-119ILNQILDEAKQDGGAPVSVAVCDKSGELAGFLRMDGAALRTGHLARRKAYTAARMQARTADFLARIKREHIDINYFCDPGLTAFPGAAPLVSKDGEVIGAVGISGRPSEQ
GUT_GENOME096384_0274723-128LPPVSAVVLDAGGHLTAFARMDGTFLATIDIAMQKARTAVLFQANSGDVGANLHPNGPAYSLENSNGGLVGIDGGVPLRNAQGVVIGALGISGATKEQDGQIAALT
GUT_GENOME175139_01691261-388SLNLDLAERLADIAVKVANSIDLDVVITVVDASSNPILFKRMDNSLLCSIEISQAKAKTAVEFKADTLYLSNNESLKTLNNFSNGSTNYCFLGGGVPIKSLCGKIIGGLGISGGSVEQDCLVAEKTLK
GUT_GENOME018676_0017243-158LAQKAVAACTEAGYVVSVSVVDNAGVLLSFIRADGAGPHTVKASQAKAYTSASSRNPTSGIAKAVQSNPDAAGMTDIPDFLVLAGGVPIKAGNATVGGIGVAGAPGGHLDEACAQK
GUT_GENOME141054_0210918-133LVNRAGEPGKDPLVIAICDGAKELVSLTMMDGTYKMNEHLASNKAYTAALLHQTTKSWKEFMEKKGTNLAVYGDPRMTSLPGGTPVITEDDVLLGAVGVSGWSMEEDQMLASEAVE
GUT_GENOME167281_0098381-209MTLKLANALIEKVKAYAQKMGVNVVIAVSDQAGRPVAVQCMDDAYIASFDIALNKTFTSASLKMSTEELSHLSQPGQSLYGIQFTNQGKIVIFGGGEPLKADGKIIGALGVSGGSAEEDTAIAAYGKEI
GUT_GENOME183774_00881208-331AEYIAGKCIEKSYDIGVPMVICIVDFDGNVILLERMDNSLLISLKVAFKKAYTAAALKSPTGELYKALLPGGEFYGLNNDENIITFAGGFPLKIDGHIVGAIGVSGGTVDEDTSVSKYGVEVFE
GUT_GENOME176528_0324325-152LSEKNLSLDLADKLAQSAIQACSAKNYNVAVTLVDRAGTPLVIKKMDNAGPHTIEASRMKAFTALSTKNPTENVMKNAEANPGAANLRDIPGFLLLAGGVPVKSGDLVIGAIGIGGAPGGDLDQQCAL
GUT_GENOME113272_0096639-170EKMTLSLARQIVERVKIKAKETGVNAVVAIADAGANVITVDCMDDAFIASYDIAVNKAFTSVALKMSTYDLGKSAQPGGTLYGIQNTNNGRIVIFGGGEPLEVDGKVIGGLGVSGGTAEQDTFLGSYGRQVF
GUT_GENOME142616_00510211-332LCRAVLDKAKEMHLPVVTAACDAGGNPTALMRADGAYIASVDIAQSKAFTSVSVQMSTEKLGALCQPNGPLYGIQNTNQGRIVIFGGGIPLYRNGVLIGGFGVSGGSAEQDTGLAHYAEEIY
GUT_GENOME163309_0187961-186LDMAKKTAYAAELAAKAVNVKAVISIVNEGANLIYFSAMDDSYIASAKISQDKAYTAAALKMPTYKALEESRGGALDGLTNGNGIMLLAGGEPLFANGRLVGAIGVSGGTKDEDALIAKTAAEVFS
GUT_GENOME207970_0194721-142ARQILEHAQTQAAARARAVSIAVVDIHGALIAFARDDDASGVTINTAIEKARTAALLRESSKVFESFINGGLPSFLSTPDVTPLQGGVPVVVDGLIVGAVGVSGASGDEDAELATSAAMLFS
GUT_GENOME161807_00471277-409GRITLDSAKRLIEKIEQEALRRNKPSVIAVCTPDGNPVAVHVMDGSFLVSFDMAVKKAYTSVAVKMSTMELSRLTQPGQTFYGLGKMSDNIVIFGGGVPLKVGDTIIGGLGISGGTGEEDNSLAEYGLQVLKE
GUT_GENOME095287_0032831-156SLSPDVALDLARASLEACRAAGYQVAVSVTDRFGSPQVILRDRFAGPHTLTTASAKAWTATSFRTNTSELVALSQPGQPQAGIRHLPNVVVLGGGVLIESAGTTVGAVGVSGAPGGPEDEGCAKAG
GUT_GENOME062281_0290237-154QKALRENKPIVISITKNRKQIFYAALEGTSKNNEDWVRRKENTVYDFEKSSYEMKLSMDLKQDDLWNRYGLEKGNYAQAGGSIPVFVSGTGMIGTVTVSGMAQSEDHAFVAEALKTLK
GUT_GENOME219940_0023937-161PVLTGEQAQAMVEVVMHEVRESGHAVTVTVVDRSGQILAVLRDHHAGVHTLNASYKKAYTAASQKRETVAIARGIRDGSIPSDIRYLDPNFSLMEGGIPVILENVVVGGIGVGGAHGSEDGRLAR
GUT_GENOME131541_0179933-158LTQNGALKLAEQAKVEAKKMGKNVSVAVVNSSGATILLLKGDNVGVHNTEASRRKAYTSVSTKSASWDLMKKVLSDSSSGNLESMPELLLLGGGVPIWKDGILIGAIGVSGAGGGENDHKIAKEAV
GUT_GENOME000337_0094833-163MSLEVAKLMASNVMLACQKQAKPAAVVILDQGGNLLASQRHESVGIHNLIAAERKAFTAYSTKTSTLDFMRNAQKNPDAQNLNSLPELLLLGGGVPVIYQNQIVGAVGVAGAGGSVQDHQCANSGIQTTLK
GUT_GENOME207500_023257-136LNLKIATHVIEAVQQIANNYGTPVVIAIANEWGTPIAIHFMDEALPASFDIALNKAYTSATVRLSTEEIRELSRDGGELFGINNSNNNKIVSFPGGFPLKINEKVVGGVGVSGGTAKYDNELAFLAKEIY
GUT_GENOME257704_0400026-139RAEELGLRLSIALVDASGLPVLTAHMDGAAQPCREIALKKAGTAAAFDKPTGDWQAYLEQSAPTVRQGLPLQSGLVLFGGGEPLRLGEAVIGAVGVSGASEAQDMLCARAAVER
GUT_GENOME207893_01832185-311IKMNREIAEYLMEKLLKKSEEIKVPMVICIVDESGNPIMFQRMDGALLASINISIGKAYSAVAFRMSTDKLKELALPQGELYGINNMKKIITFGGGEPLIVNGTIIGGIGVSGGTVEEDMMVAQFGK
GUT_GENOME141004_01231198-322LSFSFCEKLMHQVCIVSEEIGVPVTLAIVDAHGNARFNYRMEHALLVSAELATKKAYSAVAMKTSTEKLTEAVQPGAPLYQLETLTNGDIVTFGGGVPIYGKDGAIIGGMGISGGSVEEDIHIAK
GUT_GENOME070336_0133510-129ALLIGLEAVRKASSMNVNVSVSVMNKYGSQIFFCKMDNALPISEKMAFKKAHTSVLLQMPTADIKKSIDEAWHGLDTVMAGEIVMFGGGIPIFRNNEFVGAVGVSGGNNEEDVLIAKTAA
GUT_GENOME096140_03762336-471VNENDEKMFYLGIYAVISAGSKKQLENDVTAFCSMAEGEGFSFEPAIWEQIEAINTALPTGARFCSVMQPVFTQPLCALTPFVVQELYQPGGLFYGINQVSKNVLIGDRKLLKNGNGFILGITGGGKSVETKMEII
GUT_GENOME193403_0002451-176YQDAWELVTRARRKAEELGLAVVITVVDPAAQVVMTYRMENALLVSNDMAYKKAYTAVGMRMQSKDLAPLTQPGQWLYQLETMTDNKVVSLAGGIPVYCQSEMIGGIGVSGGSSEEDQSIAEYAVG
GUT_GENOME143124_0172511-136LTLEAVRTALDAAVRKASAQGIRINVAVVDASGVPLGFLRMPGAFLHSADIAVDKAYTAAGFRLPTGAWNDVLKGLSPAVRQGLPARPRFAGFGGGFPLIAEGELVGGIGVSGGSEEQDEACARAA
GUT_GENOME151856_0138663-186TVGILCHAAKEKSVQMGLDISFAICDADGLPRLFCRFGDALVLSTILVPAKAYTAAVTHTPTEALGEFVADGGNLMGIHTTGDKITLVPGGIPLFRDGKIVGAIGVGGGTKEQDLEIANSIVAK
GUT_GENOME095736_0171846-166SCEHSGYAVSAVVVEASGRIRFEAVGDHATVHTPTSAYRKAYTVVTMGPIFHLDTSSAFAAAVAKNPSGPALSSLPDVAPLPGGVAIKVGDEIVAALGVGGAPGGEKDEVCAQAGVASIQD
GUT_GENOME095592_0215224-135QAAPLTVAVLDAGGHLLALQREDGASLIRPEVATGKAWGAIALGKGSRLLALDAQQRPAFFAALNGLGERPVVPAPGGVLVRDQDGRVLGAVGISGDTSDIDEQCAISAIEA
GUT_GENOME257704_042436-134FERAAQLAAATLRHARRLGLRPLAAVVLDAAGHPLAVLRDEQASFLRPQIASGKARGCLGMGFGGRELARRAQAMPAFFDAINSLTGGEVIPVPGGILLRDAAGNLLGAIGVSGDTSDNDERCALLAIE
GUT_GENOME106866_0163024-141LDFGLRVIEIAKKEKLKPLRIRVSIDGDIVFQYLMDGKTSDLWLNRKENTVLKTKQSSMNVFNNQDFYKNIVDDENYAICGGGYPLIVANEFKGVFCISGLEHYEDHALIVRVLKEMK
GUT_GENOME098156_006706-128EKALQIMTRARLASEKMGVLMSFAIFNDKQDLILFMRMDGSSGESVRLAQLKALTCLKLCENTDNLAYLVNKENGVLRGIRYDNAISLIGGGKLICDNGTIIGAIGVSGGSETQDIEVAAAAL
GUT_GENOME000519_01728232-352QAMAAAAQSRGAELGVPIVFAGVDAGGHLMLLHRMEDSLLGSLDLASNKAFTAAAFKQPTADLSEASLPGAELHGIQNSNDGRVVVFGGGLPVFVDGVLCGGIGVSGGTVDQDVTIASFAM
GUT_GENOME095246_0384216-119SRAAQLGVPVNIAVLDAAAHLLAFGRMDEAVLGSIDVALGKARTSALFGITSEAVWDYCKPGAPAPGLERCNGGLMTFPGGAPVRDPSGRLAGAVGVSGGSPDQ
GUT_GENOME171657_023375-126SLSKANTIIKAALAKGTEAGMRPLSVAVLDAGGHLKAFQKQDGASLLRFEIAFGKAFGGLTIGTGSRTVEKFAKERPHFVEGLVAASGGRVIPVAGGVLIKNAKGELLGAVGITGDTSDNDE
GUT_GENOME096381_0468692-197KGGQKVSVAVVDRNGNTVVTLRGDGAGPQSYESAERKGFTAVSWNAPTSQLVKNLEKTPNLKDIPGTLFLAGGVPVRAGDAPVAGIGVAGAPSGALDEEFARAGAK
GUT_GENOME147366_0323436-165KAPILTYEMAERAADSAFQAALKEAKSVSVSVVDRSGQLMASLRHHNAGVHTIQASYKKAYTANSTKQSTAVIANNIKEGKSPSDLRYLDDNILFLSGGAPIIIDGIVVGGIGVGGAQGHEDARYAQIGA
GUT_GENOME143709_010264-132LTLDMAKLLIAAAERKSRELGLAEVIAIVDEGTNLIALHRMDNARIAAIDIALNKAWTSAAMKMPTSNLSEAALPGGPSFGINTTNQGKIVILGGGIPLVKKGSIVGGIGVSGGTSAQDIEVANAAVQA
GUT_GENOME143124_046801-139MPGLTLEQANAIIAGALAHAAGKGYKPMAVVVLDAAGHLKSAQRQDGASMFRVDVATGKAWAAVGMDASSRTLAQRAKDNPNFFVTLAATAQGRFLPQTGAILIRDAAGAILGAAGASGGTGDEDEEICIAGVTAAGLQ