UHGP-MC 51725


Information


Number of sequences (UHGP-50):
250
Average sequence length:
80±10 aa
Average transmembrane regions:
0.17
Low complexity (%):
1.4
Coiled coils (%):
0
Disordered domains (%):
17.37

Pfam dominant architecture:
PF00801
Pfam % dominant architecture:
5600
Pfam overlap:
0.71
Pfam overlap type:
extended

Downloads

Seeds:
MC51725.fasta
Seeds (0.60 cdhit):
MC51725_cdhit.fasta
MSA:
MC51725_msa.fasta
HMM model:
MC51725.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME000102_022292446-2524PYAYISANSRGELGVEMTFSAASSYDDTGIVSYKWDFGDGSTADTMEAVHTYETAGTYEVTLTVADAAENTATVQMNVT
GUT_GENOME018415_007011104-1192NKAPLAIIDVLETMSLEGNAIGLDGSRSYDPDGSIAAYEWKDAQGKVVSTTKSTNVSGDGLTTYTLTVTDNKGATASTSQEVLIAKPVI
GUT_GENOME018415_01119360-464APVAAFTAVEKNGTVTFTNTSSDANGDALTYNWDFGDGTTATKNDNKSFTHVYNQVGNFTVKLVANDGTLDSQVATKTVNVTCVGASCEVQNTAPVAMFEVSVSG
GUT_GENOME103758_01682753-829EAPTIKLDFPKEANTDEKINFSSEGSKDDGKIVSYNWDFGDNTTSSEKNSFHIYKDPGTYIVKLTVTDDKGLKTEKA
GUT_GENOME140277_02939188-273TKIMRPSVDFTFSVDQNNWKRIIFKSEAKYAYSSSWTFGEGDAASSLANPSHTYTDKGSYQVTYKAVGITGEEVVVTKTIPVLNIG
GUT_GENOME137702_00704213-271GEPSPFSSFARSYSPITSYEWTFEHGTPERSSEAEPSVLWSEAGDYQVTLTVGNKNGKQ
GUT_GENOME214946_01011365-423RIQFRNLSSDEEAGFLWNFGDGTTSTEKNPFHVFASSGDYTVKLTASNEYGEDVMQLKV
GUT_GENOME237520_020461521-1608APTASISNGSALQSTVNYGIRFDASASTDNFGIRSYEWDFGDGSTGWGSTAIHEYSAAGDYTVTLTVTDYSGNTATATIPVTILSPDD
GUT_GENOME090432_017621037-1115INALPTANAGVDFNATVNKTVVFDGSRSYDADGQIKSYLWEFGDGSSGWGIAPTHIYTASGTYIVTLTVTDDNGTANAL
GUT_GENOME283332_01470495-564ADFSFTDTLTICGTFTGTNAETVTWYWGDGASDVGNKVNHTYEEPGTYEIRLVASNDLGDSEVTITVTVG
GUT_GENOME014082_019541406-1476NVEYVYDASTTVSDLGVQSYAFDFGDGKTASGTAKKQVHKFTSVGTYEVTLTVINDAGTKSQVTKKITVEE
GUT_GENOME051201_01281538-612IAGEEIAFDGSACTDNVRIKRFVWDMGDGNTIFGVRNSYIYDEAGEYTVTLTVEDATKNSSTSQIKVIVYEPSEY
GUT_GENOME057047_0004922-93VDFTYSPDAPRAGQSVSFSNQASDGEKWAWSFGDGSTSTSKSPVHTYRQPGTYTVILKVDDKSSLTRTRDIT
GUT_GENOME039663_00131122-207ANVLPVAAFSYSPLSVKAGDSVTFTDESKDEDGEVISRKWLFPDGTTAEGAQASFTFASAGIFKVTLTVVDDRGAESSLTKALFVR
GUT_GENOME271085_011762972-3063DKENNEKPVARINGNEVMEKGVEEYFDGYSSTDDDSIVSYHWDFGDGTYSDDIQPIKSYRLAGTYEVKLTVTDSFGEQSTATFTVTVKERTA
GUT_GENOME206559_00763201-280ISEGQSVTFADLSDGATEWHWTFAGGTPAESNEKNPTVTYAVGGDYDVTLTVGDGSGATATAERKAYVHVRKEQAKALIG
GUT_GENOME029189_0181223-95PHNPDEPSMDRLTPNYDYAVYKNGVTFTNKSKGASRYLWNFGDGNTSTLKHPEHIYSQPGSYKVELTVKNGKE
GUT_GENOME237421_01054261-335MPVPDFTFAQDSNTFTMQNRSLNATSYEWDFGDGTTSEEENPTHTYTEPGTYNIILTAKRHCMTVAMQTTVEVEG
GUT_GENOME261128_0184518-110CQDNDDVNIKPVAAFKVGTTTIEVGQSIYFTDLSFDEDGSIVKWQWDFGNGASSEEENPSVVYNEAGEYKVVLTVWDNNSIQNENAFDKTIIV
GUT_GENOME263661_0075143-116GESVTFTDATVPNEGTKIVAYLWEFGDADKSTSTEQSPTFVFKKDGIYLVKLTVTDSNGLKVSSQKEITVINPT
GUT_GENOME207745_0219159-124SVTFTDATTNNPIGVTWAFEGGNPTTSDKSVVNISYSTKGKYDVTLNATNAIGTSTLSIPDFVQVG
GUT_GENOME096287_002011089-1168TVTPPNQAPTAAFSFTTSGLRVDVDGSASSDPDGTVASYAWAFEGGGSGSGATESHTFASAGTYTVRLTVTDDDGATAQL
GUT_GENOME023274_00189702-777DKTLVTIGEKVTLMNQSNLASVKYKWEIDGASPATSTEKNPQVSFDKAGSYSVKLTAINEKGQEDSVTQTELITVI
GUT_GENOME249369_00126821-906TQPPEAVLTAPSTGFAADPISFSAAGSSDNREIRSYHWDFGDGETAEGKTVTHHYTAGGSFTVTLTVTDADGNETKETQTINVTAE
GUT_GENOME000259_004131319-1409VVNDTEAPQAMIALLNPSGIPGMALTFDGSGSTDNDAIASYYWEFGDHTVSNQAVSAHTFQKAGTYQVKLTVTDAAGNEGTGTVEVKVQQP
GUT_GENOME201997_0042628-99SAEPPYASFSSEVYKLGVKFDNHTRNASRYEWNFGDGTTSTEKNPEHIYAKTGTYWVTLTASNGIGERTSSQ
GUT_GENOME103718_02099758-846IDELDPIARAVPSATRVDMGESVDFEAEDISGDENSVDSLTWEFGDGTTATGWSATHSYDVPGEYTVTLNATAESGCTASYTVTVTVDV
GUT_GENOME018788_020822745-2833EPPVAIVTPSIMGIAEMEVAFDGTASTDNVEVTEYNWDMGDGTVYKDAPHPIHVYQEPGIYTAVLEVKDAAGNSDSATITVEILENNGN
GUT_GENOME136870_007212264-2350APVPVLDCQSTVVVGSEYMFDATGSTDDTEIVSYSFDFGDGTAVVKNERGKTLHIFDKTGKYKVTVSVTDSDGNTAKLTKEITVTSR
GUT_GENOME018857_02045137-213LNEKLSFKDKSVPAQGATLTKWEWHFGEGKGSVSEEQNPEWTYTTSGSFTVSLKVTDSKGNSSSTSKDIIVIDPSDL
GUT_GENOME017010_01246920-997FQAMNRKGEAPLTITFEDLSQYDPTSWTWTFEGGTPATSDEENPTVVYETPGSYDVTLTVSNAYGTSTVTKKDYIVVG
GUT_GENOME054668_00991941-1028HERVNKAPKAVINGFAVMEIGVEEFFDAGYSTDDTGIVSYIWDFGDGTTSNEIKTVKKYMKPGQYTVTLSVVDDDGVKTTETMLVTVN
GUT_GENOME210816_027801777-1859VEPVADAGIDLYGLAGMSIRFDGSKSTDNHYIASYKWDFGDGYTANSAKATHTYSDSGIYTVTLTVKDSAGNTSSSEISVTVH
GUT_GENOME096202_04339660-739ARFKTDKTLVAPGEQVTLIDQSTEVTEEWLWTIDGASPSTSTDKQPVVTFAEEGAYSVTLTAKNSAGQDSVTKQALITVR
GUT_GENOME142465_01500721-802PKAAFTVNRTLAAPGNTIKFTNASSKNATSYKWEFDGATKTTSTAKNPTVTYRKAGTYNVTLTAKNKDGQRHVTMKKLITIT
GUT_GENOME234140_00832268-338ANFTYEVNDNLVQFNSPNSPGISYEWDFGDGITSTRGVEVHKYLATGTYAVTLVATNHCEADTLTQTIVVS
GUT_GENOME162747_00974666-736YTAQNEEVRFSAKFSNGNNDNINYKWDFGDGTVADGKDVTHTFERTGQYNIKLTATNAYGTSSTDSLTLQV
GUT_GENOME155212_0031155-118KVQFIFSGSDASGITWDFGDGTTSNEWNPLHEYAATGVYHGSQVVTNSYNGGSSTTLNFTIEIM
GUT_GENOME256155_0075535-106GEKPKAAFVADTDFMKVSFENKSTNGESYYWEFGDGTTSTEEAPEHEYAIAGNYTVTLKVNSAAGYSDVYEG
GUT_GENOME180664_00266222-303NEPRIYELVKFELNPAYYVAGEVVSVAWDFADGQNAVGENVIHCFTKSGALNVKCEVTYADGSKEAATMNVNVGAFAWASTN
GUT_GENOME000128_025902621-2707DEDTEAPAAIITENICTLTGFEISLDGGASTDNVRITDYTWSMGNGDVLKGVMPVYTYNKEGVYKVKLTVKDAAGNTSSAETTVKVL
GUT_GENOME256097_0218844-110VPVGVPVRFHDKTEGYPTEWEWTFEGAEPATSDVQNPEVTYPEAGNYFLMLKAGSDGKYDRCVFEKG
GUT_GENOME141207_01689617-698PNAVITANNEGKVGESITFSSENSTDTDGQIVSVLWDFGDGTTSTQTQPTHQYGSEGQYTVSLTVTDNDGLTATATHNVTVS
GUT_GENOME266641_01446671-756FRIVREDASASESLHIFEGESIGFKSITQGEPEGLTWSFPGAEVETSTEANPVVTYNKAGTYDVTLTVTRGTETAKAERKGFVVVS
GUT_GENOME143497_020252416-2496VQAVLTCELNQEAGVEYTYDASGSWGDLDIKSIAIDFGDGTAETGNHFVHKYTDTGTYTVTLTVTDTEGNTDSLARTIEVT
GUT_GENOME237421_00995790-861PVADFITPSMGCAPDTIAFENVGRGTSFYWDFGDSTYSAEQNPTHIYTTPGIYTVTLVANMANGCSTGDTLQ
GUT_GENOME208825_007842250-2333PVAVLPQNMTGLTGVASHLDGTGSTDNVRIKSYTWDFGDESAKQTGSKPVHTYKKAGTYTVTLTVADQAGNKNSTYGTIEVKDA
GUT_GENOME051954_02333292-361ADFTSHVIDSKNRVVAFTDESVGKVLEYYWDFGDGMTSTEKSPVHQYRKADKYHITLIIRGEGGESRRIK
GUT_GENOME072652_008871175-1256APTSVPGEGVFGIAGKAVSFDGTASYDNHYVEKYYWDFGDGTVSDEAKVKHTYNSEGDYNVSLTVYDSAGNFDTAQMKVKVY
GUT_GENOME141210_00853599-709PQPNRKPTAAAGADQSVTGPASVVLDGSQSKDSDGTIASYAWEQVSGTAVVLAGANTAKASFDAAEVTVEEQLTFKLTVTDNEGATASDLVVVTVKPAGVVDPVNNAPVAQ
GUT_GENOME215851_00801205-274ADNTKWTQVTFTNTSSNALSWSWDFGDGTTSNEYSPVHTYSKGGIYNTTLTTYGAGGDALSSTQKVVIIK
GUT_GENOME238305_00005701-773VADFETFVSETGEVSFGNRSENADTYKWSFGDGQTSTLAQPTHIFGKPGEYNVSLIASNPTAASAMVKSVTVA
GUT_GENOME096372_02933161-247GIYDNGANNSNPVAKFTTHKEKYRLGETVKYIDLCYDPDAEGIAKYEWTGKQDAFFKPGSYPVSLKVYDRHGHVSKVFTKNVIVEDI
GUT_GENOME265506_00359956-1032PLADAGLDVMGIANEKISFSGAGSWDNHYIASYEWDFGDGAFDTGINASHSYKTDGTYTVTLTVKDSAGNSDSHSIK
GUT_GENOME100286_01429209-296TSRMILAGQTVTFEDMTMGRPENWNWTFEGADTRTSTERNPVVRYSTAGRYKVTMTAYNEVNSSTIEKDEYIMVLPTTGLSMWFPFNG
GUT_GENOME190320_0182527-108PDEPSTVLPKADFYTNQYKSRVRFGNQSKDASRYEWDFGDGSAISKEKSPLHIYEKAGTYKVLLTAMNGLGKDTLSHQVTIE
GUT_GENOME239690_025931756-1837PVAIIQANDSAKKNKNVQFNAKASRDADGEIVSYSWNFGDGDTAVGSKVKHKFTTAGTYTVELTVTDNDGANGKAVHTITIR
GUT_GENOME143718_00766713-793DNKVPRADFTASRTLAAPGSAITFTNTSSLNTETVTWEFEGGDITTSHEHSPVVTYHKEGVYKVKLTASNQAGETPAEKNG
GUT_GENOME042797_01471872-944GKRVSFTNATTPSEGCSYEWSMPGSTQETATTINAATSYEKPGTYTVTLKATNAAGTSTYSADIIVSSQIPTP
GUT_GENOME237837_00008766-835AEFDLPDFVCHPDSVHFENLSRADNYLWLFGDGTSSTEENPTHFYTQSGSYDVTLIAYKDNACKSSDTLV
GUT_GENOME207399_0001133-121FQCSRHVDCEEANFYVFADNYAVKEVIEFGDKTKNAKSWKWDFGDGSPADFRQRTFHTYEKTGEYVVTLTINGSCTSQKVLTITSLNQQ
GUT_GENOME142057_01293756-840ANVAPVARFELNVEGLNVTAQNTSSDSDGSIVSYLWDFGNGQTSAEVAPTWSYTNAGSYSVTLTVTDDKGDSDTHQQIIKVEKPN
GUT_GENOME204333_006121238-1321SAPEPAISCDNTMEVGVEYRISAADSSDNSKVVSYHFDFGDGTESTQINPVHVYQETGTYTITLTVTDDDGNESSIEKEVTVKE
GUT_GENOME207903_02640797-880LPVAKINGPYTGTINNPISFSSEGSNDSDGKIVSYLWDFGDGTTSNAPNPDHLYTKAGSYNVKLTVTDDKGAYNSESTSVNVID
GUT_GENOME060301_0063441-121PSATFSIAPERGQIEGEIQFTNASYGGSGNFTYVWDFGDGTTSTEENPKHVYNEKGIFVVSLTITDSSGRSNLYRKTIEIT
GUT_GENOME100343_01910267-329VESVAVTTGERIRFIADTDADAESWQWSFPGGVPATSTEASPEVYYTADGEYDVTLTVTRGDQ
GUT_GENOME009676_002532634-2725PRPVDSTKPNVVLNADTSVVEGYEMVLSGMDSTDNDRIRSYQWNFGDGTPDASGPYARHIYKKAGEYIVTLTVEDTSGNTAYQTARITVLPK
GUT_GENOME221971_00373578-658IEEGQTVSFQNQSTNATNYVWIFDGGTPATSEDENPTVLYSKAGQYDVTLKAISASGETVKTKEKYITVKKAPVPAPVANF
GUT_GENOME233284_01274376-449FTYNVSGLTVNFEDKSTDVDGDLLAVEWDFGDGTTSKKAKPTHTYAKEGTYKVILTVDDGSEKAQFVKEISLYS
GUT_GENOME096499_0046426-107DSTTPSIAPKASFDFTADELTVTFNNTSENAVSYIWLFGDGNKSTDQHPTHTYTEEGTYEVELFATNSSDETKSTKKSVTVK
GUT_GENOME171348_01142784-868NNPPVAEANGPYIAMKGEEISFSSMGSKDSDGTIISYLWDFGDGSTSVEANPNYTYSTAGTFTATLTVVDDKGERNTDEAQVIIN
GUT_GENOME110782_00775889-977DITVYPAQLPVADFTMAEATKNAGEQFSFVNRSTGANASFKWTFAGARNETLNTTNATAVYDVPGTYTVTLTATNSTGTSSQSKEVTVK
GUT_GENOME190811_011941853-1935MKPEAVLECETELEVGVEYLFDGSGSAGSCPIISYEIDFGDGSVSEFPESIHKYEKTGKYTVKLSVTDENGKTAAVKKEIEVC
GUT_GENOME096508_02152665-742FTANQTVVAPGEEVQFFDQSSEVTQEVVWHFPGGEPTTSTEKKPVVRYKEEGVYPVTLTAKNAVGEDVITKEAFITVT
GUT_GENOME058864_03399249-314YPEQPISFINQSKYATDYQWNTFGNPATTTEESPVVTYSESGDYKPLLTVTNSKSTDSFSDIVNVG
GUT_GENOME140888_0080415-104GGDTPSASFAASKTSVSVGETVQFDASGSKAPSGTILSYVWNFGDNTRVTTSGPLIDHTYSIPGAIIVGLTITTDKGTYATSDVVRITVN
GUT_GENOME096269_040681869-1957NRPPIADFDWEPRPAYEGDHVTLTNRSRDLDNDPLTVIWTIRPPGEPAFTTTDEQPELRFVQPGAYSVTLRVEDPYGASHQVTKNIPVT
GUT_GENOME253277_0121825-106PDQHIFPNPDVDFTYNVDGDEYTLDFYVVSTIQFNNTSAKSGNFVWDFGDGTTSSEVNPTHKFEKAGVYDVTLTLDGVGKKT
GUT_GENOME103718_02642669-753SEPEAVIDAPDQVGADEEFTLDGSESTDRYGEVVSYDWEVGDESQTGETVTAALEEPGTVDIELTVTNDAGETSSTTVPITVEES
GUT_GENOME063562_000231883-1966APTAVISAPSNVEVGVENYFDASLSEDNVGIVSYLWDFGDGTTSTKEKAVHKYDNVGEYTVRLTVTDGDNNSSSKELKVNVYEK
GUT_GENOME235146_00592461-543VANAGGDKTGVEGVNITFDGSKSTDNGEIVSYAWDFGDGSTGVGKSATHSYSKAGTYTVKLVVTDEEGLIGSATISVKINEKG
GUT_GENOME000224_02590628-692VASIVTGEQIDFVDLSRGYPVAWEWSFPGAETETSVEQNPTVVYSNAGSYDVTLTVTNEKGEKNT
GUT_GENOME236863_00968140-212ANFDYDFYDNFEIDFYNSSSNATSYSWSFGDGQSSTEEEPSHRYSRPGTYTVTLTARNSKGSHQATAQITITK
GUT_GENOME242947_00827202-278TSSVTINEGEKVHFQNTSEGNIQSYKWTFTGGTPSESTDKNPTVRYDQAGTYPVTLTVSDGKSTNTYTRKEYIHVIT
GUT_GENOME232794_05287444-517SKFSFDKLQGRVKFTNLSSSKAVSYLWDFGDGSTSTEKSPLHIYKESGSYTVKLTAKGEVDSDVAEMTVLINDK
GUT_GENOME040519_03589132-212VTNFSYSPAICKVNEKVQFTDKSVDEDGEIVSYEWDFGNGQTSQEKDPEITFTTTGFVTVKLTATDDRGATNTKQATLYVR
GUT_GENOME216741_00736223-295PVATFTTQLDGSSFDTFVFTNSSLYWDFFSGEEPFSWDFGDGSAISHEINPKHQYGAVGDYNVTLTVRGITGE
GUT_GENOME176051_0234222-115DEGVNIKPVAAFKTPVTQLEEGQSLTFTDLSFDEDGEIVKWEWDFGDGQKSTERNPVITFNSIGEYNVLLSVYDNQGAMNANDFSKVIVVKEKT
GUT_GENOME096272_03440778-862NPPTANIDGPNVCKVGETITLSSKNSTDSDGKIVEYFWDFGDGNTSNEVNPTHVYKKEGTFNIVLKVKDDKGALGEKVLTIKVEK
GUT_GENOME024791_00945424-500GDLQAKFSVETYMNRAQLTNLSSENAKTFEWNFGDGSTSTERNPMHIYAEAGTYTVRLTARGIVKTVTAEQQIIINP
GUT_GENOME096499_0195125-99MPMALFDYQIDGITVQFTNYSTDATTYMWDFGDGNTSTEENPLHVYAESGNYIITFTATGEGGSKTIKEMLKIQK
GUT_GENOME180609_02170438-515TTDNSDDAKVYITPGSSVTFTDMTEGEPTSWQWTFAGAEPATSNEQSPTVTYPNTGTFDVSLTVSDQAGNISTKTRTG
GUT_GENOME022218_01738546-625DASADAKINIFEGDEVHFKDLSQGHPTSWEWTIEGVTPSVYNEKNPVVKFEKAGKYNVKLVVAKGEERAETTKQEYVVVN
GUT_GENOME268965_008963-69VDFNYTTVGLEVTFRNRSKDIPNNSKLLWNFGDGDESSEENPIHNYKLSGIFSVSLSVKNQDDIIIG
GUT_GENOME037779_01951228-298IYQGETVQFYDRSEGSVKSWQWEFEGGTPSASSERDPAVRYDRAGTYSVKLTVSDGTTQSEAVREGYVVVG
GUT_GENOME023473_00290474-539DMAFVPAGVPVQFTDKSEGAPTSWEWKFTSDSETLTSTEQNPTVTFTKEGRYDVTLTTTNAIGSNK
GUT_GENOME233284_01275419-498PKADFTLSVNKLNIKATNNSSDEDNDKLSYSWILSDGFTSNSKNLDHTFAKDGTYKVTLVVSDGKTQSTKDDSVTVASDS
GUT_GENOME047071_003052947-3038SPTNEDNENPIAKVRDNFSVREGAEVILDAGSSSDNVKIQKYEWDLGNGDKKSGKKIKYTYTEKGTYNGKLLVTDTSGNTATKKFTVKVRSR
GUT_GENOME217115_02083117-190SPEKVVAGEPVQFTDRSTDEDGEVVAWEWKFGTTTSDKQNPEFTFAAQGATEVSLTVTDNKKGRNTKTVTIDVG
GUT_GENOME039230_02066269-356DSEGSSIDIAQGDSVSFADASQGNATSWLWEFEGGVPATSTERNPTVTYPEAGSYSVSLTVGDGTSTSTVTRQGYVNVKAEQPSALIG
GUT_GENOME015459_0014631-117PRAAFTMSEAPEGYEINKAITFTDASTPEANTNIVSWLWSFGDEANTTSTEKSPTFTYTKEGVYNVTLTVTDNHNLKATLTKSITVL
GUT_GENOME153033_00680126-201SKGCVKVDQLVNFNLECTGNPTSFEWTFPGGTPATSTEQSPRVKWSKKGDVEVTLVVKRSTDQAEKTLKKTIHVGP
GUT_GENOME262276_01036129-208DFSYSPMMVNVGDEVTFTDKSTDSDGEIVSREWVFPDGSTSTETNPSYTFTQKGMFQVKLTVADDRGGESSATKSINVRD
GUT_GENOME140888_010091438-1520PLAIPDFEADVTSGNAPLTVQFTDKSLGAASVQWDFGDGTTSTAWDPSHTYSAPGTYTVTLTTINAGGGSTQTTMTVTVIAPP
GUT_GENOME282631_00334447-534PSGLASQTMNTIKVNEKLSADFEWNPLSPATHRIVFKNRSTGAVKYNWDFGDGATSSEFSPTHSFETKETKTYHVSLTAVSADGQESV
GUT_GENOME207399_00076111-192ELSALFSFTQQGSDIAPLTLTLSNQSEGATTYQWSFEGGTPAYSSEKNPTVTFEKGGIFTLRLEASNHSQRRVMEKTVVVRA
GUT_GENOME194878_003321262-1355EDNTLFNDKRLPLASFTVSDGPYYVGSAVQIEDTSKGGIVSWKWDVTGAEEKTLETENPVLLFTEAGQQTIALTVTDAMGRVRTTQKSITVQEM
GUT_GENOME182387_04105216-294GGYIHFLDKTLGMVEDWKWTFEGGNPSVSTDQNPVVQYVNPGKYKVTLTAKNSVNSSVKEKEGYVYVISAEKLVLYLPF
GUT_GENOME213599_012992397-2503QVLPLPEAAFTWNADYAVTGLPDSLRLERRPNGGIRFGNYSSYDAPEGMDDRLTYSWNFGDSTATTMEKNPVHLYTDNGTYEVVLKVKNAQGCADSISDIIYISVLK
GUT_GENOME207452_00699709-790PTASFKVSNTLIAPGQSVTFTNTSSEVTEEIQWKFPGAKVESSIEQNPTVTYEKEGVYPVTLTAKNSSGENVETKTELITVT
GUT_GENOME195475_00611444-516RFSYDKLLGRVQFLNQSSDKAVRYEWDFGDGTTSSAANPMHTYAADGYYDVSLTAYGEYGQDVARATILINDR
GUT_GENOME231250_041831745-1828NVPPVAVIDAVERAAKNKQIDFDGSASSDADGRIVRYSWSFSDGGTELGPKVNHRFSATGGYTVTLTVTDDDGAVNSVVHKIVI
GUT_GENOME252911_01762230-316GYMGDFYIDAFTVSGLKPVESIEAMTGEMIGLVDISSGDITEWSWSMPGAVPATSSERNPQIYYTRDGNYDITLTVKDAAGNTSSKT
GUT_GENOME207494_01205653-735ADFDVSSTFVAPGQEVKFTNLSSMASSEFEWTFTGANIETSTEENPTVTYEKEGVYSVTLKAKNALGKDEIVKKGIITVSNAA
GUT_GENOME207914_00078769-851APIAFIEGPLKGKVNDIIKFSADKSFDEDGQINEYRWDFGDGTTSNEKNPMHIYSEQGAYTVKLTVIDDKGGEGIAKFSITID
GUT_GENOME017152_01577217-293GEVASVKINVGEEVHFRSLAEGATSWEWTFEGGNPASSTDENPVVSYDAPGKYTVKLVVGNGTETAEVIRDAFVEVS
GUT_GENOME103750_01699777-859NSAPNAKIEATNYGKVGETINFIGEKSNDVDGKIVEYAWDFGDGSTSSEKNPSHIYENEGTYLVKLSVKDDKGAIGKGSTTIN
GUT_GENOME157353_00778986-1056GSTIKMTDLSTDADTYKWEVPGAQPETSEEQNPSFSFPQSGKYDVTLTVTNVKGSSTMSKTIQVQKVEGEL
GUT_GENOME013468_00267601-679QSALNNAPQVSGTAVVNKSVLTFKSTSTDAEGDELSCLWNFGDGSSSSDCEGTHTYAKDGTYSVQLTVNDGKNAAVTKE
GUT_GENOME155337_014261297-1378APTASLQGDSISIEGYSLSFDGSLSKDNFQIKSYKWDFGDGSKAEGKSATHAFANEGEYTVTLIVEDESGNTASDKMTVTVY
GUT_GENOME247230_0036836-105DVDYGSFYSYTLQFVFDGSNAETIEWDFGDGSPVSTEWNPSHTYAEKGVYYVTQTTTNSYNGGSTTVEVY
GUT_GENOME283847_02058621-702PNVVITSSQTVAKVNENISYSAKNSTDDKGIKSYYWDFGDGTKSHEINVTHKYADEGTYTVSLSVTDKGNNVSKTTCKITIK
GUT_GENOME232544_00440373-473LDVEIGAQPTSNFTFTTVCQGTPTQFNSTSTTNPSGYSIDSYLWNFGEPSSSNNTSTLQNPTHTYANSGTYTVTLTTKIQTEEGFCEDTETKQVTVYSNPN
GUT_GENOME238304_0047126-105SNPPAYATFYYQADGLTVNFTANVPSDVYSCEWSFGDGKTGYGENVSHTYTSAGTYYVNMTATNKGGTKTYSESITVKKK
GUT_GENOME216384_006891184-1265SAKFNSVTLVGYELSFDGRKSWDNNKLVKYTWDFGDGKTAEGMEATHKYAKDGIYTVSLTVTDESGNTNTDSSEITVYGSDY
GUT_GENOME100828_02318247-317VKTGETVQLADLSTGEPTQWSWSFPGGTPSSSTERNPNVYYTRDGKYDITLTVSNADGSNSKTRTAFVSVT
GUT_GENOME269654_02035194-264PEASFTYSLDGTKVKFNSTAKYVTSVLWDFGDGTTSTELNPEHVYDDLGIYDVTLNAKGISGDASYTLRLK
GUT_GENOME263855_00116196-271VSSFTYKQDDANPNRIIFTNTSNGAVSVLWDFGDGATSTEMNPTHVYATKGRKMVTLKSTGILGTGDTKEGKVEIW
GUT_GENOME194592_0241892-193SSELNKPFAWIQGPYVAKVGDPVDIDAAASHAVSGSLTSYEWDFNGDGIYDETGTSPRITHTFSQEFSGVIGLRVTQSDGQTAVATTQVDITDDGDNTPRDQ
GUT_GENOME049450_01779134-201PVAFVNRSSFENGEIVGYAWNFGSGDADDASTEENPSKRYSAPGDYTVQLTVTTDNGREKSCQKTVTV
GUT_GENOME207254_02345772-856NKEPKAVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKVV
GUT_GENOME092528_00176734-813AAFKASKTLIKPGEEVTFTNSSSTNTKSVSWEFEGANITESKKDEQKVTYDKPGTYTVKLVAKNEAGEDTSQTKGFITVS
GUT_GENOME096381_03869509-579GQAPLTVAFTAKATDEDGDTTKLSWDFGDGATSTEANPTHTYQENGTYTATVTAEDPSGRTGSASVHVTVG
GUT_GENOME258880_00498625-713LIDDFAVVAPKEDVSVVRVPVGEAVKFTDCSTGDPTSWEWHFPGGTPEASTGQHPTVTYSQAGTYSVTLTVRNGEEESTVARENYIVVH
GUT_GENOME283137_02222239-325VDGLTVTGASSVSQVEVLTGEPVKLANLSGDEATSYEWKFPGGTPATSTERDPEVVYTLPGTYDVTLTARNDSDSATITRPAFVKVK
GUT_GENOME018512_00092662-742QANTAPVADFGSSVSGLAVTFTDKSSDKDGDKLTYSWDFGDGSTKSTQANPNHTYAKAGTYSVVLTVSDGKASAKKTLSVV
GUT_GENOME097058_001381373-1457VADFDVPSVMQVGVEYLFDGTLTKSKNPVTEWVYDFGDGSENSSERSPIHAFGNVGNYSVTLKVKDAFGNSDSIQKEITVVEHTS
GUT_GENOME220293_01300398-464ITFEGSKVQMVNQSTDADTYLWEVPGAQPETSTETNPTFMFPQDGDYNVKLTVTNERGESSTSEKVS
GUT_GENOME044469_00166202-283PQPDFTFEMDEKKPFEVSFSNKSVFCLDYEWDFDDGSSTVNEKEPQKHVFPGVGIYNVKLTSKGTRGNLVTKEKVVHIWEDI
GUT_GENOME233268_006955840-5925KVVPKAGFTVDYEEGCFPLTVRFSDASQEATKYNWDFGDGTTSTIRSPTHTYQLPGNYEVHLTVPGPDGISSDTVGYITVYDHPIA
GUT_GENOME012340_0009549-114GLEVTFTNTSEGATTYKWDFGDGETSKEASPKHAYEGAGEYTVKLTAANADGVVNRKEQTLTIAGE
GUT_GENOME224004_00177285-349ETGDVVEFIDITEGEVVSRHWNFPGANPSESTEKMPRVYYTDDGQYDVTLTVTDAEGLSSTVTRT
GUT_GENOME097369_00697117-198MTIDFDFSIALNDIAPGVVSITNKSKGGSKYEWTFEGGVPSTCSKRFPEAITFAEGGEHKIRLRVFNGSKYEELNKTFTLRP
GUT_GENOME055461_0011820-116GSDSIEGGAASASIAVEASANEVGYGEGVTFTLQNVTESDIASVCWDFGDGETSDAFAPSHAYRIPDDYTVVASVTLSTGEVKEYKTLVKVFAEEIN
GUT_GENOME157353_01537219-307KANFTTDACYELINSQLVGFSNSQIKMVNLSTDAKSYLWEVEGAYPSTSTEKNPSFFFPKSGTYNVKLTVKNSRGSNSTTSSLDVSVLG
GUT_GENOME256084_00336635-716AGFDVSSTFIAPGESVTFTSTCSLSATDLHWTFKGADVETSTEENPTVTYSNEGVYTVQLVASNKRGEDTVIKENIITVSEE
GUT_GENOME238306_00989610-694KVRFTNLSHIMTLYNNVEEHHYDEKCDEYEWDFGDGIVASDKNPVHVFPQEGGSFSVTLRASIAEGVCFHDTMVTITIPAIGDTR
GUT_GENOME140283_02621209-282IQEGGQVHFKDTSTGNPTTWEWTFEGGEPATSTEQNPVITYKTSGNYPVKLIAKNANGSNEYSRTDFVMVKGVA
GUT_GENOME266647_010731019-1090MSLLKGETVTLTDKSRYAPTQWNWQLDSRRLSLRTEGRHPQIRMDEPGTYDITLTATNAQGSNVATRRHALV
GUT_GENOME015171_01335315-370VAFRDRSYGNITSWKWDFGDGTTSTEQNPVHEYAKDNARYVVTLYVEGPDGKDLCC
GUT_GENOME023450_00871965-1057TADFDMTNPQPRCGERTSFLPKNMAPGCTYQWSMPGAEVATATTKNASAVYTTTGGHDVTLTVIMPDGNKLTKTAVVEVMASKPEIAYTVDSI
GUT_GENOME005228_0182943-114NEYKLDYLVGSVIQFTNTSSAKGNCVWEFGDGTVVSNEENPQYKYTVAGTYTVTLKVEGEGQRNYRLLINDI
GUT_GENOME061155_03710712-781PTAGFEVYKEGYEVTFGNRSENGISYEWDFGDKTPASVLKNPKHVYAKPGVYKVLLKTSNNVGRAASAQT
GUT_GENOME172458_00633248-335IDGLEVTGVASIDGVNVATGEVVKFADLSQGAPVKWQWAFPGGTPSASTEQNPQVYYKEGGTYNVSLEVIDAQGNNSTVTREGFVTVT
GUT_GENOME191917_01350218-313VTPTSSADLASLRQFELCSFEYNPAYFVRGTVASVHWDFGDGGTSEEEIPLYAFAKTGTWTVTLTVTYEDQSTESATRQITVSGLAWSTFPNAGYQ
GUT_GENOME237433_0093846-124KADFSFSGTGNYPPCTITFTNKSQNADYFSWDFGDGGSSSQRSPNHKYNDSGIYTVTLRASNSNGTDVITKTVNIKQKP
GUT_GENOME064569_01934602-691ANYKDEAPTAVITFEHKKIHIAQKYTFSADKSMDDRGIVKYSWDFGDGTQKTGKTVKHEYLQPGDYIVLLTVTDTKGQTNTKKLKITADG
GUT_GENOME116776_00918326-392VSVTNKSTNASKFQWNWGDGSAETEEREPQHTYDRPGQYTISLNVLGQNGDNCRKEVTVVVHHPAPV
GUT_GENOME221686_015912173-2259PEVNVDFTATPMEVFAGSAVDFSASATGTGSVTGYSWIFEGGTPATSTVATPAVSYATSGSYDVTLVATLNDGSQRTMKKSNYITVN
GUT_GENOME195670_03218201-300ADITADRRMVIEGGVISFKDTSLGRPTRWNWTFEGGTPSTSNEQNPTVTYSTAGKYKVTLVASNDMNTSTAEQEGYITVLPGKDIVLFYPFEGDSKDMGP
GUT_GENOME222975_0014445-113ELGQTAVFKDASIPDEGSRIVAWLWEFGDAKKSVSTEQNPTFVYPSDGTFTIKLTVTDDNGLSATAKKD
GUT_GENOME258880_01046233-324LQITGAADVTSVEVQTGETVKFADASSGTPTSWQWNFPGGTPESSAEQNPEVCYTRDGIYDVSLTVGDGTNTSTKTLSGFVKVTGTKPTAII
GUT_GENOME039779_0020350-104YLVGSTVQFTNTSAISGTCTWDFGDGETSTDVNPTHTYSLPGQYEVKLTVGDDYT
GUT_GENOME033948_013241263-1336QTNTEYAFSASASNADRGILTYKFDFGDGTIEEKNNATCVHKYNKSGNYKVTLTVTDNFGLADTVQKTVAVNEP
GUT_GENOME024439_00055120-207KAANLAPSASFSYTPETVVVDTEVTFTDTSVDSDGEIVARRWTLPDNTTSTDASVKYTFTKGGTFSVTLRVTDDRGASSEVSKTVYVA
GUT_GENOME011023_00553643-722NLPALTLSSLLADDKTVVLTANAYSLTFTRIESYAWDFGDGATGNESSLEHQYADYGVYTVSCTVTDREGKARTEQCQVN
GUT_GENOME157468_01826477-566NLPPVAVISGSSYAKLDSTKEFNADSSYDQNGGVIKSYSWSVSGGATISSGSGTSKITVKMPSSAGEVTVSLSVTDDEGSSSETVTKTVK
GUT_GENOME100276_03443764-842GDNVSVTEGDAVKFDASGSTDNGVLNSYEWDFGDGANSTNATETHSYGSTGTYTVTLNVTDAAGNVDSDTMTVTVNEES
GUT_GENOME246951_0104738-116EAGCVVPVGKQVRFKGLSSGNPSEWTWTLDGGELANISGQDAEVVFEKEGRYNVGLTIVDNGEERMVTLDGGVLAGGTH
GUT_GENOME016866_03641346-425NLPPVAMFATDKEEYKMGEKITYTDQSTDDENNIVKAAWENNSLAFFEPGPKTVTLTVTDNHGATNTYSKVITITNETLY
GUT_GENOME284447_0116955-129ADFDFSVDKSKGSVTFTNKSIGADAYEWTFGDDQDGFSTEKDPVYTYSKPGTYKVTLVSSKGTDYAEISKDVTVE
GUT_GENOME285592_013982562-2644PKAVLNCESIMEQNVEYLFDASLSSDDTGIISYEIDFGDNTEKAAGAKAVHKYSEAGTYKVRLTVVDKDGNSSETEKEITVKE
GUT_GENOME080351_02926130-201FAPTVVYEGEQVRFRDSTAHAKSWEWRFGDGIKIDAIDQNPTYVYRIPGEKIVSLVVNGDIKYVKQKKITVL
GUT_GENOME002570_014001770-1856LDNPYACVNQEILFRNNSTDTVSTILNTYWNFGDGSRDTVWSPRHKYDKQGTYLVKLKIDNGCGWDTISSPVIIYPLPHLEIKSEDY
GUT_GENOME167045_01333258-325GEEIHFTDMSTGNPTSWEWTFPGGVPETSTEQNPVVYYTKGGQYDVTLKVADAEGEDVKTIEGFVRVT
GUT_GENOME169987_01301474-543VADFNVDMTAVAVGGSVRFTDNSWGAQITNWDWVFEGGTPAASTERNPIVTYNQTGTFGVRLTVTNADGE
GUT_GENOME252583_013161934-1998FDASGSSADEEIQSYTFDFGDGISKTITQAVMKHRYAEKGNYTATVTVTDISGKTSSVSRNICVD
GUT_GENOME157195_00727198-274HDDAKINEVIAFTARKSKKKSKGAEIVSYHWDFADGTTSNERDVKHSYDKIGTYDVVLTVTDKAGRSAKAIRRIMVN
GUT_GENOME142610_01174430-528PPVALINAPKTVKAGQEFVASAAGSYDLDGYITDYFWDTPNAKGELSTLPRGTLWYDKDHLGEQTLALTVMDDDKMTGSTSTEINVIEPKPDASIRVDG
GUT_GENOME213364_00747238-323AVTAPGTVDGINVATGETIDFVDTSTGDGITSWEWSFPGATPPTSTEKNPTVYYTLDGAYDVSLTVTDGAGQRSTATLPAFVTVTG
GUT_GENOME233265_01200970-1048FDVSHLSIIKGDTIELRDHSRYSPTSWKWQLSNGHRALLVDQPSPMVVPTAPGIYDVSLTVGNALGSNTSTRKKILIVG
GUT_GENOME100276_00912337-412NEPPTAEFEPNETRAKELDPIRFDASATTDDGRIDAYRWDLNSDGEIDAEGESVTHAFETPGTVTVTLQVVDDLGE
GUT_GENOME117761_019002564-2652DIAPVAMLPETVDAMEGEGITFDGVLSTDNIRIGRYEWNFGDGTTAVGVRPEHVYEAEGNYTVSLTVSDAAGNKDTATTTVSVGNKTKD
GUT_GENOME031621_00096571-679VENVTNKPNAWINRQYVAKIGDTLELDGAGSYSANGKIVKYEWDVDQDGVYDVTTDKPFVNYTFTKEFSGLLTLRVTDSSGLMNVATTTLTVSDDGDEHEREFDNCPDV
GUT_GENOME119498_0022734-118TDGGKELVADFDYVVEEGGTGKVTFTNKSQNATDYEWSFGDKKDGSSIMENPVYTYEATGTYKVTLTAIDAKGNYKSTEKEVEIV
GUT_GENOME056785_017242334-2415PEAILPENIMGVSGLELGFDGTASFDNVRIVTYEWDFGDGSGKKGTRPTHIYEKPGTYTVTLTVSDAWGNQDKTTTNMIIRD
GUT_GENOME132203_00827527-611VPSVAGFTSGTDGLAASFSASTPAGAGPFSYVWNFGDGNAATGQQVEHTYAEAGLYTVVLNIFDQQGELESTVNSEVDVKAIAPI
GUT_GENOME259628_010332549-2630APLAYITASTVTVEGYELVLDGIQSADNRGIARFDWDFGDATTGTGAKVKHIYSQIGTYTVSLTVTDTSGNTGSYETEVKVL
GUT_GENOME096287_01626566-638SVADGLAVTVDASPSSDGDGSVVSRTWTFGDGASGSGVTATHTYAAGGTYDVTLTVTDDDGASAQTTRQVTVA
GUT_GENOME033285_00972637-721PTAIITLDKEKFEVGETVNFSAAESYDDHTDKENLKYSWDFGDGKKSGSLEATHKYYLPGTYVVILTITDEKGRTNTAVQEIEVE
GUT_GENOME096511_01803111-171AAGTATASFNISQNGLKIKLENTSSSVNSLLWDFGDGNTSTEISPEYEYSSSGTYIIKLTI
GUT_GENOME096430_01211927-994SGLTVSVDGTASADTDGTVASHEWDFGDGSTGTGATAQHVYAAAGTYTVTLRVTDDDGATGTSEQVVS
GUT_GENOME237059_00496365-435SACDLAIKFKSTSTIERDTIKAYLWDFGDGMTSAMESPTHVYDAPGSYMVTLKLTSGNGCTSTVSRAVQVN
GUT_GENOME096430_01234416-501GNRPPTAEATITNDPASARSFTFDASRSFDLDGGALTYRWDFGDGATATGALVTHAYGGTSTEPVTVRLDVTDSVGATGTATLTVV
GUT_GENOME037952_00770622-691AGFEVSASLVQKGAEVTFTNTSSTSAAEYKWSFPGATKATSNEAEPTVVYTKPGRYDVTLTVRNRFGEDT