UHGP-MC 7854


Information


Number of sequences (UHGP-50):
172
Average sequence length:
111±10 aa
Average transmembrane regions:
0.01
Low complexity (%):
3.4
Coiled coils (%):
0.73
Disordered domains (%):
1.18

Pfam dominant architecture:
PF13684
Pfam % dominant architecture:
58
Pfam overlap:
0.66
Pfam overlap type:
extended

Downloads

Seeds:
MC7854.fasta
Seeds (0.60 cdhit):
MC7854_cdhit.fasta
MSA:
MC7854_msa.fasta
HMM model:
MC7854.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME081552_013594-116QDYLMRLIHEMVRTIIKLIFNIDEKTVNLEQELKETSDLYGKLVKLADAGKINEAENLLYEQLENGQREDLKAALGFYDHLNDYTEEFLDKADFSREEIKSGLVSVLRMYGYE
GUT_GENOME096372_00554140-220YRQPATVLYKLMEYCTAAGSYDEAENLLYELKGRESSGDGELPEWGLAFYRRLLACSDEKLQAGKLPRDEVESGLREWQAA
GUT_GENOME223343_0026926-137QDWVMRQVEMLVQFIARTLFHKDYINYEIVDEANLTGIDLLYKKLMKMIEEGQICEAENLFFDNLDEKDPEYLKLCLVFYQTLNQMSDDELETHNFSRQEVSDGLEKATKKF
GUT_GENOME188428_013083-115EDWLMRQLQIVPKQLPTIKQTNNDPLIEVEDSLTGEKRSVSLTQYMHENVLLRNICEAEDVFFAHLHLLTSHQITFISSWFYRKLANISETELLQANFSVEEIRQGQDEIQRL
GUT_GENOME103863_029374-119EQDYVMRLVKDLVRFLMQLITGKPQFRYETDAEEPSPCGDDYTRIIAMADAGRINEAENLLYENLDTDNEDYLLMGLSFYSHINDYKDDFLLESNYSREEIKDGIENFVKEYGVTG
GUT_GENOME099642_005724-115NDTFMREINDIIKMTASIYTDDLDYEYENIDNKTKEDKIYDELMELVSSGDLNAADECIFDNLDPNSMNYLKLALNFYKKINELDDDELKSMDFERGEIKSAVLEILDAYGI
GUT_GENOME171359_04985119-236YVRRSFKSLQLLLEAMLHGSDRRLLPVMETTEGLLRELKSYRMPDELLERLWAWLEREGRYAEAEDALFRWVRQAAGRPEEAEQRKRQGVRFYERLAECPDEALDQGGLPREELADGR
GUT_GENOME198438_014814-119KDDYVMRMIKDMARVLARLILGKDDINYVLPEDEEFTVIDNLYKKLITMADAGQINEAENILLNELEDKSSEYFEMAASFYLHLNEYSDGFLDAHQYSREEINEGMGNLGKEFGVD
GUT_GENOME030215_004636-120DRMSGNIQDIARLITRLLLKGDMPQYTLPAAEADYTEADRLFKKVIGFADKGDINGAENELLMNMEDDDPDYLELALTFYLYLNDMDVDYLDDHDYSREEILDGLKSLAEDWGVA
GUT_GENOME241809_000318-115IMRQIKAFAEGIGYMLSKYKNNTDTEIVFHLDEPLLPHQSELQQLIAKKQYATAAKRLLSLRYAMPEMEFFKLGVWFYDTLNQRSDTKLQQNHYSRAAIIVGLKQLQE
GUT_GENOME018865_016763-121EQDYIMRLIHEMVRTVMKLVFGLDEEEEEELRVLDTMSADNGEKLNRLIDLADEGKINEAENRLYDLLEDKDPDALKIAIAFYDHMNGFDTEFLDEADYSREEIRDGICDVLKRFGYGG
GUT_GENOME224801_003297-115MKQLEIVPLKLPSIKVTNDDPLIDVEDDITGETRSVPLTQLLLEKIVFLKLADAETILFDYLDKLTTSQATSLAGWFYKKLDALSDEELKRGNFSRKEILQGIDDIKRL
GUT_GENOME142287_0361311-110IKLMVARLLLGKKYSEYEEKDLVYNTEEEVILITLKRLVFQGNVNEAEEILFDKAKSVNSENMQYIAIEFYTMLMEKTDEELEAMNFSKQEVYQGIEDVT
GUT_GENOME208654_009464-113ENDIIMRQVRDMTRMLAKILFGKNTATYEYKEEDRHTATDSLYARLIALVDAGKINEAENRLYEELERDEEGTFEVALGFYDYLNELPEEFLEEHDYSKEEVKEGAQSLA
GUT_GENOME274183_0080514-116DTIRLCMYLICGASQNKYKQPKFVENSAEDSIYYELTALIDAGKLNEAEDVMFDRLNPRNADDYYTMLCVYDYMNGLDDDYLEANDFTREEIEDGVQEITGIY
GUT_GENOME237487_016746-121FQQDFIMRQIELIARYIAESVFKKKSTEYKIIEKENLSDTDKLHNKLVNLINENKINEAEDLLFDSLDLDNKKYLEVAIDFYSRLTKLDDATLELNDFTKEEIAEGLADVSNKYGI
GUT_GENOME217881_014772-120FEEDYVMRIIKEAVRALLKLLFGIDTTNPSVELLEDAQMRDRITEMTDMVDDGHINEAENRLYEVVDVSRREDLQMVLIFYSYLNDLDDDFLDDHDYSREEIGQGLRDMIGRFGLEGMA
GUT_GENOME137125_011573-120IFENDFLMREIKDMTKLLGNVFLHRKESEEIEEQDLQDEEYRKLLRQLKKKLEEQQYREAVAELKDAFQQGSMEYLTIALYCFDAINAHEENELAKAGYSRNELYNDLSFISEQYGIR
GUT_GENOME161399_011104-119EDYIVRMIRDMGQMLARVLGSSAWEPEETAEQWVERTSGSTPLWDELCRLCGLGEINRAENLLFEELDFSDESTFPIALGFYEHLNHFSDKELEAWDYSREEIFDGLRDCAEQYGV
GUT_GENOME255710_006705-120QDYVMRMIHQLIEVLLGDIFKGKEKVLEENINKSQEQTEKYDKMKMLVDNYEINQAENLLFDNIDIDNIEDLKLALLFYQYVNDKDNEFLKKSNYSREEIEQGIKDIAKKYGYENI
GUT_GENOME103364_001183-118MFEKDWIMRQIKDLTKMIAIGVLHKAGPEYEIVDEQNLSESDKLHLQLMKLLESNRVKDAMQLLQDSITNTDLDHLKVAMDFYDRLSALSDEKRLAAGISISDIQAGLDAVTSCYG
GUT_GENOME191543_013164-120FENDWMMRQISSMTKMLRMMFFHKASEEITAEEELKDEETKVYYHTIYQLIQEDNYAQAMNYLVEHFTDKNLEYLKTALAFFDYVNAKEDSELRAHGYSKDQLYSDLNFITKQYGIE
GUT_GENOME142586_0262714-111LGRFLFDRKEEIMEKINIEKLTPKDIFKICFNKLFHEGNYNKAEDLLFDELKKNNSPDVYEIALQFYNSLEKKSDEELYAHNFSRDEIYQGLNDIKKF
GUT_GENOME254339_003504-117QDYIMRQIEMFARTLALLVFRKEETAYELREEEAETQKGRLYRLLRQLTEEGKINEAEDRLFEALDPEDLSLLEIAIDFYMRLNAMEEEELERAGYSREEIQQGLAECARFYGV
GUT_GENOME098041_004144-113DYLMRQIEEMARLCSQVLFAKHTEPLPVFDEQGNVTENGVLYGRLRMLCSAGRVNEAENLLFERLDAPGGENFLPAALQFYQDIQKWEDAALEQAGFSRTEIWDGLAGVR
GUT_GENOME285167_0035226-121LDNDKDFIMKQVKSFAEGLGYVLGKKRNGDIEVVFQQENQQKNKFLAEIEQYLEKKKYREAIQSFFKLKYEPEPGEYLKIGNQLLDKLKARLADED
GUT_GENOME104734_013605-113EDWFMRQIEMLLSTIIHILSENKETQESEQDLEVRWQTEQLLDEKDICAAENYLFEKAEQFSGNPVFLKTALDFYSQVNKMSDEELDQHNFSRDEIYDGIKEMCRINGI
GUT_GENOME207567_002194-117QDYLLRQVESLAMSLSKLFFNKDMVQYKLEEKDNLSDTDILYSKLGKLVSEGKINEAENLLFDEVDKSNIEYLRVGIDFYTNLNRLEDKVLEAHNYSREEIEQGLHDLSSEYGI
GUT_GENOME158598_021055-117DFILDQIDMIERFLQTVLFESPDELSGGIDNIRFFDDQELIRSDLQRKIENQRYNEAEDFLFEEIEKDPKNDEMYKLGMWFYHKLAQIDEDTLEEHNFSKDEVLEGLQEIERM
GUT_GENOME027324_001648-119LLKQINVVSEFLQKLFTDMETNRKLNENEQYQKDSFEFERLLENLIEEDKINDAENILFEKLETNNLMYATIATRFYDKLKGLSDEKLQKSNYSRDEILQGLNDMCDMFGLE
GUT_GENOME046228_0022125-114SDIDDFNIAYSPTEEFLLITIKRLVLEKKINEAEDVLFTSLEKNRNDNMLCIAGEFYTMLMDLSDDELKENNFTRAEIKEGINDVKELFN
GUT_GENOME078308_011823-118QDDYILRQIREMVRAVMKMLFQVSAVELTPDVIEDTDARQILTNLTDLADNGKIDEAENQLYEMTCDGDRQNLEIGLLFYYHLNGKDDEFLEASNFSREEIMMGIQDLAERYNLSG
GUT_GENOME034183_012026-118DWILKQIDMLIQFVARLVFHKDTAAYKINDESNLSQTDLIHEKLTCLIKEGKFGDAEDFLFDNINESSIDYLEMSLDFYQRLNAITDTELEENDFSREEIEHGLRDILKKFNV
GUT_GENOME072968_0060170-186REDFLKRQMEGLIRVYAKLLFNKEYKGEEWEELVPTTPQGEPISLQDDLDRMLSLGEIGQAEDLLFEVLEEAVEQKVSRRECEELVTWFYEQLTRMSDEELHRGDFSREEIIDGQAQ
GUT_GENOME217916_014864-98EKDYILRMIKSVGQVIMKMLQLLEMADKGAINEAENLLFRKADTRDRSYLEMAMRFYQHLEEYSDDFLLAHDYSRIEIAEGVKRMAGEYGITGLE
GUT_GENOME188425_0174319-134EKDYIMRMIKEMVRVLFSLMLGKKYVSVELPDENKYNVSGKGLDDFLRMVDYGQINEAENMLLENLDYDSREENLAAIRLYQYIGEKDEDFLSKCNYSKEEALEGLKMLAEKAGFG
GUT_GENOME013723_0052012-118QVDIIAKTIIGLIFGKEKLMEIIQDREETNDIQTKANEQYLSCVIEKLLRENKINEAEDLLFEEIKKDPTPGKLEIAAKFYDTLNKYDDQKLSECNFSHDEIIEKIG
GUT_GENOME266146_033452-119FEQDYVMRLIKEMVRAILKLLFNIDTESPTIELLENKEEKETLENLIDMVDAGEINEAENRLYDLISATDMNSMEVAILFYSYLNDKTDDFLEANDFSRDEIKLGMENVADNFGLNSI
GUT_GENOME231922_026789-96GENDFLLADTYSKDDIFWITLRGLISKGKIDEAEDMLFKEAEKNPSMEIYEIGEKMYSLLANKSEEELKKYNFSKEEIDLGLEDLKRL
GUT_GENOME140104_001613-126YEQDYIMRMIKDMTRMIAKLLLGKDAPQYMLPDAQPDDKGLDGDSGSFYRRLIQMADAGEINEAENLLTDYLDQGSGSKEELEVALGFYVYINEMSNDFLDEHEYSREEIYQGLESLSTQFGVS
GUT_GENOME013208_005586-126DWLERQIEAIGQGFAALLFGKNRVKKVFERFEEEHQEVKNQKMDEMLLDVLINQHLNEGEFLKAEETLFSFIEKEQTPHMLMTALSFYNQLSDLDDNKLKSSGFSKEKITQRIEKLKQIYN
GUT_GENOME136783_004985-120DDWLLRQIEVFGLFLRRLLNGYKDEKMSMYELEQMSLTQNTVYKKVCKLIEQNKICEAENVIYEMLDNKNDRDSIEAAILFYYKINKMTELELRECNFSRNEILSGLIKISKMCNL
GUT_GENOME069637_000431-74MYEQDYIMRLIKEMVRTLLKLLFNVDTDSPSAVLFYANLNNKFDTFLEQHNFSRDEIKSGLRDVVEKYEGNDFA
GUT_GENOME096272_010966-122IMKIIKTTLQGLAYVLKGKQSMDNSIDENNNTNNLVLTEEQALRLTIIKYINECKINEAENLLFEAIYSHKSPEFLELALFFYEEINKFSLDKLIDCNFSKEEILDGLNTVKRIYNV
GUT_GENOME207903_027566-112ILRLAEDLGKFAAKALLHKEQEEYENINLDSLSSEEILNILLKKLIREGKYNEAENILFEELNKNPSDDLVNIGKKFYNVLLSKNDEELIRGNFSRQEVLQGLKDMK
GUT_GENOME046951_011059-124YENDYIMRMIHDMIRALAKIIFHKDIDEWEQISFRDEEAGKLFSELDGLLYKKDLKGAQSLLESRLDVECLENLKVALLFYDRLNQLEDSELEEYGISREDLEEQVRNVMEKFGYG
GUT_GENOME101498_004714-118KQDWLMRQIEVLAVAVAQLVFGKGGVEYDLKEEDGQTRSDGVERALAELIEQDRLGEAEDLLFAGMDGVDLAGLARALDFYQTANRLSDETLGRQGFTRQELLEGLREAVERYGI
GUT_GENOME198855_020303-121FEQDYIMRLIKQMVAALISVILGKENKMHELPLEDRYKASEGLLRELLTMVNDGKINEAENLLYERLEQDNRKDIENAILFYSYLNELDNDFLEKCNYSRAEIEMGVREIAKRSGVEGL
GUT_GENOME004102_006106-122EKDYIMRMIKEMARVLFSLAFGKTFTAVELENEDQLRVSGKSLREWYDMVDRGEINEAENLLLDEIHYEVREDVMAAALFYQYVSEKEENFLQEHRYSKEEALFGMEDLMKKSGYQD
GUT_GENOME136644_006754-118EQDYVMRMIKDMVRALALLIFGKRDIRYEIPEENARTEEDDLYTRILQMADRGEINEAENILLTELPKESSNYVVMAADFYQHIAEYSDEFLEEHNYSRDEILEGLESIAREYGI
GUT_GENOME180961_003544-117TGYMMLQIRLLIQALTKLLTGQTGEQMDSTLIEAEAVGKSMGLMDLAAQGRINEGENRLFAALEQSPKDAMLLKEGLDFYGWLDDKEESWLQLFDFSHEEIRAGIRDLAVLMGQ
GUT_GENOME248075_014333-119YENDYMMRTIREMLRVIAKLLGKTTVTYELPSEENDTEEDALYRRLIGMADAGEINEAENMLFEALEENGEDFGGAALAFYEHLNDMDAGFLDDNDYSREEVSLGIKNVARVLGCES
GUT_GENOME167756_015074-116ESDWLMRQLRAFATGLGYTLSRRKGGETQVVFPELQDKPLPHQVELTQLIDKHAYAQAADRLLALQYAMTASEFLKLGIWFYATLNRFDDQNLASGGISRASLVAGLAQLKQL
GUT_GENOME243879_017543-119TQDWMMRQIETLTLAIAKIVFRKDTAEYRPSAAGEDGILSNADRLHMTLNAALKEGRISESEDLLFDALEDGDRNCLEVALDFYCRLNELSDQELAAGNFSRQEIQDGLNDVMEQFG
GUT_GENOME253417_006525-117EGYILKQIKAVAKAIAKILFNRDSYDYVLPKDGKYNQYDNLYLTIMGLADNNKFNEAEDLLFENIDGRNAVYMQIAFAFYEKLAGLSDEVLKANNFSKEEVEQGFIDVLKMYN
GUT_GENOME122437_002705-120DYLVRQIDVMVKYLAETVFKNKKKTYNITAEKYFEINGAGKNVLYLYDLADQGKINLAENILYDKIDEERSLELFEVGVDFYVYLNSKDDDFLESNDFSRQEIYDGLEDLQKIYGL
GUT_GENOME093226_015844-113QQDWLMRQIEAMIQAILAVALGISANEQTATQIEDSSYGKMLEKMIDDGDICAAEDLLFNDLDQSDLSWLQIALDFYSKLNNCSDDYLAMHDFSREEIDQGLRYICTLFG
GUT_GENOME076263_025085-116HKEDFIIRQIQSAARALAKLFFNSETVSYTIKDESNYTQSDLIYLKIKELMEQHKYLEAQTLLLEHLDSDDLQYLKLILYFYDQLDQMNDEQLAKYYLERQIITNNFIDATS
GUT_GENOME121797_003997-120KDYLMRIIKEVARVLASVMLGKKYVQVELPVESKYQISGDKLHHLKTMVDNGEINEAENELLDRIDYSDKEDLAEAMFFYEYAASKGDEFLEAHNYSLEEIKDGLQQLAQDAGY
GUT_GENOME138452_002126-116DYIMRLIHELIRTLIKLLCGADPDRSEEELLPAAKKGRYLSLRQMLDDGEINQAENLLQEELDIHDRADLEMALLFYRSLNQKSDEFLEDHNFSREEIRDGISYVVDLYGY
GUT_GENOME006950_0029413-127QDFVLRRLEDQGRFLARLILGKDEAHYELPEYEAMDNRADALYRKLLALVDDGEINEAENILLDELDTGDLNMFEMALCFYLYLAHLDEDFLEEHRYTREEIGEGMEALAEDFGV
GUT_GENOME181699_015213-120QNDYFMRQIEMLGRFLAKLIFNKETTVYEVIIDEEGNITPAGLLYLELNTMIKEGRINEAENLLFDRIEATYDNEYLEIAIDFYSQVNNLTDEFLEEHDFSREEAMEGLSKVKQLYGI
GUT_GENOME037207_006256-115DVLLRQIKSFAQGLGYMLGKAGGNPGNEIVFPQTDARVFPGQERLEQFLTVKQLPEATRFFFSQRYAMAEKDFMALGAWYFERLNALSDEALADANYSRQAVEAGLKQLA
GUT_GENOME264872_001215-121QHDWFMLQIQMMVQFIAKVTFQKDPFEIDALFDTEWGHTETDPLEKEIAALFAQSRFQEAESLVLKRLSSKQPAALKTALSFYNALNQLPDERLNAHGFPRERVRRGLEELTRLYQI
GUT_GENOME188419_0123710-124ILREISMLIRFVAKTVLQKEVDGEEIILYDQNTQQKNGTLDAELALLVRQGDINEAENLLFETLEEERTQENFASAVQFYLTLHGMGEKELARHDFSEEEIAEGFADVRKLYGIE
GUT_GENOME096414_002109-111EAIKRMVAKLLFGKSFDNYEEIMISHSIEEEVLLITLKRLAFEGKVNEAEDMLFDNAKNSKVENLPYVVIEFYTMLMNKSDEEFKEFKFSKEEIYDGIEDIKK
GUT_GENOME243835_002334-117QDWILRQIEMIGIVLARLLFGKDKPDYEIVEKEKLSDSDQLYIELIRMLDEGKINEAENRLYEELDSSRLHMLELALAFYSRLNQFDDAYLEAHNYSRQEIEEGLQSVTKFFGI
GUT_GENOME096272_02885135-216KLSQYELPIETEKMVLKYYLKKGKYSNGEDLLFSMIKKEKSEEIFSLGYEFYDTLKKKSSEELELGNFSLEEVDFGLEDLNQ
GUT_GENOME059581_018045-120EDWMMRQINMMARMLAQLALGEDTSVRYDLHREIKPGEAAGDLRSMLTVMIAQGRYGEAEDLLYDSIRPDHPEDLKLGIDFYTNLNCLPDDALEAGNFPREEIQSGLEDLSRFYGI
GUT_GENOME215246_015844-118EQDYIMRMIKGMGQIILKLFGKSGAEERYATNYAATPLEELYQRLLELADRGLINEAENLLFREMDRSDKAYLEMAMNFYRHIEEMSDDFLAAHHYSRAEIAQGIQMGAAEYGIT
GUT_GENOME155588_022066-114LILDIVESLGRNIGKALSDEKSESETITIENLSDKDMLLLILKKMIYCGKYSEAEDTLFCFAKKNKDSDFTSICEWFYKELSLKSDKELIDNNFSRDEIKQGLEDFKRT
GUT_GENOME231391_019942-106FYEQDWILWQISIIIKFIRSLLSKESNVYEQDEEQTVFKEEIEKLIELNRLKEANSLIVKTSKDKDVLALGLWFYALLNDLSDKKLEEGALAREDILKGLRTLLY
GUT_GENOME207914_012182-110KNDYLLDMVESFGKNLGKCILNKNEEPAPIEFGDWSDKDTLLAILKKLISERKYNDAEDVLFEFLEQDRSENIKSIGNWFYNSLSKLSDEELVSGNFSREEIAQGYYDF
GUT_GENOME230905_017372-96LTEDWMMRQVDALARSIAYLVFQKESTGYVPGGTAEDAALDELHRRLLEKVNAGDIGGAEDLLFAESDPGDRRYLELAVDFYARLNDLTDAQLEA
GUT_GENOME063305_0157821-148KNSIAGTLRLVGILLSGAWNDKYKQPRFVENAAQEDFYQRLLQQIDRGEINEAENELLAYVEEAESRLLETQERRKRAEDTVLNMGLSVYGYMNEKGDDFLEKSHYCRDEIEDGVRGLLMRFGVRIPV
GUT_GENOME000279_018094-119NDYIMRQIEGLTRFLAKVLMQKDMGSTDVIDEQGHLDPGNFLLYRLQGMVGQGAINEAEDLLFDTIRAEPREAYLKVAIDFYSLLQGMEDDALKTADFTRQEIAEGLRDVRAIYGA
GUT_GENOME156675_011459-114DLAESLGKNIANSITNDKNDSDRISIESLSDKDMIFIILKKMISEKKYNEAENILFEFIEKNKNDLISETVEWFYKELSSKSDKDLLENNFSREEIELGLNDFKKI
GUT_GENOME007418_010148-111TALLAKALFKKDSPIVPDNAFPETSEPGIVLKRLRQLVVEGKVNTAENLLFDSFDKKKPYFIAIGLDFYARISEFTDEALKEADFSREEIQEGIGDMLKFYGIK
GUT_GENOME028491_015064-116KDWLLNQIDDLVEFLAKVFLNQTTTEYLPENIVNSTSDELYNSLDRLICEKKINEAENLLFDKIEVQNPKHLAIAINFYDKLNKLPDETLEEANYSREEINQGLNDILDKFGI
GUT_GENOME096045_003169-120DWFLFQIQLLALAIARIVFGKGSLEYETESEEIPTTTDLLHRRLLELLAKKRICEAENLLFDSLDPHNKSGLALALDFYDRLNKLTDEELEQNNFSREEIDSGLRDVMKLYE
GUT_GENOME252036_026183-120KEDYIMRIIHEAVRTLLKLLFNIDEEKEDEIEFEDIELEKAHRRLKLLAQEGKINEAENQLWELLDGSDREAFKLALLFYDYINTFEEEQLDAAGYSREEIDEGILAVAKNYGYEGMV
GUT_GENOME097687_003204-117EKDYYMRIVHEMVRMLIRLVFGKDIDRNDEEAVPLEVMERYRKLTAMADDGDINEAENLLLDGLDQGSRAYFELALMFYEKLNGKTDEFLEEHDYSREEVLDGIKYVVDYYGYG
GUT_GENOME027124_0455032-142DFDLRLTNELIETYLKIAYGIKPGCWLDAEKGLLQENTLYPKLKQMINDGNINDAEDILYEYANPINPDILKVGLFFYYDLNRMADAELEAADFTRDEVKEGVQELLKIYD
GUT_GENOME130319_015695-119QDYIMRMIKEFANVLGRLLFNKKTPMYELDVTNRYHAGDDLYARLCRMADEGKINEAENVLYQEMVPGDMDYLEMALAFYAYLNCFSDEVLIRSCYSRDEIKEGIEDFLDRYGCS
GUT_GENOME048724_001343-116ERDYVMRLIHEIIRTLIALVFGRDVDEETELIVRAESRQLYERLKSLVDAGEIEEAENCLWEAAENGEIDFETGLLFYEHLNEKSNDFLEEHHFSRKEVVEGIRYLAECYGYGT
GUT_GENOME194397_013144-120QDWLMRQIEIIAETLARVLFRKTPARRADVVLQQKSGVSGAELTETLLAMVRAGELCAAEDLLYERLDPDDRDGLAAALRFYSALNALPDAELEARNFPRDEILSGLTDVCRRCGLE
GUT_GENOME141063_011923-120FEQDYIMRMIKGETAALARIIFRKTSPQYELPEENKYTAVDDLFARLVRMADAGAVNKAENMLYQELDQSDKTYLEMGIGFYHHLNEYGDDFLVQNDYTREEIGEGIRQLLAEYGMDS
GUT_GENOME057164_00504404-517KDYIMRQIKMLGQLLARLLLRSDGVKYELPLKAPYSKADELHQALCRLLAEGKINEAENWLYEEVGQLGLPGLRVAIDFYLQLNERDDAYLEAHNFTRAEIEDGLKNIAQQYGV
GUT_GENOME235575_020373-121REDYIMKLVKEMIRTMLKLLFDIDTDEPSAELLNEAEKQEYDALTAQIDAGEINEAEDRLYDWVEQQPERADHENARLPILFYAYLNQKSEDYLEEHDFSREEVELGLRSIAGKEGLMS
GUT_GENOME047306_001595-118DYIMRKIEEWISMILEFVFKIDKNSSPEKLLKLEESKKVLKDLKSKIDIGNINEAEDSLFEMLKHKTQDSLLIGLLFYSYLNEKDSKFLNEHDFERDEIKTGIKDLLNEFNMNN
GUT_GENOME153805_009003-116QDDFIMRQIEVFARTLAHLLFGKETTVYEVPAGQEDTGAGGLHTRLLALLAAGRVNEAENLLFDALESGESGLLEVAIDFYARLNGWSDEELEQAGFSREEIGQGLDACARLCG
GUT_GENOME000573_0096710-111IGRVGAKMLLNKTERKRESVEFGQVNSVEILRGVLKRLVAEKSYSKAEDILFSELENNTSEEVYNIAVEFYNLLIEKSDEDLESGNFSMEEAHQGLIDIKKF
GUT_GENOME284802_008515-116DYLMRQIEDMARFLGKVLFVKTEETIPLFDEQGNVLESGLLYKRLCAMLEEGDANSAENLLFDEVEAHPDPAYLQVAVQFYADLQHWTDAALEAADFSRQEVLDGLAAVKEL
GUT_GENOME055807_028895-120QDYVMRLIKQMMQVLAKLIFHKKEETEVTEPTLTSFGDSEADIDLFALADSGQINEAENLLYKHLDTSDLSQLKLAFAFYEHLNEYQNDFLEEHNYSREEILEGIKNIASEFGVSG
GUT_GENOME122014_000245-121VEKDYIMRIIHEVIRTLFKLLCNVDIDRPEMEQIPKEVEEAFLPWKDRIDAGQINEAENLMVDGMKANSRRDFLCALLFYDYLNKKEEAFLKAHQYTRQEIREGLSYAIGFFGYESM
GUT_GENOME096512_011042-67KNDYMLDLVETFGKNLGKTLFNEVKEVEEWFYNELSLKSGEALEKGSFSRDEVEQGLKDFIKSFKN
GUT_GENOME103750_020464-118DYILKMIKSALNVIGYILKGKKSIEENIDLKRENIVLNEEELLELTIIHLLSECKINEAENILFEAIQKNNSPKYLSLAFFFYSEINEWSNEKLSDCNFSRDEILTGLNDVKKIY
GUT_GENOME261579_005294-119DKDYIMRLIHETVRTLIRLIWNRDMDREEEGAVSFALLQRYQELAKMIDDGQVNEAENLLLDELDPDDPEYFEMALRFYEKLGGKSEEFLAAHDYSQQEVIDGLKSVVEAYGYTDL
GUT_GENOME262233_023495-114MMGLAHELARILILLVLNQDIDKEEMVFPEEEKGTDLFSMLEAGNINEAECLLTKGLEPENQTHFGRALLFYDKLSGKPEAFLEAHDYSQEKILEGLKRVVEYYKFGSLM
GUT_GENOME141701_009355-118EYIIRLIKTGVKAAVALFAGKDAIKSAIDIENYNMTISGDELLEFMIKKYISEGKINEAENILFEVIESQKTKKNLETALSFYKELSKWHEARLLKCNFSKLEIEQGLKDVRKL
GUT_GENOME000943_012024-116EQDYIMRMIKSIVSFFIKLITGKSDTVYELPSDGLYTPCDDIFARILSLADGGKINEAENLLYANVEPDSKDYLLMGLTFYSHINEYTDNFLSDHNYSRQEIQDGIIAFAKEF
GUT_GENOME067046_000335-117KDYVMRIIHEMIRTMVKLLLNIDMDDGKQEVLSDGVMADEYGRLLSMIADGRINEAENRLYEFLDEDNTDHLELALRFYDKLNELDDEALERAGFSRDEVKEGLMRTARIYGC
GUT_GENOME282985_013053-120ENDYILKQIEMSTKFLAKLIFGKESPEYKLQYDEIANAKPIDLLCLTLRRMIDEGEINEAENLLYENIEREARAEYLEIAIDFYDRLSKLSADRLDECDFSRAEIFEGLENVKRIYGI
GUT_GENOME096500_016955-121DYIMRLIRESIKFLAKIFLHIKDVDETNYDLVYEKEDSQTDYLHSELLKLIKLGEINKAENLLFENLDTENKKHIELALDFYNRLNHLDDEFLEESNFSRGEIVDGLKDIAKEFGLQ
GUT_GENOME098568_041212-119FEEDYIMRTIKEMVRALLKFLFGVDTDAPTETLLEEEQSRAVWERLKKSIDDGNINEAENALYSFIDAGDKDALKTALLFYSYVNEKEDAYLEENNYTREEIKTGLQDVAAKYGLDGL
GUT_GENOME246110_000525-118QDWIMRQIEMIIQFLARLLYGSDSVAYEEYLDEEHITSDALFRAVNERLQQGRISDAEDLLFDQMNPSNPYELQVAIDFYERLNDLSDDELEQGDFSREEVEEGLRDVIHQFGV
GUT_GENOME033208_0154812-119TIRGAVRVAMAIAGGAFGEKYKQPKFIENSAQDSLYRKLTQMVDEGKFEKAENILWERAEKKDLLDLQLAAAVYSYMNEKEDTFIEQSDFSREEIEYGIRCVMNMYGF
GUT_GENOME114832_016935-121QDWLMRQVDMLVQAFARLLLGKDAGDTLQEEWNESSTAGEADPFAAELNALLDEGKINEAEDLLFDRVGGGDPAYIKTGLAFYARVNRRSDEFLEAHDFSRNEVREGLEDFARIYGI
GUT_GENOME239851_014924-115NEDFLTRQIRAIGSGLGVMISGKNSSGTEIVFPKKQGEKLSYQVDLEMLIGRHQFAEAADKLSRLVYAVPDAEYFSLSVWFYQRLNQFSEATLMAGGFSKGQIMNRLEALKR
GUT_GENOME035511_009292-120LENDYIVKQIKEMTRALINMLFHVDTDSPTEDLLDTQSAKEQYRELCRLADQGELNEAGNELYDLLEAEEKEALKIALLYYSYINDKDSAFLEAHDYDRDEIKSDLKDLVERCGIIGIS
GUT_GENOME152712_002423-119IFENDWIMRQIEGMTDMLGKVLLHREKSEIRIEEELTDEDIKTYYNNIQMLLKEKKYKEAVQYLQENFATGSMEYLKVALTAFDQLNALTEAELLDGGYTRQELYNDLEFITVQYGI
GUT_GENOME142434_015854-91EDNDFILRQIKSFAQGLGYILSKKDNANETVVLFQDGETSLKGYKEELAHCIEENGFSEAIQMLESWRSTRISRAQYEKLFSWLQEKR
GUT_GENOME207742_007196-118DYIKRVIDSIGKMCVAMVSGKNAIESNIEDNNYDMKISEDDLLEIMVKKYVNEGNINEAENIIFEAIKSHKTKKSYEITLDFYKHINDYSDEKLKELNFSREEIVDGLNEIKH
GUT_GENOME125925_013323-118RQDYITRMVQEMVRMLLKLMFGIEAESPNLEMFRDAQARERTERLLMLADEGRIDEAENELSDILESCVPEHMQTGLLFYWHLNEKDDAFLETHDFSREEIRDGLEQVVKDSGMAD
GUT_GENOME243787_006305-113QDDWLTGQIDTIIKLLAQLYFQDSKAITCLDESEKRNLYQEAMALMEKQAYQQLNQLLLEKAETRSKETLKLALHIYDRFNRLSDEELEKGAFSHNRLYGAVQELEKIY
GUT_GENOME217899_011922-104FENDYIMRQIRGMAAVLARLLFGKSATEYVLPAETFGEADRLHLELVKRIEEGRLEEAENLLFDRLDPADPDTLKVALDFYARLEKLDPAYLAAHGFTEDDIA
GUT_GENOME199330_008903-108FSEDWIIRQIEMLIAALIDAIFEHKPSEKPYTSEQITVNDLVDKNKICEAENFLFETAKLKGTESNREFLVTVLDFYQRINSMSEDELNAAGFSHDEIKQGLIDFF
GUT_GENOME158781_015504-117NDYILRQIELLGRGLREMVYHDQEDLSEEVFSGQGVFSAKNYLRHLLRRLVSEGRIGEAEDVLFAHLEREPDPAHLEIALEFYDELAALDDGTLRRGDFSREEIAEGLRDVKAL
GUT_GENOME096049_017416-116DFILRQIRAFAEGLGYILSHGKGGQSEAEIVFPEKQEQKLPYQNELQELIDKKQYADAAKRLLSLQYAMTEAQFMKLGIWFYSELNEYSDDQLVEGNFSKQSIITGLNQLK
GUT_GENOME244177_004435-111DTFIKEINDLINMCLGVFFSGKTYKDNENSKNNKIYRELQNLISDGRINEGENFYYDNIDPDDTEYLQIGLSFYRALNDLDDKFLHENNFSRKEIKIGLEDLLDAYN
GUT_GENOME262398_0059524-115YSSLNDFDLVYTPMEEITLISVKRLLFEGKIGQAEDLLFSSIEKNKSKNSIFIAGEFYTLLMEMSDKELEEKNFSRGEIMEGIKDIGWILKE
GUT_GENOME190887_009482-118FEEDYIMRQIREMVRMLLKLLFQLDQEEDSEELLRGTKENEVLRELLEMVDDGRINEAENRVYELCEDGEMANLKVMLLFYDYLNGKSDEYLEECEFSREELKEDMRDLLAGFGLSD
GUT_GENOME245503_016836-119QDFIMREIELIGRFYSRVLFGREMEQEQQEVRLDVLSENYLPYRLHRLVDEGDINEAENELFEAIEEHPRKEYLSAAFEFYRHLSELDPVYLKQCGFSEEEILEGLAEVKRIYR
GUT_GENOME087888_0152334-153YMYERDYIIRMNNEVIRTMVKLLLNKDIGTPVDISDGAIKGENRKILEAANGQSGIDNIKKMEEEILTLVDANKEGALEKALVFYGYLNELPEDELLSGNYSHEKIKKGLKMVAEHYGIL
GUT_GENOME062203_030034-112DFLLDKFEGFSKALAKTLFDVKEDVDPIQFNNLSNKDILYIVLKKLLKEGEFNKGEDLFYKELYKNKTQEFYELGQWFYNYLLNQDDKLLEEKNFPRREIYQGLEDLQE
GUT_GENOME243814_019233-118EQDYIMRLIREMVRTMAKLVFKVDIDDPKQIDFENEDLEEEYYNLITMINSGMVNEAENMLLDQLDLSNKKSFEMSLMFYSYLNEKNDDFLETHNYSRKEVVDGIKNVCDKFGYSG
GUT_GENOME000496_030363-118EQDYIMRMIHDLVRMLAKLLLGKDTVMYEFPDEGEFTEGDFLYKRILQLLEAGRINEAENMLFDEMHTDDLGYFEMAMDFYTRLNQLDDAYLEEHDYSREEIEEGIQMAARAFGIS
GUT_GENOME056927_0134111-123RAVYNTLRCIGVMLFGQNRFQQPKFLENAAHKDKYEVWLGMIDEGKINEVENELIEETEQRRPDMHPSESEKMAMLEVALQIYKYINDKDDAFLERSSYSREEVEEGILSVMH
GUT_GENOME216267_001122-119FEQDYIMRQIKECVAALMKLLFNINTESSAAMLIKSQEKQSQSDELIQKIDKGHIREAMSELNTVTEKKTKDDLLIGYQFYSHLCKKDDDFLELCRMDFPEIREEMKKYFSQYGSSDV
GUT_GENOME098570_021235-119KDYIMRLVKEIARVLGFLIRGKKEEEDLEEEIQTLPIDGIFKFIIKLADKGNINEAENLLFTQLDRTDMRHLYVAIAFYEHINEYTDEFLMGHSYTREEIIQGLKDIAKEYGISE
GUT_GENOME258161_005435-116DYILRMIEDMGRMLRQVILQEEEDMLEIIDEDGTFSDSDFFGYRVGRLLAQRQINEAENLLFEEIGRDPQPAYIAVALHFYEDLQDLTDEELEEADFSREEIAEGLEALRRL
GUT_GENOME002066_002422-120YEQDYIMRINRDVIRTLSKLVFNRDTELPIDINTNDLKSEDKRNLQTLNGQVDLGEISEVEKDIYQSIERKEDRALEKALLFYTYLNEQSDDFLLAKNYSREQLRSGLKFISSQLGISD
GUT_GENOME018594_009428-120MERQIENLGKTLVGFILGKKGLEGLSEKYEESFSQGTIDEDILERQLKKLIEDGEICKAEDLLFASLEENPTPQKIVVGLNFYATLDKLDKNFLKKHNFTEEEISDGIKEYQA
GUT_GENOME111115_004633-119YEQDYILRQIEILMQGIRRVFFGKREKGAAAFAVSGALPGALWYTRVLERLEAGDINGAENLLFALMDPAEPQGLLAALDFYNRLRRIPEERLLESGFSLAEVEQGLLDAAALYGLD
GUT_GENOME107791_002473-117QQDTILRQIEMLTQALARIFFRKEKVVYEFPEQETGYSDDDVWYARILRLLASSEINAAEDALFDGFDASSVRLRAALDFYSRLNLLSDEALAHANFSREEIAQGLDDIAARCGV
GUT_GENOME171348_014435-117DWIMRQITSFIHMIGKVLFKKDSVELNIHSESNSEKVHLLYERLINLINNLKVNEAEDLLFENIETDDLIYIKIAMDFYDKVNKLSDEELEEADFSREEIKLGLEDILRLYSI
GUT_GENOME096879_037175-119DYATRMTNSVVGTLLKLIFHIELGKNEDTAWKDESYFETYHQLTKMIDEGQINEAENRLFDLLDPSNTEIFKMSLMFYYYMNDKDGDFLEKNNYTKTEITDGLRHVSTLYGYESM
GUT_GENOME004784_010485-111DFLMQQIKELIQFLLGLAYPMGFSDEKKTETQSSSISKKLYALALEGKYGEAEDALFDAFDEGEADLATGILFYESLLRVDEKELRKGDFSLKEIETGLRDYAKLFH
GUT_GENOME000137_020235-119YEKDWFMRQISDMVRVVALTVFQKDRPEYKIEEETTTTSSDDLYRHTLKLLESREYQRALDYLSQNLSSERLSDLLVALDFYDKVNRLTEEELARYGLSREELLYSLQTIITLYE
GUT_GENOME006187_011624-119EKDYILRLIYEIIRTLLHVLFQIDIARQNGPAFKDEQRMEWFRQLTAMIDEGEINEAENELLEGINANSMKDYELALWFYVYLNEKDNAFLETYNFSRKEVLEGIRLTGQIFGYRS
GUT_GENOME164334_012041-115MAEETDFLLRQIKGIAGQLGYILGKRAEGTESAIIFPANKPPLPYQDVLRELLHTQHYQEALTKFTQIRYAMEQDDYVKLGLWLYAALLQLPEKERTEAGLTTAELQKGLNKLVA
GUT_GENOME143421_0232310-117MRQVKAAVFNPFKKVSEVQVMPMILVTDEGTGEKYSVPIQSYLSELILALQVNQAENILFSETNQMTDAQLMDLGNWFYDLVENLSDEELESANFSRTEIAQGRADLG
GUT_GENOME055438_007323-120EQDYIMRQIQQIIQILMKIIFKIDTASPETFLIKEIGKREQADDMLRNIDSGNIAEAEQMLFTTIKNRTLDDLLVALVFYSHLNEKDDDFFETNNYSRSYVENSIKRLLSEYGLEHLA
GUT_GENOME034932_022921-94MAEDSDFIMRQIKSFAEGFGYMVGKKDGEKTEVVFEQQQGQGDKIHRDITELLMHQKYEQAIQYVYAQKFTLEEGQYFILGQWLLGKLSDIPEI
GUT_GENOME000202_0043813-95LDFESLSGDDMIYIVLKKLVAEGKFNEGENFLFEEAKKNITPRVFNIGTWFYETLNEKEDEELLKANFTREEIEDGLKDFKKI
GUT_GENOME103800_03124155-214GKYDKAEDILFYMIKKSNKDKNIISMGINYYERLKNLSDEVLEAGNLSREEIEDSYKELM
GUT_GENOME268811_001806-121DWMERQIEAIGNTFAAILFGKDKVKAILDVDEEENSATEMEDDILDRMVKKHLADKNFNEAENLIFNALERQKTARRFELALNFFNEANEFSDEVLSKYDYSHEEIEDGINTLKKL
GUT_GENOME096464_0247611-117MAEDLGRSLMRLIVDDDSDDSETIVVENLTDKDRILIILRKLISEKRYSEGEDVLFQFAEENQHSYVESVGEWFYRELFLKSEDELAEGNFSVQEIEQGIKDFNKLI
GUT_GENOME210257_041153-117FEEDYIMRIIKDMVKALACVIFGKRFTEYEVEEEKADDTDFLYRDIIEMADKGKINEAENILLTDMDQTDKRYMEMAMSFYLHINQYTDEFLAANGYSRQEILDGVEALAAANGI
GUT_GENOME257855_012963-118QQDYILRMIEDMAQFLSKIAHYHEQESMVSIVDENGVIDEIAFFEYRLTKLYYEGKYCEAEDLLFKKLEQTDGKYYINTAIQFYQKLGQTDEQKLTENGYSEDRISSGLKRLRTLY
GUT_GENOME033784_005915-120DDYVMRTISDLVRAIARLALGKNEINYALPDTEDKYSDTDRIYRKLRDLVDAGEINEAENQLYENLDENDTEHLEMALTFYMYLNQLDDDTLFMANYSREEIVEGINSVSASFGIT
GUT_GENOME089277_0178810-116TIKKQIAKMLLGKKYSDYVDMDVVYNPDEEIFLITLKRLVFENKINEAEEFLFDRAENNPTENLPYIAIEFYTMVMEKSDDELKEADFDRSEVTTGIMDFREIMGIK
GUT_GENOME256840_002095-118DYIMRQIEMLSRSIAKVVFDKDTNTVDLIRQDDVGGSLSEKEEGMLSKIQDLDINGAENDLYEMLEEGLTPGKLKLALWFYTQLRSLSDEALEKADFSREEIDEGLEGVFKMYH
GUT_GENOME089207_019745-119DYILRMIQEMGQMLARILGSDALDPTDQVQLEALPAGDGLGFLEELKALCSHGQVNLAEDRLFEELDFSAPSALATVLGFYKYVNSFSDQQLEAWGYSREEIYQGLADCGERFGV
GUT_GENOME096458_0078410-110LGKNIGKTLMKKKEGSTEVINLKDATSSDLLPIILKGLILKKSYNKAENILFQELKNNTSQENYKIALDFYDSLMEKSEEELNQGDFSKEEVFQGLKDLEL
GUT_GENOME244210_008691-94MDNKKDWLMRQVDSFAEGMGYLLSKQSGNSQSEVVFPQENAKKLPFQKELASLIAANDTRGAAQRLLKLQYAMPEEQFLQLGTWLLSEMKQDSF
GUT_GENOME257495_010015-117QEDWLMRQILSMTEAIARVIFGKTDADYVEDGEFVQTDELHSRLLKLLDERRINEAENLLYEQADPDDIACLLVATDFYSRLGRMKDAELEESDFSLEEVQSGLEDFAGRYGV
GUT_GENOME000203_016045-118QDWLLNQIEDLVRFVAKIIFNKDTITYEIENDSNLTECDLLYKKIKSLLQEDKICESEDLIFKNLDKDNNDYLKLAIDFYSTINKYDDKKLEVYNFSREEIIDGLKDVLNIYNI
GUT_GENOME096571_0166014-115VIRFLKTALISPGGGFIEEVEDIPFQSPQLEIKKELERLIKQKEYCQAEDLLYEQLEQEESDDNLCLGLWFYNRLLDEDEETMEEHNFGKDEILAGLKELES
GUT_GENOME111584_0171312-118INNTVRMAAIISFGELQDKMKIPKFIENCTRSSLYEQLTHMIDVGEINEAENELLDAIDANEKSDLEMELEIYDYMNDKEDSFLEQYDYTREEIEDGIKSVMVLFGY
GUT_GENOME191339_016921-119MYEQDYIMRMNRDVIRTIAKLIWGRDVEGPMDINADELKSEDKKLLILNEQVDVEKIIELEEKIQEQIVNNKERALEQALLFYSYLNKQSDEFLAANDFDKEKIKEGLMFIAKQYEIAN
GUT_GENOME048800_020973-120EDDYLLRLIKEMVRTVLKLLFHIDFKEEDPVSVVFESEDAEKTLYSLIHLADQGFINDAENQLYDFTQNPENMESLKTALLFYSHLNQLDNDFLEDHNYSREEIVSGVKDVLNRYHLD
GUT_GENOME092011_0012512-114IRKAFESLAQILTSHDIKNSDLFEKSEIKTDEAKLYIVLRQLAATGKICKAEDLLFDAFKMRPTPLCLKTARMFYRDINTLTDEQLDKCNFSRKEILDGMWDA
GUT_GENOME111298_004752-121FEQDYIIRQIKECVAAAMKLIFGLDTETPASMEFQNKEKQALSDELIMEIDNGNIKNAISNMYLNTLDKTKDDLLIGLIVYSHLCEKDDDFLDSNSISFSEIKESAKEYFSEFGLSSVVD
GUT_GENOME158306_013325-109VQRLAHMLARLVFHKDAVFYDTSAGTQELSALQAHLIDLLQCGEINGAENELFDAMPPDDIRYLEIAIDFYARLNELTDDELEAADYSRQEIEEGLRDAATHFGV