UHGP-MC 4724


Information


Number of sequences (UHGP-50):
147
Average sequence length:
120±15 aa
Average transmembrane regions:
0.03
Low complexity (%):
0.62
Coiled coils (%):
0
Disordered domains (%):
2.96

Pfam dominant architecture:
PF00150
Pfam % dominant architecture:
476
Pfam overlap:
0.38
Pfam overlap type:
shifted

Downloads

Seeds:
MC4724.fasta
Seeds (0.60 cdhit):
MC4724_cdhit.fasta
MSA:
MC4724_msa.fasta
HMM model:
MC4724.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME135092_00563207-286GDKYKLSGWVRGENVKGQAGFYIHEGQKNIHKDQNNGSFDWKKIEFEGIIPEGVKELSFGTVLYATEGKAWFDDLQVNIE
GUT_GENOME213024_0162442-163AAIQNDALHFFRKNIGDSLIQHPLPPVDLSSRIIRISGEAKGERLSSSSMSAPYHGLKANLMVQSGKKKNWFGMAFPAGSFPWRRFDSIFVMPSDIESGTLYLGLQHAAGIAQLRNLVVESI
GUT_GENOME015047_0141837-161RLEGGAGTRDGCMKFENAKPDGYSSATLWFPARPGELYTFRAKIKGENIGEKQHPMHGVVFSLFTREGGKWNPLGKPVPLKGAFDWTDVRFTARIPDYLPDNSGNLIVGLHYASGTLWVKDVSVE
GUT_GENOME101861_0059738-147VICEVPAGSAKGIHGALFTVDLKPFRNRYIVWEVMGRGDNISKPPERHFGQKFILTYTTPDGKLHYAESNGRYGSFGNTKLAFGAMIEPDAETGFLRFGMQHVSGRMEYD
GUT_GENOME251946_00708522-683PAEVKAFAAEVEPECAGDAPLPVIDLNSPEFHRKLTSAFQVTQQDGSPVLTVVNPDRCTNVIFLPLDKKRLAGKEIILSADIRQYKVSQRSKPYNGVKLMLVLTGADGKRNYLQASTVDGSCDWTRRTLSLNIPADVRKVELALGLENVSGTVDFRNIRLEK
GUT_GENOME063123_0225860-172VKPGNETKEQHFAIREIDLAPYRGNQLFLTYEVKGENVSVPPKQWNGVKLMLHFKTPQEEVWRGAGAPTGSFGWRTASATAFIPAGATAGTINLGLQSSSGTVWVRSVKITAS
GUT_GENOME007886_0069945-167AIQLTVPENVEPAKADIRATLPVKIDQYAGKSILLAAEVRGEGISTGGKRWFGGKIILSIYNTENGTYENPTLFVPERGTFGWKRVSQKITLPAAIRSATLKCCLSGVSGTFLVRNITITPVP
GUT_GENOME232996_0077848-173DGAAVLKITVPADKNGGSVSIPLPPGSVAGKKVFFEAEIAYRNVTEPVREPWNGVKFMVPFTDEKNKRTWPQWNGFSPAVPRWGTHPRKYTFWSYTFPKKVRSCSLVLGLESSSGEVEFRNVRFST
GUT_GENOME007886_0189949-169LPTGGPNQSYALEAAGNVMPDIPLAGAAGKTVVFSGMIRGEKIEAKDPKFGGVVFLIHYKLNGKSKFAGGVSPAIKSGSFDWTYHSIAEKIPKEATDVTIRFGLQQVAGRAFFSKLSIKLE
GUT_GENOME007470_0285330-177QWKLDRGVFHRKTDGKDRIVLDVKPSEKAGRHYAEIPLDLSAHAGKYTTFSIRIRSQGWKQDKSERGRRGIRFMVNYRHPETGKSVWHHSWDTWDTVDWEKYSGQKPDEWKEISFFSRLDPGIREARLFLGTENNSGKAEFDLSTFRA
GUT_GENOME180680_00663604-722DETAAVSTLEITRKTPGSTFVTIPLPLSILGGQEVVLSGEIAHKAISQKPKSFNGVKLMLVITGPGGGTNYLQVPTGEGDSGWKNYALKTEIPYDARKLELYLGLEGVSGTAKFRNIRF
GUT_GENOME017889_0220849-173LPTGGPKGKPAIRIAGGASDANRQIQYKLDLDKVRGKVIELSADVKAENVSQPPKHYLGVKMMIVITKADGTRQYVDITTGKSGSYDWRECKVKAAIPTNAKEVAISLGLQSAMGAVYFSDIDVE
GUT_GENOME043880_0038855-173LQIRVPKQAKEIRAQNCAFATIDLNPFAGKSFHASVRLRGRDISKPFQPFNGVKVLLYYKTPNGGENWPGADLPVGTFGWRTATFSTELPRNVNEGVFRLGLQESSGEVEFDLSTLRLG
GUT_GENOME044373_0249958-173DGKGALVFRTRRNTETDWIRIPLPAGKLAGMIQLEAVVSGKDLAPSGRPFWGSKVMLNFQSAGKSRWPETSRNFGTYAWSTAGMAEVIPADAANLGLYLGIQGASGEFRVATVRIY
GUT_GENOME142616_025101016-1112MPDLPSDRNLTLSCYVKTDKVTATSSGGGAGMYFNFYKKDGTHLSQPVQPNKVTGTNDWQRISFSTTAPAGTYEVRVYFGLRDATGTAWFDCMQLET
GUT_GENOME236147_0064853-168SLLVEVAPDKVSRGITGAFGKLDTAALAGSRVRCRVETAGKEIAEPGNPWNGGKFMLEYSSGGKHFYPQGKMRTGSYDWMPVEFMVSLPFDLDSANIALGLQDTSGQIRFRNLSVG
GUT_GENOME114532_0067550-168LEIVQPDRVQGTRCITKTFRPEDLSGKILRISGEMKGEAILPGAKPYFGGKIMLVLSSGKDTQWLTVDVPTGTFDWQSFDRVFRANSNLDSARLVLGFQDSAGKLWIRNLKLESLGQPV
GUT_GENOME247081_0084950-154WFRTDIRRNLEISRYAGKLLEISGEFRWIDSVPALGKKQGMVQLIVRKRGGGDSYFGAFVTPGDTAWKRFSFPAEVGFEAESITLLIGFQRAGGCFEVRNLNVEI
GUT_GENOME213065_0249133-161QCVNENGESVLRFEIAKDAKSKSNLSEIPVELSEFAGKYVSLSAEVREIVLHADKSERSPLILNLEVGLKSGGVKIARNRIKTEKADGKWTKIGTAMKVPEGAIAKSAVKIGINRASGVAEFKNLKLEL
GUT_GENOME072384_0168739-165ENGKKILTVSAPEAVSQSFQIQNIDMTPYIGKDILLKVDIRANDVKKPKNTMYHGVKFMLILRQNGRKQYFETRMTGLRYGTYNWTTSQIMLSVPENIAKNAELVVGLQNTSGTAAFRNFSIQEFKH
GUT_GENOME019617_0039076-218LNGWQISPSNCAIISGGVLQITNPVRAEKSRAEIVKNLPIKKVAGRRIYVSVELAQDLTPSVSRWGGKIFLIEGGSDNFYVYAGNYIAPGKSDWAETSFFTDVPLESKNLRMHLGVESSSGKILFKNLKISSQDILAEFAKIA
GUT_GENOME266938_0082351-169DGVPMLKVTVPDPSAARMNAAFRPMNLNPLRNKILMITFEVKAESVTKPAHSYNGIKAMFYLKTKTGEVWRNPVHLHGSFDWREIRTGIEIPSHAEKAQLIVGLQESSGTLLVRRIRIE
GUT_GENOME255724_0027693-204PNAKEKKAVVDFPVNLKKYGGKSLYFEADVKAANVSKPDKKWNGIKFMLHFQHSKKKDKVWNSAKCDTGTFAWKRVSFNSSIPEGAKNATLSIGLQDSTGTVEFKNISYRVF
GUT_GENOME018163_0169865-182ENGKSGFLRFSASGQKSDLISIELPAEKMKGLIQLEARIRGKKLAVGSKAYFGPKVMLFHQFGEKNSWPEPEGSKLGTYDWRTVAMILEIPQNTTKVILTLGIQSASGEFDVDYVRIY
GUT_GENOME015047_0216370-187VMISPSPKIGNATLGAVLNIPVQKMRGMGVRFRGEVRYENIASDAGGPYWGGKILASNFRQGVYNFFVSPVMRGTQTEWQPVSLECSFPEDQQSVHVVFGIQQGWGKLQFRNLVYETY
GUT_GENOME021955_0252973-183FDCARRGYCVTERIPKDTVAGKTILIRADMEQKDVAKGKHFWETAKLQSSFRKAGDKHSTMRNITVQPGTSSWRPYGYILDVPADIQNDLELMIGLQNVSGTLKVKNVQIC
GUT_GENOME010116_0268435-162LETRDGKQYLRVVIRPEEKEAMNCAEAEFDLAPWADSMVELTIRARAKGVSAPVNYYNGVKFMLNFTDETGKEYWNNVVRLDGTFDWRDISFAALIGRPGKKSLLKLGLQESSGEVEFDLGSLRVTKL
GUT_GENOME237527_0109329-162TRPEVVAQLQGAEFLPDGGPDGQPAIRVRGPAKPTLRLRTEAFRDAVVSIRAKVKGHRHSQAFGECLHLRAAYPIRFREDWLYRPPAVRPLAAPDWQDYAITCHIPSYAGDFLLVFGHAGQDGDVLYSDVRLDR
GUT_GENOME237527_0007938-177RSDKFSSWSADESGTPVLKVEVPSSETDRTSILGAFRKLDLRPYRGKDIQVTVESRHVNLTVPSKSWLGFKLMLEYVGDAGVKVYPGGGNGVGTPRDTDWRTVSFSTRVHQSAEEGILHLGCQDVTGTLWFRNLRIRVVE
GUT_GENOME011076_0072945-175SLVTLEDGKKALELRIPETAEQRRNVLFWRGKGKDFQNRNVRFSAELRIDLKAPRKKWDGGQFAVWGARPDGKPGIWEDSQYIGVGKKTWKTYSFECRFPADLQSLQLMIGSNGAIGKVWVRNLKIESEET
GUT_GENOME251946_0117444-175KVISGVSLEDGFVKVAPTDGDLKSKGLLGIWKGIHFPQYAGKPVRFSAEIMLKDVLPLPGAKYAGTKCMVTFDENGQHFYPGIPEMQGTKNWKRYDCVFRVPANGFVSFSLGMQGARGVAGFRNLTIEPAEL
GUT_GENOME080649_0280754-170LPREKLSRGTVGIFRSLPVDRLRGRRVSCSIEVRAENLVPGPQAKHFHGGKFLLWIKSPGGELYPGGKIAPGSYGWQRISFRTNIPADAETVLLTLGFQNAGGKLEFRNFSAEADDT
GUT_GENOME222113_0078147-164TVAIPADAPEISRRNTADTPVWLTRFEGQPIEFSIRLRNSGISRPDNLHNGIKFQLYFRDGNGTERWTQAEIPHEPFPWRRFRFSERIPADATSGTLRLGLQGSSGNVEFDLDSLQIR
GUT_GENOME018291_0048845-165LRVTVLPTSHNKRGTNTAQMNLAPEKIRGKYLRFTCEAKGENLNIAELERPYLGFKFMLIVKSEKKTSYPGGGRNAKNFDWKKISFETPIPEDVTGAKLNLGLQNISGTVYFRNVRLEIID
GUT_GENOME102101_0338818-167LTASAGENLFRSATVQQGKYGTFDKAANTINVTIPHGTPVTEATEGVRFSFDNSMIAGKTVRFSFQLRTRDVRNIGGKPHHNAKLLLTGATKSNVYFKGSKGMTGTSDWTDVVWQVPFYGDNTRLTAVFGLQNSTGSAEFRNIRAELCDP
GUT_GENOME024210_0111037-157DVVTVDIPAGKVDSGSATCAYDFSRHNGKVFRASIRARGFGVSRPEPRYLGFKFMGTYRDDASGALNYPGAAQIGGDFPWQTVVFTDGNVGVKRAPGTFTLGLQDACGRVEFDLSTFKVEE
GUT_GENOME018611_0116246-162DGVTVLNFTAAAIGENTVAIPVDPALLAGRMVTLSGEVAQTDVSAGPHPWNGVRLALKLVDETGRLNMPQAGSVSGSAEWTVRRVSVKVPDKLKSAAIVLGLDWVAGTARFRNIAIT
GUT_GENOME007711_0230862-160YLPVDHLRSKRAVMEVEVKGEIEPLGKRSSGGRFGLIYEDSKGKKYYPGIPLPTGKFDWTKVKLEHILPADLKSINIELGAENAKGKFIYRNLKFTAQE
GUT_GENOME253124_0193553-164DFQDRVEYRTIDRKLDARKFSGRLIRLSCEARAENILRPRKVFEGIKLELSMKNDRRTTWGGIRFPSGTFGWKKFETVCAAPADLKSVNLAIGIQNSAGKFQIRNLKVESVG
GUT_GENOME120665_0081334-151KKLNGTSILRVCVPQGQKSRTHCASANIDLKPYAESLVIFSIRARAREVSKPRNYYNGVKFMLHFTDENNREQWKNAVRIDGTFDWKELSFYAMISRPGNQSTLKLGLQESSGEVEFD
GUT_GENOME019121_0164765-160IKPEWKFIRITMEMKATDLVPGEAGWQSGRLSLAFYDKDKKRMNPYPKMFEISGTTDWKKFEYNYTIPEGAARLAVDPSQYGKSGTVEFRKLRVTA
GUT_GENOME011076_0032731-169PSALAGWNDPDGNGKIVRQPDGKNALEVTASPEDSAKGRILSFELNPGKFAGKRVRLSAEIAYETGKPLAPWQGAKLMLAAETSRGWRYTSSYFPPGKYDWHSVARSADFPPGLLKLRLILGVQSAAGRIRYRKLKVEA
GUT_GENOME017889_0013451-164VRFTVPEAKKSGQNLMTIPFDIAACRGRTLRVSCRIRADKVSVPPQAFNGVKLMVSFRGTTPEWMNLARVWGSFDWREESFSFQVPDNAVNGKLYLGLQDSSGIVDFAKLRIEA
GUT_GENOME018611_0201860-187VSPEEAKATPDVVHNTRQKAPQVVRFTLDMEKMGVAGRRLYGEAEIRFNNVSKPDHRWNGVKFQLEYESGGIACYPSYFFGIPGFGTQKEYFRSFCAHPIESDVKKARLALGLQESSGTVSFRNVRFY
GUT_GENOME007470_00467162-319LKVEILASEPKVDAYYFDSLTRPDALSNCWIAPGSDVKALPGTGIRITTSREAKGKFRGLVRNIPVKPLLGKRIRISVEAKAENLEAVKDSRTGGKFMLSIQTPKETFYPQENPKTGTYDWKTYSFEYTIPPNSTQAGLALGTEHAAGTILFRNVKVE
GUT_GENOME136248_01624103-225SQVDGKTVLRCSAPFSSSGIAESVARGVELPPGSRIKLSIWVKGEKIRPVRNGGGSRVGLIVVDKTGEKAWNIAPSSVGTFDWTQKSILVEVPEGYQKAHFNLGLAGAEGTVWMRDLRLEQVR
GUT_GENOME072384_0234784-172TLSFQAELKNIVLNRPDDVWGGFTILLHGHKDGKYVALKTERVRMDGVGFRPYTLRGAIPSGLTDLYLEFILQRASGSVKIRDISMKLQ
GUT_GENOME122963_0076643-170SVCDSVLTFQIPARWMTIWGAVSLPLEFKSRKPGKPFTVRISGMVRLVDVVRNGVRKDDGVTMHLAYRRNGKKVFQGLQYVYRFGSCDWSEFSNTFSIPADAEDFSLLLGLQQCVGTLGIKDLVITEE
GUT_GENOME007470_0030345-155VNTSDKDNHSMRLWMKTYQGQKLTFKVVLSGENIAAKKRPYQGVRFFLWGRVGGKSTEFGKNLDLKGTFDWTTLTNVVTIPENFDGWLTIGLREVTGTLRIREVTVLEEIA
GUT_GENOME101537_004771381-1499PDGAKLVSAENGNALEVTGGLVVRRIAVPENETVCFSARIMAENVVRKNPKQHWTGTRFSVYMGKNGWKGVRAEQGSFGWKTVSFKVDVPAGVRSVQLQMGLKDASGTVRFQDVRAEVV
GUT_GENOME181275_01152103-218LHVKNTTQGGGSLATFPINVAEYAGSLVSITCQIKADKVIKDRLGYLGIKMMLPHTVNNIGQWPSATDGLHGTFDYKTFSLNVMLPYETSCANICLGIQDSTGDVYFRNLEIRRVS
GUT_GENOME273583_0035582-189TDRAVLASREFHVLQKYRNYMLEISVEAKAFSVTVPKVPWEGVRFVMNYDTECLWYSECCNGYSGSFDWRILKFRTRVPIDLKKATLSLGIIGNRGEILFRNLKIAVV
GUT_GENOME021955_0146263-170KTEPRLLCGISRKLDASAIAGKRITLSAEIRRDIHVSQKWQGGRFDLVVQRADGRTDTFGIYLEPGSFSWTTVSKSFDLPQEIVSATLYLGFRHATGTLRIRNLRLEA
GUT_GENOME112429_0070253-164PKQVGTVVILNIADLFTRIGACGKTVEITWESQWKEITSINKPWAGIKFMLMYKKADGSMIYAGCFPNEKVQTTQGWQIFRKTLKIPAEIQKPQLVLGLQGASGEIEFRNIQ
GUT_GENOME021955_03122192-324TTQYAEWNIIEGKSVLKVEYPADAIPKDQTFGIASFPVDVSSCMGKEIEISCLAKGSGIAPQKYPNHSARILVRFHQNGKNIYKLSRGRTGTFDWTPLSCKLKIPEGVGSIYLSIGLQKTTGTAEFKEIRIEE
GUT_GENOME080649_0175757-181DGSPAVAFVLPKRGTRWMSIGLDPAKLRGLIQLEAVVRGKELTRGPQPYFGSKVMLAISAGGKTRHPEPLRRYGTYGWTKVCIVENIPDNADKVTLSLGIQNASGTFEVAGVRIFRCVETDDPSE
GUT_GENOME021955_0161252-173MEDGKFALHLSPKPVKPGCRAMVSRQFDLNSVRGKRLTVTADVKWDIKLPFSKWMGSRFMLCITGSDKKTSYVSASIPPGKSDWKPVEFSAQIPDDAQRVMLFLGIQDAPGDLYVRNLKVRS
GUT_GENOME220189_0189645-164PEIRQDGVVLSVPQRVNNASLTLKLNPADIAGQRLNFSVEARGTGIAPAREPWNGGKFQSNIEAPGHHSWPSAKIKLGNFDWETLAFSADIPADVTNAQLTLGFQDSYGKLEFRNLKIET
GUT_GENOME044373_0143932-170DSPDSLKRWQASGTRPEFLPGGAPDGKGNAVRFVRQKDGNSKITLPLDPKKISGLISFEGYIRGSDLSGKADYLGPKFMLTVKDGKKYYYPEPVKGFGSYGWKRVRTFFQIPENCSSLSLTIGIQGGRGMFEVAQIEIR
GUT_GENOME258218_00402397-484GKRVKMTADIKSENVTGWAGMWFRVDAKNSRNSLAFDNMQDRPIKGTTDWKQCQIVLDVPADAGKLAYGVLLDGEGKVWFGNVSFEIV
GUT_GENOME011076_0213752-167EKDSGKEGGPALHLKSTTPAGTRNYPILLDPELCRGKRLMLEVRVKGLNLERGEKQHFAPNFMFRIQRRNVRKVRWAKVASELGSYDWKSFYCVEEIPKDLESICVVLVLQECTGE
GUT_GENOME251935_0305168-179LPIDLTPYRGKWVELSIRAKAEEVAKPKLPYMGIKFMLKYQGANQKWYYPERNNLSGSFDQELSFTHKVADDAGKGSVNVMLAQASGKITYDLSSLKIRVVPAPETQKAVPQ
GUT_GENOME011319_0063838-158WSDSDGMTLKVMNSDEGKQVMNVLKLKAEDVAGKTFKVSFEYKMDVEPGENKWSGAKAFFQMKSGAKYARLWCPLPCIGTKDWTARSVTLTFPEEVSAPGLYLGIENAKGTVSFRKFKMEE
GUT_GENOME025611_0196448-175PNGGPESDESAVRLSVTDPAKGAMAQFPVDVNLVRGKTVKLTGKIKGENISVPPQAYLGVKMMFEIVGADGKKNYPDVQPKLTGTFGWREAITIATIPADAKAVTIVIGLQNSSGTIYFSDLELDEVK
GUT_GENOME278390_01805647-766EKGRPAIRIDSPDATASRQLRIPVALEAVRGHAVELEAELKAEKVQRPDAAYLGIKLMLVITGADGKLRYHDLMEGKYGNYDWRSCRVKVEIPPDARRVMLHLGLQNAIGTVLFSNIRLN
GUT_GENOME236234_00467956-1089RLDWNAPEGATIEDDVLSISLSEKGTAMASADFDFSRFAGKTVKATVLSRGRGVKRSNVSYFGFKFMLYFVDETANQAQYLTGPEREGDWSWRPIEMIFDLRNMKPTKGKIYLGLQETTGTVDFNLASLRLEEA
GUT_GENOME025611_02240153-259DGKTNPCLLLWLKTDDVAGAKVRFSAQIKAENISKPPAGHLGGKFGLMVSRNDGKTEWPAATIGGGSFDWKQVSFTADIPYGVRSVAIMLGLQGVTGRILFRDLKVE
GUT_GENOME275835_0019846-166IREKDGSGALKLFLAEHKPIGTISISLKPQEIAGKRLRISAEVKGEKIAPAKERFNGGKFMMPVESASGTLYPGAQLSSGSFGWEKKAFVAEIPWDVKKASIVLGFQDSFGTLLVRNLKVE
GUT_GENOME017148_0087596-190LKEAKGYEAELSFEAKADDVPRPAEPWEGIRIALNYATEVDNYADSYNGLSGTFPWRSFRLKTRIPANLTSASLTIGILGKQGSVSFRNVKITVT
GUT_GENOME214799_0094672-181IRLDTRRSVGNLLNLPAKMYAGRLVQISGEFRGEKILKGGEYDGGKVIFSWMLDGRRRYLGIRVPTGSFGWKPFSRVVYIPGGAGDIVLNIGFQNSAGVFKVRNFQLKVL
GUT_GENOME272472_0055054-180KKSVKNGKTFLTIDVPTGSPDKGNTNCAATQINLSKFRGESLCFMIKARAKDVSEPRHGYNGVKFMVNYKDGAGDEFYYNTINLKGTFDWQNISFITLVSESATNATLYLGLQDSTGEVEFELSTLR
GUT_GENOME101537_00454152-323LSTLQVRENRNARKPLPRALRWQSSVSHRELNLTFAPGTPERAIADETPFAAVSDGILSIFISAGKGKNRQSVIRIPIPPAKFPLAGKRIRLSGEIRLTDVTVPRDPWNGVKVMLYHRINGKGNYPTLYTNQYPNYGSAPWRTFRGHTNIPENATDLLLYLGLQDSDGIVDF
GUT_GENOME157262_00442801-919GSRLIVDVPESAGPCASMPSAVVDLSPFVGHWLRLEIPIEAAHVPRPEKPYWGVKFQLSVDYADGTRDWPGRCFGEGTWSSVGTFRAHIRKPLRRAKLELGLQSSTGRVVYDLSKLKVW
GUT_GENOME276360_0142447-168KDWKPGIVTDGDSKCIKIEPGGTFGAWVELPEGETILFSVRMKFQDVKRKEEQKHWTGATFKAMLRIPGEKTAWPGRVMVGSSEWKTVTFKVNIPLGKGNAMAWIRCALEGATGTVWYRDLS
GUT_GENOME109794_0196150-169PTGGRGNSGCLFFESDSPKHSARMFLKLNPKQFAGKIRLSAWCRGENIGKSAPWFGPKVMLVCKTAHGAAYPEIPKKRGTYSWEYGECEIETPPGCSELELVLGIQNCSGKLWIDDIKIE
GUT_GENOME013145_0252735-157GAETKEKSFLIRSTAPETNRWTFARLPLHADEYAGQIIRLEAEIRADGIRPVSGKKSAGAKLMFVLSNGQKQDWPGIIIPSGTFNWEKFGKTVVFPTDLKQAELVFGLQDAAGEIEIRNLKIS
GUT_GENOME018409_0036537-164KWRMEGAAKVLEDGSVELVSSGMAERPGAIRWMPLRSGEKYVFTAEMKGENIGEKQHESHGVVFGVFVEIDGKWRQLGEKIELTGTFDWLPVTCNVAIPAEFKGSSGNLIIGLFNVAGKLNVRNVRMT
GUT_GENOME023609_0008946-163NGVFTVTLPRVPHTVNETRGVSVELDPVKFRGKSIMVQAEMRCRGIGSDLSQRHVGGKILWFYRNAGGARIFSSTPLLGDEEQWKTVSVFCEFPANIQSAGLVFGIQQGWGTLEFRNP
GUT_GENOME025611_0215745-170RYEAQGGIADSGCVLFENTRKDQSSLAAIPLEVEKLQGRAVLVEGWMKAEEVGRPELSYLGPKLMLAVTGRDGGKNHPDQEKAFGSYDWRKFTVFARIPHSAQKVELCIGLQGCTGKVWVDDVKVS
GUT_GENOME102749_0016242-163DGKTIFAVDVPQEDAATSHWFSAKFDPAPFRGKSVCFLVKFRTKGVTQSKLRHLGSKFMIVCNDGPESKTQYHDAWLPEGDQDWQYGSIFQTISPNAKNAEIVLGMQNVSGRIEFELDSVEC
GUT_GENOME095216_0139550-171EQRPAIQMTSTSPEKSALCELPLDLKQYAGKLLRFSADIRLEKIQPPAKPYYGLKLMFILTRADGSRHWAEDLKPAERFGTRNWEEYEAYLKIPADARGAVLSFGIQQAEGRADFANLKLEV
GUT_GENOME017889_0112466-162DMKQIAGKAVCFSADLRCKDIASDTEGDHVGGKLLVSYRDGSGMRFFATPSLTGTAAGWKNYSRRLEIPADAEDLTVTIGIQQGWGTLEVRNPSMDF
GUT_GENOME007908_0124870-188DGTPAIRIDVKPESAAGQHLAIRKIDLKPYQNRQLYLEYEIKAENVSRPPQPWNGVKCMLHYKTPGKEVWASPGEVYGSFDWKTVLVPAFIPPGVETGTINLGLQDSSGTVWIRKMRIL
GUT_GENOME215400_0142750-164VLRVAVPEAEKDGQFFAVRKIDLKPFRGRPVILRAEVKAVEVSRPAQHWNGVKVMLHCKTPSVEFWRNPMGLCGTFDWQCVETRLTVPHDADEGEVSLGLQGSSGTLLIRNWELI
GUT_GENOME195721_0240544-161LRNGTPVWVVEVPPEKALGTHGLKAAFDVTPYRGKQVTFMVRFKATGVTEPQKKYLGSKFVLHFRPAPGAKSCWPDANLPQRTTGWQTGAFTVNFPAEAVDGELRLGLQDVSGKIEFE
GUT_GENOME135970_01034200-286RRFQVPRGASYKISGWVKGENVAQRAGFYIHQGSPDRGIHKDSNSGTFDWRQVVFQGTLPYEAEYLQFGTVMYSTSGTAWYDDIEFT
GUT_GENOME157138_01629870-982IRLQDFKPAAGKLFSIQQQLPQMKPNTRYRLNYYVRGENIQPVRSGGGVTVNIWDAANRWFPQNKLIGTFPWILQTYEFTSSPDTGKKSAPRILLYLLHASGTVWFDGVTLEE
GUT_GENOME007461_0080250-169KLETRDGRQILSVTVPEPEKQPTNCAGATVDLKPFRGQSLCFMIRVRAIDVTKPAHSYNGVKFMLNYRTGDGEERWHHPSNLFGSFDWKLVSFTAPIDAGATTGRLMLGLQESAGTVEFD
GUT_GENOME113464_00138912-1031TEEKYSGNRSLKLTNPAQGSYYTFYAQTLAIPKGKTYTFSVKAKTVNLTSNEGGQLFVYYYNENGELVRPQSEYIQNTNGWQEYSFTFTYPANATSDLYVCLGLMGCKGSIYFDDAQLEE
GUT_GENOME044269_0141348-168YFTQDGLRLNVEQRASIVQASRSFAPDRFAGKLLRLSAEAALENVRPGVRSYEGASLQFTFERAGKRTYGGIKFPNGTSGWKRYSHVFHAPAKLDAVTLKLLFQNSSGQVTIRNLSVEELG
GUT_GENOME044373_0212029-161MDFHSPEEFRKWSGKSDRMIENGALHIRGNQILVRKIDPALIRGTVWIEAEMKVERIPNPEKPYWGAKAMLVIEEPNGRKSYPEPAGIPRYGTTGWKTCNLLLKIPEQVKSIQLWMGIQNATGDYWLRRFQIY
GUT_GENOME044269_0056945-164VTVPKHPNPPNGTRGVVLKLDPKKVAGKAIRYRAEMRWRGIGSDTSGSHIGGKILGTHHNTAGVGSWTASPSLLGTEQEWQEVSYFCQYPRALKSASITFGIQQGWGGLEFRNPTMEILP
GUT_GENOME113468_0056640-164PGAEWLQDGTLKVKADRSGRSLVSCSPDVRKFAGKFVRVSAQIKAEKVSAGKRPNHGVKVMVSRNSNGMGELWKASANRLWGDFDWNTQEVCLLIPEDVSDFRIHLGLENATGTVWFRKISIRKL
GUT_GENOME171359_0463890-201GRQSMLIDASRTSRVSIHAKAPVESGKSYRLQVWYKTENIAGGSGAYFRTSVLNASNTKLIDGPGSTKVYGTNGWRMQQVFITVPAGGAKFLVELFLENGTGKVWFDNIRLE
GUT_GENOME025611_00692177-293AEFLAGGGPDGMNCVKFSGTTGELQLDTNLLRGRIVTVEAMIRGEELGGGTLMISPAAPAGPGEYYPALRADKGSFDWRKFGYSVRIPDYAGKLKLRFLHAGKGGNAYYADVKLSLS
GUT_GENOME122963_0151937-144HCNQQETLAEFDLTRLRGKTLTFHGSVKLNGVSKPPRADLGAKFMFIIKKKHGTVYPGCRNLNGTTDWRPIRFSCLIPPDAEKGTLVCGIQRSTGTVQFRNISMNVED
GUT_GENOME251935_0154038-155TIRNGVITIEAPEPVTSRFACYDLEESEVAGTLQELSGECRAERLARLHERLHGGKVNLTGVDPAGKRFYSATNLQTGSYDWRPFLLDGYFPADGRLPRIMLGIQQASGKLEFRKLRR
GUT_GENOME279866_0338056-178TDVLIVEVPESAGNSVSAMVDIPVNLAKNGMAGKMIYGEAYLENENVKKPTKSYLGIKFMLPYNSPSQGRSYPEFLTPKQKYGTAPWRKLGSMIQIPSDVRLGHLTLGLQNTTGKVSFKNICL
GUT_GENOME021955_0001355-171IKVSPGDTGRHFVEIPLDFQKWKMDGLQVEIQGEVKMTHVARHPKSKWEGVQFRLLFPYEGAVHDPRILYGNKVPYFGTTEWFLAGNKIHLFHGVTKGKLSLGLNNVSGEIQFRNIR
GUT_GENOME219866_0020568-169GARFSLPGKEYAGKLIEFSAEVRWIKKDPGLPATGRILFRAQTPKGERDIYAGFNVSPSDRQWTRKKCAMNIPANTGRLHLLIGFEQTAGEFEIRNLNLDVL
GUT_GENOME122963_0118842-156GSRVLTVTSGNRESGKTVTFPLDTGKIAGRRLILSAEVRTDIERSTASWLGAKLMISGKRKDSPFYSRTPLPQGKTDWHRVRFTVDIPHDLQNAGLTLGIQRNKGTVQYRNVRIE
GUT_GENOME236234_0118547-173RGDIVSVALPRKGTALCRAKVDMTAWTGKVVRAAIRSKGKNVSKADETWLGYKFMLHYRDKAAGEELWPGAASRSGSWDWTETEVKIDLRGKSPEKASLAIGLQDASGEVCFDLGSLRIEESKPLFP
GUT_GENOME236883_0181943-179DALRGWRDAQPRHLEANAGKDGKSALHFVCDSHSGADWIHVSLDRPSLGEGLIWLEAEVRGQDIREPKVSYLGTKVMLTIDRGGKMSHPEPPHVTGTFDWQTLMTIQHLGKDVDQVTLSLGIQGTPGQFWVNAVRIY
GUT_GENOME007886_0110723-163QWKMSRYTKIIRQNDKSFLQISVPEKAPDIAKANCAIAQLDLKDNPLGELDISIRLEVESLSKPRNHFHGAELKVFLETADGKQLSLGTSVPGSKKTDKILRVRRLIPAGIQHGWIRLGLNDSFGTVKYDLSSLKIRFLPL
GUT_GENOME007470_0160055-175VLTIRVPKGKESPGTENLVALPLNLQQSGLAGYEIQGEADLCFSDIPKPAEGYFGVKFMLPYRFSGEWHYPEFLPENFRRWGSSEWVRVRGRALIPASQKSALLKLGLQGASGTVSYRNIR
GUT_GENOME102370_0179554-163VPENSRKGSHWVTVKSLDLKRLRGKKLLFTTFVKGEKISKAPAYYFGLKYMLIIKSESGMSYPQAPAMRGTFDWKKSYFEVNIPRDALSGELVAGLQEASGKASFRNFQI
GUT_GENOME063123_0138652-169FENGGLRIANDNPKGARLLPIRIDAAKVRGKKLKVTSMVRGENVSKPEKPYLGVKAMIYYKAASGDRQWIEHKPKKDGTFGWEEVVTQADIPADAEDVSFQVGLQESSGTVWFDDIKA
GUT_GENOME275835_0181352-170GQCLQVTRQKSVQEKGSTGVFYRFRPEQIAGKRLTITVDVKRDIGVPAVKWQGGKVMLTLNTGNKTTWPGIYMPHGKTDWKTMVLSEDIPGDLKSAILLLGIQDAQGTIWFRNLKIEVG
GUT_GENOME010633_0127056-170VPAGKGTQQRCMERVVDMTPYRGKTVTFLVKYRSIGVSRPPEAYNGIKFMLKYKPSPESDFRWPGGNGMYGDSDGWQWTSFSETFPAGATSGTLIMGLQNSHGRVEFDLSTLQVG
GUT_GENOME276360_007437-108GAYARLETERYQGHEIEITLKFRAKNLFTSNRYGAKGILEFKGGNGVRYLFAERFPNGTYEWKTVRLFAVIPKGSRESLLKFGLYGCSGKLEVDLAGSSVRI
GUT_GENOME278390_0306534-172DTPSKLVRRSGEHVTLPDGTAALKVTGNEKRPFSGVRLRLPLKPEYAEKVLYLRARLKYENVSKPLRPWNGVKFMITTRLNGKFQYHQRGSEYGSREWFDAEIYAPLDPGAEPQATFSIGLEDSTGTLYVRDIQARVMN
GUT_GENOME273687_0100634-151AKLSEGRFLKFSVSPENKSGQHCASAEFDLSSSADCMIEFSIRARAKNVSRPPQEWNGVKFMLHYRDSAGKDFWPGIRKFSGTFDWQELSFFALVGKPQGKAFLKLGLQDSSGEVEFD
GUT_GENOME011319_01950172-315AKAGVVPLKPSLFECGRFGTVNGDTLTVRVPEKTGITNQTRGANLRLDTRELAGKQVVFRAQMRTRNVESDANGEHVGAKILAISTVNDIAYYYFTPSVTGTQDWQDVSANCNFAKAQESMQVVFGIQQAWGEVDFRNISLEII
GUT_GENOME010397_0101048-171VPEGGAGGSGCIRFHRDTVGDSWLLIPIDPAELRGRAIQVEAMMKAEKIAKPSPGYMGPKLMLNLQSPGENSHSEQEKVHGTYDWRKFQVFAQAASDVEKAVLAIGIQHGQGTLYMDDVKITLV
GUT_GENOME011076_0069544-164LSFDVPAGKETHTDRTKVYMTVDFAKLKANGKKLVISGEVKMTGVTKPKDVWNGAKVMLTYTTNGKKVYPSFLTGKTFGTTDWTKFRKAVQIPADAKTGIVCFGLQDSSGKIQFRNLDLSV
GUT_GENOME275835_0053246-166SGGVLSFDLKKQTGGFPCIYRSLPPEKAAGKLIAISAEIKGSQLKRYQKKPFTGVKLQFLATADGKNIFAGPTLTKEGSYDWQTLRQVLYLPPNVQDLKLQIGMQGTTGSFQLKNLKIEDL
GUT_GENOME236147_0110669-188TVTVPKGKIPGDKMVIVAIPLKVSALKLAGSSLFSEMDLKYSNVTQPAHPWNGVKAMLVYEAGGKTEYPDINPGWQKNVGTRDWYRANCVTEFPKDTVKVTLMLGLQESSGEVSFRNIRF
GUT_GENOME021955_0004762-171AIRFSSKTYQDLLLKIPLDPRKIKGPIRFEAWTKSSTETNPGKAYWGTKLQLVVKAKGKYTYPETPRNESGSWKKVYKDLVIGPDTEELTLRFGLEQCKGSYSVADFRIY
GUT_GENOME101861_01485622-735VSVPEGKENGQHVATSTVDLRPVYDSNLVFRIKVKAENVSKPPEKWNGVKFMLSYRTAEGNTVYKHPSRLWGTFEKEICFVTDVPEGATDGVFYLGLQNSSGKVTFYLNTLQIR
GUT_GENOME275835_0209973-199LPDGNTALEIRIPRNAPIQNITYGASRELDLRSFRGKQIVISGTVSAKEVPERNEGHRCVKLMLYHRGKTGNRYIPSSYRLWGSFPESRLELAFTVPEDVGKNTLMVGVQHNTGVVLAKDLKLEVKE
GUT_GENOME021955_0287557-166IGEKQRGMHGIYKTFPVSEMKGKLIRITGERRGSNLVLPERDFLGPKTMFIITCKGGQTHYPGVSGKFGTYGWEPFEALCKIPENAERIQLFLGMQNGSGTFGIRNLRIA
GUT_GENOME015499_0013933-151LVVDAPEGSSAARKANFHLKTIDVKPFAGKHFGASIRMRCADVGTPSFRFGGVKFQLKITGKDGKAHYYQPAEIPVGTEEFVRVELDEVLPSDAVEAVFRIGLQCCPGRAEFELDSLEY
GUT_GENOME113587_0340513-188FQTMKMKLLLTGLMLCISILEGSAQNLVRNPGFDGMKAWSLTRWTKIYGRISTADRNMILANDDLKQTTMAQQGVKLKPHTEYRISFRIKGENITTDGAKNSGACIMILHQGKYVYECSPAGAWKTVSGTFDWKSCALTFKTGETTGWSTLYLVLRKSTGTVRFSDVRLEETAKPV
GUT_GENOME261169_035127-164LLIFSAVLKLSAALLPGETIWEETFATKNAVNAWISSGSKPEWIPGGGPDGKLNAVRFCRKAFGDSFLIKNLDGSAVTGKFFLECWVRAENIKGERNISFLGPKVRFSYKTGNRMYYPEPKKGWGTYDWTLVRCYIDLPENATEKNLMVGLQKGSGSF
GUT_GENOME023609_0016333-159KCAQYQNASIKFIPGAGPDGKGALRFHNDTIRHNDLKIPLDLNKVRGRGVQLQGKLRVENFPVPSRHYLGPKLMLIAKTPSHISYGEQPKKWGTTGWLFFHTFLRIPDDATEAMIRLGIEWAKGTLY
GUT_GENOME010315_0065448-176SGVKMSFDKGEKGEGGALKMETSENVHFGGLSLPVDARKLRGSVLELSGEMKGEKIAPGPKYYNGAKFMLVIDSPSGNIYGGSQMPTGNFGWKKVKLFIRVPYDATAGRLNLGFEESRGAIWIKNIKLE
GUT_GENOME017889_01141189-285SFSRIIPVRPGSRYTVSGWVKGENIRGGFGAGYFLHIGPRNNLACAPQNKWNNYGTFDWKQIEFTGTVPENADTLRFGTVLNAENGRAWFDDVKIKL
GUT_GENOME241327_0323638-175AEMIKGWNVPIGRVYDPGGGKEGKGALYFSSPEDRPGGIVIRKSLDPKVVNGRICLEATVRGKNLKPADRRYWGSKIMLSFRGIRGDIANPEPQRRFGTYDWYKAVVFTDVGNPESIKLSLGMQKGTGEFWVQSVRIY
GUT_GENOME279866_0012040-156LTVTVPDTAREKINCAETEINLEPLRGLLVRFSVKVRARDVSRPRKPWNGIKVMLNYQEASGKQYWHNVQNLSGSFAWRQAAFTTTISSTAAKGRVMLGLQDSSGTAEFDLDSLEIR
GUT_GENOME221118_012397-164ILAALLAGSFGAAGGPAGDCGLAQAEWKLGTHSRLEERDGKRLLVVEVPEKDRNGSYLSRAEIDLSPFRGLPTTFSMRVRIDGVEKEAGRKGIKFMVTYQAPSGAGIWHGGWGIYGPADWRSVSFLSPIDDYTGKAELVLGMEKNFGKVEFDLDSFRA
GUT_GENOME219941_00392261-361ASSAVNVLDRGKSAGRRVFVEADFWQNLSPSVSRWGGKIFLLDGDSKDFYVYAGKYVAPSNSDWQKIRFWTDTPLDSREVRVSLGVESSSGTVKFKNLRVY
GUT_GENOME030759_0120951-159VITVPPERKTGPNTVSAAVNLAPYRNKVVQFSIPLRAENVSVPEQKWNGVKFQLETAGTDGKPSYSGAELPPGSFDWRLATFKVAVTADTGILQLGLQQSSGKVEFDLS
GUT_GENOME236883_0233944-165VDGGAVRLALSPDSPHDKMRCVERTFPASLMAGKTFRLEFDIRGENVTKGDKPWYSVKVIFSFSDGKKEMHYIDAGGQLGTFDWRRVERDVSFPPDVSSPVRLSIGFQEVTGTIWLRRLRLT
GUT_GENOME044269_0323758-153IPVASFRGGFAELCGEYRANRLRKSGRHPHGGKINLQWRTPDGKRHYAPHSLPEGDCDWTAFRLVGAIPDNAVDLRVQAGLQQGRGQVAFRNVRVR
GUT_GENOME251935_0066443-164LKDGIITVEVPDTPEKMKQHFAVCPIDLKPLYGQTVSFSIEAKGNDISTPAKFWNGGKVMLSFRDREGKMMWPGSSRISGSFDWREFGFSATIPEGATEGRLNLELQQSSGKIEFRLDSLKV
GUT_GENOME114532_0034754-175EGKPAVKLVSTDVKKGVGCELQLDIEKIRGKLVEVTADIKGENVTQPPKPYLGVKLMFRVDLADGKIDYPDIAPAAKKIGSFDWTEYKVKTRIPADAKSARLVIGLQESTGTVCYSDLEIEV
GUT_GENOME017148_0200047-164VIKIENTNATSSAMSNAMLDAKTLQGKRLTVTALIRAEDVAEPAKSYLGVKLMIVVVRSDGSVKYFENLPAHCRFGTYRWKAAKVSAAIPADAERVFLSIGLQQTSGVVFYSGFRCEV
GUT_GENOME136248_0197735-140RGVQFKLDLAKLGACGKNLQISGEIKFDNVSRPPQVWNGVKAMLSYRNQGKMEYPSCYKGISYGSRDWTPFTSTIAIPATAGQCTLSLGLQDSTGTVSFRNIKIDV
GUT_GENOME238256_0020911-166IITMVLLVFNMVADEFAPLRLKWDLPEQYAKMEGDLLIVDIPRDKHGASAMATAIVPNDRIEGLKRFSFSIEASAENIAKPSKSYLGIKSQLHWRDAKTGADNWPNTRPRTGSFPRTVLKTYVDFMGATPGFAEIQLGLQETSGKIVFDLSTLKLS
GUT_GENOME102749_02743235-350LEVRIPESSGEKESRIQFMLNGEQLRNRRVTVSAEMKIDLHSPADTGSGGLFQLWGPTPKGKPGMWHACRIGSGQKDWKTYELIYDFPDYTKFAVISLGIGNAKGKVQIRNVKVKS
GUT_GENOME128367_0058138-163TPAGTLEIQIRPGKEANRFLTAQRTFDIIPYRGHDIELVYRLRAEKVTRPPQDFTGIKLILSCRSGHREIVADTPRQWNHGDFDWREVTIPMSVPEDARTAKLMVGLQNSSGLIEVRSITVRDLGE
GUT_GENOME007886_0070049-161SANDSPTSDVRATTSVKLDGLGGKHMILKAEVKGTALSVPPKRWLGIRFVLSYRSGGELRHETLVLPLHGTFDWKAAELLIKFENPAPHATLSVCLSGVTGKLEVRKLRLEPS
GUT_GENOME007470_00337605-696IAGKNVSLCAEIRQKDVTPAPKEWQGIKLMLFYEDAGGKKQFPQAKAMPGSSDWRAHAVHVRVPAGIRKAEILIGMEHAAGEVAFRKVRIEE