UHGP-MC 61233


Information


Number of sequences (UHGP-50):
101
Average sequence length:
296±20 aa
Average transmembrane regions:
0.04
Low complexity (%):
1.79
Coiled coils (%):
0
Disordered domains (%):
4.21

Pfam dominant architecture:
PF02514
Pfam % dominant architecture:
6436
Pfam overlap:
0.31
Pfam overlap type:
reduced

Downloads

Seeds:
MC61233.fasta
Seeds (0.60 cdhit):
MC61233_cdhit.fasta
MSA:
MC61233_msa.fasta
HMM model:
MC61233.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME262235_00654930-1166GGDPSAGIREASVRIFGDAPGSYGAGIDLALMASAWEDEKDLAEYFIQASAFAYGDKLDGKTSVREFIDNMKRVDLSSDTSVCKRMDSISCGFGIQVHGGIRLTAKLLGKRDIRQYQSLSERGKEVETTALNESVERDINATLLNPLWRESRKEEGRDGASDIMHLMQNIFAAQCACKPLDDRILDELTETYINNKDMRDWMLAENPFAAEESARRMLELHSREKWRPSPDVLSRLK
GUT_GENOME009972_00351920-1209VRMSGMYRDSLWPTVELMDDAIQAVIQLAEDDSINYVKMHYGEEIQKFGEIKMDQVQAHNLASARIFSSQMGTYGAGVNLALENRNWNTIDDLGKIFVNWGCYAYSKTMKGSFAPKSFEYQLATMDLTIKNEDNYEVNMLDSDDYNAFHGGMIAAVRALRGSVPESYCGDTSDGSSPKIRTLKEELKWLYRTQVLNPKFIDGMRKHGYKGAADLTSLVRHSFQWDATSDIMENLLYEQITQEYVFNEEVNHWLREVNPWGLKKMAEILLESIQRGLWEAKQETYEHLRKI
GUT_GENOME095248_00470956-1262LMSITGSYRDQFPALMALLDRAVAQAATAEPGNVIARNTAQIGQELRKQGVPRAQAEQLARVRSFGNAVGDYGTGLSDAVQSDGLQTNDARLGQMFLERMSQPYLEGEPVTGVAGGVAAQALGAHLRRTDAAILSRSSHLYAMVSSDDPFQYLGGLSAAARVAGRAQGLELHVAQLQDAGEPTTETAQRAIALEMQSRYLHPGWLQAQKAEGYAGTLQVLKAVQFAWGWQAVAPDTVRSDHWQSFYDVLVRDKHQLGLPEWLKEHPQAYAQSLERLVQAQRQGYWQADADTQKQLAQMYQELTRAAP
GUT_GENOME013200_00617887-1176VTICGFFRDMFSNLVTGLNKLFAMLDGLDETDEESSFAKNTRENYGMLISEGYSEDDARDLSRCRLFGPGEGLYGTGGITDAVNSSSWKEEEDLSDIFLRNMGHAYSLKRRGADIPGLMTANHRNVDVVTQMRDAVERELIDLDHYYEFFGGLCKTVEVSRGSKAAMYITDTAGPSVRTTDVKRSIEHGIRTRLLNPKWIDAMLKTDYHGAQHINDRFENVLGLAATTGAVDSSVFSDMESCYITDPGMRRRLMENNNWALMSMINRLSEASSRGYWNATDEELKTLNDA
GUT_GENOME120945_01393877-1180ELGRPRIDVTLRISGVMRDTWPDAVTLMDEAVLLVSSLNETEEENFILANIHKMELECGELGERARTIRIFGDPPGAFGAGIDLALKASAWENEKDLARYFIQSSAFAYGKELDGRKSMKEFVQNTKNIDLSCDVSSSRRMDTLACGFGLQVHGAFKIVAETIGGKKIRQYQSMSERNTVITTTTLEKQIEKDIQQTLLNASWQDSTMEDSYTGASEMMHRIQNVFDVKCTSDCVQDQILDQLVETYINDEKMRKWLLENNSYAAEEIARRFLELEARGKWRPDNELLKKLQDNYLSIEGCMEG
GUT_GENOME171583_01264965-1256SGLIRDMMPNALKWLDKAVTLVAELDESEEDNFIKKHINEDVSWLVAEGENEKDAYQKARFRLFGDPPGAYGAGVGQLIEQKNWKSIDDLADVYVTWGGYCYGDGAKTNQDKRLFRRRLATMEVTIKNEDNREAHMLSSDDYNAYHGGLIAAVRSIRGEVPRSYVGDSSNREQVKTRNLKEEVNRLVRGETLNPKFIEGMQQHGYKGASDLANVVAHSFAWDATSAVIEDWVYEGYANKYALDKDMQEWMQSVNPWALARIAETLLEAIQRGLWQAPKDMEDQLKQLYLAME
GUT_GENOME000582_00866856-1138LRVSGLFRDNFPNLMELLDEAVNLAASEEEPDEVNYIRKHINRDIRKLISRGMEQEQARRQASARIFSCPPGQYGAGINLLISSGKWDGRKELGDAYLAWSAFAYGQGREGIQMADSFADRMKDSQVVLKNLSSVEEDLMDSDDFYNYFGGAIAAAEAAGGSAVSYTVSSSDGKEEQIRTTKEELERLVRTKLLNPSWIQGLMEHGYRGAQEMSAMFDIAFGWDASTGNIDSWVYEGLVQTYLADETLKEWIKEQNPWAVHNMSGRLLEAYGRGMWNADEESL
GUT_GENOME207893_01140925-1230KLNRPRIDVIVRISGVLRDAYPNVINLMDKGTVIASSLEEPFESNFVRKNTFEIARVLKELGEDKDIKRRSTIRVFGDKPGTYGSGVNLALMASAWKDEKDIGKIFVYFSSHAYGENLNGRMAKHEFVENVKASEISYDTTISNRYDVLSSGFAASVQGGFGVVKKLLSGKEMKQYHGGTENKDNIRICTLKEEIKKTMEETFFNPLWKENVKKNGYTGASEFMRRIQSVFDWQCLSKNIDNKDIDKLVDLYVNDEEMVKWFSEHNKYAVEEIGRRFLELYERKKWNPDKEILDKLRKNYIKIEGD
GUT_GENOME141221_006511303-1592VNITGLFRDTFPNLIDMIDDAVKLVSGLDESSDENYLADNLRKEILEGMKAGLTVDEARRKSSMRIFGAPPGAYGAGVNHAIETSEWKTVEDLADVYISWSSYAYGRGVHGESMKDQFVKRFSKVGVTVKNMPDREIDLLDCDDVYTYLGGMNSFVRAYGNPDAISVMGDGSNPEHLKLRNAAEECKFVFRSKILNPKYVEGLKEHGYRGAAELANVTEYLFAWDATSDIVDDWMYEQLADKFLFDNDTKEWMMDENPHALMNILNRLHEAISREMWNADQDTLEKLKQL
GUT_GENOME002763_00071898-1189SGLFRDMMPSGVMWIAKAVQMVNALDEDHSVNYVKKHIDLDTAALVEDGMDYEEAIKATTIRVFGDAPGAYGAGVNQILEQSNWSTREDLANAYVTWGGYGYDEQGEIIPSHQQFRKRLGSIEVTVKNEDNREVSLLSTDDFNAYHGGMIAAVEAVKGIKPKSFIGDTTYRKDILVRSLEDEIQRQFRSEPMNPIFIEGMMKHGYKGAADMASYVQHTYEWDATSDMIQDWMYDAYTDEYVMNQDVQDWMRQVNPWALGAICTTLLEAEQRGLWKTSAERIEGLRDVAMSLE
GUT_GENOME038689_01034508-795SGLFRDTMPSVMQVMDKAVLLAAEQDEPEDLNFVRKHIQEDTKELEQQEGMEHDAAWRQAAFRVFGDAQGTYGAGVAALLESKNWETIDDIADVYVRWGGHAYGGKTKGKFLPQQFRKRMGSLDITIKNEDNHETNMLSSDDYNAYHGGMIAAVRSIKGSAPRSYCGDSTDRTRVTMHSVQEEAKRLFRSEAINPKFIKGMMQHGYKGAADMANMIAHSFQWDATSAVMEDWMYEKYAEKYTFDPAVQEWLRDVNPWALQRMTEILLEAEQRGLWQAKPETKEELQKL
GUT_GENOME102627_00027930-1232ELKWPRIDITVNVSCILRDNLMNCIDLIDSAVRAVAELDEPLDKNFVRKHTLESITSGMDADDALTRMFGAPPGSYISGVNLAVFASSWKEDKDLAEIFVKAKGYGYGNSRNGKPMFEQFASSLSRVNVTFDKVSSDEGDILACGGHFSNVGGLTVAARYLSGTDVKAYYGDTRDPRDISVNTLADEIRRVMRTKVLNPAWIEGMKQHGYKGAGDIMKKISRLYGWEASTKEVDDWIFDEVTATFVSDPEMRQFFKDNNPYALEEIARRLLEANSRGLWNTDKDTLQDLQNVYLDLESVLEDL
GUT_GENOME243197_00257999-1242ASLRVFGNADGAYGSNVNHLVENGRWDDEDELAETYTRRKSFAYGLKGQPVQQTALLKNALATIDLAYQNLDSVELGVTTVDHYFDTLGGISRAVRIAKGGQSAPVYIGDQTRGAGTVRTLSEQVALETRTRMLNPKWYEGMLKHGYEGVRQIEEHVTNTMGWSATTGEVAPWVYRQLTETFVLDIEMRERLASLNPVASAKVANRLIEAHERKYWSPDPEMLDVLRRAGEELEDRLEGVGVAA
GUT_GENOME254280_003741004-1313ELKRPRIDVVVQTSGQFRGAATSRMRLIDKAVRLAATDPDGEFGNFVREGSLEIVKALIAGGMSPEQAKSLGNARLFGGVNGNFGTGVTGMVQNSGQWEDTKGIAEVYLNNMGALYTDEHWGEHVPGVFKAALTNTDTVVQSRSSNSWGPLSLDHVYEFTGGLSLAARHVTGKDPDAYFNDLRTPGRARIQEAGEAAMVEARSTVLNPKYIKEMMKEGASATGSFVEVFHNTFGWEVMKPDMLEDHLWQEYKEVYVDDKLNLDIREYFEKNNPAALQEMTAVMLETVRKGYWKADAETIQQIAKTHVELM
GUT_GENOME140888_021061001-1289LRISGLFRDTFPNLIDMIDEGVETIASLDESDEENYLAMHLREDIQAALRRGLAPREARARAMVRIFGDPPGDHGAGVDVLIESSKWSTTEDLAETYVTWGCHAYGREWRGEKVPEVFKEKLSRLNVTVKNHEDREFDLLDIDDDYTMLGGMNAAVRAFGGRKPLSIMGDSSDPQRLKTRTVEEESKFVFRSRVLNPKWLDGLKEHGFRGAQELSKLVEYVIGWDATSDIIEPWMYRSITERFLFDEETRKWIEESNPYALREMASRLLEAIQRGLWEADEEMRRRIQS
GUT_GENOME221811_00218915-1216VNICGFFRDMYPSLIEALDDILEKLYERDEPDDQNYFRAHAKARYAKLIDAGYEPEEAKQLAIARVFGPKEGEYGTELTGIIETKNWQDETELGASFASSLCHVYTRRKRGQRVEGLYEDNLKSVEIVSQLRSNPEYEITDLDHYYEFFGGLAKSVEQARGGVRAKQYITDTTGSTPCTESADKSIARGIRTRVLNPKWIDAMLAHNYHGAEQIAARFENVMGLAATTGDVDTWIYDELNAKYVEDPEMRRRMAQNNPHAYMNILEQLMEYHSRGYWDATEEQLEQIRQTYLELENSLEETI
GUT_GENOME162865_01085932-1242ELGRPRINVVVQVSGQLRDIAGSRLKLLTDAVRLASEAKDEAYPNYVASGTVLQEKLLVEKGTSPKRAREMSVMRVFGPVNSGYSTGIMGYTEHSGSWEDEKEIAQGYLNNMGAAYGDENNWGEVQKDLFASALSETDVVIQPRQSNTWGPISLDHVYEFTGGLSLTVKTLTGKEPDAYMADYRNRTNRRMQETKEAIAVETRATILNPTFIQERMKGGAGSAQMFGEIFRNIFGWHVMRPSAMDKEIFNDLYRMYIQDENKLGIQDYFLRVNPASYQAMTAVMLESARKGYWKASPEQLKATAALHAQIT
GUT_GENOME098781_01429938-1241VKISGILRDNFQNCVCLLDDAIQAVSKLEEPPELNFVRKHALENQQSQPELSWEDATARIFGAQPGTYSSGINLMVYASAWENQDDISDLFMNFNGYSYGRNRFGKQSPIALESSLKHVDITYDKVMSDEHDLLGCCCYFGNHGGMTAAARKLSQKEVKAYYGDSREVTNIEVRTLSEEINRVVKSKLLNPKWIEGQKQHGYKGAGDISKRVGRVYGWEATTEEVDDWVFDDITKTFIVDENNRKFFEENNPWALEEMSRRLIEAYQRELWNPEEGLIDEIQDTYLELESFLEESMGNNAGDFQ
GUT_GENOME244169_00385904-1210ELGRPRIDVTARISGLVRDALPQAVSLVNRAVALVAALDEPDELNYLAKHVRADAAAAEAEGIDPETALRRARWRVFGCPPGAYGAGVGALLDQKNWETDDDLAATYVTWGGYAYDEDGEAHADPQSFRRRLASVDVTVKNEDTRDLNLMSSDDFNAYHGGMIAAVRAIGGRKPRSYVGDTSARTQVAVRTLAEEFQQVVQGESLNPQFIKGMMKHGYKGALELAKQASFAYGWDATSRVMNDALYNRIAESYVLDETVRQWMREVNPWALHEITETLFEAQRRGFWNMTDEMADALREIYLSTEGE
GUT_GENOME132204_01904541-835VAKHWIEDCLFYLSIGYNESYAGECAITRIFAPPNGDYGAGISKLVSMSWTWNNTDELANFYLGRMGNMYSRNYWGDTNPLVFLRALSNADTIVVSRNTNQYGVLDNDDFFDYWGGLSMTVANISGKTPKMNVLMYADTSNPYIASLENVLNKEISARYDNPEWIKGMMKEGYSGARYISNKFVTNLFGWQVTRPSAVPNDLWNKIYDTYYKDKYGLGVKKWLQSGNNAYSLISMSGTMLTAISEGYWKTDEATIRDIANTWAQATVQNGVACCDCSCGNIAMMQWAINYVNPDM
GUT_GENOME243901_00018898-1201QLGRPRIDVSINICGFFRDMFPLLLEKMDDLFHELALLDENDEDNYLAAHARKFTQELRHDRTPEEAEKIAASRFFGPRPGEYGTSLTRLIEDKSWQTEADFSDHFTADLSYLYGRDLHGTAARDFYQENLRQVGFVSQLRTMQEYEVTDLDHYYEFFGGLAKSVESAKGHKVPLYITDTTGQNPESETVDKAIARGIRSRALNPEWIEGLLRHRFHGAQAIQQRLENIMGLAATTAAVDQWIYSDLASTYVEDQEMARRVAENNPYAYKAMLEQFMEYYQRGYWKADEEQLAQIRERFLELED
GUT_GENOME120208_00457953-1272LRISGVLRDAYPNIPEYLNEVIQTVARLPEPIQENFVRKHTLENLAEDTPESWEIAATRIFGGQENTYGTGINLAILASAWQNEQDLADVFLTWSAHLYGSNGSAKTSLQQLKKQLATIDLVYNKSSNDAHDLLGSASHFSYMGGLATTAKLLQKNREIKTYYGDTRNERKIRITSLAEEIAETVQTRLCNPQWIAGMKENGYKGAGEIAKRIANVYGWDATSGAVGDWIFDAIAQTYIMDKAMQMWFKEHNPWAMEDINRRMLEAVQRNLWKPENEVLEALKRSHLEVEGWMEESVGEIDGDFQGSAVNLYTPADVPAW
GUT_GENOME096381_062521020-1324ELGRPRIDVTLRISGFFRDAFPHAVALLDDAVRLAASLDEPADVNHVRAHAQQDLAEHGDERRATTRIFGSRPGTYGAGLLQLIDSRDWRTDADLAEVYTVWGGYAYGRDLDGRPAREEMETAYKRIAVAAKNTDTREHDIADSDDYFQYHGGMVATVRALRGSAPEAYIGDSTRPETVRTRTLVEETSRVFRARVVNPKWIEAMRRHGYKGAFELAATVDYLFGYDATTGVVADWMYDKLTEAYVLDDTTRTFLQENNPWALHGIAERLLEAESRGMWSKPDPEALDALRRIYLETEGELEQDE
GUT_GENOME080362_02942906-1221VLSATGLYRDHFPNAMKQLAKAVELAARAKEADNPVHANSQRIAASLKARGVPEAAAQRAAETRIFSNESGRYGTGLDDAALATDTWKNKAEADRKMAQLYLKNMQYAFGADETAWNTKGIAGADGQVADLNLYAEHLKGTEGAVLSRTSNLYGMLTTDDPFQYLGGIGTAVRYLDGKAPELYISNLRGSGSGKVEGADKFLAKELATRNFHPGYIKGLMAEGYSGTLQVLDSMNNFTGWTTVAREIVRDDQWQEFVDVYVRDKHQLGLKNWFEQNNPHALAQTIEKMLEMARHGYWQADAKTVAELKDRYNDLAR
GUT_GENOME096468_03715961-1250LRVSGFFRDAFGNLIKLFDAAVQAVAALDEPEDLNPLAARVRSEQARLVANGLEPEQARRQAGWRIFGAKPGAYGAGVQGVVESRQWTQRQDLAEAYLNHSGYAYGGADDGAPARAELADRLGRLEAVLQNQDNHEHDLLDSNDYYQFHGGMLAAVHQQAGREAASYHGDHSQVDAPRVSTLKEALNRVVRARAVNPKWLAGARRHGYKGAFEMAATLDNLFGFDATTGLVQDHQYAMLADAYLLDEENRAFLKAHNPHALRDMAERLLEAQQRGLWQAPGAYRQVLENL
GUT_GENOME207750_00981929-1223LINISGIFRDMFPGIIADLHALFKKISCLDETEEANYFKKHTNKLYAELLAEGYKEDEAFDLACARLFGPSEGEYGTNITNLIETSNWEQEGQLGDSFTASQQYVYSDKHHGEKIDKLFQQNIEAVKVINQIRSDFEYEVTDLDHYYEYFGGLSKAIENQKGEKVEIIIADTTMERIQAENVQSSINRGVRTRITNPKWIDGLLNHPVQGAMKIAKRFDNLLGLAATTNKVEQWVFNDVYDTYIRDENISKRMEENNRFAYHSMIETLLECNQRQYWNADDEQVRQLQIKMMELE
GUT_GENOME096450_02468955-1245LRITGLFRDTFPNIIELIEEGVNIASALEGESAEDNYIRKNIFKEVKELVGNGMSFEEAEEQATMRIFGCPPGTYGAGVSNLINSKNWKDVNDLGDVYALWGGHAYGSKLHGKKVKEVFQRRLKEAQVTVKNESSMEIDMLDSDDFYNYHGGLIAAIRANSGKAPRSYSGNSSDPERTKIKDINEETARIMRSRILNPKWFEGLKKHGYKGAQEISKMVDIAFGWDATSEVIEDWMYEKISETYLFHDEKREWIKSVNPWAVHSMVERLLEAHQRGMWNAKKESVDKLRQL
GUT_GENOME100869_01104882-1168SSVMRDAWPSAIAMLDRAVHLVAELDEPETMNYVRKNAQTMGEARIFGGAPGTYTNSVGLALKASAWKDERDLARYFIDSSSYVYGDGKQGEQDVEAFVANVRQIDVSCDITSSRKTDGGASSYSARVQGGLQMAARLVGGKEIRQYMGESSAAREIRVVRMQDHVENAIADTLLNDFWREQMMGQGYSGAADLMRRIQTVFDTQCVLESVSHETLDAIAQQYFLDENMRQWFTEHNSYALEEGMRRFLELETRGKWKAEPETWRKLKSVYLQAEGDLEDTISGFGE
GUT_GENOME275835_00307854-1136MATASGAYRDQFAGLMEKLDRAQREAARLTDTENFIAQHNQSIMKRLAEQGIPESQRARLAERRVYAPAPGTYGTGVNRLTGSSGAWEKNSEIAGLYRERMSYGMDPDGSFAKSAEGFRLHLANTSVVLHSRSSTLYGVTDIDDMYQYLGGLSLAVKEAGGQAPREFIIDNRNAGRIRVSPLKQFLAAELSSRALHRDWIAAQQKENYAGARMIARTVDNLWGWQSVTPENVAGEQWNELYEVYFKDRHQLGLSRFFRRENAWAEQSIAARMLEAVRKNFWNA
GUT_GENOME236183_01629927-1214VRITGLFRDSFPNLISLINDAVSMVSDLDESDEENYLSANLRREIAEQIESGLDEASARELSMIRVFGGRSGSYGAGLNHAIENRNWNDTGDLAKMYVTWGGCGFSSDGREIPMEETFVRKLSSADIGIKNMPDRQMDVFTGDDVYGYLGGLAALAKADGREMRLYVGDGSDPDRPVVRTSNEECGHVMRSRILNPRFIEGLKRHGFQGVAQLASVSEYMIGWDATSDSVDDWMFDEFAEMVLKDNNREWMNDENPYSMMEIIQNLLEAAQRGLWDAEEELLSRLKEA
GUT_GENOME093964_00614237-525LRVSGLFRDMYPNLIDIMDRAVTMVAALEEPEDKNFIKKHVREDMEKLTAEGIPEENALDQSYIRVYGCAPGCYGTAVSHAIDSKEWDDFHDLAQIFETWSSYGYSTRAHGERQPEAFRRRMAAVSVTIKNESTAEYDMLDSDDFYGYHGGLVACVRSISGKKPMAVTGHTDDPDRPVTRDLSREMARIVRSRILNPKWLEGLKRHGFKGAQEIAAAMDSFFGWDATAELGEDWMYQSMAETFLFHEETRQWMEEVNRWSVHSVSERLLEANKRGMWNTDAETLRRIQM
GUT_GENOME222909_01080946-1224VVNMCGFFRDMFPNIIDDLNMIFKMVSNLDEPEEVNYLKASTERIYKKLLKDGYSQEEAEDFATARLFGPAEAEYGTGITSLIETKNWSSEEQIGEKYTESLKYVYSSNARGKAVPGLLNSHLQAVDIVSQIRSSHEYEVTDLDHYYEFFGGLSKSVEMAKGKKAAVYITDTTGENIETETVDKSIARGVRTRLLNPKWIDGMLKHKYHGASKISKEFENILGLAATTNKVDNWVFESMNSTYVTDDKMKKRMKENNQWAYLSIIETLLECNQRGYWQA
GUT_GENOME177031_00274942-1219ELRRPRVDVVCTMSGVGRDLLAGPMELLDDAIHAIASLPEDPEDNPIRAHAMQQVEELGLDLDSAATRIYATAPGNYGTGVNKLVQSSEWDDNSDLAEVYLHRMGHAWGKHVKGKENRTLLESSLSRVSATFQNVDSTEISLAGVDHYFEFLGGVSSVVETVSGARPSAKVSHAWQHETNVESLDDAMRLESRTRLLNPAWYEAQLKHGYQGVANIRSRFENTFGMQATARVVDNWVFDRTASTFITDDDMRERLAEHNPAAVLSMTERLLEAADRGL
GUT_GENOME194691_00073930-1240ELGRPRLDVVSRISGLLRDTFPTLITLLDEAVQLVLSLDEPIEANWGKKHYVEEVAELRAAGIDPETARQEAKLRVFGCPPGTYGGGVAVLIETKQWQSSDDLGETAITWGCHAYSQELHGRVSRGAFERKLARVEATVKNQNVIGFDLYDIDDEYIYHGGIIAAVKKLRGQAPMSYYGNTSDPRHSVVRTLSEESARVLRSRLLNPLWIEGLKRHGYRGAYDVAYNMDNVFGWDATADSVLDWNYEALASHFLGDPSNREWLKSANPWALYEIAQRLLEAAQRGMWQAKPETIEMITEIYLSCEGVLEEG
GUT_GENOME092814_002701012-1321ELGRPRVDVVIQTSGQLRDIAASRLFLLQKAIDLVAETKDGKTPNQIALGRVEAERVLLEHGVPPEQARKLSGKRIFGGLNGAYGTGIQEMVESGDRWEKESEIAEVYMNNMGAIYGDEELWGETAKGIFEAALQNTDAVVQPRQSNTWGALSLDHVYEFMGGLTLSVRHVTGKDPEGYFTDLRNRRRVKTQEIKQAIGVEARTTILNPTYVREQLKEGAGAADAIDETIRNTYAWNVMKPSAIDKELWDALYDMYVVDKHQLGTVDFFERNNPAALQDLTASMMETIRKGYWQATAEQKQTIAQLHAES
GUT_GENOME224048_00643955-1244MRISGLFRDSLSAAAELLEKAVNLAAVQEETPEDNFIRKHIEEDTDDLMEEGLTKEEAIRQASYRIFGCPPGSYGAGVAHMLEEKNWETVHDLGDVYVRWGAHVYGKKEEGQFLPKIFRRRLQGIELTIKNIDNHEVHLLNSDDFNAYCGGMNAAVQSIRGKMPRCYIGDSADRACAETRSLGEEFRRVFRGESMNPKYIQGMKQHGYKGASDLANLVVHAFGWDATSSVMENWMYEGFAKKLALDQEIQDWMREVNPWALHRMTAKLLEAEQRNMWEAQPETLAQLKQL
GUT_GENOME226825_01620714-1017ELKRPRMDVTVHISGVLRDTWPGILARMDEAILLAAAADEPPHANYIVKHLHAASMNGKKPCIARIFGGAPGTYSNSIGLALKASAWQNENDLARYFMDSSSYVYGKDKHGEKNLSAFLGGIKRTDVTCDIVSLRHTDALNSSYSARVQGGYALAAKSLGMKRRMRNYMGESTENGIDVKTLKEHLDDGVGQTLLSEAWKKKRMQQGYDGAADIMCRMQNVFEMQCVNQTFSSETLDTLAQQYVLDAQMHDFMRENNPFAAEETARRFLELESRGKWQPAPEVLSKLQKVYLKTEASMEDGLCG
GUT_GENOME207623_04971847-1116LDRPRIDATLRISGLFRDLFEAQIALFDLAVQKVAVLDEDDADNPLAAARRRGESLARVFGGAPGSYGAVAADLALGMTWETRSQLGDAYLDGSGYSFGGGHDGVSAAATFRDRVRSIDALVHPQDDRERDLLDGEEVADFAGGFAAAAASLGATPALLHLDTSRPDQPKARTFDEEIARVVRGRLTNPRWISGMLAHGYRGVAEIAQGIDALYAFAATSDSVPEHLFDLAHGALLRDEAVLDGMIARNPAAVAAILSRFEDALRRRLWV
GUT_GENOME013945_00163944-1233LRISGLFRDTFPNIVQLIDRGAGMISSLDESDDENYLVANLRADIARAISDGIPEDKAREEASIRIFGDAPGSYGSGTNILIRTSDWKDVSDIGDIYRSYGQYAYGIGRKGEARPEAFRRRLELMDVTVKNSVSREYDMFDNDDVYNDLGGFNAAVRSVRGRMPMSVIGCSADTSNIRTRTVDEEGRFVFRSKINNPKWLEGLKRHGFRGAEEISDMTEYVFGWDATSDIIDPWMYQSIADRFLFDEETAEWFRDANPYAMQATAARLLEAIGRGMWDPDDITKEQLQEI
GUT_GENOME113380_00317949-1237LRISGLFRDMYPNLVELMDAAVSCAAAQDESEEQNYIKKHINEDMLALLDEGLDANTALDQAYLRVFGCPAGGYGAGVANLIGNRNWSDYHDLAAVYETWSGNGYGRGHHGEGMQKMFKRRLSSVGMTVKNESTVEIDMLSSDDFFSYHGGLVACVKSNSGHAPISLTGHSDDPERPLVRDTAKETARIMRSRVLNPKWLEGLKRHGFKGAQEISKAVDSFFGWDASAEVAEDWMYAHIAENFLLDGETRAWIEAVNAGVIYNVAGKLLEASRRGMWHADEEMLTQLQG
GUT_GENOME096502_00246906-1196IRITGFFRDAFMNLVEYIDEVINEIALLDESIEDNFIKKHFLEDLNLFLDQDLDGEEAKRRSLFRIFSAKEGAYGTGVNTLINGKSWEEKKTLADTYLEYGGYAYGKNIYGTKARDIFKSKIEKIDLAVKNIDTKETDVLINDDNYSYLGGMVAAAKTYGKSDLEAYVGDSSDPQRVEIKNIAEETRFVIRSKVLNPKWYDSLKRHGFRGAGEISAHVDHIFGWDATTEIIDDFTYQQIKYKFVDNEDFAKWMESVNPWARQNIIERLLEAIKRGLWQADEKTKDDLIKKY
GUT_GENOME096532_03939947-1237VVHMTGLFRDMFPNLLEDLNRIFRRVSELEEPDELNWFKVHSRNTEEQLLAGGYEPDRARDLANARIFGPAEGQYGTNMTRLIETKQWSEESELGQAYADSQQYVYSLKERGQAEPELYRIRLMAVDVVSQIRSSHEWEVTDSDHYYEYFGGLSKSVAMVKGSPADIHITDSTLEQPVTESAEHSIARGVRTRLINPKWIDALLKHPYHGAQQIAKRVEYVLGLAATTGKVEPWVFDQLHETYVADEARSRQMEENNRWAYHGLVETLLESQQRGYWEPDEAQLEKLRQKY
GUT_GENOME103750_03707967-1249FRDMFPNLLNSFNEIFEKLQSLDENYEENYFKYESDVILKNLLDDVYDEKIARELSIGRIFGPEEGEYGTNLTSIVESKNWIEEEIFGKSYIDKLNHIYTKNYRGKKCDKLFEGNIKSVELISQVRSNPEYEVTDLDHFYEFFGGLSKSVEITRGEKAEVYITDTTSHIIETQTVDKAILRGSRTRLLNPKWINSMVEHKYHGVQEIKKRFENILGLAATTNKVENWIFDNMNTTYIEDENLKNILKENNKWAYFEMVETLIECNQRGYWNTSEDVLKKLRKT
GUT_GENOME147776_02662913-1224ELGRPRIDVTVRISGVFRDTFPNLIERVEDAVNLVAALPEDPDMNYVRKHVLADIKIWQARGESLQQAQQKAAMRIFGCPPGAYGAGTDLLLGSGDWQDTGDLAQSYLNWSGYAYGRGLSGQIARDVLSARLKSSSTAVKNDPVPESDVLDNDDFYSYFGGLVAAVSQERGSTPQTYIGDSRDPQTLTTRPLRQEIDRIVRGKILDPQWLAGLRRHGYSGAQQLSATVDILFGWGATTSQVPGWVYHRVAQCFLFDDAVRQWLESENPWALHAMSERLLEAAQRDIWEALPEDLTALQQIYLYMEGELEEDI
GUT_GENOME283701_004551053-1351LVTTTALYLNDYPYTLDVLDKAIRLAATANDTAYPNYVKLNSEAIYQQLIAQGYNETDARIASTSRIFSQEPGNHHNPLEDAIVMSDTWESDEKLADAYIETFGNAYGSNGSSVHMSDLYNMNLAGSQVAMFRRYVDANTLLSGDDYSAYFGGLGLAIRTASGNDPLMWISNLENPNDPHIESLSESLAKDLRTTYYNPKWIQAMQRHGSAGARAITSFVENMFMWDVTSPSSVTNGMWDDGYKTYVQDKYGLSLETWFNQNNPFAAQSMYAKFMEAARKEYWQTDEATKRDLANRWAE
GUT_GENOME266374_00566842-1153ELNRPRVDVVIHICGFFRDMYPNLIRNLNEMLQRLLQSGESEEQNFFLKHTRRMQEKLRLENEKNKTTMAEKECLKLASCRIFGPREGEYGTRLTEVVRKGNWKEAAELGSSFVEDLSFGYAANHQGIAADTLLKWQYQQVEVMSQVRNNVEYELTDLDHYYEFYGGLAKAIENEKGEKPVMLVADTTGEAVQVQDLPTTLKRGIATRLLNPAWIEGMMRHEYHGVQQIAKRFENVMGFAASTDAVDSSTFSDLTRCYAADETLRHRMQESNRWGYMKMMERLMEANNRGCYQATEEELEQIQEAYLEAEGE
GUT_GENOME014636_00512938-1216VNITGLFRDTFPNLTDMINDAVAMVADLDESDEENHIREHFRRDLVEGIESGISEDQARKEALYRVFGAEPGQYGTGVNTLVNTSNWNDRADLGNYFIDVGCHAYSRDSQGVKAEKIYRKRLSTVDVTVKNSTSREYDLFDNDDVYQYLGGLNTAVEAVRGEKAKMSVIGCSADVDCPVLRTISEESHYVFRSKVLNPKYERGLRVHGFRGATEVLKMFEYIFGWDATSDIIENWMYDRLAEHYILDESVREWIENVNPYAMREMIDVLMEAHSRGMWD
GUT_GENOME283701_004531072-1326NSEVIYNALKAKGYSDEDARKLSVCRIFSQEEGNHNNAMQNALLITSSWENEGQLAETFIDTFGNVFMGSEINSIHIEDLYSLNLNGTEVAMFRRVVNVNDLFGDSDYFGYFGGMGLAIKHVSGQEPKMWIMNVENPSNPKLESLSESLWRDTRSTYFNPKYIQAIKSYGATGAGIFADFFRYMSAWKITSPDSVNDNMFQEAYEVYFQDKYGLGMNEWFTKNNPYAQQAMAAILLDSIRRGDWKADANVVNDLV
GUT_GENOME194858_026281071-1350VLRISGLFRDTFPNVVELVERAVLAVAGLDEPPEQNFVKKHTDQERKRLVAEGLSENEALEQASLRVFGCPPGTYGAGVSKAIHSQNWESWRDLSQVYTLWSAHGYSSRFHGQAMPELFRSQLSSVGMTIKNESSVEIDMLDSDDFYSYHGGLIACVRDCSGQRPVSVAGLTADPARPETARVETELARVMRSRILNPKWLEGLKRHGYKGAQELSTTFDTFFGWDAAAEVASNWMYDAMARQFLLDEETRRWMEQVNDAAVLQMSERFLEAHQRGMWQA
GUT_GENOME141754_01168893-1188VVQVTGVYRDQFDSYIRLLDEAIQKLSLLDEQNNLVADNTKSIASELIKKGLPEKEALLMAGRRLFGNSPGEYGTGVPHLAMHSTEWDDDSALAQQFLKSLQYSFGKDHWGQSAGSENLFAEQLKGTDMAVMARSSNVHGVLSTDHPFEFLGGLSSAIKHLDGKAPQLMISDLRQQQPKTKGLEQFLSDEMRVRYLNPQWIKSMQAEGYAGTVAIVNVNNNLFGWQVMDERSVRDDQWQAMMDVYIQDKYQLDMNKWFEQHNPTAQAQMIEKMIEAIRKGYWQASKETQQALVERW
GUT_GENOME007443_020201081-1452VVQTSGQLRDLAASRLFLVNRAVEMAAAAKEDQYENQVATGVVEAEKALIERGISPKDAREMATFRVFGGANGGYGTGIQGMVEAGDRWENESEIAATYLNNMGAYYGSEKNWEVFRKYAFEAALTRTDVVVQPRQSNTWGALSLDHVYEFMGGMNLAVRNVTGKDPDAYFSDYRNRNNNRMQELKESIGVESRTTILNPNYIKEKMKGEASSAGGFAEIIRNTYGWNVMKPKVIDQELWNNIYEVYVKDKFDLGVQDFFERQNPAALEEMTAVMLETVRKGMWKASEQQIAELAKLHTELVNKYRPSCSGFVCDNAKLRDFIASKTEPAMASRYKESISKIREAVAENDKGVVMKKEELNPATEQHKTKVG
GUT_GENOME096269_01107997-1295ELQRPRVDVTVRISGFFRDAFPHLVKLMNDAVEMVSRLDEPAEQNPLRHRVLEETAQKMKLGLSEEEARESAAIRVFGSKPGTYGAGLLSLIESGSWQTDADIAEVFTQWSSYAYTNSHYGKPAAADFRHRLRLVQVAAKNQDNREHDIFDSDDYFQEHGGMVATIRSLTGRNPKMYFGDSSNPSHIRVRDLKEEALRVFRSRVINPKWLESAKRHGYKGALEMANTTDFLFGYDATAHVIEDWMYEKVSEQYVLNEDMRQFLLQSNPWALKDIAERLLEAASRKLWENPDPHTIEGLM
GUT_GENOME095590_00040970-1259VVTICGFFRDLFSNLIDELDDILHAVAALDEPADINPLAARTRQQAQAMRQSGEPEERIEALAHARIFGPPPGQYGSGLTDIVDSGQWQSPEDLATAFTAASGHVYTRYDHGSHTDGLYDSRLKDVDLVSQTRSSNEYEITDLDHYFEYLGGLGNSVRAAKGTSIPILVTDTTQDEVYTTSVGRAAAKGLRTRLLNKDWMEAMLAHGHRGVTEIEKRVTNLVGLAATTDQIDDWMFDEVCDTYIDDPEMVQRLSTLNPHSLASMAQRMVEAHDRSLWNASAEHLDKLHDL
GUT_GENOME243483_00547884-1197ELQRPRIDVVIQATSVYRDQFDPFLRLLAAAIEQLAAMPASAGNPIAGNALALQQRLQRDGLPAEQAQRLSRLRIFSNAPGEYGSGLNHAVLARSGAAGKDDAALASGFLDRLQHGYGSDGRATTLPGGNLFAEQLHGVQAAVLSRSSNTHGLLSTDHPFEYLGGLALAVRHLDGASPALYVSDLREPTPRTRSTARFLADELRAQTLNPQWIGAMQAEGYAGTLEVLKNVDNLFGWQVTAPGTVRADQWQAVHDTFVRDSRQLGLAPWFEQHNASAQVQMIERLQEAITRGYWQADDRTRAELQERLQQLQGA
GUT_GENOME034034_02173846-1124SGMYRDSLYPTVEYLDECIKCVMELDEDGAVNYPRKHIIEDAKELENTNCDYVQCRVFGSAPGTYGAGVNLAIESRNWQSIDDLRDIYVSWGSYAYGKNIKGQFLPEVFSKQLSKVDLTLKNEDHYETTMLESDDYNAFHGGMIAAVRSLSGKAPKSFCGNLSQKDSVSVQSLSEEMKQLYLSQVLNPKFIQGMKKHGYKGAADLCNIVSHSFAWDATSNVIEDWMYNDLTKMYVLSKDVNAWLKKVNPWALRKMCTVLLEAKQRGLWNTPKEIESALL
GUT_GENOME283701_00440951-1316VITVSGLYRDMYSDLVKLLDKAVRLAAQANDTTSYPNYVKQHSEAIYQSLLSEGYSEDEARSLSMSRVFSEPPGAYTPGIQEVIPASDTWNSTDEVADVYLERMSYVYGNDGWGQKSSSLFEKVLNGVEICEFSRSSNVYGVLDHPMVAAYLGGLGMAIAKVSGKYPELYINNLRESGDYRIETLNQFFNRDLLTRYLNPTWISGMQGHGQDGTRYMDTFVEDLWMWQVTTPGLVTEDTWNRVYETYILDKNNLGMKEFFDSNPYAKQSMLARMVETIRKGYWNPSAEVKTELINEFIQSVNEYGVTCCHHTCGNLVLNQMMVTGSSLSMEQLQQYVAAFASATGQNLNLGTPGSTPQSGSTPTST
GUT_GENOME075837_03745961-1250LRISGLFRDTFPNLIERIEDAVNLVAALDEPEDINYVKKHVMSDFRHFLSEGMERERAFEMAGVRIFGCPPGGYGAGVDILVNSKKWEKTEDLGQAYLTWSGHAYGKKLHGEKFQEVLSRRMSRCDVTVKNISSFESDMLDSDDFYNYHGGLISAVKAAKGSFPVSFTTNAGDPRHVETRSIHEETSRIMRARINNPKWIEGLKKHGFKGTQEFSAMLDIIFGWDATSSVVDDWMYESVAETYLLDEELRSWIKEVNPWALHSMSERLLEAEQRGMWEAREETLEAIKEI
GUT_GENOME177287_00498962-1285ELGRPRIDVSPRISGLFRDAFPNLVEMVDRAVRMVAALPEDGDDNMLRAHVEADVADMIARGIDVGQARRKATLRVFGCPPGGYGAGVEELIETKAWQDKADLGRAYIGASSHAYGEGVFGDVETERFTAALSRMDVTVKNEDTREYDMLSCTDFYNYYGGLIVAATTVRGEAPMSFVGDSSDPTRIATRTTTQEARIILRSRILNPSWIEGLQRHGYKGAGDLSKVLDILIGWDATADVVDDGLWEKVARRYALDPAMQEWFHEVNPHALHNIVDKFLDAAQRNIWQADPGTVEELQDTYADIEGTIEEVSDDPCSGTGGTPD
GUT_GENOME266065_00452868-1188QLGRPRIDVTIVPSGLYRDLFPNLMQLLDSAVHVVKQLDETDNYVRAHIQSMKRLLMKEGVKDEDLAEKLASVRMFSVPPGAYGTGISDVVGASGTWDNEQQVADVYFNRMGHLYGQGFWGDKVEDATDGLPSGMSVEIFKKALSGTDVVMHSRSSNLYGALDNDDFFQYLGATAMAIRSVDGKTPDVIVTNLSNPAAMGQETLDKFIGREMQSRYLNPEWIKKMLNEGYAGARFINKVVYNMWGWQVTVPEAIDENKWQQMYETYIEDKYQLDIREKFKESGNLYAYQTILSRMIETVRKDYWHPDDKVLERMLREFEKT
GUT_GENOME059193_00096983-1269VRVSGLLRDSFPAAMHMLDAAVQAVAALDEPLESNFVRKHTQERLAAIEADDPDAWRSATFRIFSSEPGTYQAGVNLAVYASAWQTEADLADIFLHWNGYAYGKDAFGVKRPRALEASLSTVDVTYNKVVSDEHDLFNCCGYYGTHGGMTAAASHFRGGQVKTYYGDTREPEQVQVRDLTDEVRRVVRTRLLNPKWIEGMKRHGYKGAGDISKRTGRVYGWEATTQAVDDWIFDDIANTFVLDPETRAFFDENNPWALEEIARRLLEAEARHLWKADPDVLAELREA
GUT_GENOME207893_01152813-1102LRISGFFRDAFPNIINLVDKAVQMVAFLDEPDDKNYIAKHVREEIASAIAKGKEIKVVKEEACYRIFGSKPGTYGAGVNNLINTKKWKNFKDLGDVFTSWGCYVYGEKNYGKTSPEIFKKRLSNLDVAVKNIDNRERGMMDSDDFYSFHGGMIAAVKTHKGKEPKAFIGDSSEIERVKTRTVSEEMRHEFRARILNPKWIESMKKYGYKGAGDISLTVDYAFGWDATAKAMDEWMYEDLAKKFVLDKNFTKWMKSVNPWALQNITERLLESIQRNMWNADKEMEQKIKKV