UHGP-MC 33241


Information


Number of sequences (UHGP-50):
55
Average sequence length:
370±45 aa
Average transmembrane regions:
0.01
Low complexity (%):
1.44
Coiled coils (%):
0
Disordered domains (%):
0.83

Pfam dominant architecture:
PF09848
Pfam % dominant architecture:
9091
Pfam overlap:
0.23
Pfam overlap type:
shifted

Downloads

Seeds:
MC33241.fasta
Seeds (0.60 cdhit):
MC33241_cdhit.fasta
MSA:
MC33241_msa.fasta
HMM model:
MC33241.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME096381_0265944-406GSEVRSWERSIPVLANALNDAGLGRVEVLLEYGLPLNSKRADAVLAGVHPTTGLPSYVVVELKQWSSAEPDDDDPSLCRIDAYARAVLNPVDQVRGYCEYLVSFNGALAEHPDRISGAAYLHNATRFGVSGLFAAEQNEYGRLFIGERRGEFIDYLRSKLGAESGAAAADELVNGKIGPSRQLMAVAAQEVREREQFVLLDEQQVAYRTVLNAVRRAKQSDHKEVVVVTGGPGTGKSVIALSLLGELYRQGITALHATGSNSFTTTMRKVAGARKREVQGLFKYFNSFMTTERNSLDVLICDEAHRIRETSANRYTPASNRTGKPQIDELIDAARVPVFLLDEHQVVRPGEMGTVAEIKEAAA
GUT_GENOME188079_017034-429YKSSKAEFIEDTRSCAIVEKLTANFERSEGRRASPGEERAWQSSLQYVANALDDGAIPDDAGVAVEYQIPSCAKRVDVLLSGYDAASRPNLVVIELKQWSDTAATDEDGILAAPRYGSVLVRGPHPSYQAWSYAELLRSFNAACAASSKEAGNAAVHVEPCAYLHNHAPQGAEKTVLDPRYASYLKAAPVFLAGAREQKRLKSFLKRFIVEGDRGQTIERVENGEIRPSKALADSVAGMIRGNREFVLIDEQKTIYEHVLSLVCRKETPGLGGRRRVVIVRGGPGTGKSVVAVNLLAALTSRQKLCCYVSKNAAPRAVYAAKLSGTLKKTVLSNLFKGPDWACELTPDTYDAFSCILVDEAHRLREKSGVYGNKGENQIRELMRAAPVTVFFIDDEQRIHVKDIGSVAAVREEAEKLGAQVEEFEL
GUT_GENOME143719_0168318-378EELVENMLQVDIDHSIQEQLSWKNSLYLLIQLLNEQSFGNLWLAAEYSLTADRRIDAIIFGYSSQHQPTAIIVELKQWEKLAPNIEKQKTNVNVCIGNRNEYRLHPIYQTVNYARDLKAHHEIVANKKIDLYAIQYLHNFIGDKNAFFSEMYEDYQKLSVGFFVKGEERLLVNHLKKHFDVTINGEQVATEFLEGKYIIGQVGFNGLRSVLNQKDNAIMLADQIEISAQISRLFKKFTQNHRNTAIIIRGAAGTGKTIIGIHLLFLAQQHGLKINDMVFTFAKSRMLREVVKNEAGLMQHIPYLNGIALKDYSLVVVDEAHRDTDINKTIKSLFSYSKRPKIVVFLQDDHQRVLLEEVGTL
GUT_GENOME243156_0208828-397KVPVDRSDSEYRSWKQSLPKLIKVVNDAGLGNLMLAAEYKLPTGGRIDAMLLGYSAAEHTPLAVIVEIKQWSKEKIAIKDRGFTVIHVQNDGQGYPSIHPICQTNEYIKYMNRNHGSVLDGSLSVTACQYLFNFDATYKEELFEGDFEKYRHDQDKMFCMGEEGRFHEFLKHLFSDEEKDCSEAVSVLFEDNYRITDLDMEVFKDITNRPESIRLIEDQINCMECVNMSIGRLLAGTLDRKQMFIVTGAAGTGKTIVGFKLISDYCQAVRERRIDTEYKCAYTLPRSRTIKAVLDGIGGGLQTVFLNNLKGSFELLVVDEAHRVTDFDRKKGGTGHILNQANIVVVLQDDNQRVLGNEIGTLDNYRRFAR
GUT_GENOME105097_0084912-347CYADSVLAFKQISEKKWLNLMTKNFKVVYPKFKLTKDQCTAWRDCFRVMQMALANTSHPENTTIIFEYKLPYEGGRRPDVILLSSSKIYLLEFKMSDHFVKPDLDQLLAYQRDLSEYHLESRGKNIYPILVYTGANCAKDINNKKVNDKLNIYQICNKACICSPKNLKIECSDTKPIDWKRWVKSKYEPLPSILEAVCQYAKNEPLPEIRTANSAGIGKAFKIIKDNVKYAKDNHKFVVCFVSGVPGSGKTRLGLRLVYEEAEKTKDRAMYMSGNAALVEVLQYTLAQHKKDIKKDSEALIKELFNVINQDSIDMNVLVFDEGQRAWITDDGKINQ
GUT_GENOME000314_008682-315NSCYKGYINDFLITNKESWIDEMIINYKRLCKELPGDLQINAWKDCYDKLIINLAKYKNYKLYLVFEYELPHEGGRRPDLIILSKDNIIVVEFKEKEGITRADLDQVSAYSRDIQHYHKFSHNKIIKPILVPTRSVGKIKEIGNVIVCSPDKLEEALESIEINDINYSPEEWLNSEYEPLPTLVQAANRIYTDKPLPYIRRVNSAGIPQAVETLKEIVQQAQVQGERVLALGTGVPGAGKTLLGLQFVYECFNDNKNKNSIFLSGNGPLVNVLQHALDSKVFVQPLRNYVKFYGISKRGIPKEHLVVFDEAQRA
GUT_GENOME219872_003015-433YQKTLAEFYEDQAAGIVADMLAGAGHESSSRGSAQYRAWENSLVFMESMLKGAKVSGDCGVLIEYRLPSTSRRVDFIVTGHDVKGDPNFVIIELKQWSWADVVREKPGIVVANVAGSRNEETNHPCYQAWSYKVFLENMLNTVEEHHLQAHACAYMHNYPYKGFEVDPLAADPNHELVRDTPLFGQYDRRELGDFVSQYVGRGDGEAIMELLADGKIVPGKGLVDAVSGMFDEVSPRSFTLIDEQKLAYETVMDAVRTTPVNDKHCVIIAGGPGTGKSVVAMSALVAILREFRNNEKRRNVRFVSPTSSFRIAMVEMLTSGCTNKKEKARRNALAKNLFCGSMGFFTPSPADPKVNGFTENYYHCLLCDESHRLHSYQNMYRGTNQIEDIIQAACVSVFFVDDNQALRPNDIGSVASIEKAAQKYSAKI
GUT_GENOME096556_003203-413LYAGTSTNFIALNFNNQIADLLKTEFRVQFGYNPSMNEVMSWQNSLYRVANMIDRSGLHDNGIFVEYQLPLSSKRIDIVITGTDLKKTKNSVIIELKQWSQCTLTEFDSDKVVTWVGGGNKHVLHPSVQVGNYKYYLQDNNSAFYDEKDPVHLYACSFLHNYNPQDGDPIFDKRFESALTRFPVYTAEDTDKLSSFLCDRLSQGEGMEILSKIENSKLTPSKKLLKQVSGAIKEKLKHKGEFKVFGQHKVKDDYTLLDEQLVVYDTIMSVVRKGLSQRQQYAVIVKGGAGTGKSVIGLQLLADLAALGYNAHYATGSGSFTKTLRKIVGAGTDNLFKYFMSYGTAQPRELDVLIMDEAHRIREKTGYPFKSTGRPQVEDLIRATKVALFFIDDFQCVRKGEIGSSQYIKNE
GUT_GENOME171519_0078432-398YRYEPSEYELLSWKNSLEQLCSIFRNAGLKDNGVILEYKLPLTSKRLNCIVTGKSKENNDEALVIELKEWHKTHESSGRNEVYANVGGLQKEILHPSVQAKQYVEYLKAVHTVFNNENNNIKVEGCTYLHNYIFQENDSVLDDKFNDVCRIYPLFSRNETKKLEQFIGNKVGNGKGTDILKKIECGKFKPDKKLMNHVSDVIKGLPQYVLIDEQQIVYDKVLSITESACTSSEKKVVIIKGGPGTGKSVIAINLMADLLKEGYKTNYATGSRAFTETLRSIIGSSSASTCKYFNSYMKSEENELDILICDEAHRIRKTSNNRFTKKEDKSDLEQIDELFRAAKVCVFFIDDVQKVRPDEIGSTSYIR
GUT_GENOME050342_0111642-381KSWENSLSYMAEVIKISTLPDDCTIILEYNLPISSSRIDLMITGYDFNNKEKILIFELKQWSKVDLVDNSDVLLETFVGNSQRKVLHPAYQVLTYKDMLCDYNKFIQENNAIIEASVILHNYKKDNIINSLIFDDLLSKVKMFYSDDKQNLIDYISKSFSKGDSSKIIYEIENSDFNPSKKLQNEITNLMTGNNNFRLLDDQMIIYDEILRSLLNNDNTVSIVIGGPGTGKSVIAINLLTSMLQKGKMTQYVSRNTAPRVVYSAELKGTLKKSNIDNLFKTSGAYTDTKEKSFDCLIVDEAHCMAEKSGLFNNYGFNQIKEVITSSNNSVFFIDEKQRIH
GUT_GENOME134215_0030416-398DEIAQRMENFYNDNSDAEKKSWRASLPKLIEVIQSAGLGNLYIATEYELPAGGRIDAALIGDDDNGKHRVLVIELKQWSRDGIEYYANKGFPAIKVNATNPYLSRHPVNQTKEYTDALIGNHSNVVNGQLSVSGCQYLHEFELGEKNFFIQDRYSDIDISMMFVKGEEKVFADYLKSVFSPFTDNELARNLFIEGDYVTTEMDMEIINRITESPDNIPLWHDQSKILDYIMPLLKQQAEGKLRTKHMIVIAGAAGTGKTVLLTHLVASILKERPEAKVGVVVQPNWEKTGENIFKIYGMNSRYLSVTTATKIITGNERYDVIIVDEAHKLSRKYGKQHPSFGQVYKIPGFEDCNSHLEILQKMGKQIILMYDVLQAIRPANIT
GUT_GENOME222009_0049929-401QELGIHGGGHAEQESWKQSLTRVASILDNEDIPGLAQVALEYRIPTTGKRIDFMISGQDEKGKDNVYLLELKQWETAELSNYENCVRTFVGGANRYVQHPCAQIYNYAMLLKNLNQEIQDGDISLYPAAYLHNYKEEDRKNLTDSRYEFYLKHVPLYLSSAMDRHALRNDIKRHIVKPSSKDLFRIIEHGKLKPSKSLQESILSLLEGNQEFYMLDEQQVAYSAILSYVQKASKQPGKTVILIQGGPGTGKSVIAVSLLAELIHQGYSASYVTKNAAPRNVFATELTRGHKSKAYIKNLFKSSQGFVNEAKDRYDCLLVDEAHRLGDRSTFFDKSGIDQARRIIHASKVTAFFLDEEQRITTKDMGSLAQLRK
GUT_GENOME255537_0172913-459YIGTVRALYSMSGKEIAARLRELQKNPSESECRSWENSIPILVNVLHQAGLDALTLVLEYQTPIGNRMDAVLLGEEQKTGKPLVLIIELKQWTDIGENIDNRQSTVSICISRAEGRFEDRLHPVQQTLTYAKHLRMNHSNVTDGKMEVRCLQFLHNFEDKSRLFCGAYHAYAKLQHETYVKGEEEKLASDLSSLFSHNPRPDVVEQFLTGTYVLGQISFRNLKDVLARKENAAMLDDQIEVNRRVCSLMDHLHDPQFGKHLYLISGPPGTGKTVVGLHTIYAYCSKYGPSVRNRGGCIFALPRSRTLAQVITGASGVAPAYLDAVPPGRDLVVVDEAHRIEQLESTLTSLFRKADMIVVLQDDRQRIRPTEEGTVENFQRFAERHGISYTVDCLASQKRAGYLGSYVADLDRLLYERQDQPLKRQSALELRCWKDLNDLDTHLHQLA
GUT_GENOME045659_030404-411YQETKEQFLLDVYNDKIDEKIEKLVFERLNRRTGYSEYQTWMNSMQYMYKVLENKAIPSNSIVAIEYKVPNSNKRIDFIVSGEDEQGRESAIIIELKQWQSLNKIENMDGLVETKLNGHLTPVAHPSYQAWSYVSLIEDYNEDVRKYKINLQPCAYLHNYRKKEYDDLVDLCYEYYLDKAPVFTRGDTKKLAMFIARYVKKGNPDVMYHIDSGKIKPSKSLQDSLTNMLKGKPEFTLLDEQKVTYEKALAIAKENSARKQVYIIHGGPGTGKTVIAVNMLVELINRDKNTIYVTKNSAPREVYQKKLTDGGYKKVYINNLFKGSGVFINANENDFDCIIVDESHRLNAKSGMYQNNGENQIKEIINAAKFSIFFVDDHQIVTTSDIGSSKEIKKWCKYYNAIVYEDKL
GUT_GENOME273193_000693-317NSRCCYVGTVAEFQKLNEDDWKEYMIRRFHQVYPDFPLDEDENRGQVKAWKNCFKVMQKALTSAKHPDTNYIVFEYKLPYEGGRRPDVLLISASSVVVLEFKDWDSYHVFQADQLIGYKRDLEEYHFATRGKKVYSILVLTLTKGLSCQKKGVHVRSPDMLSLDCLSSEPFDINVWLDSRYEPLPSILQATRLYAENEPLPEIRQARSAGIPEAMDFLHEMVQYAFKNKKHLLAFVTGVPGAGKTLLGLQLVYKETEHSTWFLSGNDPLVEVLRYTLKSKSLVNRIYNIKNEKNINRNVIVFDEGQRAWPEEPQL
GUT_GENOME184671_013413-330SFCTYPIETFKGLHKSEWMETMKAAHEQYGRDFPNRDFHHEQLFAWDDCFDVLQRVLSDFPYPDFYLVFEYILHAENACRPDVILVSSDQVFVLEFKHKEYAPEADIAQADMYGRFMSTCHVASRDKEVITCLVLTKLAAEEERMDGSLHFVSERALKKLLQERIKPPSRPLDIDAWQKSLYEPDKNSLRIMVEMFEQNELPHLKTARSPKIPVASTFLKNLTATAKAKKEHWLCVISGVPGAGKTLLGVQYIYEMRKLDSTYQEDNATYVSGNAPLLKVLKGQLKYPSFLMTAPSLINSHRQGQLKATKLLVFDEAQRMWSKERMKS
GUT_GENOME095664_015551-284MSARCFFQFKGKDCSDSSLRDRAKASLAEHQVEYFDLEKCQMRAWDEELDILIKIGSAYPDCTIAMEYMIPRMGKRADVVLLIDNYIFVIEFKVGGKTYPAAATDQVLDYSYDLKNFHSASENRTIIPIVVCTKAQNGRAELSIGEQGICAVQHSNASQLPHIIMQTMEAVHLPDKALNHDEWLNAQYRPTPTIIEAAQALYRHNTVEDISRNEAGLTNIAATTECINAIISRSKQKGRKSICFVTGVPGAGKTLVGLNLAATRRSEAASSSDEAAIFLSGNGP
GUT_GENOME188411_011524-320AHPRCSFASSLGTFLSVRKEEWLAQMKARESGRPLHEEQIAAWADCYDVLLHTLPAIQAQHPELILIFEYELPYEAGRRPDVILLSQEQVVILEFKMKCRVLRADVDQTAAYARDIQEYHFESRNRKVTSLLVVTRINHTIELRGSVLVSSGDRLQEALLDTLQAGTTACDATAWMSSRYEPLPTIVEAAQMIMRKEALPHIRSADSAGIPQALQCLTGIATYAQEKGKYMLAFVTGVPGAGKTYLGLQYVYESLQAAEQVHSVYLSGNGPLVKVLSSALGSNVFVKDLHKQIDEFVRYQAKDFYQNIIVFDEGQRA
GUT_GENOME092118_0075140-404EFNSWKNSMQYMRGVLSDKDIPSNMGIAIEYNIPPTGCRIDFLMSGYSKSDSHEAIIVELKQWDECTEVDRVEGIYKVNTYTGGALRDVNHPSYQAMTYANLIREYNKNVEDKKINIRPCAFLHNYYITNNDPLLKEQYQEYIKKAPIFAHSDVLKLREFIKKYIKVGDDKKVLYEIEKGKIKPSKMLQDTLENMLKGNKEFYMIDSQNIVYQYALKIATDTLKSNEKNVIIVKGGPGTGKSVLAINLLVELNKKDMTCFYVTKNSAPRGVYSTKLKGQYTNEYIEHLFQGSGNFFDKENNVIDCLVVDEAHRLTEKSGFHSNLGENQVKELIKASKFSIFFLDENQRVTLKDIGSEELIRKYAN
GUT_GENOME100408_038353-408SFYAASSENFFQDVKQGRFQQLMIENASRQNINVGEAEQRSYKASGQKIKELLKAGRIKDVYLVFELMVPYSGCRIDCMIFGCDSEDGKNVMHIELKQWSEDGIFPASSAGNFVEAYTGGKIQTAAHPSQQVKGYHGYLKAFVEAVSTDSLNLQGCAYCFNYIRKDDDGVLFNPKFDVLQEEFRTYAANEKDELAGLLHTLFSHGSGEAVFNEFCFSPLYPSVNLLENISDILTLNDKLSLVGKQIEARNDILAYLRNSDRYGKNVVIVNGGPGTGKTVIALKVLAELSAKGRYRVMFATKSKPLLEAIKCMVGRQEANLLFHNLNDFIPARCMENGVDVLLVDEAHRIELSPNNKYTKKDHWTELSQIETLIRAARSCVFFIDDRQAIRHSEIGKTGLITSCAQT
GUT_GENOME236065_010426-434YRANLSRFRRELEAIPYRLAGCKSPFDAPESVTKRSEFRAWTNSLRAMRLALDDARLPDDTGIMIEYRLPATSRRIDFIVAGNDASASPGFVVIELKQWSEAKPDPECPSMVFANTGNSHCGETPHPCYQAQSYRDFLALMNTSVEIRRLRAFSCAFMHNYPSRGAADPLARPPYAELVHDSPLFGRFDSGALGDFIRARVGSGHGEDIVNDLFRAEIAPSKSLIENIRTLFEKNAKERFVLIDEQRAAYETILREVRKARETACRRVVIVNGGPGTGKSVVAMTALVELLNRSKGRPAEERNIRFVSPTASFRACMIEMLCAAPLRKGGHIVSSKGDVGKLFCGSMGFFAENEDRKAKGQKAQHYSVLIVDEAHRLHSRQNMYRGRNQIEDIIEAAQTSVFFIDDNQALRPDDIGSVAAVRRAAGKFG
GUT_GENOME019283_0097536-403TGEAEFRSWVNFLEYMYKVLNDEAIPNDSGIAIEYNLPNTSKRIDFLISGYDESKTANVVIVELKQWSEIKKVDGLDGLVETYTGRANRRVVHPSYQAWSYAQMIRDYNENTQVEEVQLWPCAYLHNYLRTPESDPLYDPAYKDYLEVAPAFAKGDVRKLRDYIKRVVKDGDDGEILYEIDNGRIKPSKSLQDAIVGMIDNNPEFNMIDEQKVVFERVMELSRQCEKDGKKRVLIATGGPGTGKTVIAINLLARLTQEGVFAQYCSKNSAPRDVYKRKLKGHRTISCIDNMFRGSGTYVDTPRNAIGVVLADEAHRLNEKSGFYGNEGLNQIHEIIHAARLSVFFIDESQRVTVKDIGSADEIKRWAA
GUT_GENOME110899_001002-404IVYSGSKREFLRSTYNDQIALEIERNVLERLGRHTPESELNSWRNSMQYMYRVMSDEDIPMDAGVAIEYNIPQTAKRVDFMITGYDESGKPGMVIIELKQWSELKKVENSNTLVESFVGHGMRLLVHPSYQAWSYAQLINDYNTYVQQEQVKLSPCAYLHNYQRQELDPIDDEQYAMYTNDAPAFTSGQAGKLCSFIKKFIRKGDNSELLYQIDHGKLKPSKSLQNAIASMMKGNEEFVMIDEQRVAYEEVISLSERCQRDGKKRTLICKGGPGTGKSVIAICLLSELTQRGQFVAYVSKNSAPREVYAEKLKGQIKKSSVDNMFKGSGSFTTAPKDIADTLLVDEAHRLNAKSGMYSNLGENQIKEIIHSAKCSVFFIDENQRVTTSDIGSVDEIKEWAHKE
GUT_GENOME069746_0179339-408SEVKSWEFSLKQLAIIFQDLNFVDHGILLEYQIPLTSKRLDCLIAGKDSNNIDNAVIIELKQWSNTKPSDADGEVITWLQGEERDILHPSVQVGQYLKYLSNVNTAFYEDTPIELSACAYLHNYMLKDNDPIMDNKFDDIIKQYPIFTKGNEKYLRNFLKEKLSNGKGLNIIEKINHSPIHSNKKLMQEISSIIKGKSEYILLDEQKLVYDKILSIIKTAYREKKRQAIIIKGGPGTGKSVIALNLMADLLSKGYDTHYATGSSAFTKTLRTAIGKENNLCCDTQFKYFNSYVNAECGSVPVIICDEAHRIRETSNSQYTRINHRSNKKQIDEILWAGKVCIFFIDDNQIVRPNEIGSSNYIKEHALHYG
GUT_GENOME024288_0049628-420SFVEDVKTNCIADKIADKIREHGINAGQEREYISWQNSLQFMRNVVDDREIDDNVRIAIEYNIPLTSKRVDFIICGADSNNQDNVVIVELKQWQKAEVVADDMHYCVKTFVGGNDRIVCHPSYQAYSYACFIKNYSQTVLDNSIQLVPCAYLHNYDPQFKQNLSNPIYNEWINEAPIFIRNETKIFSDFIKKFVSRKSSNRDLLYTIDHGRLRPSKALQDSLSSMVKGNKEFMLLDEQAVCYDMCLKTMTKCKEDGKKRTIVIQGGPGTGKSVLAINLLMEFINKGLNATYVTKNSAPREAFLHILTKSDAKKLVNIKQLFRSPFGLTQIPNNTYDCLIVDEAHRLVDKMYGDWEGKNQVKECISASLLSIFLLDEDQAITNKDIGSVDEVRK
GUT_GENOME076214_001454-407YQNSVTGFQNDVDNNSIADKIEQNFIEKVGHTVSSSEKRSWNNSLYRMGTIVRLSKIPDTCWILIEYNIPSTSKRIDFLVCGQDKHKNKNFIIVELKQWEKATRTDMPFLVNTFIHGNYHDVTHPSYQAYSYKHFLCDMSTAVYEKNLHPYSCAYLHNYKVSKKEPLLDKQYAEFYTDTPMFFQNDASKLERYFYEHVGQGNGKGIADDFEDSKIKPSKKFIEYVSELFQNNPAYILLDEQVIAYAKIMKYACASDQRMTILVNGGPGTGKSVVAMNVFVDLLKAGKNVQFVAPNSSFKSAMIDVLAWHKVEAKNRLTKIFSGSTKFYEAPPLSYDVLIVDEAHRLKAKGTYMYKGDSQVEDVIKASRVNVFFIDDEQMIRPNDEGSMDYVEAVAQKNHSEVIK
GUT_GENOME282466_0050179-360RCLYNNNFNDFYSESNDSILGKLVNNYHGEVSTASRDAWTGEIEIMKNYLSNYLGEEGQIIFEYDIPRLGKRIDVVLLFKGIVFCLEFKVGETKIQEGDVDQVFDYALDLNNFHKYSENKTIAPILIATKHNSSSTSIQKSVYSDNVINPLVTGEKDFPKVIEAVLKTYPNESQIDRNWIISPYAPTPTIIEAARALYENHSVEDITRHEADKVSTDKTISYILKVIADSKANGKKSICFVTGVPGAGKTLVGLDVAIKQTYQGHDEPVKDEGAVYLSGNGP
GUT_GENOME194757_0101916-395RANVIADTINNEVSRRLNLNSKRSEVLSWENSLRFMMSVLLDEGIPASAGVAIEYNIPLSNRRVDFILTGKSSQRADTAVIIELKQWQSAEVTKKDAIVKTWLNGGVRETNHPSYQAWSYADLINDYNETVRSESIQLMPCAYLHNMKQGDAINHAFYSHHTGRAPVFISPDALKLSAFLSQHIKYGDSDNIMYRIEHGVIKPSKNLADALASMMDGNAEFLMIDEQKLVYETALDLAHRVAKGGKQCLIVKGGPGTGKSVVAINLLVELTKREMMTQYVSRNSAPREVFKKRLTGTRKKTHIDNLFKGSGSYVNAEPDTFHALIVDEAHRLNEKSGMYQNLGESQIKEIIVASRLSIFFIDEAQRVTLKDVGTVEEVRR
GUT_GENOME065557_0053035-398VGESEIHSWQASLSYMANVMRDFAIPDLAGVAIEYIVPNTQKRVDFIITGLDQQDKEHVVIVELKQWGEAFKVTDKDNIVSTYLGGGIHEVTHPSYQAWSYCSLIENFNEDVQTRPIELHPCAFLHNFDESISPELRDPIYSNILDISPMFTLGQMDGLRNFIKTYIPKPDTTNIMESIEHGKLRPSKSLQDSILNMLKGNKEFVLIDDQKVEFEQIKKAALEAIKNNQKTVYIVRGGPGTGKSVVAINLLAECIHKGYMAQYITANAAPRNVYSAMLQQRYTKSEIKALFQSSGTFHTRKKNELQIAIVDEAHRLREKSGMFQNQGEDQIKEIINASIFSVFFIDRNQRVTFSDAGTIDKIRY
GUT_GENOME156757_0145737-479LKSMIAKRVIYQEPQSIFFKDVMTNTFVDKMIKTAMYYNINPSDSEVRSWGNNAPKIKDLLELSGITDSYITFEHLVPYNMKRIDCILYGRNLLGQGNVVHIELKQWSNKGVQAAISEGNFNIDEISDVTYKVQAYTGRGNRLVAHPSQQVRGYNDYLTGFIEVLSNKELLIEGIAYCYNYRKNVKPNALYDARYDELQKTYKTYAGDEVQELAKRLQSALGRGDGLTIFNKMINSPIRPSKKLLESAASLIHEGNSSEFALIEEQIIARNVILDKIRKIGKKKSVIIVKGGPGTGKTVIALHILSILAKHKKKYNIHYATKSKPLLEGIKYRLPRGSKAKYLFSNITQFLPANFERDSLDVLLVDEAHRISNNANNQYTPSNKRTNLSQIQTIVRASKISVFFIDDKQAIRSVEIGSSELIRDCAKEYEADIVEIELKSQFR
GUT_GENOME000609_012898-391YCAAVQDFLAMKSYEIAGHLKEITKGHTSESEYQSWCRSLPILLGVLHRAGMDRLSLALEYETPLGKRIDAVLLGEGRVTHRPLALIIELKQWSSIGENTKGKETFVRVCISPKEKLFEDREHPVQQTLTYKKHLKRNHSNVSSGKIDVLCCQFLHNFEQKEQLFEENYSDYAFLKKETYVKGEEEKLEAFLRDTFSSSPNQEAVDLFLNGEYVFGSCDYEALRSVNEGKENAVMIDEQKEINKRVYKILEQLKEDPSHRELVVISGAPGTGKTVIGLHMLWLYGIIFEKTRQHGLKCVFSMPRSRTLAQVIQGASHIPIVYLENIAAGMDMAVIDEAHRIERLDEAMRTLFTKAKIVVVLQDDHQRVRLSEEGTVENFVRFAG
GUT_GENOME179847_01807114-373QKNAWLKEISILQNQLTDYPEGEISFEYTIPRIGHRIDTICIIDGIIFLLEFKVGSSKYTKNADDQVTDYALDLKYFHEASKDRYLIPIVVATEGAVQPVSIQLMHDKISMPLHCSQESIATAITATLSILHDAPLSLSTWQNARYAPTPTIIEAAQAMYRNHSVYDLSRNDAGAQNLTATTMAINRIIDHCKRFHEKGICFITGVPGAGKTLACGTGACSSAVAAHFEKGLIEKLQNVVSHDFAVMTYTEAIEALKSSG
GUT_GENOME046289_026284-316CFDSDIKKFIEIEQDQWLGEMINNFKLIFTGEFPSDEQVMAWKDCFVKLQKELKKISFLEGHIIFEYLLPMEGGRRPDVVLLLEDKVFILEFKMKKTYSHSDLDQLKGYYRDITGYHRESQQLTVIPFLITTMIEDKKIVIDGRYNICSCNMLVDTLNTLLSGKNLNIDPKIWMESEYSQLPTLINAAIDIFNNNDIKELRSAKSAGIYTALEKLKTISEWTTDVMKNTSKSNSISFVTGVPGAGKTLLGLEFIHHSKGGSFLSGNGPLVKVLQYALQSRTFVTDLYKFKSEYTGNSKQPHTNIIVFDEAQRA
GUT_GENOME260204_0043916-400RRDVLVEKIETNYRQKIGGVKESEKRSWENSLRRVVKILNNHDIPDDAGIAVEYKVPETDNRIDFMISGYNEEGKGIIIIVELKQWSKVDEVPDSLLVRALVSKNHIDEKAHPSRQALNYAGYLSNLIEPVIEDKIRLYPCSYLHNYSLKENDPLIDSKFNSLFRQSPVFTIDNEDEIGEFIKKYIKKGDRKKVLFELDNGNIVPSRRLQDDIANMIEGKKQFYLLDTQQDIYEKALNMAQETFKTNEKHVLIVKGGPGTGKSAVAIQIFTELYQRKIIAHYTTMNGAPRKVYFEELKNTEDQEYVRNLFTGAYAYDVAKTNEVPVIIVDEAHRLKEKTYVGKVRGNNQIREIINAAKFSIFFIDEDQIVTDNDIGTIEEIKRCC
GUT_GENOME137526_0024746-334AWKSEILFLAAQLSNSKLSDDVRIIFEYMLPNENTQRPDVILLFKEKVIVLEFKTGGNKVTLDYVAQFMDYQSILKTYHSVVNKKNMDVESYLVMCAYNACDIDLTELEYKLDENDKKRILGKDTFRTLVNKMEEVEPLSENEVYEWVNGQRVRGYKVWEVGQRLKEDLKDGTKNIYSKISSIPYQYLHTTQDKIFDLVKKDEKNIIFVSGVPGAGKTMIGLITLFGCHGEHMNARYYTGNGALQNVLSKTLSEKSEIQMLSSFRTNYENSNFCEERVLIFDEAQRFWK
GUT_GENOME000338_0475921-418YISNAGEFKNSVETNKIAEKIETAFREKMGYSVGHSEKASWNNSMQFMERIIRNSGVPDDCGIFIEFNIPNTRMRVDFMISGHDKMGNKNFVIIELKQWSSANSTDKKDVITYVGGRNKPVSHPSYQAWSYKQHLMNLNEGIHSNGLNGYSCAYLHNYIEREFEPLNEEQYCDYLEDSPIFFSNDYNKLQEYLFKLVGKGDGMEILQTIATGKIKPSQKLIDHVAGLFKGNQEFILLDEQKVAYETIMSLVTEQDSKKTIIVKGGPGTGKSVISMNSLGTLLNRGFNAKFIAPNASFREVIVERLIKENPMDRARIKGLYSGSGQYYNCKIDSFDVLIIDEAHRLKQKGAFQYKGENQVEDMIKTAKTSIFFIDDTQRIRPEDIGTVAEIKRVAEQYG
GUT_GENOME115382_006703-438QIVYCETMRDFINSCLVSKDIGQKVKQSMYNAGYPHVMDNWVTSWQNSLPEIALALKDSGIDDDIDVAVEYRLKHSMERLDFLIYGLDENNHKNMVIVELKQWSQVQRVDSLNKVHTMVAKGVFEDHFHPSYQAYNYAGQLKSFNEYVQNEKVGIEACSYCHNMDDGFRSVMDDISMFPFISNSPAFLNGDGEKLKEFVKKYVSKKCHEILFEITNAKTVPSDDFAKLIRDALSGNQMYTLDDRQSYALSTIVDTVRDAIYYDQKKTIIIKGGAGTGKSIIAVNALGQLNSPKKKSERVTTFYVTVNAAPKKTMFNGLKYGRAFKTDDLKELFKYPTAFVNRPKNEVPCIMVDEAHRLFKWKGGVGLKSGVNLLKEIINTARVAVFFIDDNQAVTTEDYATIENIKALARECHSKVVIGEELELTSQYRVQGGYNY
GUT_GENOME058155_0022790-410ANTHQYAFCSTVETFLSMSEAQWISDMQDGFNQSFKLPLGNIQIGVWRDCFRVLKDALPRFNAGYPNFHIVFEYALPYESGRRPDVVLVSNEFLIILEFKKKNSVLRSDIDQTAAYLRDMEEYHYESRNKNIFALLVVTEMKDQYYKEDTISVVSGDLLYTGLADCVAGRTTACDIDTWLHSRYEPLPTIVEAAKMFVKKEELPNIRRINSTCIPQAIECLNEISEYAKSNRKHVVAFVTGVPGAGKTFLGLKYVYDICNDYENVNSVYLSGNGPLVSVLTDALHSSVFVKDLHKVVIQYITNEAPDFNNNIIVFDEGQRA
GUT_GENOME071764_0036238-412SEILAWRNSLRNMYSVLNTDAIPNNAGIAIEYMLPYQYSRIDFIVTGKDNFNSDHVIIIELKQWTEAKAISDKHLVETYVGGGNRELTHPSYQAWSYAETLKEYNKTVQDDNIQLHPCSFLHNYVVIGNDALLNNEIYPEITDAPVFTRDDAKNLQNFILKYIKSGDTNNIIFRIDSGKLKPSKSLQDSIAKMLEGNREFIMLDSQKMIYENIMYYVNHMKYEDNKKMVIIVKGGPGTGKSVLAINLLADIINSDKSVAYVTKNSAPRNVYFEKLSKNFKKNVVNNLFKSSGIFYDSPKNAFDILLVDEAHRLNEKSGLFSNKGENQIKEIINASRISVFFIDEDQIVTSKDIGSIDLIKSFAKQLNAKIKTLEL
GUT_GENOME165701_0278730-386THNFEEQRSWRESLYRLIQIVNTSGLSDLWLIAEYSLIGTHRIDAIICGFSRLFGRPLALIVESKQWNKIGDGGAQHADQVKMNFANEYRMHPVAQMFQYEHQLETNHSVVKADNIQLAVLAFMHNLPESQKPKLFTDEYRAFAQYAHLVFVKDDEARLTHYLGSLFKKCSGEQTAKQFLAGHYELSRADFFGITKILQRKDNAVMLADQIEVSLAFDQLCAEFKKNPRHMAMIVQGSAGTGKTIVGLHLEYLAVQRHLIPLRRTIFTFAKSRMLHNILGQAMSASGDPRQLPYLDMLSEHDYSLVIIDEAHRMSDVPQHIYTLVGPNTRPKILVFLVDDHQCVHVKECGTEFVLKC
GUT_GENOME102038_0151524-298RYYYSDTISMFLDRSTDEIIGKLALASQHDINDETSNSWLEEIESLRNVLVPYKNKGSIYFEYNIPRMGKRADVILLINELIFILEYKTADSKFTHDAITQVWDYALDLKNFQEGSLERIIVPILVAPSEKDKNCIFVLHNFGDDVYEPLLTNANHLDEAISIPLSQIPHSVIEHSAERDERWAKSGYEPTPTIIEAAVALYEENTVEDITKHGGDIDKASAELRRIIDYCRENSRKAICFITGVPGAGKTLIGLNTAIDQFNRGEKAVYLSGNF
GUT_GENOME096513_0488127-394SEINSWNNSMNYMYKVLNTPIIPDDVGVAIEYKVPTTSRRIDFMLTGLDKQDRYSVVIVELKQWTALNKIESLDGLVETVINKRLGKHTHPSYQAYTYARLISDYNETVQNESISLYPCAYLHNYIREENDPLTDAVYESWIAEAPVFTKGDALKLRGFISTYIKKSDRKKSLYLIEHGKIKPSKSLQDSLLKMLDGNEEFLMIEEQKVAYEEALLMAKKAQSTDRKQVLIVEGGPGTGKSVLAINLLVKLTSQEMVCQYVTKNSAPKRLQQGYRKTVINNLFKSSGVYYEALPNEFDVLIVDEAHRLNEKSGLFKNKGENQMMEIIRAAKCSIFFIDEYQKVSMQDAGNKEQIKQFAQLYDAEVVEI
GUT_GENOME096495_021704-404YAKTVGEFIDEVENRTFLAALEKNYRVLVGKVQAGEYRAWQHSVQEYMYAVIKKAGLPADAGIAVEYQVPGLSERVDLIITGFDDNGEMTAILIELKQWSAIECCDSAVSYKVQIPMSGGNELRDHPCYQINYYKNVIAGGISEEDFPLHLCPAVVLHNYSLQENKDPLFQNGYKNYMLNVPVFFKGKDEKLESYLAEKIKSGDNGSVLTYIETKCCGISLTLLELLKKLSVNDSSLRLNAEQQRMQGTVMKMLNTAKRDKRKKVVLVQGGPGTGKSVVALKILAACLCEHSDMNCYYTAKNKALRAVLRFDLSQIAGMERFSSSIVYPPFLEKLERNKASVITVDEAHRLSEEKQIVSIIKAAVVSVFFIDEGQIVSFEDIGTKEVIMECAKRQDADIID
GUT_GENOME053977_003184-425YKNTLNGFIEDCYPKGNTVRNIADIVKEKMTLCGLGGTNDSQVQSWRASLPEVAKVLEDSLFDRAIEVAIEYKPSISKERIDFLVCGTDEFGNKNVVVIELKQWSQASRSNLKDFVWTFGGHGLDDYWHPSYQAYNYTEIIKNFNEYVQTADVGFHACSYLHNMPEELRGLLDNEDTYPLVKTSPAFLQNDILKLREFVGKYVTKPFKDPLTGFSLLYTIDNSKMVPAPKLVSTLTEALKGNDFFSYDEGQANAVATIVNQVRAAKEKTVIIIKGGPGSGKSVVAINALGKLVSGELAFNKKQKQKPLTCAYVTANSAPKVSYSQELIQNNFKYATLKELFKGPASFKESRTNDFDCLLVDEAHRVFEYDGGSFGLPKTGLNALELLIRAAKVCVFFIDEDQQVTTHDFATIDVIKKTVKKM
GUT_GENOME069336_0111973-471LLENLPNDVGIAIEFNIPLTSKRVDFVISGFDLHHMPIIILFELKQWEYVDDVKNQDAVVKTLISNKEKPSLHPSYQVLCYSELLENYNQYIEKSKVKVIPIVYLHNYILKEHDILFANKFKKYYKKAFIFGKEDSQKLKSFIEKLIVFGDDLNLIQQIDQSKTKPNKKLLESLDMMIENNKIYNLLDEQKTIVEAIKYEAKEAFTTNKKKVIIVKGDPGTGKSVVAIYLLGELLKLGLMGAYVSKNMAPRKVYKNSLVHNNEISVHELFKSSGYFFKDSQNKYDFLLVDEAHRLQEKSGIHNNIGENQIKEIIHASITSVFFVDEKQMITLKDIGSIANIKYFAKFYHASISELELHSQFRCNGSDSYLDFIDSFLYNKRGNYKFNFDFQVLDTPNEL
GUT_GENOME020796_0070194-459EVRSWSNSLTYMGMVLENSTVPSDTGIAIEYNIPYTSKRVDFIMSGYNAEGHNSAVIVELKQWEHADSVPGKDGVVRTYINGGEHETTHPSYQAWSYATAIGDFNADVQDLDVKLQPCACLHNYIERTPEPLLDSMYEYYIQRAPVFSKKDGKKLKQFLENILKKGDKGETIFMIDRGKLRPSKSLQDVLSSMMIGNREFVMIDTQKVVYEEILHSMRQLSKSDRKKTIIIRGGPGTGKTVLAINLLAEIIAEGSSAAYVSKNSAPRSIYNCKLKGSMKRSRIDALFKGSGSFIDAPENTFDVLLVDEAHRLNEKSGLYASLGENQVMELIRSSRLSVFFIDEDQRVTIKDIGTESEIRKYARRFN
GUT_GENOME041657_00002141-515EKMGYHTSESERKAWNNSMQYMMKVLIDNKIPGNVGIAIEFKIPNTSKRVDFIITGKDGQLKNTAIIIELKQWTEADMVEGKDGIVQTFTGHALREVAHPSYQAWSYATTIEDYNETVQDRQIDLHPCAYLHNYLAVTPPTLLSDDYKTYLEKAPAFIKGDVEKLRDFINKYIKYGDDKETLYMIENGKIRPSKSLQDALSSMLKGNEEFVMIDDQKLVYETALDMARKSYRDGDKRVLIVKGGPGTGKSVLAINLLVKLTNENMVCSYVTKNAAPRNVYATKLSGDFRKTRINNLFKGSGSFVESSENEFDVLIVDEAHRLNAKSGMFQNLGENQIKEIIKASKFSIFFIDESQRVTLKDIGSVEEIQKYIGQA
GUT_GENOME199378_014834-403YHASKKTFINDVFNNTIADEIENAFLAHLGRHTSYNEVLSWRNSMIHMYKVIDTPDIPNDASIAIEYQIPLTSKRIDFIISGFDNENKGHVVIIELKQWEQAKLSPKSALVKTRFQHGESEVAHPSYQAWSYAYMLINYNETIRDQGINISPCAFLHNYQTDDVITNPIYSEYIEKAPVFLKTDAQKLQNFIKDRIKYGAKDDIVWLIDKGKLRPSKQLADALTSMIKGNQEFVLLDDQKVVFETAIEMANKANAGKKHVLIVEGGPGTGKSVVAVNLLVQLTKQGIVTQYVSKNAAPRSVYTNKLSGSFKKSYIDNLFVGSGKFIDVPESTFGALIVDEAHRLNEKSGLFSNLGENQILEIIRSAKFSVFFVDDKQRIHIKDIGTKREIKRIADSYNAV