UHGP-MC 99853


Information


Number of sequences (UHGP-50):
420
Average sequence length:
72±6 aa
Average transmembrane regions:
0.1
Low complexity (%):
6.59
Coiled coils (%):
0
Disordered domains (%):
10.27

Pfam dominant architecture:
PF01176
Pfam % dominant architecture:
71
Pfam overlap:
0.86
Pfam overlap type:
equivalent

Downloads

Seeds:
MC99853.fasta
Seeds (0.60 cdhit):
MC99853_cdhit.fasta
MSA:
MC99853_msa.fasta
HMM model:
MC99853.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME165981_014603-79EVRAGMAAVSMAGHDTGRYYIIVRAEDGYVYLADGTLRTCEHPKKKKMRHVQINRRISPEIESILSEGGEWKNETIK
GUT_GENOME090322_010297-78GMFCESRQGRDKGRYFMICNILDERYVEIVDGTMRRIERPKRKKLKHLRLTPEISPIGEKFLSGAKVFDAEV
GUT_GENOME258643_0013318-83LKEGCVVKSLKGHDTGRIYTVIAVIGDSFVLVADGKYRKLDNPKQKRIKHLEIVREVALPTSLTDS
GUT_GENOME096502_003488-84KVGQVVSSKNGRDAFKVFVIVEILDDKYVLIADGKRRTLEKPKKKKVNHLNIYKKVFEEIKSKACGKYQLNDAYIRR
GUT_GENOME245203_006296-82IEPGRAVLSMAGRDAGRRFVALRVDETYAYVADGDLRKVETPKKKKRRHLRATPEFFPGIAEILKDGSLPSNAEIRR
GUT_GENOME258557_012348-71RGRVVVSRAGHDAGRWYAVLDESGGWLLLVDGHARGLAKPKKKRLKHVRPLPLTVAVAGTGASG
GUT_GENOME046367_004353-68IVKGRIVKAKAGRDKDKFFVVVDIYKNTAYICDGKTRRIEKPKRKNVLHLAPTAEMADEFNSNRKI
GUT_GENOME030350_0059313-81DRWETGMLAKSLAGHDKDKVYVIVGLNEQFTYLADGERKTLQSPKKKKKKHVQLIREQHEISEADDVKL
GUT_GENOME091104_012943-75EFCTGMFARSLAGHDKENLYIIVRREEQFAWVADGKTRTLEKPKKKKWKHLQVFYEIPGPICELQAQGKPLRD
GUT_GENOME239198_015042-75TGNIAISMAGHDKGHAYIIIKEENGFVYLVDGEIKKLSNPKRKSLKHVRINKQYPTDELTKKLLKNDPLYDHEI
GUT_GENOME092907_007324-83NFDFKVGQVVKSLKGRDKGEVMIVVEVISSSLVKVANGSNRPLSKPKLKKSKHLQIYNDVLKDFSLNPLSFNDSNVRKLL
GUT_GENOME056772_0032720-99LGRLVKSTAGRDCGRLFIVVGEIDEKYILLADGDLRSIESPKKKKLKHVKVYDAQAEQVSGQLCEGRGILNAELRKTISL
GUT_GENOME235523_008631-83MKIGDVVISTYGHDMGDWYIVEDVLNEYVFLIDGKNKPLDKPKKKKVKHILTTNYFAEDIANKLITKQNIQNAEIRKALNFFK
GUT_GENOME234163_019024-75VKTGMLAKSKAGHDKGQVYVIVDTDDAYVYLADGKIRTLGRLKKKKKKHVQPILREFDMTAADDAAIKQILK
GUT_GENOME087607_006397-81IQIGELVQSLSGHDAGGYFLVTGREGGMLCLCDGKRRKAVGQKRKNPKHVAASGMVCDWVRQHPECVNNTSVRRA
GUT_GENOME130650_012973-75LKIGVVAKSKAGRDKHRFFIVTAIDGRYAYLCDGKIHPLKRPKKKNVKHLSLTADTIPAGTFQTDKQIRKALF
GUT_GENOME029237_007752-70ERGQIVKSLAGRDAGEWFAVVSAGEDFALLANGRERPLERPKKKRRKHLALTSGMLDEEALRSNRSLRA
GUT_GENOME114602_000844-65FTPGAVVMATAGKEKGSKFVVVEDFNSRQVLIADGRHRKLARPKRKSVRHLVLLGRLPEVPA
GUT_GENOME254699_011575-82PVQLGSVVLSRAGRDNGRYFVVVGQVDDEHVLIADGRLRKVESPKKKKLKNLTPKPACMSEIQIKLDKGELIQDAQIR
GUT_GENOME220305_007295-82KTHIVVSLAGRDKDRPFVVVETENDFVYLVDGKLRKLEAPKKKRRKHVAFAGQLGERLSGKLLREDKVLNSEIRRALA
GUT_GENOME213313_008642-78QAGDFAVSAAGRDKGRCLAVLSVSDGYCLLADGGVRKCEKPKRKKIRHVTLTGERSEYLAERIKNGEPVTNSMLRKE
GUT_GENOME281143_007287-66IGDAVQSIQGRDTGKVYIVLNVDCDYVRLANGFDKHCNSPKKKNVKHVRLIKPGVLDPLT
GUT_GENOME242942_007627-73LQVGQIVKATRGRDEGRVFVIKEIVDEKMVKIVDGKTRTLEKPKLKKVSHLIISKEKIDLEEVTSDK
GUT_GENOME251530_0095517-95EDPRVGEIAISMAGHDAGLILVVIAGIDGQHVLAADGRTRKLMLPKKKKMRHLNVIARLSEEDVRLLRAGKINDSFLRR
GUT_GENOME147776_015074-70LQIGTVVKSLAGRDKERHSVVVALENEYVFIADGKMRKLETPKRKNKKHIWPTGRLIQMDGATNKSI
GUT_GENOME141704_033794-80ECQIGRLVLSKAGRDKDNSFIIIDILDDNYVYICDGDIHTLDKPKKKKIKHLIFTNIICEDIINLKATSNNIKNSVI
GUT_GENOME081377_020413-68LEVGTIVRAIAGRDADGYFVVTALDGEFVWIADGDRRKLDKPKHKRRKHLRRTCTVMDLNGITDKK
GUT_GENOME031246_0019010-81GYVATSKAGRDAGRSFLIVEEVDENYILLVDGTLRKLARPKKKKRKHVRIHGECAENIKIKLLDGKQVFDAE
GUT_GENOME137639_002362-68KGRLAFSKAGHDKGKLYLIIREEGERVWLADGRTRGVLSPKKKNRKHIQPAGQEFSEEEVDLFFEKP
GUT_GENOME063692_026675-71RSKSGHDKNQIYLIKEKDEKIVYLVNGTTRTLDMPKKKNAKHIQIIKNLPIEVTEILEENLSDLTVK
GUT_GENOME253443_004061-77MNIGDIVVSTAGRDAGKYFVVVGIADGQYRFVCDGKLRKTAKPKKKKIRHLKYFCSDADAQTALTLNQPVSDKFIRS
GUT_GENOME276075_000914-69EKGDIVISIAGRDCGQIFLVWSLQGDDVFLVNGKLRKINSPKKKNVKHIMFKNKSPYFKQMCDKGQ
GUT_GENOME112053_000272-84DNLIGNLCISTYGRDAENEYLIYAHTGNYCFLVDGNAKKIINPKKKNIKHIKLLGIESKAIKEKIEKNKKVFDSEIYSFIKNY
GUT_GENOME043547_0163822-89GSLVRSRSGRTRGSVFAVVATETDAKGCLYALVSDGRTRPLSHPKRKSAAHLETLLPAVGRTPFGTDE
GUT_GENOME237866_020166-86MVGMLASSKAGHDINKVYVIVAEEGEYFWLADGRCRGVLKPKKKKKKHIQLIKYFSDDVLDECVQQGKNFSDVEIKRVLKS
GUT_GENOME153580_007526-69YCPVKSLAGHDKNQYFIIVSDEGEYITMANGTTRKVDDPKRKNKKHIQAGKTPLFAALPTDEEI
GUT_GENOME150267_0003812-73LGISKAGHDRDRLYIVLGARNGMFLAADGVHSHVNKPKRKNEKHLQLIYNVTPEIRDLLNEA
GUT_GENOME233273_024231-77MELRKGMLAISKAGHDKDSWYIVLSIEGNHVFLVNGKNRTLDRPKMKKRMHLQPVNTVPKVLQEKLDKNMQWTNEEI
GUT_GENOME238973_003423-67MKPGSLVRSKAGHDKGELLCVVGLEEKTVWVCDGRQRPLNRPKKKNPKHLGLLPLTLSAEEMQND
GUT_GENOME081289_011532-80EIGVGSVVCAAAGHEKGGLFLVMDTDGDFVLIADGKRRKVQKPKRKKKKHVIPIELPPEENLLAGGTITNTQARKALAK
GUT_GENOME239041_005782-73IGQYAFSKAGHDKGTLYMIVAEEGDFVYLTDGRLKSPEQPKKKRRKHIQPVNSFAGEALRKRMESGEIVRPE
GUT_GENOME219301_007205-85LLGRIVQSKAGRDKGRRFAVVAIEDDAHVRIADGDTRKIARAKKKKLKHLSFERARIENLESMLTATGNTADAALRKALAH
GUT_GENOME096873_0068316-93LLGKVVISKAGRDTGDYYIVIEQIDDNYVLLADGRLKTIEKPKKKKLKHLKMTDVLATDIREYIMSNKRILDVMIKKF
GUT_GENOME244169_004024-72QAAVGRVVKMTAGRERGRYYVIIGVESEQYVYVADGRKRTVAHPKRKNVRHIQAVSPQRANPTPVHEWR
GUT_GENOME093909_001236-79GLLVVSTAGRDQGKYFIVTKVIDENFVLISDGKVRRAEKPKRKKIKHIKSMGLHSERLVEKLGSGSKLTNPELR
GUT_GENOME235148_007521-70MKTGDVVISVSGHDKGRLYVVVQAGEFSLLCDGRRKLLHNPKKKNSRHLKETGEWVDLSSYNPLYDAHIR
GUT_GENOME264627_008345-74TGMYAYSKAGHDKGTLYLIIKEDKESVTLCDGLLKTIEHPKKKNKKHIQPVKRSDDALRKKLLAGDAIRN
GUT_GENOME236884_004451-66MVKSVAGRDGGSYAFVVGEPEPGYVLIADGRIRKVENPKRKKLKHIRFIADAPRLDDRKVTNRMLR
GUT_GENOME179148_012624-73MKTACLVKSLAGHDKNEFYIIIRVQGEYAYLSDGRKHPLDRPKRKNIKHLQPLKEYDKGLREKLIMGSRV
GUT_GENOME009122_000284-76QFTAGDIVKSVAGHDKNRIFLVVSIDKIGYPVIIDGRFRVRNKLKTKNPKHLVKLAHDEMIYNKFLSPIVTDT
GUT_GENOME207678_019857-72IGQIVVSQAGRDQGQIYQVVAITKEGKLLLADGRKRGLDHLKQKNPRHVRIVHSIATGEATQGSFS
GUT_GENOME021926_002505-87NFQIGRLVRSRQGRDAGRYFVITGLPEENYAYISDGAVRRLSAPKKKKLKHLELCPEEIPAVRAKLEAGSKVFDSELKSAIEA
GUT_GENOME236326_005142-69LQRGAVIRSLAGRDKGRLLAVMRAEGRTVTVCDGSERPLNRPKSKNIRHVEPAGVSLPESELAVDRAL
GUT_GENOME279992_014825-80KVGNLVRSKAGHDKEELFFVIDIVGEYIYLVNGKTRTVSHPKKKKLKHLQCFNYQDEKLAQMKEQGISFRDEDIRC
GUT_GENOME013604_000753-70LKTGSVVAAEAGRDCGGFFVVTDIDENRVYIADGKSRKLKCPKAKNIKHIRLTNSMIDLNDITDKKLR
GUT_GENOME122083_005694-68LQVGSVVKSLKGHDKDKIFVVVEVVSDKYVRIADGKTRKISAPKLKKVKHLESNGEILEKVQQII
GUT_GENOME217925_016932-70VEVGTVVQSLSGKDRGTLLVVVGGDEHRILVADGGHRMLAHPKAKNPRHLRQSGIRLTADRYRTDRQIR
GUT_GENOME079930_000787-83VLGRVVVSRAGRDRGRAFLIVGVVDEAHVLLSDGATRRLARPKKKKLMHLRIEPAVAGEIAEKLAGGAALLDADIRK
GUT_GENOME027392_001067-78LSPGTIVEATAGRERGQWFLVLRCQDASYVELADGRKRPVAQPKTKNRRHVRVRVAADSDAAYRRTHPHSPW
GUT_GENOME062401_004885-83FMTGDIVESTAGHDHGKYLIILGVNEGRIVVADGDTRTVAKKKEKNPAHLRFAGKSDEGVLLKIREGRVEDHEIRKQIK
GUT_GENOME258383_014957-81EVGRVVMSRQGRDRDRCFVILSVVDDQFVMMADGLTRKLDHPKRKKVKHLHAKPVKMQDLAQRLASGQVLDSDLR
GUT_GENOME170192_002732-68IKTGSIVRPIAGRNQDRFFVVIQTDADCVLLSDGRRRKLNHPKRKKQKHLEDTGYTLDLLKLTTDKQ
GUT_GENOME070127_006475-81FKKGYFARSKAGHDKDTLYVILDRDDRFVYLSDGRLKEVNNPKKKKEKHLQFIASQAEGIAAKLDSGLEITNEDVRR
GUT_GENOME010958_013685-83LIGCFASSIAGHDKCKTYIITDADDLYVYLADGKYKLIEKPKKKKRKHVQISGIKDINIQLKQDCNSKIINEDIKRAIK
GUT_GENOME034610_011186-74IAAGMVVKSMAGHDSGSYYAVMRVENGFAYIADGKLRKVERPKKKNPLHLQKTLTTVELAEITNKKLRS
GUT_GENOME142591_004057-81FSPGDIVRSRRGKDEGQLLVIIGLESERIALVADGDLRRFDRPKRKNVLHLEPIGIASDEVASSLRETGRVTNGK
GUT_GENOME055312_011265-72LGHLVVSLAGRDKGCICAVVGYADDEGYVLIADGRTRKVEHPKKKKLMHIKPLEPELPVAALPESRLT
GUT_GENOME254341_003021-72MELYRGCVVKIRAGRDADTFMAVADFDNSRVYLVNGKDRCLNKPKSKNYKHITLTKMVLDEETMSADKKIKR
GUT_GENOME109282_007452-67IGTVVFSKAGKDKGSLMVVVAEENGVYFVCDGKKRKLNSPKRKNPRHLCFTDKKISEDKIITNRLI
GUT_GENOME045628_018003-75EFRIGGLARSKAGHDKNELYVITSQDETYVYLSDGRCHPLEKPKRKNRKHLSPETWYDESLAMKLTQAGGARN
GUT_GENOME251709_007744-71KRGQIVFVNAGKQKGQFMIVLSADEKSVLLIDGKRHPYDRPKRKNVKHISATQTIVSENDLAANSRVK
GUT_GENOME086476_000461-68MVISLSGKDRGRLLAVLGGDDRRVLVVDGKHRKLGSPKAKNPRHLRPTAWTLVPSQMRTDRQLRRALR
GUT_GENOME096045_023464-68IQGMIVKSNAGHDKERFYLVVRLENGFAYIADGKRRKLERPKRKNLSHLSSTKRIVDISLYDTDR
GUT_GENOME240840_002477-84LKIGQIVKSKAGRDKDRVFVISRILDEQHVLVCDGDLRKLSSPKKKKVKHLVIYNTVLTEFANKLQSNENLEDAFLRR
GUT_GENOME212238_005638-73LKGQVVRSKKGRDEGKVYIIMEIIDDDLLLLVDGKLRKLDRPKKKKVKHLYIYKDVIDTEVSDFSD
GUT_GENOME102210_002549-82GGICVSTQGRDEGRYYLIKEVLPNGFILVCDGNYKKLASPKKKSLKHVKLLPETAEAIAEKFSYGGKVFDSEVY
GUT_GENOME237426_001484-74LGDKVCSTAGRDAGKSFFVVGILDEQYVLISDGRTHKVDKPKKKKIKHLAFEGFSEEVKQRLITENALTNP
GUT_GENOME096472_0239123-103VLGQIVQVLRGTCAGEYAIIIRKDNSRTLWLVNKDRFTIDHPKQKNVKHVQPTNWIAEDILEILQKNGRLTDAMLRYALNQ
GUT_GENOME181476_000902-71EFQVGDIVRSKAGRDEGAFLAVVGIEDGYPLLCDGKHRPLERPKRKNPLHLAGTNRRLDMQSMETNRVLR
GUT_GENOME208951_000973-76KRGCVVVSVAGRDKGLPLVVLDAEDVSNRALVADGRKRRLEAAKWKNVRHLRETGLSVRDASMATNREIRRALA
GUT_GENOME091787_013579-78IRPGMLAYSLAGHDKGSLYYIVKADERFVWLSDGKLKNTDSPKRKNRKHIQVIKKGTEEVPAPLTNEAVK
GUT_GENOME019241_002976-79IGCFAKSKAGHDKDEVYIIVNSDKNNVWISDGRLKPVAEPKKKNRKHVQLVNDKDINIEEKLRRQEPIADEDIK
GUT_GENOME222413_003758-85GLGKVVRSKNGRDEGREFIVTALEGDFAYVADGDTRKIERPKKKRARHLFVTEETISSLQEKLEAGAPVENCELRKAL
GUT_GENOME013723_000245-72NEMQKGLVVKSQAGKDKGCYFVVIGVEGNCALVCDGKLRPLSRPKKKNLKHLVVTNTILLCSEYGTNR
GUT_GENOME125681_005865-89KFSRGDVAISLKGRDKNKFFIITDFDPENGIALITDGRIRKTARPKKKNAKHLKKIQSAVCTELAERIYNGEPVSDRKVKTAIAR
GUT_GENOME096493_002566-76QVGRLVYSKAGRDKDKCFIIVDVLDDKYVYISDGNMRTIENPKKKKIRHLVFTSILCNEMSEREELVTRIN
GUT_GENOME104556_006934-71IKVGSVVKACAGRDKGRWFVAVAVNGGFIEIADGKERRLEKPKRKNIKHISPTDTLIDTDGLTDKKLR
GUT_GENOME064676_048223-68GKLALSKAGHDKNTLYLIMKEEQETVYLTDGCGRDIRHPKKKNKKHIQLINAGISREELEKYLANP
GUT_GENOME005952_008724-82YETGCLAWSLAGHDKEGIFIIIKEDAEYVYLADGRLRTVGKPKRKKKKHVQASRIRDEKLCDKLMSGSTVTDEEIKYFI
GUT_GENOME119110_0016513-68LEKGCYALSLSGRDSGRIHVVLRCTEDGYAYIADGRCRTVSKPKKKKFRHLRRIGL
GUT_GENOME256395_004266-73LLGSIVTSLCGRDKGRSFVVVEIIDADYVYIADGRLRRVESPKRKKIKHISVLARTDEAGGARMDVEN
GUT_GENOME162978_006637-79KGSIVKAVAGREKNGFFVVLDCDSVFAYIADGKRRKVEKPKKKKLIHISPTGTVIEGSIKTNPQIRKILNNFR
GUT_GENOME138783_022324-76IEAGMMAQSRAGHDRGTLYLILKVEGDFLYLSEGRLRPKEKPKKKRQKHVQVIRRVKSRDFDGTAITNEEIKY
GUT_GENOME258449_0080310-81PGQIVKSKAGHDKGCVFFVVEVLDDEYVLIADGGRRKYDSPKKKKVKHLQPYNRINKTIAEKIDSGQRVENI
GUT_GENOME017092_011375-82CAQIVRSLAGHDRGGLFCVLDTDGPYLLLCDGKRRKLENPKRKKAVHTAPAGDFQHPVLDQLRTGGNLSNRDIRRALA
GUT_GENOME223302_008102-70VGYLVYSMAGHDKYKVYAVIQEDEQSVWLTDGDNRTLDHLKKKNKKHVQLIRHKVSASADAITNEMIKR
GUT_GENOME236871_006271-67MIGQFVISKAGHDKDEVYIIIEENDKQLLLSDGEYKKTGNPKRKNRKHVSFTTEYLKPEQTEKLKLL
GUT_GENOME088989_006642-68KRGQIVFSKAGRDKAHFLVVLSVNGNRCAVADGRLRRGGKPKCKNIKHLGATNGSIAEEDLFSNRLV
GUT_GENOME164598_000077-79LIGCIAVSTAGRDCDRTLVIIGICDDDSVFVSDGRLRKVATPKKKKMRHLKVVSKTDAGSRERIAAGLATDSF
GUT_GENOME009216_008124-79FSIGEIVESTQGRDKGNLYIVYGYENNKALLVNGNNKTITKPKVKNLNHLTSLNFVQENLKDKILNKKTVFDAEIY
GUT_GENOME114249_015645-78PGSIVYSKAGRDKGGYFIVMSQTGEYISICDGKGRKTDKPKRKKIKHVKAGVGFSEFVSKKLAAGEKVTNTELR
GUT_GENOME180753_004233-86VFAGDIVYAKAGRDKDKPFVVIEVLDEQYVLLANGRQRRVDKPKKKKLKHLLKSGHASAYICEKLQSGVKVTNPDLRKVLAEFS
GUT_GENOME280822_005009-88DLGSVVFSLKGRDTGLCYAVVGYLDSSTVLVSDGYKHKLAQPKLKNIKHLRFEGAVLTTIAEKFVAKKKVFDSELRSALR
GUT_GENOME130738_003195-70KGVVVRATAGRDQGMFFVVLERNGTTVLLANGKDRPLHAPKRKNVKHIQLTNRVIALDHLTDKALR
GUT_GENOME170592_005593-67RGLIALSLAGHDKNNFMVVLDANEKEALVCDGKSRKLEKPKRKNIIHLKATTKSLDEKILESNRS
GUT_GENOME153166_0114412-92IAKSNIVKSTAGRDEGDLFFVLDIQGEFLLLADGKSRRVEKPKKKKCKHVSFVGESHSVVAEKIRSSEKITNSELRKALAA
GUT_GENOME111115_000835-83VKIGQVVRSTKGRDAGRLFLVVGFADDEHALLCDGDLRRIDSPKKKKRKHFLPTQELISSLGERLTRGEAVCDAHIRKA
GUT_GENOME114621_000512-72IKQGSIVKPTAGKAQNRYFVVLSIEGRRAAISDGRKRKLQNPKQKGIVHLQDTGCQLDLSEVTTNKQLRKA
GUT_GENOME200541_000565-81YSLGQIVFSKCGRDQGRPFIIVSIEEEYVYLVDGALRKVDSPKLKKKKHIQRTLTTVEWIKQKIIEENRLTNSDVRR
GUT_GENOME120672_003141-79MELGQVVYSKQGRDSGRYYAVVEIVDDTYVKIADGKLRRVKSPKLKKVKHLKTKGDMLDKISEKLKQNAQVFDTELRSA
GUT_GENOME128748_003117-82MEPGRVVQSTQGRDAGRYFVVLQVVDDRHVLMADGQSRKIDHPKKKKMMHLRPKPIVVNVEPQALENKHLQDSDLR
GUT_GENOME250975_004645-62SAVISKAGSDCGKVYLVVGYDGRGYALCVDGKRHKLGSPKSKNVKHIKATGTEVELPG
GUT_GENOME261151_0103113-75CAGTVVYSVAGRDKKRPFVIVGVADEDSGTVYIADGLLHTLPSPKRKKLKHLRITGSVLRNFT
GUT_GENOME158229_005854-70ECGTVVKSLKGHDSGEYYVVVRTDGGFAYVSNGKTKKLAAPKRKNPKHLAVLDKRIDISEITDKVLR
GUT_GENOME239661_023898-77VGTGRVVRSIAGRDAGVFYVVLEDRDGRVLLADGKRRPLERPKCKSRRHIRKTNTVLELSGAATNKMLRR
GUT_GENOME270901_013296-79PGEFAFSKSGHDKDNLYIIIKEEGEYVYLSDGKYKTADNPKKKNKKHVQTVHYEDECISEKIKNGVPLNDVEIG
GUT_GENOME103937_002941-76MKLERFEVVESTAGRDEGCIYLVSEILDENFVLLIDGKTKPISKPKRKKVKHLKTLGAVESELCGTFEDKSRTNDG
GUT_GENOME170512_016625-79IKEGNVVFSKKGRDKGYPFVVLLSLDDDFVLICDGDRRKVDKPKRKRRKHLSATPHEAPEILSLYAMNRLKDSDV
GUT_GENOME000604_0256611-81FQPGLLVSSKAGRDKDKLYVVTGMCGEQLLLSDGTGRTANNPKRKNIKHVQRISLTVWQCVGYSRHPEVPD
GUT_GENOME275209_006593-68IKEGSVVVSRAGHDKDDFFVVLKVSGKNVIICDGKRRTLEKPKVKNEKHLIVTKEKLDASFMQTNC
GUT_GENOME024124_009836-81YSLGTIVLSLMGRDRDRYYIIVGVCQGDFVLIADGDLRRVSAPKKKRLKHLKGTPMRAEGIAEKLSQNKPVSDSEV
GUT_GENOME131790_006304-70SIVVSRKGHDTGRAYLVAAEVGGDFLLLVDGRYRTLDKPKLKRAKHVKYAGKCALDVTSATDADIRK
GUT_GENOME258233_005266-80EGDVVVSLSGRDKGRLFFVLSCDDKAALLADGRVRRVARPKRKNRRHVAPSAYENPVVREHLSRGAPLTDAMLRK
GUT_GENOME121521_000453-66EYKLGYFATSLCGHDKGKVYMIVKQEGEMIGLSDGGRRTIDNPKWKKKKHIQMIRSERMAEAFP
GUT_GENOME007813_0103112-78VGDVVQSIKGRDRYRVFLVVDMDESDKVSPVVIADGRLHKLDEKKHKNPRHLRIISEHDKIDKTAMS
GUT_GENOME047723_006745-67IRIGSVVYSKRGRDAGGYFMVSEIRDADFVFIADGHTHKLARPKKKNIKHLKNSGHVLEGIAE
GUT_GENOME017737_017443-75GKLAISRSGHDKDSVYVIVKEEEIWCYVADGRLKPVERPKKKNKKHIQIIKRLPKEITEMLPQDREFRNEEVK
GUT_GENOME008483_001558-79RRGDATVSLAGKDKGLPLAVLYVEDGGVFLANGREHPLARPKRKNPRHLAAPVAVLDEGSMATNRELRRALA
GUT_GENOME025296_001771-69MHRGTVVRSLAGHDNGRLYIVLDFRDGYAFVADGRHHTLEKPKKKKQKHLKDTGDYAELSKYEPLYDAH
GUT_GENOME215504_002968-79LGGIALSLTGRDKGRCFIIEEIIDDNYVYITDGMLRKVAHPKKKKIKHLELKPLVFESIAEKFKQGTKVFDK
GUT_GENOME100949_0271910-83FRPGQMALSNAGHDRGKLYVILDIQGEILFLTDGSRKTVAQPKKKNVCHVRRMNYIDEGIHAKLENGYPLSNED
GUT_GENOME111314_003211-62MIYEVIDDKYVTIVDGQLRRLYRPKKKKLMHLSLMPNVLENIGEKIKEGKKIFDAEVRSALR
GUT_GENOME209032_030824-75IKKGSICRSLAGHDKGTYYIIVETGNVIKVSDGKYRPLSNPKVKNLKHLEIVDYEDKEIIRMIENNRLQNEN
GUT_GENOME254154_001995-65RGYIVRSTAGRDKKRVFLIVGEKDGRVLVADGTLHTLSRPKEKNLRHLAVLAAGGDGAISV
GUT_GENOME057222_001734-83MKISVGSIVLSKAGRDKGRYFVVTEVVDENYVRISDGDLRKAEKAKLKKVKHLKFSGDNLPEFANMQNKDKGVVNAELRS
GUT_GENOME117761_022122-76IGKIAVSMAGHDKGSAYVIVKEDHEYVYLCDGNLKLLGKPKKKNRKHIAVNKTFDTGETGEKLQRGEKVYDHEIK
GUT_GENOME269177_0028013-71LHSGSFVLSVAGRDAGRVHVVCSCEGDEYVYIADGGCRTVGKPKKKKRKHLRLLPVPPY
GUT_GENOME096399_014964-72IREGSVVKSIAGRDKERYYVVVSPANGYVFIADGKVRKLEHPKKKNIRHVWPTEGVIPLGPETTNRQIR
GUT_GENOME024303_007795-78PGRVVESTAGRDKDRYFVVLSVLDNAYCILSDGKARKVDMPKKKKIKHLRVTEFFLETIAEKLAAEQTVTNSML
GUT_GENOME175197_009574-75IRGSVVRARAGRDKDGFFVVLERKGSFAFICDGRRHSLLHPKKKNLLHLTLTRNVLDEGSMETDRAIRKALG
GUT_GENOME229869_012306-79EIPVCTVVKSCAGRDKEGLYVVVGKLDYPYVWIADGRKYKLDKPKKKNCRHLQIVGTSVSRGNAGPLRISNEWI
GUT_GENOME130648_012694-75WKQGMLVRAERGREKGRLLCVVGQAENRVLLCDGKERPLSRPKAKNPKHLTVIGHVLSERQMQSNRALKQAL
GUT_GENOME001900_010326-71IPVGMTVLSLAGKDKGRVYAVIGTAQGPYVYVADGRKYPKSDCKKKNCRHLRPLGPSANPDACRDD
GUT_GENOME130638_017172-76IGMFAISKAGHDKDQMYLIVKEEGDFFYLADGRLKGIEKPKKKRKKHLQIVKTGIDEALAEKLKKGQRIYNEEIK
GUT_GENOME285934_013823-84INPADLVCSTAGRDKEKYFMVLSVIDSQYVLLCDGKMRPVDKPKKKKIKHVKLLDYSLDRIKEKFAECKKINNADVRKALKS
GUT_GENOME111606_0051219-102SKGQIVFSKCGRDKGRAFIVYDFNEEYVFIVDGDLRKLEKPKKKKKIHIQISKAVNEEIKNKLENELYILNSDIRKALAKYNNK
GUT_GENOME218104_006383-76ISKSDIVQSIAGHDKGSLFYVIETDGVYLLLANGRERTVEQPKRKKQKHVCKVPRPDSTLTGKIRNGERVLNSE
GUT_GENOME055427_014255-69ERGRVVRSLKGRNKGKSAVVLSVEGNAAFIADGKEYKLAKPKLKNIKHLEPCNVILNESDFAFDS
GUT_GENOME066397_014462-79ELVMGTVVRSKAGRDKGDLFVILRLEGEYAYIANGELRKVDQPKKKKLKHIQRTNYVSGFIADKLAAVGKVTNSEVRK
GUT_GENOME078809_023413-71YHKGQVVLSKAGRDQGLYFAVLGIEGDMALIANGKERRVAKPKAKKQKHLAATNKTLSEEALATDKLLK
GUT_GENOME190945_0196211-76YPKGQLVRSRQGIDQNKLYIVTASDDQFLYVANGVKWTVSNPKRKNPLHAQKINIRMDAEITDADI
GUT_GENOME172932_037246-79VGEIVISKAGHDKGEHFVIVKSDSEYVYLVDGVFRTVDRPKKKNKKHVQLVHFKDSNVIKKYVNNEKITNEEIK
GUT_GENOME239296_001173-70FKKGMVVLSRAGRDSGYPLVITEIKDGFVYVCDGKERPLENPKKKNPKHLAKTNQTIPLDEITNRKLR
GUT_GENOME235154_000803-83FSQGEAVCSTAGRDANKCFLVVEVVDEQFVLIADGKLRKIEKPKKKKIKHLVSLEKISQETAQKLTDRQLVTNGEIRKVLA
GUT_GENOME049363_015643-74ELTAGMFARSLAGHDKGKLYIVTAVDGDVVYLADGKIRLTTNPKKKKRKHIQPDFTVSDTILEREMHGLPLR
GUT_GENOME239069_007231-69MYSKAGHDRGGMFLVLRTDGEYAFLADGKVRKLDKPKKKKQKHLQKTNLRIALDESVQTDAGIRKALAQ
GUT_GENOME257504_006249-65LSVGNIVMSTKGHDKGRLYAVVAVVSENFVLLADGALRKVDNPKLKRIKHIKLLRCA
GUT_GENOME214510_0054326-94GRLVRSKAGRDKGRYYLVLARAADILYLADGRKRGIANPKKKNIRHVQLVHKVAADLVSKKNGELPSDL
GUT_GENOME160083_011886-77GYFATSLAGHDKGLTYVITGVDNEYVYVTDGRLRQVDEPKKKKLKHIQLIKICDDTIKDKISRNIKLTNEDI
GUT_GENOME018599_000716-70DRGMVVRTCAGRDKGYFQVVLSVEDGCVWLCDGKSHRLERPKKKNLIHIRRTRTFLEEDCLQSNM
GUT_GENOME010791_002713-70LKIGTVVQSLRGRDEGFFMVVAFDGMFAYLANGRSRLLEKPKKKNLRHLRVTGTVLDPERVQTNKQLK
GUT_GENOME096512_0302916-96LIGRLVISIAGRDKGSCYVVLESLSDNLLLLVNGSSKTILLPKKKKLKHLTLTNVIDGEVRDSILSQSRNADLIIKRFIKL
GUT_GENOME252797_0079912-84FRTGDIVIVRRGKYAGKPFAVMGASDDRILIANGAEYTAARPKKKNVIHLQQTHYNLEDVAGRVAGGKPLDNG
GUT_GENOME111731_003053-78FRKAQIVRACRGRDSGRLFCVLEEAGGWLLLADGKRRRVDHPKRKRAKHVLLAGEWEHPAMEKVRTGLPVRDRELR
GUT_GENOME249375_010373-66LTEGSVVKSTAGHDRGRFGVVIAFRDGWPLVADGKERKLEHPKRKNPKHLIGTREKINLREIHG
GUT_GENOME139959_002419-80IGYFASSLAGHDKATVYIIIKEDGDNVYLADGRLKTVDNLKKKKKRHIQIIKKQDAVIKSKITNNQTVTNED
GUT_GENOME114383_001974-80KISLGSVVKSKQGRDKGKYFMVYSYDGCQYVNLVDGVYRKQQAPKRKKIKHIDLTGIILEGLAEKLQGGVHVFDAEI
GUT_GENOME159078_001422-71VERGCVVISLSGRDKGRLMAVLRTGPDGLWLADGRRRRVEAPKRKNPRHAALAGCTVSEAAMATNRELRR
GUT_GENOME176398_004641-77MQEFVVGDLVVSTAGHDMGQFYVIVDIIDKNYVALADGKNKPITSPKKKKIKHILSLDCRDVDFEQKISKKDVDSNA
GUT_GENOME246348_007714-83IRTADIVLSLTGRDRGQLMLVVAEEGDFLLLANGRARRAENPKRKRRKHVSLQGPCDERTRLKLQSESRLTNSEIRKALA
GUT_GENOME013494_001552-65RIVRSLAGRDKGRLFLVVGVTDKGELLLADGKMRKVETPKKKKEKHTEPLEGRTLLEDCPAEGM
GUT_GENOME158620_001841-75MDRLTPGQIVLSLAGHDSGKLYVVLRREGACFLLADGRNRKLQNPKKKNGKHLERRSDSPLAEAIGGGVVTDKMI
GUT_GENOME096513_0006410-86KVGQLVRVLKGKDAGSVAVVVAIVDTKFVLIADGHKRKFDQPKRKNVQHLELTPIISSEVVHSLQESGRVTNAKLRY
GUT_GENOME099135_006974-54IPGWQARSLAGNDQGKVYVITEDRGEYVYLQKEGGKTFRKNKKHIQVIKKY
GUT_GENOME234846_000581-79MKLEKGTVVCSLAGHDKGDFQVVIEFDDKYAKVCDGKYRPLERLKKKKLIHIKMTNTILSEENLKTNKSVRKSLRPFIE
GUT_GENOME283126_012851-69MIGRIVCSKSGRDKGYFMAVIKADNNYLYVCDGKERPLERPKRKNIKHVALTNTYLDENSYSSNKSLRR
GUT_GENOME240252_008305-88DLSLGQVVYSKAGRDKGRYMVIIEKIDAQYVKVADGGTRKVEKAKKKKLKHLSKTNHILSTVHKKLEAGQQVRNEEILKQLNQL
GUT_GENOME041767_012517-89IGRVVVSLAGRDAGRSFMAVAAIDAEHLAVADGKLRKIERPKKKKRKHIRIENAFDERIKEALLRGERVLDADIRKSLRKLGY
GUT_GENOME100707_001164-82IKAGDVVYCGCGRDEGYFAVMKAEGKFLFLADGKRRKTENPKKKNIKHAAYAGVLPGDIADRIRTGGKVGNERLKKILK
GUT_GENOME186905_021254-74ELALSLAGHDKGHYYVILREEGDFVYVADGILKLCEQPKKKNCRHIQRIKRLPAEVTELLLNQEDRNDLNV
GUT_GENOME114281_010145-79VTVGTVVYSRAGRDEGNYFMVSEIIDDGFVAIVDGDIRKLNAPKKKKIKHLLVTSDVLEDVAEKMRIGDKVCDAE
GUT_GENOME237530_024378-81MMAVSRAGHDKGKIYVVLECGENTCSLADGETKLLGRPKRKNQKHVQLIKHIPPELLADMAEIKDDADIRRILK
GUT_GENOME031521_012444-83EYQIGQVVYSKSGHDQGDMQMICAIEGEYLLLADGRRRKLEKPKRKKKKHVQPTFYVEKDVAAKLQTGEYLLDADIKKAW
GUT_GENOME097828_0017916-85SIGRLVVSNNGRDKGNFYIVANIKEEYLYLVNGSTKTIEKPKKKKKKHVILTNVIDVNIKNSIESQDKNA
GUT_GENOME285123_003043-75VQRGQVVRSLAGHDKGGFLAVVQVAPPFALVCDGKRRPLERPKRKKLFHLAPTATVLPEEALRTNRQIRSALR
GUT_GENOME215875_012385-84SGWIVRSKAGHDKGTLLCVVGEAEDFLLLADGKTRKAAAPKRKKRSHVEVADAGTFGHPALEKLRGGQPVFDSELRRALA
GUT_GENOME000994_010702-69IDMRTGMLVKSKAGHDKDQIYLIWKEEGSYIYVVDGVHRMIDKPKKKRKIHVQPICRYYEISEADNVK
GUT_GENOME100455_010263-74WKTGDIVVSKAGHDKGSIYVVVGRQEDGRLFVADGKRKKIETPKVKKHKHLEKIGMIEETLAEALESKTGVF
GUT_GENOME127158_001224-67FTTGRAVLSKAGHDKNRFFAVVGTDGDYVLIADGHERKIESPKRKNAKHLQLTNIYFTAEEMQA
GUT_GENOME188699_003716-77MEVGRVVFSKAGRDAGHYFVVVAVLDKDHVAIANGCQRKVDNPKKKKIKHLVAKPEVLEEIREKIFAKKRIF
GUT_GENOME140605_0146512-84ELAIGQVVRSKAGHDKGKVMVVLEQVDAKHVLVVDGDSRPLEKPKKKRVKHLQAFHHVLPLEDARLSKTGFQN
GUT_GENOME118758_001106-68GLIVCSDAGHDKDRFGVVVRLDGCYAYIADGKHHKLSNPKRKNVRHLKPTAVTVDLSQCCTDK
GUT_GENOME112417_003748-73EGTICVSAQGRDKGTTYAVISVVSPQFVLVADGNRKKLSCPKRKNLKHLILTPRTAAEFGADISNG
GUT_GENOME112775_002194-74ISVGDTVICLAGRDKGQAFVVLEVCEKFCFIANGKTRKTDSPKKKSLKHLIKVNSEMSQDITEKISNGLAI
GUT_GENOME142396_006454-78YKAGMMARSLAGHDAGKIYVIIGSDSAYVYLADGRLRTVDHPKKKKKKHVKPVCVRCGEAALDDVAVKRILRNYL
GUT_GENOME060734_003407-77IKVGQVVISKAGRDKEGFFVVLEVIDDRYLLLADGKRRTLDNPKRKKAMHLQKTHSLVDLKPEDRPLNDSY
GUT_GENOME044648_004975-88TGQIVQATAGRDKGGVFCVVGMDHRQERLLLADGKRRKVSRPKAKKLGHIRQLAGARQEYDHPAIQKLRQGEPVSDRELRRALA
GUT_GENOME113180_009311-71MTGKLARSLAGHDKGTCYVITGADEKYVYLADGRLKSVASPKKKSLKHIQIINTTVDEALLVKLGEQTKEA
GUT_GENOME006163_0027811-87LKKGDFVISLCGHDKDSIFVVLDTEQNYVIVCDGKNRKSDKRKRKKFSHVRFLNLHTDVIDKVPSYAVDANVRREIK
GUT_GENOME111857_002461-69MVGEIVFSKAGRDRNKAMVITAVTENYLLVCDGKERRLERPKRKNPKHLQFTGLSLEPHRFETNRALRK
GUT_GENOME199380_024674-79EPGALAFSRAGHDKDGVFIIWKECGEYVYLVNGSDRSVGKPKRKNKKHVQITHMRDDSISERLKQGGTVTDEEVRH
GUT_GENOME145988_008435-81DFQTGRVVISRKGKDKGTFYAVIGMDRLKNRVYVADGWRHSVSRPKPKNPKHLQMTNWSLPRLEERLRDSSKNNDEW
GUT_GENOME047433_006528-93IQISDVVMSTAGRDQGEWFYVIDADPVYLFLANGKDRTIEKPKRKKRKHTKKVLRSETRVAGKILSGDKVLNSELRRELAVLGREF
GUT_GENOME127701_010197-81CIGRVCRSKAGRDKGKYFLVKEIVDEQFVLIVDGAAHKIASPKRKRIKHLECRGDVAEAIAAKLKEGAKVFDAEI
GUT_GENOME000050_004165-68EGMIVKSKSGHDKDRYYLVVSCDDSVAYISDGKRRKLDKPKAKNYKHLQPTTHLVDVTLYTTDK
GUT_GENOME217902_006606-79LSTGQLVVSKKGRDKGKLFVVVCIVDDQFVLIADGKYHKVKKPKRKNVKHLIVLQQVDQSIVERFKANLALTDE
GUT_GENOME014633_007945-77QGDVVIATAGKEKNQIFVVLDADTHYCYLVDGKRLKLTKPKKKSLKHVQKASKIGFDAQKLKSGQEKVNAEIR
GUT_GENOME274744_000745-79LAEFVYALSGRDKGKCFVVLSQKDNYLYLCDGKRRKAQKPKCKKIKHVRLTGDYDENLRKILADVGRLTNKEVRF
GUT_GENOME193232_014193-69NIGDIVVSTAGHDRGEYYLVIECDKDFIYVANGRLKTLDKPKKKNIKHVSRLGKSDEFIDIRKSDNN
GUT_GENOME009057_023954-74ELAKSLSGHDKDEIYLILKQEERFAWLVNGTTHTLLKPKKKNTKHFQIIKQIPTEVVSQIEENAWNDDTIR
GUT_GENOME233572_001412-70TGELATSKAGHDKHRLYLVVKEDDESVYLCDGRLRGVTNPKKKKKKHIQIIHYSLPTELLQRAANATCD
GUT_GENOME112096_001181-62MRTGSAVVSKAGGDKGRTYVIVGFDEKGYALVCDGRRHPLSAPKKKNVKHLSDTGLTLSGFR
GUT_GENOME070693_001592-68KAGQIVRSVAGRDKNIFFVVTAVDGAYAVICDGKSRPVEHKKRKKLIHLSKTNHFVELEKVKTNKEF
GUT_GENOME096449_000406-73LIGKIAYSKAGRDKDKWYIIIDRIDDNYVLVADGKKKTISKPKRKRLKHLNLTCEVAEEIKNSLLSDE
GUT_GENOME242999_012683-69FEKGQLVKSKAGRDKGEYFLIYDIVDDKNVLIVDGKIRRLEKPKLKKKIHLSKVNKKSNILDTIDKN
GUT_GENOME252665_005065-80KADIIVSMAGRDKGGLFYVLRVEDGYAYLVNGKQRTMENPKRKKLKHLRFAARIDSNVANKILRGDKVASSELRRD
GUT_GENOME057520_009598-82SLGQKVESITGRDANKIYLVVAIKGKEVFLANGRERKLANPKKKNIRHVKVYKWIAEAVADKFASNKKVTDEDIR
GUT_GENOME096518_008368-79VEGQVVRSLSGRDKGKFQIVIKLLDNNFVQVVDGKRRKLERPKLKKTKHLQKTNSVFDMTYVTNDSHVRKLL
GUT_GENOME243851_005187-80IKEGTVTKALAGRDKGKGFIILKILSEDYVLIVDGKRRTLENPKKKKIKHLALYKDVIELNVEYLNDSYIRKML
GUT_GENOME235153_013553-69LKIGQVVISAAGHDKGDLLVIAGFEKMQVLVCDGKHRRLEKPKCKNPKHLEATDRYLDKDSMATNKM
GUT_GENOME110727_016733-66YGEGELVRSVAGHDKGNYYIILAEDGEYVILVDGISRTMEKPRRKNKKHVQLIHRKGGVPATDE
GUT_GENOME147168_002644-46MLATSKAGHDKNQTYIIIEEMNDFYLIANGTTKTVAKPKKRKD
GUT_GENOME270034_016133-75EIIKGTFCESVAGHDAGSIYVIIGKSDALYVCDGKLKTIESPKRKNRRHVRILNYRDEMLQNKIEQNRVNNED
GUT_GENOME228687_005983-71ITAGMIIISTAGHDKGQLLLVTGADGRFVYLADGKERRLSAPKKKNLKHVRPTAEFLDPAAMTDKRLRT
GUT_GENOME208123_0113013-85GYLAYSLAGHDKGNLYVILSVEGDIAYLADGRLKLIDRPKKKKLRHLQVVKKDFRQVLTESNETAGSITNENV
GUT_GENOME231391_023572-79LTYDVGHVAISLAGHDKESRYLIIGINEDILYLADGDKRPLAKIKKKNKKHIQIAYVESEKQIKIKQAIALGTIKDEE
GUT_GENOME188395_004317-89FQYGMLAYSKAGHDKGNLYVIIKTDYEYVYLVDGIRKKINNPKKKKIKHIQIINDIPEIIKECMCSGKKVTDEDIKRSIKIFS
GUT_GENOME246481_003806-77VEVGRVVYSKCGRDEGRKFLVTGVIDEDFVYIADGDTRRMTKQKKKRRKHLKATQTLDQKLRDRIVNGDIPL
GUT_GENOME027327_004256-74VPGQIVRSKAGHDMFKHFVIVEVVDNDFVLISDGKTRKLETPKKKKIKHLAILNDNAFKDESFSLTNKN
GUT_GENOME024820_007893-82IDAGSLVYAIAGRDKDGLFMVLKRDGAYVYIADGKSRKSESPKKKKIKHTRLVGTASEDIKNLLLEGESLTNAVIRRSLA
GUT_GENOME139860_016242-69VGMLATSRAGHDKDTTYVIIGEEKEYVYLADGRLKTVGQPKKKNKRHIQIIKKVQLQKNEDGWNDLEI
GUT_GENOME000700_028127-82EAGTLVKSVAGHDKGSYFFILKEEGEYIYLVDGKYKRLDCPKKKKKKHVEPLLWEKHFPIDMIRENKKVTDEEIKH
GUT_GENOME060949_009974-81LKVGQIVRPLAGRDKGQVMVVYEILDQNYVTICDGKLRKVSNPKKKKVRHLAKTNQIVTILHEKLKNGDKVNNAEIRK
GUT_GENOME215700_012722-63QTGYVVKSLKGHDANRIYLVLAVLSADFILLADGQYRKIENPKQKRIKHVKILSEQRHDFEC
GUT_GENOME183752_021001-67MKLKPGCLCRSLAGNDRDRYYIIIDDQGDYAAVADGEKMTAASPGRKNKKHIQAEKTPVLSGSSLTD
GUT_GENOME044229_000991-81METGNLVLSLAGKDKGRLLTIVGSAKEGYVLVADGKLRRLCKPKLKKLKHLAPWGDGAGWSPEEMPQTDRSLRKQLAQMRR
GUT_GENOME104784_0069417-88GSVAISLAGRDKGVRYAVTGITEEGYALVADGKLHKAAYPKRKKLKHLQFDGVRIENIDDILSSPGGTADAA
GUT_GENOME243775_005254-83FKTGMLAVSKAGHDKGRLYVVINADQEYVYLADGKNRSVCQPKKKKQKHIQINYHIPMVLAKALETEQELKDEQIKKAIK
GUT_GENOME040305_003011-65MQVGDLVIATAGKEKNEIFLVVKVDQQFVYIADGKRVSVTKPKRKNEKHIQKVSKQRFLLGDDIT
GUT_GENOME253848_012813-62MGAVVRSRSGRDRYRLYVVIGVTEDGRVLIADGRLHPLQRPKRKNLRHLTVLAAGPAAGF
GUT_GENOME192988_012204-73VQAGMAAISMAGHDRGHLYLILKLEGDYAYLADGRLRTAERPKKKKLMHMQVIKSVPEGFRDSIDREYKN
GUT_GENOME083297_005642-82EYTKGQVVYSISGRDKTMPFIVLAEEGDYLYLADGKLRTLEHPKKKKKKHVQKTKKVCYNIKDMLDSEAYLLNADISKALK
GUT_GENOME130504_0001112-76GDIVKSTAGRGSGRIYLVVKSLENGYVHISDGKTRKINNAKLKKNKHLVKLGNVDDLGFLNAPGG
GUT_GENOME095922_008278-73LEVGDVAVSRAGHDAGRYYLVVAETGEDFVLVCDGRYRKTDDPKVKRRKHLVFVAKAGADIGADDK
GUT_GENOME038094_0135614-84EFAKSMSGHDRNQYYLIIKKDDEFVYLVNGTTKPLTKPKKKNRKHIQIIKHISDEITGLFDGEPTDITIRK
GUT_GENOME071741_000774-76CKGFVVLSLSGRDKNNTFVAVATDDGYAYCADGRIHPVASPKKKKLKHIRILGKSSVTDISAVTNKQLRAALR
GUT_GENOME103749_027094-81FEAGMLAWSRAGHDSGTLYVIMRTDEEYVYLTDGRLKPVDKPKKKKKKHVQVMRTIPQELADMPEKEIKNEDIRRVIR
GUT_GENOME044786_015325-84MTGCFARSLAGHDKTEIYIIVGEEPGYVYLSDGKLKKVENPKKKKLKHIQPVKRADALIAEKLETQKELHNEDIKRAIKE
GUT_GENOME142594_003601-69MTAGQIVCSKAGRDKGRFFVILRTEEDFAWIADGDLRPLAHPKRKRQKHLAPTSQRANEEDVQTDKRLR
GUT_GENOME262257_002573-82YTVGQVVYSKSGHDKGTAFIVTEIDGEYLYLTDGKCRKMDKPKKKKIKHVQITNYCDADLKTKIENGEYLLDADFVKALK
GUT_GENOME021775_009088-78GRVVVATAGRDKGKPLVIVGTDGTEYVLLADGRRRKAQSPKRKKRRHIRATAYRIDPALFGDNAADAHLRK
GUT_GENOME020663_000965-82PKAGDVAVSRAGHDKGLVMVILSVPEEGVALLTDGKTRPLEKPKKKKWMHLTLYPTRIDVSMKALVQGETVQNADIRR
GUT_GENOME000582_002114-76ETGGFAISKAGHDKGKLYAVIGYREGKYLLADGRIRTLEKAKPKKEKHLQFIHAQDERLAELIRGGKPVRNED
GUT_GENOME057641_027253-83DFTKGMLARSKAGHDTGKWYVVMDVDQEYVYLADGEARTLDRLKKKKRKHVQICYKIPESLDKILKDGTQIRDEHIKRAMK
GUT_GENOME238266_004277-71SVGDVVKSVAGRDKDKIFIIIRTQEKSVFLVDGRTRKINNPKKKSAKHLEMISVAKGFDLAIKIQ
GUT_GENOME113641_001526-78KRADIVMSTAGRDKGRLMFVLSVLEGGTLLLADGKCRKVENPKTKKEKHAELFERSENPVSRKILSGEVPLNK
GUT_GENOME104483_007566-89LTVGSVAMSAAGRDSGRAFVVLGEADNDYVLIADGKLRKLEKPKKKKRRHLKDTGVRADDIINALNEGRLLDADVRKRLAQLGF
GUT_GENOME003671_013032-88VDIEAGSIVKALRGRDENRLFVVLDADGEYAHITDGHSRPLEKPKHKKLKHLKYLGNCNESKVYEKIVNGKHIENAEIRKVINLFSS
GUT_GENOME149704_019473-77EYGIGMMARSLAGHDRGKMYMILSQDEQYVYLSDGVYRTVGKLKKKKKKHIQIDCRIPEEIQKLLDEGRPIQNSD
GUT_GENOME191892_006738-80CFAKSLAGHDKETVYIIKKMDSEYAYLTDGKNKLIDSPKKKKIKHIQVIKKKDSNLESKHNNHISIQNEDIKK
GUT_GENOME096480_0095010-85IGEIVRICRGRDAGNYAIIVGFEDDRYLLLADGNKRKFDTPKKKNISHVEFQHAIAPDVVKAMHENNRVSNGKIRF
GUT_GENOME118378_008844-73VRGLVVRALAGRDKGGFFTVLEASDDMAVICNGKRRSLEHPKRKKQKHLLITKTILEEGSLQTNREIRHA
GUT_GENOME180136_002413-83MFPIEVGSIVRSKAGRDEGRLFVVIEELDENFVRVADGKLRGMERLKKKRRKHLKPTGSTVSDLKNRLADGPAVDNHEVRS
GUT_GENOME251190_0229432-103LLAVSRAGHDKDTLYVVLAEENEYFWLADGKRRGLESPKKKKKIHVQLIRHLPQEVAAQLQSIELDAHLRKA
GUT_GENOME000203_032497-82RVGQVVEGLSGREKNKIFFIVKVIDNKYVLISDGNKRKLDRPKLKKVKHLKLYNLYNNRVEKKIASEEITDAFLRA
GUT_GENOME259941_000686-73IEVGSVVKSLAGKDKDRFFCVVGFTGDEKAYCLLRDGKLFKVEKPKKKNVKHINATGFKLQSEALQVN
GUT_GENOME286468_01013254-325GWTLCRSSRTYAVLDVEGEILALADGSHRTVANPKRKKSRHVAPTATVLGNELLKSDQPISDAIKAYDAEHP
GUT_GENOME110869_007933-59MKPGFLARSLAGNDKGAVYEIQEDLGTHVLLLGPAGRPFKKNKKHVQLIKKTKESYT
GUT_GENOME129041_000373-65WQQGSIVLATAGRDAGMYFVVVAADEHFVWLANGKRRTLANPKRKNKKHVQQTATVIALTSWT
GUT_GENOME214656_008205-65RIVRSKAGRDKDRLFIVLREDTDGYLEIADGMLRTLDKPKRKKLRHLETVLESYDCGESAL
GUT_GENOME252152_007447-83IQTGDVVKSAAGHDKDKIFAVIGTENGFALLCNGKDRKAQKPKRKKLKHIKKTGIRLAWISETPEKVNNTSVRKALT
GUT_GENOME091227_014956-86ILTGSVVISTAGRDEGRTFLVVDSLDELYVLISDGDTHKMEKPKKKKRRHLKLVTEPDNDIVSRLSSGQPVFDHEVRRWLS
GUT_GENOME160447_0049510-81GSIVTSKAGRDKGRTFVVTATEGEEFVMLADGETRRVSRPKKKKLKHLSFEEGRLEVAKLPEDAMLADAAIR
GUT_GENOME247085_008619-74EKGSVVLATAGRDAGRLFVLLESSDGEYLLADGRTRRLSHPKRKNPRHVQMTALRLTPQQYAADGR
GUT_GENOME259251_005426-78LEVGRIAISRAGRDAGRKMLVVQELDADFVCVADGRHRQMARPKKKRRSHLKPTAHVDMALRERLLRHEAVAD
GUT_GENOME095978_010052-86DYNVGDIVISKAGRDKGSFLVVIGFLDDDKLLLADGRLRKTENPKKKKIKHITKINAKSTLICDKIKCNEKIPNALIRKEIERIK
GUT_GENOME255416_005225-76GQVVFSKKGRDKSLPFIVVDFDETYVYLADGKVRLLAKPKKKKIIHVQMCCDIIEEIKFKLENNLYLEDADL
GUT_GENOME066321_006743-75WSRGQFVVSKQGRDNGTCYVVLKIEDQFCYVADGRKTSYLKPKKKNVKHLQATHWISPEIEDTLRQGNAPSDM
GUT_GENOME158618_002395-81EKGGIVRSISGRDAGRLYFVLQTQSDRVLIADGQLRRLEKPKQKNVKHLEFVSDGRSTRVYAKIMEGARLENAELRK
GUT_GENOME114914_0107160-136ILGQIVISNAGHDRDSHLVVIGHEGENFVFVADGRLRTVEKPKRKKLRHVLRTDIYSPKISKKILQGMPVTNAELRK
GUT_GENOME096230_041973-78YGIGALAKSLAGHDKDNLFIIIEESQEYVSLMDGKFRTRQNPKCKNKKHVQVIHDNDEPQRRKLIEENGLTDEYVK
GUT_GENOME237062_014314-73KVQTGDVVVSTDGRDKGIFYLVVRTQENIAYVTDGRKRKVNNLKKKNFKHVKTVVEQAEIGLAMKIQNGE
GUT_GENOME170511_007003-71LKTNSVVISLAGRDKGKFLAVMRIDGGHVFVCDGKERPLLKPKRKNVKHIRATQYVLSSENTVSDPALR
GUT_GENOME258967_0163522-92IGTVVRSRTGRTRGKLFAVVGFVMKGTKEFVLVCDGSNRPLSRPKLKSADHLETVCGGNSKTDVTSDKELR
GUT_GENOME245657_005093-72IKPGMVVRSLSGHDQASFYLVVKTDQNFVYIADGRRRKLFKPKRKNPRHLQKTNRIVSLEEADTDRKIRR
GUT_GENOME258569_0044423-100DYSVGSVVRAVSGRGAGNLYIVVGMDGDKYAWLCDGKYRRIKNPKRKNIKHVEMERPADGEMTSMIENRIDVLDSEIR
GUT_GENOME217893_002276-85TEPGRVVISTQGHDAGRWHVIVSVLDARYVLICDGEARRLAQPKKKQVKHLSALPLTIPVEGKGASGGPIADSDIRAALR
GUT_GENOME218105_010655-73RGQVVESTAGRDKARYFVIVGACDEAHVYIVDGVTHKACRPKKKKLKHLRFCQKCLDMDKKLEGGAGTL
GUT_GENOME260725_014743-83VLPGEFAYSAAGRDKDNLFIILCVKDGYAYLADGKVRKVGTPKKKKLKHLKLTGTNDRFISEKLSLSEGLTNKEIKYSISN
GUT_GENOME001177_034015-79EPGYLAKSKAGHDKGEIFLIIREEGEYVYLMDGKSRTMEKPKKKKKKHLQPISYRNETIAAKLMADETVMNESVR
GUT_GENOME015025_006691-87MNPTELYVVRSRCGRDRRRLYLTTGAADAPYPAVMIVNGALHPVDRPKVKNIAHLVWIAPLSEDERAALAMAYTDKTVAEILAKYEH
GUT_GENOME157273_022765-77VGMFAISKAGHDQGKWYVVLNEDENYVYLTDGNLRTTEHPKKKKKKHIQPVTFGINQELAERIQKMQPVNNEE
GUT_GENOME074528_004038-72RGRLARSKAGRDKGRLCVVTEVLDENFVLTRDGRLRTLDRPKKKRKKHLAPLSARNEDIAAGRTV
GUT_GENOME015268_006424-71KLGMVVRSIAGHDQGNFLVITAIEDDFAYIADGKERKLDSPKKKRLKHLRFTNTVIDTDNLTDKGLRR
GUT_GENOME235884_018316-71QTVISRSGRDSGSFLSVVATDERYVYVCDGKFRPLDNPKRKNPAHLTPLKTVLGDDAFRSDRALRK
GUT_GENOME158694_005353-55LKKGWMAKSLAGNDRGRLYTVEQDAGTYVFLTDDTGKQIRKNKKHIQAVKRMK
GUT_GENOME255913_022787-75LARSKAGHDKNHVYLIWAQDAEYLYLVNGTTKRLSAPKKKKRMHVQLIKKIPPAVREFLEEQNMLGDET
GUT_GENOME025821_013772-78VGMLAAAKAGHDKDTLYIIIEETDEYVWLTDGKYRPVAKPKKKNKKHIQIWKKENEALKELRQKLEAKESVRDEEIK
GUT_GENOME074913_0041224-91GKLVISNSGRDKNQMYVIVDNIDDNYYLLSNGKTKTIQMPKKKKIKHFDVLEDAEDHIKTALAAKDKG
GUT_GENOME114245_005594-79HIAEVVCTLAGRDKGKYFLVIAQEDNFLYLCDGKGRKVGSPKKKKIKHVSFTDVKNDFVSSKLAQGGKLTNKEVRY
GUT_GENOME283701_006254-66IEVGRICMKISGREAGEKCVIVEIIDDKFVEIVGTNVKNRRCNIKHLEPLDQVIEIKSENPDE
GUT_GENOME170155_010295-72EPGTLVRSLAGHDKGKLYIIIEERYPMLLLADGALRTMEKPKSKKVMHIQLIHEKLETEELTDDSIRL
GUT_GENOME159364_003016-84PSVLVKSVAGRDKGGLFVVLSILDEQYVFISDGKKRRVEQPKKKKKKHLRCLGCNSEWLAGKLAGGGRGANAEVRNALA
GUT_GENOME140009_001673-75LVKGSVAVSLMGRDKGVLMCVVETGENRVWVCDGKCRKLNKPKAKNPKHIKPLNKKLSEEQTASDKSIRKALK
GUT_GENOME257430_004384-82TKGEIVLSLKGRDAGGLFCVVDAQDGFALIADGKGRKLAAPKRKREKHLRRVGTSAHPAIVRLQRGESVSDRALRRSLA
GUT_GENOME029380_007603-73IKVGSVVYSKSGRDAGSFLAVVGFQEERVLLCDGKQRPLARPKAKNLRHISAAKTVLTEQEMATDCSLRKA
GUT_GENOME062358_010707-84IVVGQLVKSKSGRDMMRLFLVYEVVDNEHVLLVDGKVRKLEKPKLKKIKHLIVYHTIVEDFQKKVEENSKLTNSQIRK
GUT_GENOME230896_015853-68IKPGILVRSKAGRDKDHVYAVVDLDEKYVYVADGVEKTLRHMKRKNSRHLQPILKISLRGAPDDAV
GUT_GENOME261029_001415-86DRGSIVLSLSGHDKANFFFVVGFEEENYLLLADGKFRPIDHPKKKKKKHCKLIKKIGNERIADKLASGERVSDADLRKALKL
GUT_GENOME005147_000687-78LETGQVVLSTCGRDVGKLQMVLEILDDSYVLVVDGKQRKIEKPKKKKIKHLQKYNKKLQLENLTDKKLRYML
GUT_GENOME273439_017945-77FAKSKAGHDKDQLYFILKVEEEFVYLVNGTTRSLDKPKKKRKKHVQVIHRIPAELCSILEAGITNESVKRSMK
GUT_GENOME253371_005645-70DIVLVISGKEKGKYYLVLDADTRYVYIANGSNRTISNPKKKNIKHVQKVGKTTNPIGDKVLHDCDI
GUT_GENOME209633_00515258-324GKLAVSLAGHDKGSIFLVIREDGDVIWLADGISRLYQSPKRKKRKHVQLVLNGGMDSSELEDLFQNP
GUT_GENOME233266_017553-72FELGTVVISSAGHDSGHWFVVTGADGGFVTVADGKERKLCAPKKKNIKHVRVTATSLELEGLTDKKLRKA
GUT_GENOME094066_004593-82FRPGEIVRSVAGHDQEGIFYILRAEGDFLWLADGKRRTVKASKKKRRKHVVSMGLWTHPVTGRIQNGEPVLDSEIRRALA
GUT_GENOME235296_011527-77CRGAVVRSLCGRDRYRLFMILQRCDGGSQCVWIADGALHPLSKPKKKNLRHIAVLAAADGDKTSLPANDGE
GUT_GENOME139741_0058331-103GAIVQSKAGRDRFRRFLVTDILESADPTPRAAVVNGTTRSTDHPKIKSLRHLIPVGMSDEAKSLIEKGALTDD
GUT_GENOME257096_015882-80DVCIGSVVISKAGRDKGRNFVVVAMEGEFVYLCDGDLRKSDSPKKKKLKHLAVTNTVCDYIGDKIKENKKVTNTEIRRA
GUT_GENOME131740_002053-76EIEVGSICLARAGRDKGKYFIVSEVLEGGYVYIVDGKTRKLSKPKRKKRMHLRPAGAAPGELVQALKAGIAQDA
GUT_GENOME276442_003676-88FSIGDIVQSKAGRDSGGYYIIIRMDGIFAYICDGDLRKADKPKKKKLKHLKNTSVRSEYAASKLSQGLKVTNAELRRELEEFK
GUT_GENOME070122_005496-88IDRGCVVTSVAGRDEGREFVVLDLTKDGQYAVIADGKLRKVEKPKTKKLKHLRFYAASAGEDISGKIQNGTLPGNAEVRKALK
GUT_GENOME051307_006147-80DVALSLAGRDTGNIYLVSELIDEKYLLLIDGKSRTISKPKRKKIKHIQVIGNAASEFEGVFEDKSKTNDAKIRK
GUT_GENOME096495_035193-70LEPGMLAKSKAGRDKGCIYVIIDVNAEYIYLADGAKRTVCRRKRKNPKHLQIIKKADSPSTTDDETIR
GUT_GENOME256243_005075-67CIGLLAYSKCGRDKKRLFCVIGVLDEDFVLIADGRLRRMEKPKKKRLKHLSFTDAEPLALPLS
GUT_GENOME070065_011412-78EVTSGSIVLSKRGRDAGRYFIVVRREGEFAFLADGKVRRVKSPKKKKLKHLEPTGETHEGIGKKFASGATVFDSEIY
GUT_GENOME127408_009343-73IEKGCVVRSISGRDADRFYVVMELQDDCALICDGKVRRLERPKRKNLRHLRPTKTLLALEDLTTNNQLRRA
GUT_GENOME237871_002724-69TMARSIAGHDKTEVYVVIREEGDCVILANGKNKTVASPKRKKLKHVQLIKNIPAEAAEVSEKENDF
GUT_GENOME026900_020447-76GMLVRSLAGHEKDRLYIIIRQDEEYVWLVDGRNRTMEKPKKKKKKHIQVIHRIPEPVKKMLESGETLSDE
GUT_GENOME198368_008465-81MCGMLARSKAGHDAKTLYLVCRTEGDFVYLCDGRLRKFDSPKKKNKKHIQIIHKIPEALANWNGEALRDEKIRSILK
GUT_GENOME232390_010423-70IRVGSIVKSASGHDKGIYHAVVKLEGDFAYIADGKHRHIDNPKKKRLKHLKPTNAYTDTADITDKKLR
GUT_GENOME263665_021563-73EYQVGDIVCSCAGHDSGSYYIIMKQEKEYVYLVDGVYKKIKKPKKKKKKHILFVTRPDKRMGDQANDLEYK
GUT_GENOME150859_00202256-326DFVRGQLVRSKAGRDKTRTLAVLAVDGPTLLVADGDLRKLDNPKRKKMQHVAPTTTVLENELLKSDQQLRD
GUT_GENOME026810_003442-65AEFCSGAVVKATAGREKNEIFVILKIDNNFAYLVNGKSRPLECPKKKNFRHLQLLKSNSGLNLN
GUT_GENOME011133_005287-81FKRGDVVVSFRGHDRNKFLVVTESFGDSVIVADGRTRPLEKAKMKNVRHVRFVAHSREIAEAIENGMLSDAEIRR
GUT_GENOME096130_013413-69IQTGHIVRALAGHDSGNLYCVIATEGDFLLLADGKGRTLTQPKRKRRKHVELAGEFGPMSAIPDSDR
GUT_GENOME088580_006129-73PGCFAYSLAGHDKGETYIIIECNDEYAFLANGKNRTINNPKKKKLKHLQINKHKNELILEQMYGL
GUT_GENOME000609_011433-74LVKGRVVMSTAGRDRLTFQIVLEAAPDSALVVDGKARPLQRPKRKNIKHLRLTSTVVPEEQMNTNREIRRAL
GUT_GENOME257519_012675-78GRVLRATAGRDRDKYFIVMDEERDGYVLLTDGKSRPVSRPKRKNVRHVAETSLRADEIERLLADGTPLTNKQVK
GUT_GENOME237605_013257-77GGAAYSLAGRDKGELMIILRVEAGYAYLADGKKRRVENPKKKKLRHLNVTNFVSDAVLKAADSVENHTVRK
GUT_GENOME188419_006811-71MELSRGQIVKSLTGRDKGRYFIVLEAQGEFVFLADGKLRKLEHPKCKKLKHIQITKTIVPCAEPLTDKQLR
GUT_GENOME233278_014963-69IIEGTVVKAKAGRDDERNFVVTEVCGDGRYVLIADGKTRKLDKPKRKNIKHLAVSNTVIDLNEITDK
GUT_GENOME276070_005647-84ELGGAAESLVGRDKGRLYLIVGMRGETLLLADGKYRAAENPKPKNRKHIRLLPRFYPEIAVRIGQGKDENSEIRAALK
GUT_GENOME236237_012314-74ITKGCFCISRCGHDAGTLYYVVNVSDAAYVCDGVLKGVDNPKKKNPKHLQVLNYRDAALQEKAMSQTLRNE
GUT_GENOME103754_0364810-85IGQVVRIMQGREAGQYAVIIKLLEDRYVLLADGEKRKYDRPKKKNLHHIEVMDYISPEVQKSLLETGRVTNGKLRF
GUT_GENOME243299_006562-62IREGQIVISKAGRDINRKFVILKIIDENYVLISDGRLRKIDKPKLKKNKHLQKTNSFIDLE
GUT_GENOME282985_011676-67GLVVRAKCGRDKNRFFVVISCDGEYVYIADGKTRKIEKPKRKNMKHLFVTDTVIPDGSLNTN
GUT_GENOME167727_001496-81RGMLAVSMAGHDRGKVYVVWSETEDTVLLTDGDLRPLAKPKKKNKKHIQIIKKNQETWQLSQNQQVTDEQIKYMLK
GUT_GENOME069861_001182-79KIGDVVLAKAGKEKGQLLIIVALDDKYAYLSDGKRLKKDKPKKKSFKHIQKFTNLSLCESDLHDQNERVNAKIRKFLS