UHGP-MC 79161


Information


Number of sequences (UHGP-50):
132
Average sequence length:
218±32 aa
Average transmembrane regions:
0.02
Low complexity (%):
1.75
Coiled coils (%):
0.17
Disordered domains (%):
2.06

Pfam dominant architecture:
PF12897
Pfam % dominant architecture:
76
Pfam overlap:
0.01
Pfam overlap type:
shifted

Downloads

Seeds:
MC79161.fasta
Seeds (0.60 cdhit):
MC79161_cdhit.fasta
MSA:
MC79161_msa.fasta
HMM model:
MC79161.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME143282_0320146-218IEAIHESVTSNFTRYTKESMKQGLPSWTAPYAKPVILHHNEKDGKVIGRVLSAEYRENGEAGLPCVVIQAVIYDEDAIEQVLDGRLLTTSIGVRVNDVRCSICDHHIISEKDGCPHHARGTFYNGQLCTWDMNDMYVKEISYVVVPSDKFSKNTRILENNSVKSVTFSESANF
GUT_GENOME000050_0038215-268LQKINQYTRRNFTEKELFSFTVVLCDNEVDRDGEAFTTQSLQKLKALYIGTTGIFDHSQKSEDQAARIFHTWVETDPQHKTQYGEPYSCLKAKAYMVRTEKDKDLLLEIDAGIKKEVSVGCRVLRKTCSICGADLSYGCCEHKNGEFYQGKLCYTKLEQPVDAYEWSFVAVPAQPAAGVTKQYHQIETQPKRKARTNEEVVKSLGTGTISVEEQKNLKLYIDRLEEKALLGREYLQDLKSDIISLSFLSNSPLD
GUT_GENOME110045_0105217-300VKSDSKTSVPTKELELINKLTRRSFSEDEIYTFSVVLCDNDIDRDNDRFTDEALETLSELFIGKTGILDHDPKSQNQTARIYECHTEAVPDKKNILGKPYKRLVAKAYMPKSEKNENVILEIDSGIKKEVSVGCKVDKKTCSICGKNLNEEPCNHVKGRKYKKNGEYQLCHRVLELPTDAYEWSFVAVPAQRMAGVIKAFTPTQNGGETKMEDIIKSLNSGEEVTLSQKDASKLLDLIDNLKEKAKIGSCYKEDLKNEMLKLAAIVNPEINSQVMKSVAEKLNI
GUT_GENOME089520_0292816-203DMRKINAVSSHPLTADQVFIFGMRVCSNDVDKDFERFTTKSLHSLAKMYIGKPGIIKNEESARIYDTVVVSDIERTTRSGDIYCELIAYAYILRSEVSKQLIEDLNADMVKNVSIGCSVASTTCSICDEETCSHVKGSVYDGKLCFKNLNSPVDAYEWAFVVEPKKPDAAMALDEVIQHCYKVAERLR
GUT_GENOME245947_0250766-229ETENYKFEIRLCDTKIDRNFEKFTLPCLRKLSEMFVGKNGFVGQDSIAKILSTEVLKGKDGEWFIKANVSIKNIPENFKVIEEIKSGKRKKVNIGCSVATRTCSICRDSTGSCKHKPGEYYNGKQCFMELNNPIEVFEWTFVATPVKEETNMDKHTSESKEPFL
GUT_GENOME018580_001443-156ACDDRTDRDSERFTGAALESLAKLFVGKPVLQDHDWSASSQCGRIYAGAVGTGEDGRRQLVLRAYMIRAADTEARIRAIDAGIIREVSVGCAVASRTCSICGKDYGSCRHRRGEDYGGQRCEVVLDQATDAYEVSFVAVPAQPGARTTKGAETP
GUT_GENOME253512_0041313-209LKSHSLKETDLSVINELTLEPVGADDIYTFDVVACDNDIDRDFENFDDEALEQLAALFPGKTFIKDHRWSADNQFARAYAAEVQDTGEKTSDGRDLKCLVVNCYMLATGENEHMISEIKAGIKKEVSIGFAAKSVICSICGADNRKTLCMHYGGKDYEGQTCHFTLSDVSDLYELSFVAVPAQRRAGTRKDYLSVEW
GUT_GENOME158620_0104210-258VPEAGLPAEELEAINAYARTALTAEEVFAFPVLLCDNEVDRDFERFAEETLAELRELFVGKTGIADHDWSTEKQLARIYRTELATDPGRKNSLGLPYQYLKGWAYMLRTAENASCIAEIQGGIKRETSVGCAVKHRLCSICGKELGSPDCGHRPGQEYQGRLCFGELREATDAYEWSFVAVPAQRGAGVLKGRRGRPNCLKELVESPQGRGCAGEYRRLKQEAALGRKYMDGLRREVLRLSLLCDRQLY
GUT_GENOME104570_011166-265VQAGDLEKINAYTRRELRAEEVYVFNVLLCDNDVDRDGESFTVPALQKLAELFLGKTGIFDHNPKGENQTARIFDTEVRYDTTRQTAYGAPYAYLKAKAYMVRSPKNEDLILDIDAGIKKEVSVGCSVGRRTCSVCGADGHERTCEHQIGQWYEDTLCYVMLEEPTDAYEWSFVAVPSQRQAGVTKGFCSKRMRIAGKEERNLLNRNYEELKKALAGESGGMALSGKEQALLAGRLAELEEQAALGKAYLTSLQERVVKL
GUT_GENOME175902_0037517-249DDYIMAKINVHTRKELSKDEVYAFPVILCDNEVDRDNERFSIDSLNVLKEMFVGKTGIFDHNPKGENQTARIFDTEVKTDKSKKTSIGEDYTYLLAKAYMIKTDKTENLISEIEAGIKKEVSVSCKVDSEICSVCGADMRVKPCSHIKGRTYGKKFCHTVLEKPTDAYEWSFVAIPAQPNAGVSKEFSGSYDIQKNTDKSYDELINLINEKDNIISKLYEDLKKDVISMKYLC
GUT_GENOME103454_0121818-268ITQDELKLINSYTRREYTENELYTFSVVLCDNDIDRDGEAFSLNALTQLSKLFVGKTGIADHNPAANNQTARIYNCQVQEVPNRTTAFGGKYYQLVAKAYIPVLKSTENIIQLIESGIRKEVSVGCSADSIVCSICGENIRQNPCSHYKGNGYNDSICYHILDNVSDAYEWSFVAVPSQKNAGVIKSYGYNKEENMQDILTEIKKGNYSVINSKNAEKLAGCINQLQELADCGKEYRQELEKQYIRYSSLC
GUT_GENOME117244_0138216-267DYEKIRQFTRKEFAKDELYVFEAVLCSNDIDRDYEKFSVGTLNELAGLFNGKTGIRDHSMKSTDQAARIFETRVEKEDGRKTADGEDYYVLKAKAYMVRTNENQSLIKEIDAGIKKEVSVSCRVNERICSVCGADKAKEYCNHIAGRKYDEKMCFITLNNATDAYEFSFVAVPAQRDAGVEKAFGNTHRTDAQSIEKQLKEGETLLSAQEAQTLFDYIGSLKENARLGEDYKKSLEKEILSLCRKALPEMDI
GUT_GENOME220761_0223267-248VSAKEYYGCNNNGDAFTEEDLKNTHQLFVDNANVFLQHVNKDPAKGIGKPVFSWYNDTMHRVELILRIDKSKPDATGTVRKITNGEPLYVSMGCTVKYDVCSICGNKAPTRRDYCDHLRFNMKKILPDGRQVYALNPDPKFFDISIVAKPADPTAHTLDKRASLRGSCEPGDSFTSSAELGE
GUT_GENOME001307_014744-246INRYTRRQLSEEEVYTFPVVLCDNEVDRDYERFSCKALQGLCELFVGKTGIFDHNPKGENQQARIYRAEVVRDSDRTTSAGEAYHYLLAQAYMVRGEHNRDLILEIDAGIKKEVSVSCRMDRAVCSVCGADQREGPCAHEKGKRYPHPDGALCHHILDGPSDAYEWSFVAVPAQAGAGVRSATSKHFCQEDLDGLLRGGALEGEAAERLKGYLRQTREAARLGGEWRRRQEDELVRCGLLSLE
GUT_GENOME246575_0181513-210AELDEGELALVNAQTLRKLEAEEVFTFRLAACDNQVDRDHERFTDETLAELAERYVGRTVLLDHNWSAGNQTARVYAGAVEDGPGEGVKRLVLRCYMLRNEQTASTIAAIEGGILRECSVGLAVRRETIFFQFQLCSICGADQAATLCKHIPGREYDGQVCHMDLDGVADVYEVSLVAVPAQPGAGIVKGKRYGGQEP
GUT_GENOME030509_0094847-307GKSALAHAELEAIARYSRRKLTAEEVYTFPLVLCDNDLDRDYECFTDAALEKLAVLFRGKTGIFDHDPRGEKQTARIYRCAVEEEPGRLNHRGEPYKALKAWAYMVRTAANADLILEIDGGIKKEVSVGCSIGKKVCSICGADRQEATCGHQPGKKYGGRLCYTLLEDPTDAYEWSFVAVPAQPAAGVTKSHAARKALHLTPERAIAEMERGELHLDAAAAYGLRKHIAALEEKSEEAGRYRQSLCEAIRRRTMLAQPQMP
GUT_GENOME066862_0019136-220EDLKLINKYSKAELKAEDVYTFKVACCDNQVDRDFESFSDKALFKMAELYVGKTIVKDHSVSTDNQLARIYKTAVETGENDVKRLVASAYVPISDSTKDFITAIESGIKKEVSVSCSVTSCVCSICGKNFIECNHQKSKKYDDYTCYVVLDDVSDVYELSFVAVPAQKNAGVLKSFKSYKPQPTE
GUT_GENOME044057_0099315-198DPDERAMQAINALAIQPLTPESVYRFSLRACDDQVDRDGERFTVDCLRALAPMYIGKTIIMDHCWSATNQIARVYNAEVVQDGTVTYLRADAYLPRTESTQPIIDKLAAGILREVSVGCSVNRATCSICGLPYSSCNHRRGERYGESLCVVELSEPTDAYEISFVAVPAQPAAGVIKAASNRKP
GUT_GENOME026365_0015912-228SGAPSAEQLEKINRLTKKAVTAEEVYVFSVRLCDDQVDRDDERFSESCIRELAALFIGKTGVMDHVWSAEKQVARIFDTEVLAENGATYLKGWAYMLRGAETDGLIREIDGGIKKEVSIGCAVKRRMCSICGAEYGACEHVKGETYGAERCAAVLCEAVDAYEFSFVAVPAQKQAGVLQKQTGGGVPVDEFALKKLQKEAELGKAYRQELETRVTRA
GUT_GENOME229667_0088213-214GVVVDEEMLAKINTYTIRQLSAEEVYVFRVAMCDNDIDRDEEAFPRASLTELAELFRGKTMIADHKWSTSGQVARVFDTVVEETGGQNKTGEAAARLIAYCYMTRSEKNEDLIDDIDAGIKKEVSIGCAVNRCECSICGSDRRKTWCEHVPGVEYDGEQCYIKLLEVVDAYELSFVAVPAQREAGVVKSYGAQKPREAEEKA
GUT_GENOME093696_007493-263KISSNAPHGASEAELALINKFTLKELKADDVFVFSVILCDNEIDRDGERFDDSALLKLKELFVGVTGIFDHAQSSKNQTARIFETEVISDKTKKTSYGGEYKFLKAKAYIPDTEKNRPLISDIASGIKKEVSVCCSVKEHICSVCGNDMRSEKCTHMKGAVYSGKLCHCILKDPADAYEWSFVAVPAQAGAGVIKSAEKTKVAKSFCDGENAFKKGDEIIISNALSKQLGQKLSELEKCAEEGKAYRRNLTKQAIKYSSLC
GUT_GENOME094049_0196012-210AGATLNEAQLEKINALTLRTLTSEEVFTFKVAMCDNEVDRSYEVFPLATLQELAPLYLGKTVISDHSRSADRQVARIFDTAVIPGEGITKSGEPYHQLVAYCYMLRTDTNQDLISEIEAGIKKEVSVGCAVQSAVCSVCGTDNRKHWCDHMPSKEYEGQTCYFKLEGAWDAYELSFVAVPCQPAAGTTKSYGAEKPQEK
GUT_GENOME096435_0412849-206LIVKMEAIHVGKTRNSTYYTEEGLKGGLKSWTQPYNKPVLTHHNQYNGEAIGRILKAEYAEQTLSGKSGLMFTVEITDPDAVEKVLDGRYQTVSIGASTDKVTCNICGTDRTQEYCGHFPGDTYDEQQCHFIIGNTFGREVSYVNTPADENAGNRSVE
GUT_GENOME012126_0001827-203KLKNMTESSVETYIDENSLMVDIEAIHSRVTRNNTYYSPECLKDSVPYWTNPYERPVIMHHNEKDGVIIGRIKSVEYKEAETRTGTPALVFTANIGDEEGKKGIKNGTLSTVSIGVIAHDLRCSICGTNLAEEGLCEHEKGETYDGKLCYWIINKMEPKEVSYVIVPSDIYAHNLRV
GUT_GENOME026466_0034317-258GAPGAEDLAAIRRLSRAELEADQVYTFALRLCDDQVDRDDERFTPEALERLAELFVGRPGLYDHEWSAKGQFGRIYRTEVVEEPETTGEDGGPGHALKAYAYLLRTPGNEELIAAIEGGIRKEVSVGCAVARRTCSLCGHDIGDRARCRHVKGQVYDGRRCVVRLEEPTDAYEWSLVAVPAQRRAGIVKGLAPDAHRALSDEDLRTLEAEAAMGRRYAARLREDVVRMSLLAGTGVRAETLR
GUT_GENOME034902_0105215-264VTEEQLEQISAFARRPLAADEVYVFSMILCDNEIDRDYERFPQPSLKKLGDLFVGKTGLFDHNPKGENQTARIFKTDVEFDAQRRTQNNEPYCALRAWAYMVRCAKNADLILEIDAGIKKEVSVGCAVKGAVCSICGADTRTEPCAHIPGQQYNGTICWHELIEPTDAYEWSFVAIPAQKQAGVTKGYSSENTRPKRAPDGSLTLSKAQADALDMRLSELETLAREGRARLEQLRRDFVRMAAFAMPALD
GUT_GENOME092612_0040219-221LTDEELEKINKFALKTLSKEDIYTFKLRICDNEIDRDFEVFPLSTLEKLKELFIGKTIIKDHSSRADNQVARIYDTELITESGRTKTAEPYTSLVAHCYMVKTKSNEDLITEIDAGIKKEVSVGCAIGEVVCSICGTDNRKRWCEHWNGKEYDGNMCYFELKSPTDAYEVSFVAVPAQPKAGTTKNYGPDQEDQENEEVITEN
GUT_GENOME052537_0144217-267SPEELDLINTWSKKPLTAQEVYVFSVRLCDNEVDRDFERFPAQTLEALAPLFLGKSGIFDHNWSARNQSARIFSTQVISEPERVTQAGDPYCWLKARAYMVRTDDNRGLIAEIDGGIKKEVSVSCAMGSAVCSLCGADRRSGGCSHVPGRSYDGKVAFTVLSDATDAYEWSFVAVPAQREAGVVKHYDSSKGDKTVTENDVRKALQNSKGAVTLTCDQARALARQLDTLEQEAQLGKSYKQELASQVVRLC
GUT_GENOME234009_0110818-244LAQLNQFTRRTLTQEEVFLFDVRLCDNEIDRDGERFSLEALEQLKTLFVGKTGIFDHNPKGENQTARLYAAELVQDPERITAAGEVYTFLKGHAYMVRTDANRDLIREIDGGIKKEVSISCAAASQTCSVCGSDRRNSPCSHRIGQRYGDKLCHVVLGDVTDAYEWSFVAVPAQREAGVTKQFRSETDGAQRCKQLEQQLQNRDVLLNRMQNALRQDVVRLRFLVEG
GUT_GENOME000203_0227920-236YEVTEEDLKKINKFTLSPVTAEEVYVFKTVVGDNELDDRNFEPFNLNALKDLKELYIGKTVIKDHRRSADNQVARIFDTELIQDASKLTGAGEIYTKLVTKQYMIKTSKNEDLIAEIKGGIKKEVSTGCKAKHAYCSICGTDNVTTYCNHYWGKEYNTVEGKKICYFTLDGAKEAYELSLVAVPAQPRAGTTKNYGAKVKDFINESEKENKKNSEIN
GUT_GENOME221922_0086711-235TPTQAELEQIRRYTRRDFAPEELYVFPAVLCDNQIDRDGERFSRDCLRELAELFVGVTGVFDHRASAEGQHARIFAARVVEDPSRPTNCGEPYTALRAEIYMPRNEQTAGLIAEIDAGIKKEVSVSCAVSRSVCSVCGEPAGTCSHRRGKRYGEIPCHVLHCSASDAYEWSFVAVPAQPLAGVTKSIRRTRAGVGAEGAAGDGDLLELGRAYQKQLQEEYVRCSL
GUT_GENOME108363_0008610-247PDKAAMEKINLYTRRAYKPEEVYTFSVVLCDNEVDRDGECFTKETLEELAKLFVGKTGILDHEPTSKNQTARVFDAAVKEIPGKVTSLNEPYAQLTAQAYVPRNDGTKAFIESIESGIRKEVSVGCAVKKRVCSVCGAESCVHVPGKTYNGKRCVRILSGAADAYEFSFVAVPAQRAAGVVKKFSPRFEESEKKKEVKTVYDIVKKLADGEDSVTVAKEELNMLKTELKALFDRAECG
GUT_GENOME246798_01325286-560LNKEELALINRYTRREMSEEEVYTFSVVLCDNDIDRDFERFDEQALHTLKELFVGKTGVFDHEPKAENQTARIYDCAVERITARKTRTGEDYLRLIAKAYLPRSKRNEDFILALDSGIQKEVSVGCAVSGNICSICGADRRKNACRHQKGKTYGGKLCHDVLTGVTDAYEWSFVAVPAQRSAGVIKSWKTQEGQVKELKEITKALSFGEDISLSAQEVKKLYSYLQSLEADAQYGRQYHEDLKSEVVRLSVLVQPDIGKRMMESLADKMDIAELK
GUT_GENOME026009_0027815-241PDAAELEKINRYSKTPLTAEQVYRFQVRLCDDQPDRDLERFDRASLPRLAELFRGKTGILDHEWSAKGQVARIYDTEVREEGHEAWILASCYMLRTEKNADLIADIEGGIRREVSVGCAMGKATCSICGQPYGVCEHRKGERYGGRQCLAVLSEPTDAYEFSFVAVPAQRQAGVTKGRKGGVSMTLKELVTSTGTPEQQDSLRELEQAAQFGRVCRKELLDEVVRLG
GUT_GENOME221775_002603-252INKEASVVSAGVPDAAQLEKINRFAKTPLSAEEVYVFSVRLCDDQPDRDFERFDTAALPRLAELFVGKTGIVDHAWSAENQLARIFAAQVEQDGGAHYIKAWAYMRRGVQTEPVIEQIEAGIKKEVSVGCAMARSICSVCGRPFGSCEHAKGERYGDTVCTAVLCDPVDAYEFSFVAVPAQRAAGVLKAWKGGEGMTLTELIQKTADKGKQQEFQALCKQAELGRQYAAQLRQEVVRLGLVLDVGLSEKT
GUT_GENOME214432_0066018-257LEQINRYTRRPLTAAEVYAFPLVLCDNEIDRDQEAFTKAALETLAGLFLGKTGLFDHQHTPLHQTARIYHTEVVEHPDRVTRCGEVYCTLNAKAYLVRCPETEGLILSIDGGIRKEVSVSCQITRKTCSICEKPAGSCSHRPGQEYGGKPCCIRLDAPSDAYEWSFVAVPAQREAGVTKQHEQSAPLRLAGNTPQELVKGFAQTEEVTLSASQQGVLKDYLEKLEQSATLGEEYHRSLQK
GUT_GENOME076852_0017913-261DDKELSLVNQFTRRPFEADELYLFSVILCDNDVDRDFEKFTPRALEELAELFIGKTGLFDHSMKSGDQTARIYKTWVEKQPERKTADGECYYCLKAGAYMVRTPENEALITQIDAGIKKEVSVSCTMHSAVCSICGADRRKSRCGHVPGGEYDGKTAVTVLSEAGDAYEWSFVAVPAQRGAGVIKSYNTVKGEFDLNEVVKSLQTCGDTVTLSKKQAHQLADYVDSLEQEAQLGKTYKKNLVRDVVSLC
GUT_GENOME194152_004906-258KESRGVKNSAVTNEELSAINTFTKRALGADEVYTFAVRLCDNEVDRDGERFPRATLEELAELFVGKSGIFDHEWSAKGQAARIYRTEVVEEEGVCSAGEGRCYLKGYAYMLRGGANDALIAEIEGGIKKEVSVGCSVAHCRCSVCGEEMGKCAHRKGERYGGKLCCAELTEAQDAYEWSFVAVPAQPRAGVLKRCGGGEAGTLKMLVKRRGSRANLEELETLEKQAALGARYMSALREEAKRWMLVAEQEADG
GUT_GENOME047960_0032512-261PDEAELAEIHKFTKRTFTADEIYAFSVVLCDNEIDRDFERFTPDTLEGLAKLFVGKPGIFDHSMKGRDQVARTYSCKTERTGELTSDGEPYVRLIAKAYVPRTSKTKDFILNLEAGINKEVSVGCAVGEEICSVCGADMRRGACTHRKGEFYGDGANRKQCHALLKSPTDAYEWSFVAVPAQRKAGVVKSFNYVGKEEQPMKAEEIIKSFSGGDVTLTEAQTKSLKAEFFSMQAIVDEYKAQLKKNAVRC
GUT_GENOME245500_0165343-306GVPTPEEMEQIGEYTRRKFGPEELYAFTVVLCDNETDRDLERFSIPALQKLAELFRGKTGIFNHTPDARNQAARIFNCRVETVSGRKTEAGEPYTRLVARAYLPRGGENDRLILALESGIQKEVSVGCAVKKQVCSICGADRKTTSCSHRKGHVYDGRLCVDILEEPTDAYEWSFVAVPAQREAGVIKRFSWEERRKGDLNMVWKEGAAHTELAKELRKAQDGLRFSGEECEKLAAYLENLEKDASAGRAYQTQLRENIRRLGG
GUT_GENOME059445_002087-257EQDIELIRQYTQKEISQDDIYTFNVQLCNNDIDRDGERFSVETLKQLAKLFKGKTGIFDHSMKAENQKARIFDAWVEKSNDKKTVDGKDFYLLKAKAYMVKSDENKNLIREIEAGIKKEVSVSCKTEKSICSICGADRYKTRCEHIRGRVYKNKTCYFTLENATDAYEWSFVAVPAQREAGVTKSYAQFETDFKNFKNISEEEITISKETAKSIAEKIEKLEKENTFINEYRQELAKDILKYFSANLPEMN
GUT_GENOME008139_0048727-226AQEVTDEELQAINRFALSPLAPEEVFTFRAVLCDNELDRQYEQFSVSALQQLQKLFLGKPVIKDHVHRADNQVARIYRTELSQDGGRTTNSGDTYTQLVAYCYMVRTAGNADLIREIEGGIKKEGSVGCSVSGVVCNICGADNIRANCAHYPGRTYSKDGEWRVCHFTLTGAKDAYEFSLVAVPAQRAAGVCKSYTGETI
GUT_GENOME235153_0122010-259KELELINGYTRTPLTADDVYIFSVTLCDNEIDRDFECFSRESLNKLKELFVGKTGISDHSMRSKDQAARIFYCYTQTDETKKTSTGESYTALKARAYALKNESSKALIDEIDGGIKKEVSIGCAVNNITCSICGKDMKAHACQHIKGRKYKGKLCYGILENPADAYEWSFVAVPAQRNAGVTKAFNISKSTCENTQDIIKSAVAGNAISHEETQALLQYISILEKDAKQVQTYKEHLAKDISRFAGIIMP
GUT_GENOME014297_00134417-644IGEEDLQKINTLTRREFSAGELYCFRVLLCDNDIDRDMEQFDVQTLTQLSNLFLGKTGICDHDPKSQNQMARIYDVSVENFPGKVNALGEPYYALIAKAYMVRTEANRDQILEIEAGIKKEVSVGCSVRESICSVCGKNRNVLECGHVPGQVYDGKLCYTILHDAQDAYEWSFVAVPAQRQAGIIKQAGCSREKNGVQKLWEAARTQNGVWIDREETSQLKELIQNLL
GUT_GENOME096866_013677-239LKSALKPEAGDLEQINRFTRRPFTAEEVYTFKVVLCDNEVDRDGERFASQALYGLAELFLGKSGIFNHSMDAKTQSARIYDTAVELDQNRRTSCGEPYARLVAKAYMPHTQGNEDLILEIDSGIKKEVSVGCSMGSAVCSICGTDRREKDCGHQNGQNYQGQTCCTVLGDPQDAYEWSFVTVPAQREAGVTKSCRTCTSSEQPEEAEWGRRWKNLLLGETAKYFAVACPGVPV
GUT_GENOME090682_010077-225QMLEKLNRFTRRELSADEVYIFDVVLCDNEVDRDGECFSLNALNKLKELFVGKTGIFDHNAKSSGQTARIFDTQLVSNSKSKTSRGEDYTCLKASAYMVRTDANADLIREIDGGIKKEVSISCSAKTSRCSVCGQNRQQKSCPHLKGKEYCGKKACVILDDITDAYEWSFVAVPAQVNAGVTKKSYTEAETGKFSTEDAETVEKLCENLRKDVSRLCLL
GUT_GENOME008627_0197420-280LEAIASFTRRPFSPEELYVFSVALCDNEIDRDYERFSLNALKRLAALYIGKTGIFDHSMKGSDQTARIFSCKVEQVPGRNTQAGEPYTRLTAQAYLPRTAKNQELILELDSGIKKEVSVGCAVAKATCSVCGADLRQGECVHRKGRPYPGHDCPAHVVLDEPTDAYEWSFVAVPAQKEAGVIKHYKPGQAGSSPAEQGEEQMDSVQKMLETAGEELTLTKAQAQTLRETMKALQSRAADGEAYRADLQNEVLRLCALSPFG
GUT_GENOME170648_0020617-220SAADIALINQFAVKELSPDDVYCFSLVLCDNDVDRDTERFTDQTLERLAKLFVGKTGISDHEWRASGQIARIYRTEVVATGEKTKLGAPLKQLRASAYMVRSDATKPIVDSIDAGILKEISIGCRVKSCNCSICKKPLKMNWSTWKYQCETGHIKGETYPEGLCVGDLEDAVDAYEFSFVAVPAQKGAGVTKSVGCVDDAFEVL
GUT_GENOME121122_0100613-256GEKQLAAINRFSRRKLSPDEVFVFAVNLCDNEVDRDGERFTRPALEALAPMMLGKTGIFDHNPQGAGQTARIFYTQVVEDPSRETSCGEPFCQLRALCYLPRCEKNTDLILEIDAGIKKEVSIGCAVAKKSCSICGAAAGSCGHKAGERYSGQLCCTLLEEPTDAYEWSFVAVPAQREAGVTKAYGAALSPGELKKKLALGEVTLKSGEAKALLTQLERLEPLAQAGEEYLSSLRREVEKALVL
GUT_GENOME190431_0020155-253RNYTRYTPNCLKKSVPSWTSPYRRPLIKHHNEEDGEIIGRICEAKYVTKNTRSETPALLFTVNIPGEQAKADVKSGLLETTSIGVIAHSVKCSICGQELANGETCEHERGAIYNGETCYWDIHEMEAKELSYVIVPSDMYAKNIDIYPATASNNKTKAFAESLNQDLNLLKGDTEDMSEELKVKLQESEAKVTELTNTV
GUT_GENOME110843_0025263-225ADDDYTFTVKLCDNEIDDDNERFSSQCLKQMAEMFVGKYGCIGETRIAKIVSTEIVTDENKWATFYERYQWLKATVIIPRSNETEQLIEQIEHGEKPEISVACSVNTSACSICGKTNRSCTHKPGEYYGGKLCYMTLYYPREVFEWAFVEPAKSVEKKRLVEH
GUT_GENOME210309_0097414-257VSAETLEKINRFTKKQLSAEDVYVFSAVLCDNEIDRDGERFSVEALHMLAPLFEGKTAILNHSMDAADQSARTFETQVVTDTARKTACGEDYTYLLARSYLPRIAKNETLIAEIDAGIKKEVSVGCAVAKRVCSVCGADERQNPCGHRRGKTYDGKVCHYILDEPTDAYEWSFVAVPAQKNAGVTKSYRDADALCKAICEKQADGMTLSRAELTALADRLEILEKSAADGAVYRDTLRVRAVKG
GUT_GENOME066178_0350016-214NDIALIHQFTRNELKPEEVYCFSVTLCDNEIDRDMEQISDEALHKLAELYVGKTGIFDHNWTAKGQVARIYRCEVVETGKRNSLGAPYRYLRGSAYMLRNDETRSMIEKIEAGILKEVSVGFAAKPVVCSVCGAEMRGWECQKHHQKGKVYDGKNCYGIITEPTDAFEFSLVAVPAQRAAGITKAFSNAWDAFTVLMES
GUT_GENOME089210_0006919-265EEELAAVNRFSKTPLTGEQVYLFGVRLCDNQVDRDEEYFDRDGLERLAQLFVGKTGMFDHAWSAKNQAARIYRTEIISEAGVCTETGEPSCYLKGYAYMLRTEENAGLIAEIEGGIKKEVSVSCAVSRVVCAICGNDISDRSRCSHVKGRIYEGKRCVAKLEGPTDAYEWSFVAVPAQRNAGVVKGLRQAGPETSFRQLMEELPQGVWSRQLAQMEREAALGRRYLAMLGTETVRLGLMAQVGLDKG
GUT_GENOME198457_0040618-280LELINKQSRRQLKKDEVFVFSIILCDNEIDRDFERFTTESLHKLAELYIGKSGIFDHSMKGKDQMARIFACEVEPVTGKKTSTGEDYMRLKARAYMPKTSRNEDLRTEIDAGIKKEVSIGCSVAKTVCSICGNDMRSNQCKHVKGRQYKSNGAKKLCYAELINPADAFEWSFVAVPAQPKAGVVKSYGTNAYAKGEDNLDIATTIKSLSAVNDNGVLLHKSEAEALGDYIRKMEYMAGIGETYLRDLKAEVLKSCSIAKPELK
GUT_GENOME158598_0010815-263VTDEDLKLINQYTRRELTAEEVFVFETILCDNEIDRDGEAFSLACLHKFCELYPGKTGVFDHNPRGESQMARMFKAWVEHRPEQRTRYGAPYAALHGKAYMVRSGRSADLILEIDGGIKKEVSVGCSVTRKLCSICGANWKKERCSHAAGEEYGGKLCFAILDDPDDAYEWSFVAVPAQPAAGVVKAYTQKGEFQLNQIDQIVKSLDPNQGEELKAYLSALEIEAQAGRAYLAKLKKEVLSLSCLSGAK
GUT_GENOME175447_0181412-268PPESELAEINRYARSPLKQEEVYAFTVTLCDNEIDRDGERFTAETLEQLAALYPGKPGLFDHSMKGRDQTARTYRAWVERDEAKRNSLNEPYTALRARAYMVRTPENQSLIAEIDAGIKKEVSVGCAIATKACSICGANRHTEPCAHLPGRFYEGQLCHTVLSGAKDAYEWSFVAIPAQPKAGVTKTVKAYSTQEGGSKAMKDWHSVLKNAENGVTLTTEQAGSLLAHIQSLETRAQEGEQYRSALSQEVVRLCALH
GUT_GENOME237425_004974-234INKYSLRELNKDEVFCFSVILCDNDIDRDFECFDEDALRTLEKLFVGKTGILNHSMKSEDQSCRTYKTQLIRDNTRQTLTGEPYMYLKAWCYTVRSEKNESLIKDIESGIKKEVSISCASKTKICSICGSQNCNHISGRIYDGEYCHKTIKDITDAYEWSFVAVPAQRRAGVTKSFKKEKTMENILKSIKEEKSLTLGENELKKLSSYMDNLKSMSTDGEKYRESLITDAK
GUT_GENOME014667_0109715-221AEITDEMLAKINLYTRKTLTADEVYIFPIVMCDNEIDRDYEHFTANTIKDFAEMFKGKTVICDHNYKNANQCARIFDTEVIHNPTVKTFDGQDLYQLKAFAYMLRNDANAEIIANIDAGIYKEVSVGCSVKSATCSICHNNYNDYNECRHWAGYEYDDGKICNVALDGAKDAYELSFVAVPAQPGAMVTKSKCYDGDDTAKKERKSK
GUT_GENOME247679_010599-235EALLEKLNRFTRSEVKAEDVYTFPVILCDNEVDRDGERFSADALQRLSELFVGKTGIFDHDPKGENQTARIFDCEVKAEQNRLTKAGEPYSCLVAKAYMIRTEKSKDLIAEIEGGIKKEVSVGCSVKRKLCSVCGTDLREEQCGHIKGKYYDGKLCSVILDEPDDAYEWSFVAVPAQKNAGVTKTYGSKAEEASLKALKDRLEKQEELNARACGILKREIMSLSFLC
GUT_GENOME125245_0026012-260SSGLLPGHLEDINRLSRASLSAEEVYVFSLCLCDNEVDRDQERFPEKTLEQLAPLFVGKSGLFDHSWSARGQAARLYRTEVVREPERLTQAGDGYCWLKGWAYMVRTPDNQGLIAEIEGGIKKEVSVGCAVKRAVCSICSTERGQDCGHKPGEVYDGARCFFQLEEAVDAYEFSFVAVPAQPGAGVVKGLCPAGEPAQTLRELAAGRDLCIRELDGLEREAALGRKWLFTLREEVVRMGALADSGLDRT
GUT_GENOME006324_0167810-260SGAPTPQELELINNYTVKPLSADEVYTFGIVLCDNEIDRDFERFDIPALEKLAELFVGKTGIFDHSMSGRDQTARIFSCRVETDESKVTSAGEKYTKLCARAYMPRSEKNAALIEEIDAGIKKETSVGCSVGRSVCSICGKDGRTDPCAHIKGREYGGKLCHRILCDPTGAHEWSFVAVPAQPAAGVTKSYRADEQTVKTVKRLSCANEGVTLTKAEAAGLYGYIDELEQLAREGKEYREELICDVIRMGA
GUT_GENOME026472_0050111-252DKELELIGRFARKKLSRDELYTFPLILCDNEIDRDNEKFTIPALKTLAELFVGRSGIFDHNMSGKDQTARIYSAVVVSDPEKKTADGEPYTYIQAKAYMVRTDKNKDLIAEIDAGIKKETSVGCSVRDISCSICGKNIKTEGCEHQKGKYYGGKLCCYLLSEPADAYEWSFVAVPAQRNAGVMKSFAAGGRSASQEELARFAEAYRAELRSEIIKNAAKVLPTMKSETLEDICSVLDLRRLR
GUT_GENOME203829_0052359-257RNFTRYMPKCLKNSISLWTTPYRRPLIKHHNEENGEIIGRICAAEYKTSKTLSGTPALVFTVNVPGEEAKKDVKNGILSTASIGVMAYDVRCSICGTHLEDGDECEHERGREYTVNGKKEVCYWDVYSMEPKELSYVIVPSDIYAKNTKIYPATNSRNKPIIKECLNNKGVKKMPNNDDLEKELKEAKDKIAKLEQDIK
GUT_GENOME108085_0047413-247ELSKEDLALINTFTQKEMTKDDIFTFCVILCDNEIDRDYERFTEDSLHKLADFFIGKTAIRDHSMKSSDQSARTYKTEVIKDSTRKNSLGEDYVYLKAWCYMPRIKKNEELIEEIKAGIIKEVSVGCAVKSCICSVCGKELGKSECSHVRGGVYDGRVCFGGGELQNPTDAYEWSFVAVPAQKNAGITKKFGFTADGDANRQLTFLKSENERLKGLAKAGQLYRTEKQTELIKAF
GUT_GENOME096453_0145820-210SEICRSDIDAINGFTLEELSDEDVFTFKIAMCDNNVDRDNEAFDEPALKQMAGLFVGKTIIKDHNHKADNQIGRIYACSVEQPGGLSDTGEPYMQLVAKCYVLVNDANASLIADIKGGIKKEVSVGFRLGSYICSVCGTDNAEAWCKHVPGREYDGRKCHFTMSRIEDAYEMSFVSVPAQREAGVVKLFGG
GUT_GENOME260243_0108315-261EGQLEKINALSRRVLKREEVYVFSVVLCDNEADRDFERFSTEALEKLSELYIGKTGIYDHSMKAKDQVARVFACTVEKVESRYTSTGEPYCRLKALAYMPRSVKNEDIILEIDSGIKKEVSVGCSVGRRYCSICGADKNSHECRHTVGKYYTVKGKRVLCCTVLDDPLDAYEWSFVAVPAQKEAGVTKSFNPDGASVQEDVIKSLYTSGDTLFTQKQKQGLCALFERLKGYERTAMDCTKQLRDEVL
GUT_GENOME091463_0091613-250DSDLQKINALARRELSADEIYVFNVTLCSNEVDRDYEKFSIESLKQLAPLFIGKTGISDHSMKSSDQKARIFDTYIEKQDGRFTVDGEPLCCLKAKAYMLNNEKNASLIEEIDAGIKKEVSVSCSMSSSKCSVCGNDRKKGGCSHIRGREYNGKLCFDTLSNAADAYEFSFVAVPAQREAGITKSFKFTQEENMQDILKSIENCGSDITISKSQAHKLQSYIDDLSEKASLGEAYKEE
GUT_GENOME087324_0014311-253AAPDMEQIGRYTRRAYSPEEVYTFSLVLCDNEVDRDFERFSREALEGLRELFLGKTVLLDHQRSAASQTARIYDTGLEELPDKTTQAGEPYVRLTAKAYLPRTEKNRQVIELIESGILKEVSVGCSMKRSVCSLCGKEQCQHIKGRIYGGRRCHRVLCDPADAYECSFVAVPAQRAAGVVKHYEGGMDMDIEKRLEEAPEEGVLLTKSQAGELLAQVKAWKEEAQWGRSYRERLQGDVLKYSA
GUT_GENOME078723_005078-258SGYILEKSKIENGDMEKINQFTRRKFAESEIYTFSLVLCDNEIDRDYDRFSLRALRQLQTLFVGKTCIFDHERKSANQTARIFDTKIQTQERQMPWGERYTQLIAKAYLPMTDKTQDLITAIESGILKEVSVSCAVSESRCNLCGKSSCMHQKGKKYKGEICHQIFEGISDAYECSFVAVPAQRAAGVVKYFAGQEVKQVKDILKNIYEREGMSMDAEQVNQLAEYLKKLEEKAEWGEKYHEDLSRKVIKY
GUT_GENOME004279_0085312-257PEESDYEKIRKFTRRDFEKDELYVFEVSLCNNDVDRDNEKFSIAALNELAKQFVGKTGIKNHSMRAEDQSARIFETRVERIPGRKTADGEDFYTLRAKAYMVRSEATAQLITDIDAGIKKEVSVSCSAEKRICSVCGKDKSREYCGHVAGKSYGGKKCFTVLDGIADAYEFSFVAVPAQKEAGVEKSFGGLAEKGDVEKMLSGGGEITLSSGQAESIKAFLDNMSDDAELGREYRKSIIGELVRLC
GUT_GENOME160436_0169311-210ISSLKLSDEELAKINKHTLSVVTADDVFVFKAMIADNEQDDRNYMPFTTKALQDLKSLYVGKTFVFDHKGSAEKQIARVYDTEIVTSEDKTELGENHAELIAKIYMIKTASNADLIKDIAGGIHKEISTSTVPSKLICNICGVDNTKEFCNHWNGRKYLVDSKEKICKLIIDGCKEAYELSFVIVPAQPRAGTVKTFEKM
GUT_GENOME017973_0097313-200SFAPDEADMALINAQALEPLAAGDVYAFRVDMCDTAVDRAFEHFSDEALGQMAKLFVGRTVIKNHDRKCDNQVARLFRCEVEDRGDRRALVGYAYTLDNEANATFVAGLKAGIYREVSVSFACKSATCSICGTNNVERYCKHYPGAEYDGKTCTYELSDVSDAYELSFVAVPCQPRAGATKSYGDEPW
GUT_GENOME006471_0024327-204EGLDTGNVIASDSVMVDIEGIHVGPTRNFTWYTEEALRSSVPTWTKPYQRPLILHHNEKDGKIIGRVLAAHYTDMNTRSKTGALVFTCNVPDDDGKRGVRDGRLKTVSIGVIAHDVRCSICGHVISEYGECEHERGMEYDGNVCYWMIHKMEAKELSYVIVPSDIYAHNIKIYSPGEK
GUT_GENOME000137_002945-267RQGTILKSAVPAKEDLQKINRYTRRELEEKELYCFNVILCDNEIDRDGEAFSVPALHKLAELFLGKTGIFNHDPKGENQTARIYDTKVCVKEGTATQNGEPYTYLLARAYMVRSEKTADLILEIDAGIKKEVSVGCSVDSVTCSICGTDLRGRSCEHQKGEIYNHRVCCAVLSEPTDAYEWSFVAVPAQKNAGVTKRYGAFRTEEQPQEAGNTVCREPEEVLKLLKLTEKELVLNRLEAQKLTGYVNRLKQEAKLGKEFRRRK
GUT_GENOME085017_0094814-263VTAEDLEKINRFTVKALSADEVYSFNVILCDNDVDRDGESFTNEALMQLAELFVGVTGIFDHDPKSSNQSARIYHAECVSIPGKKTVTGEGYRCVKARAYMPLTEKNADLIGDINAGIKKEVSVSCAVGSFTCSVCGSDMRYDPCNHVKGEIYDGKLCYCKLSDINDAYEWSFVAVPAQVNAGVTKSYKKEIETMENCIKAIKDGNAVKLSESEAKQLSEYIKRLEKQAQDGIIYRRSLTEETAKFAVLS
GUT_GENOME179803_0173816-208PDEADMEKINALTLRNFTPEEIFTFKVKLCSNEVDRDYESFTAECLNELAELFKGKTGIKDHHQNTDNQVARIFDTKVVTEAKKQTALGEPYTYIEAYCYTPRLPNTADFIAEVEAGIKKEVSVSCSILKKFCSICGKEYKSPWATGCAHLPGRDYENVKCVARLAKALDAYEFSFVAVPSQRDAQAKKEYTV
GUT_GENOME103484_0101022-273DKEMELINAYSRRKLTKDEVYVFGVVLCDNDIDRDNERFTVESLFELEKLFVGKTGIFDHSPTAKNQTARIFACSVESVDGRKTATGDDYFRLTARAYIPKTKGNDEIIQAIDSGILKEVSIGCAAGEVKCSVCGESLNHCSHIKGETYGGRKCYGELCGIYDAYEWSFVAVPAQRSAGVIKSFKGKEMKMEEILKSIYTEKDINLSGEECKKLCSYIDELKKSAADGIYYRDSLTSEVLRLSAVVQPDISR
GUT_GENOME199518_01742406-629PAEAELAAINRFAKSPLRAEEVYTFSLRLCDNEVDRDWERFDTAALNTLGDLFVGKSGIFDHQWTAEGQTARIYRTEMVREGAQVTAAGDGYCWLKAWAYLLRTEKNADLIAEIEGGIKREVSVGCSVARRVCSICGAEGGTCQHTPGQRYGEQLCYLELRDPTDAYEWSFVAVPAQRKAGVLKRYGHENQGMAQLRAQAELGRKYLRELRREVTRLAMLADDS
GUT_GENOME013624_0161218-262EEDLALINSFSQKKLAKDDVFVFSVILCDNEVDRDFERFTVESIEKLAELFVGKTAIKNHSMNSEDQSARTFKTEVIRDEAKLNSLGEQYVYLKAYCYIPKIKKYETLIEEIQTGIKKEVSIGCSVEKSVCSVCSKDVRLGACTHKKGRKYNGQLCYYELISPTDAYEWSFVAVPAQRNAGVTKKFDLGEKRQMNDVLKAINSAQGEITVSAEEAREIASYINGLTEKAKDAEMYRESLENDTVK
GUT_GENOME029380_0115219-281KDMELINHYSKRPLKRDEVYIFSLILCDNEIDRDFEQFSENSLKTLAEMFKGKSGIFDHNPKAENQSARVFDTSVIEHCDKKNSVGETYFTLNAKAYIPVTEKTRSLIEEIETGIKKEVSISCSVSEKICSICGCDVRSAKCSHTPGETYDGKKCFGILENPTDAYEWSFVAVPAQPAAGVIKAFKDKEEKVKVNDIKEKIFSAKGSITISEKESCALKKEFERLEKLSLWGEEYRRALTEDTVKLCAVTVPELLPDTMRSIC
GUT_GENOME244786_015319-226SQVNKINELTRRNFSKEELYAFPIVLCDNEIDRDGEQFSVNSLKQLAKLFVGKTGIFDHNPKGENQTARIYSTALITDNTKKTQSGEDYTYLLGYAYMIRTQKNSDLIKEIDGGIKKEVSVSCAVKEKLCSICGKNQNVKSCSHIKGKSYNGKLCAHILNNAFDAYEFSFVAVPAQPKAGVTKKYSAGLKETPLEEDNETTKKLNELENKLRKQTLGS
GUT_GENOME016866_022335-193TSEQLAKINQKALQPLTDDQAHVFQARIIGTKRIDKYKMKVTPNFLRKMADQVKEGVALLVDHPWQKWDALSFPYGRTFDSRIVEEGGELELYGDHYMAKGLEANGISTDQLATGIDSGTIFDTSAGFVTTKHNCSICGGDYFRGAECSHMRGQEYDGKECLVLADDGYIMENSIVFDGGYEGAGITRG
GUT_GENOME178066_0075967-230YSYDSLKQNVINHDWTNYSRPLLRHHNLEDGTSCGRIDKSFFYDHSTKEVTAEFSDSKLKKDVLDYFESKGAFDKGTASTIVELSVDEYTYERMKNGLDNTVSQSSMMSKATCNICGKDYFDGCQHIAGKTYDIEDGDNTVQKTCILQCSDFYPVELSIVNCPG
GUT_GENOME245503_0197613-267GSVSPEEMGKINQYSRRELDPGEVYTFSVVLCDNEIDRDGERFTINALKSLSGLFLGKTGIFDHNPKGENQTARIYDACVLTDSSRKTGVGELYTYLKASAYMVRSSKTEELILEIDAGIKKEVSVGCSVQKMTCSICGADLKHGRCPHKKGEIIGGKVCHTVLDQPSDAYEWSFVAVPSQRAAGVVKNCGETPRNEEMLLKSLYAAERDLVLSPAEQQELTGLLEQEHRKAALGERYVEELRDEVLRLSCLVDS
GUT_GENOME030140_01443210-424LCGKNDEIEKSGALEKLAELFKGRTGIFDHDPKSSKQTARIFDTWVETLPEKTTTDGEVYRRLMAKAYMVRTASNGDLISEIQGGIKKEVSVSCTMGKKLCSVCGADMYKGGCDHEKGGEYGGKLCYHILDEPLDAYEWSFVAVPAQVNAGVTKRFALREKQESTDKSYELALAREAFEKDVLRLSYFCKPFMSAKRVKELAELMTVTELIDFRG
GUT_GENOME092238_0051418-278EAELEKINRLSRRILKADEVFVFSVVLCDNDVDRDFERFSEQSLKTLAALYIGKTGIFDHSCKGRDQVARIFDTQVEYPDGQQTKTGERYCRLKARAYMPNTAKNADMIAEIDAGIKKEVSVGCAVAKAVCSVCGKDQRESPCSHRKGRFYGSGADRTLCYYNLEQPTDAYEWSFVAVPAQPKAGVTKGFSQRKGGEQMNFDQIEKELKKGSVTFTEEQCKLFREELSELKKLASFGKCQLEKLRKEVSRLGGITNPDIGA
GUT_GENOME017195_0195116-243SPEELKAINAMSKKKLRPEEVYAFAVRLCDNEIDRDNERFPAATLEELAPLFVGRSGLFDHQWSTRNQAARIYRTEVVRESWMTEAGEPYCYLKGCAYLLRTEGNRELIAAIEGGIKKEVSVGCAVERSVCSICGEEFHTCPHEKGAEYGGRRCWAELVGATDAYEWSFVAVPAQRNAGVMKHMRMEQEAALGRKYLESLRGEVARLGGLAGLGLEHAVLRGIADKLG