UHGP-MC 100580


Information


Number of sequences (UHGP-50):
92
Average sequence length:
286±15 aa
Average transmembrane regions:
0.02
Low complexity (%):
2.99
Coiled coils (%):
0
Disordered domains (%):
2.01

Pfam dominant architecture:
PF04960
Pfam % dominant architecture:
9239
Pfam overlap:
0.92
Pfam overlap type:
equivalent

Downloads

Seeds:
MC100580.fasta
Seeds (0.60 cdhit):
MC100580_cdhit.fasta
MSA:
MC100580_msa.fasta
HMM model:
MC100580.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME212738_0033829-306EDFRVSDANPDDFAITVMLTDGTVVEAGDVDKAFAMGNIAKVPVYTTLFSQNESAKEVMEKLNGGCGCNSKGKGCPSTPKPHHLPVNRRGVRAVSIIEPTGDSDGKFDIISSMMIGLAGDSPVLDDNLYKRLTADNQAADVENRLAESEFVLYDNAQIAIDLYTRLTAMRATTRQLAEMGATIAAGGRNPETGNELFDSSVSRRVVAAIAAKGMHKMSMQWLVVSGVPAQAGFGGGVLGVVPGVLGIAAYSPKVNGKGVSPKAAHAVAYIANQLGVNA
GUT_GENOME172923_0119612-306CWKRIDEGQVATYIPVLAKVDPYQLGVCLYNVDSGKKDCAGASNVRFAIESVSKVIALLYAIERLGLTTVEDQVGTRQTGFAFDTILNMEITKETKPLNPFVNSGAILVSSLIEEEDHLSSFDQILNFTREICNDPQITLNEEIYQSELRTGDMNRSLAYYLKAKETLANDVTTSLETYFKQCSMMVTCESLANLGAVLANDGIAPWNGERLISSDAATYTKSVMMTTGLYNQSGTYSVKIGVPTKSGVGGGLVASAPKYGIGIFSPALDQAGNSIAGLALLEMISRELDLDIFR
GUT_GENOME251398_0078931-319QCRPYASEGRLASYIPELTKADPNALGVYLIGSDGKHYCAGDYTKKFTIQSVVKPILLLLALMDNGIEQVRSRIGVEATGKPFDAINVSAQSLLSEHLNPMVNMGAIALCPMIRGNDYNERFERLLDLTRRLAGNPDIALNESVYLSEKRTGNKNRALAYMLKSQGMIADDVEDVLDCYFKACSISVDCRDLAKIGWVLASHGKPLKRLFPYEYARYVNAVLMTCGMYDGSGEFAVRVGVPAKSGVGGGIMAVVPTRMGIGIFSPALDEKGNGYAGIMLLQKLSEQLFL
GUT_GENOME159493_0059319-311ELNKSEKGGTIDPRVASEAKAGSFGISVVLTDGRSVDKGDADALFALGKIARIPLAVVLKTQQAAAAKDGQGTHCGCGCDKGGACSCAGHEKKEKLPLGRCGIRAISAIVPQGDPEGKYGVISDMLVALANSEPAFSDNLYKLYQEEVASMDIVNAYKNSELKLADDVAQCIDMYTRLMSTELSARQLATVGATIAADGRNPYSGEYAFDGSIAADVVALLATRGKHFARHWLRKTGLPGIHSFAGAIIAILPGFGAIVAYSPELNDCGVSIKAANAIEYIATQLQLNVFASA
GUT_GENOME257007_004369-307RLVDQCLPAARQGKVASYIPKLAEADGNQLGVYWMEPNGAGVGAGQWQKRVTVQSIVKVAIFLQALADCPLEQMTQKISLNATAETFNSIVDLEEKNQHRPLNPYINSGAIATLSLVQGGPGQRFQRVLELLQALTGNSRLTVEEEVYRSEKATGDRNRALAYFMRSTQILSEEVEPLLDDYFRLCSVLVDCRDLAAFAATLAKGGVDPLTGRRCASREHCRTALAVMVSCGMYGQSGDFLVRTGVPAKSGVGGGIMAAVPGRAGIGVIGPALNATGNSSGGMELLARLAQQLDLSILS
GUT_GENOME095248_0110722-305RAHKRGQVAHYIPQLASVDVHQFGISVCLASGEQLSAGDAATPFSIQSISKVFSLAIALGRHGDRLWKRVGKEPSNYTFNSVIELEQEGGKPRNPFVNAGALVTTDAMLDAPDAGGGLDELMDFVRTAAGDDRISIDEQVAASEYRTAYRNFSLAYFLRSCGNLHTECERLLQIYCRQCAIAMNCQQLASVGRFLAGFDSAAHLLNPAQARSINALMLVAGHYDGSGDFAYSVGFPGKSGVGGGILAVVPGHASIAVWSPGLNAFGNSLLGTRALQELSEFTGW
GUT_GENOME006135_0140614-292DSVRDIKDGEVADYIPALKDADPNLLGIAMTTAEGHTYSAGDVEAEFSIQSMSKPFAYAATLTDRGIDFVETKIDVEPSGDAFNDLSIEEGTKRPKNAMINAGAILAHQMLIGEDATMQERVDRVTDFFSALAGRKLTMDKEVFQSEMDHSHRNLAFAHMLTSYGMMEGKAHEAVAGYTAQCSILVNVQDIAMMSATLATGGVHPVTGERVIKRSVVRHVLSVMASSGMYDSAGEWFTDVGIPAKSGVAGGILGAIPGQLGVGVFSPPLDSHGNSVRGV
GUT_GENOME096246_0043034-331EQAHAKYKNLNDGKVADYIPALASYNPKNFAIVLATVDGKIYQAGDVNKAFPIQSISKVFTMGLVMQQSGPETVLEKLGANATGLPFNSGVGLEASKGTLENPMVNIGAMSAVSLVRADNNADRWTKIKSNLDAFANANLAVNEEVFKSEMDTNQHNKALAHLAKSFNRFYSDIDGAVEIYTRQCSVDANTLQLARMGAVLANNGKSPFNGQQLISQQYMPYVLAEMAIAGLYDGSGKWLYQVGIPAKSGVGGGILAVVPGKYSVAVYSPPLDEAGNSVRAQAAIKYIADATQANIFV
GUT_GENOME096268_0279528-331FKKQTVEGEVASYIPVLAAADREALGISIIGKNGGTIKSGTIESKFTIQSISKILTFIVACMERGLSYVLNRVDVEPTGEAFNSIMHLEMTQPKKPFNPLVNSGALTISSMLEGNTSDQKLETLFILLEKIIGYRPEINVEAYESERDTSMRNRAIGYYLLEEGYLESDLGITLETYFKQCSIDVTIDDLARIGLVLSNDGLDPDTEEEIIPRKVTRLAKALMLTCGMYDASGKFAAYVGIPAKSGVSGGIMAVVPPRVREEEFPFLEGCGIGVYGPALDNQGNSMAGIKLLRHIANQWDLSLF
GUT_GENOME029119_0161521-300LKDNNEGVADKFFATDNENAFGIAVVLADGRVIKRGDADMPVAMDSIAKVPVSTVLLTQYTPDEVIRKSGYLDPKDPAVKKAKVPTGRHGLRELSAIEPQGDGDGKMDILSERLTDLMGVSPVLDDRLYKAYLDEARNADTVNVLAASGYYLYDDAELTVDIYSRLRALKASATQLATMGATIAADGFNPLTKENVFDGVVSAPIVAMMAVKGPHKMSKPWLVGAGIPAKNSRAGAMAGVLPGVMGIAAVSPRLNEAGIPVKAALAIKEIANELQFNVFN
GUT_GENOME000530_0187820-300RGEPSGYLEDVAGQHVDGLAVAVCTVDGHQAAAGETEHAFALQSVSKALTYAVVLQEIGLDHVLEDVGVEPSGESFNHLSLHDDGRPCNPLINAGAILVHALLPAGDEDERTALLLRRYSAMAGVELEVCEDVLDAERTRADRNLGLAHLLADAGRLPISAREAVAGYLAQCSVLVTAPQLAVMGATLAAGGTNPVTGERVLDEWVAQQAMSVMLTCGMYDASGQWVAEVGVPAKSGVSGALLGAVPGALGIATWSPRLDEQGNSRRGIGLFEGLSARWDL
GUT_GENOME096459_0220240-322GAVADCIPTLARADPGSFGVALAEVGGEVHEWGDSRLEFSIQSVSKAFVYALVCQELGHRAVHDRVGVNNTGLPFNSLMALELNGGHPGNPMVNAGALVTTSLVPGATPARQWDRVRRCLGAFCGRELDVDEATYRAESRTNGRNRAIAELLASYGHIEGDPVAVADVYTRQCSLAVTTRDLAVMGATLADGGINPVTGERVVDPGVCRDTLAVLASTGLYERSGEWLFEIGLPGKSGVSGSILTVSPGKAGIAAFSPRLDAVGNSVRAQRVTAHLSRSLGLD
GUT_GENOME176996_0027930-321RKYLHEGAVADYIPELAKVEANKVAISTIEDGKLTSVGDSKIKFSIQSIVKVILYALAMKNYKVAELKKYVGVRPSAKPFNSVIELELSEKRIPVNPFINAGAIIIVAILHNVYREKTFDVILKKASEFLGEEVDYSREIAESEKESSFTNRTLIYLMLAKGILPSDTKVEEVLDTYFKACSILVNTENLAHMSYVISNDGIDLEGKEIITAKEAKVLRSLMATCGTYDYSGDFAIRVGLPAKSGVGGGIVTASKNKNGLAVYAPRLDKHGNSYSGVRMLEFLSQKLDLSIY
GUT_GENOME096430_0342623-304EDDQVADPGAGPDRFALALSELDGTEHVAGDADVGFEVQSVVKVFALATALQHVGEEVFRRVGVEPSGDPYDSLVQLEREHGMPRNPFINAGAVVVTDVLVEATGSADAAQAAVTDLLSDLVGERLEPDQETVEKERAGSSMNLAMGHLMSSFDNFTHPVEEVVDVYARLCALRVTTRQLARAARFLANDGVDPRTGNRVLTEPLARRVNAVMMTCGTYDAAGQFAYDVGFPCKSGVDGAVMGDVPNRFGIAAWSPPLDSAGNSRAGRAALHMLSERLGLSL
GUT_GENOME143137_0037728-323LAYGRSFLGSAKPADYIPELAGANPMQLGISIITADNQAFGYGDWEQHFTLQSISKVIALILAVEDMGMEAVFEKVGMEPTGDPFNSIIRLEKASRKPYNPMINAGAIVVCSCIKGNCANDRTQRFLKFARKLCGNPELKFDEAVFLSERMTGDKNRALAYMMKTNGILEGEVEEHLDVYFHLCSLMVCSRDISFLGAVLAGNGCDPVTGERYMSGNVLKLVRSLMVTCGMYDASGEFAAKVGLPSKSGVGGGILSVVPGVMGIGAFGPALDEKGNSVGGVKMLEYLSQKLELSIF
GUT_GENOME000584_0124112-306EYGKNFTSQGKLVDYIEILKDADKNNLGACIIDKDGKVYEAGKSREEFAIMSIVKVLLFEIVLENYELSEIKEYMRMEGSSKAYNSILDLEAEAGKPVNPFINSGAITSAYLIYKKFKEKSVEILMNRARLVMDDDSLDYSQTLLDTARTGGENNLALAWILKKHGTIDRYTSVEDVINLYNLACSIMVNTRDLAVFASMISKDGKNLEGDQIIKKENARILRTLMAICGTYNYSGDFAIDIGLPAKSGVGGGIMATTNKGFGICTFCPGLDSFGNSVAGVKMLEKISKELDLNF
GUT_GENOME236863_0092037-315DWAKDYACFGELASYIPQLANVDPSKSAIVVGDLLGNQICVGNGLNVKVSIQSVIKPFLYIYALEQGVSPNDISYIEPTAMQFNSDAVLQPESKKNRPGHPLNNAGAISSAGSIINFEDFLDFMRDLTENPDLQVLDDVFLSEMATNANNRALAMRLVATGRFSSIKHGEYALENYTKACAIGITPKELATAAVVLARGGKRGDKQLLSENNVVRAVNAMNSYGLYEHTGEISLIAAGVRALSCKSGVGGLIMDIDPGRGVFCTYGPKLDKAGNSVFGK
GUT_GENOME013250_0130820-309AHQTLQSVNEGANASYIPYLASVPSNLFGIALAFTDGEILEVGDTRYDFAMESISKVFTLACVLEEMDKETFKSKIGSNASSEPFNSVIDLELHDEHPYNPFDNAGAIATVGLIGAASPEERWNKIIGTYNRFAGRELTVNDAVYRSESDTNAHNRGISWLLKSSGSLQVDPDHACDIYTRQCSVAITCRDLAIMGGTLANSGIHPVTGKRVIKEENVPPILAEMMMNGMYENTGNWLYSAGLPGKSGVGGGILAIVPGICALAAFAPPLDVHGNSVRGQKAGIFLSEIL
GUT_GENOME229539_0006847-328GHVADYIPALANIDPKQYGIAIATLDGQIVGTGDYQTKFSIQSISKVFTLAMVVRHMGGDLWKYVGREPSGTPFNSLVQLEHEQGIPRNPFINAGALVVTDKVMNLYHRPKEAILQFVRSVAGNDDIYYDKTVAQSEFEHASRNQALGHFMKSFGNIDNDVDSLIDVYCNQCSIAMTCEDLAKSFLFLANHGINPHNNESILTVSRSKRLSALMLTCGFYDESGEFAFRVGMPGKSGVGGGVVAIIPGKLSIAVWSPELNEHGNSYMGIETLERFTTNIGIS
GUT_GENOME094995_0105723-301AMNEGTPDARLDGVDPEKFGLSITLVDGTTINIGDTDTRAAIGGIARIPALLALMQQTDNPVEAAEKSDVGAGCGCKRDRSIKPRIPVGAYSVRIVSAIEPNGDPDSKWNEIINGMISAMGSTPVLDDALYKKLTKINIDSNAEDVLADTGFYLYDDAAPSIDLYTRLTSICASASQLSAYGATIAAGGYSPTAKTQVFDSSIAPRIVASMAAKGPHRASYLWLVGTGLPARSSFSGMIVGVLPGVLSIAAYSPRVNDSGISVRGAAAIEHFMNSLHLS
GUT_GENOME068725_0178910-307VKKVVDETYEKYKDLDKGTVDKNYGDNDSADFAIVVTLTDGTEITRGDADKRFVMGALVKIPVMAELFSQFDGARHYMKMLKKANGGTCGCGCSDTNRPAEQPVCPRGVRSVSAVEPRDDRDGKMTVISDRIVAMTGSDLVFDDKIFEARMKAVETDDTVNKFAQAGYFLLDNAEDSIYDYTNLESLPATTRQLAMMGATLAADGVNPVTNRPAFDGKYAQDLLATMAVHGPHAMKRWLVKAQLPAKSGRGGAMLAVLPGVMAIAAYSPRLNDCGVSVRAHKAIARIARKLQLSIFAS
GUT_GENOME096389_0187763-355SDAEPVRHALAVVTAAGAEYAAGDREHLFTIQSISKPFVYAMALDEIGHDGVQQAVGMEPSGEAFNELSLEGGTGRPLNPMINAGAIATHQLISGAHGEPTVLPETASRAGRTDEVRRRTARIIAGLSAFAGRRLQVDWATADEEYEEGYRNFAIAHMLRTHHVFDAAPAHVVRGYIDQCAVLVTVQDIARMAATLANNGVHPESGTRVISARAARQTLSVMATCGMYDASGRWLAEIGIPAKSGVSGGIIGVLPGQLGLASFAPRLNDEGNSVQGVEMFKRLTDDFGLHLMA
GUT_GENOME097761_0109511-306ENNKGLISQGAVATYIPELAKVSKDYLGAVIAFPDGTMLSAGDTKVRFAIESISKTVVLALALLDNGEEEVFKHVHKEPSGDAFNSIKKLETEPDHLPRNPFINAGAIMTASLIKGKDPEDKFNRILEFMKKISEDDSLELATDIYLSEKATGDTNRGLAYYMKGQGVLSGNVEDILDVYFKQCSIYVTTESLAKIARFFANGGVLSTGERVIPKKYAQIVSGLIATCGMYDQSGEYLDNIGIPGKSGVGGGIISPVSSKKIGVAVFGPALDAEGNSVAGMGIMKDISKEMELDMF
GUT_GENOME022480_0051121-299YKNLEKGAVDKRLAAYSDAKNYGVTIVLTDGRIIEKGDTNVRAALGNIATVPVHAVLLQQNSVKELVKKAGKTENAKINDENVGVCPHTVRAISAVEPQNDADGKYDVIMNMLVSMLGNEPVLNDKLYQANAAANIAANVENKFADANYELYDNTATVLDTYAKLGALTVTTTELATLGATIAADGVNPVTKEIVFDGENSAPLTTIAALHGDADRIRREMLKSGVPEVYSYSGLVLAILPGLGAIAAYAPEVGRHGRSKKGVRAVRYITNAIDYNVFA
GUT_GENOME096544_0068422-310LQGGANASYIPYLASVDSSLFGVAVVTAEGEVYSAGDADFEFALESISKVFTMARAMADVGPEAFHQKIGADPTGLPFNSVMALELHNDKPLSPLVNAGAMSSASLVKAAGAEQRWQRILETQQAFAGRKIQLSPEVNHSEQTTNFHNRAIAWLLYSGGAMYSDPMEACDVYTRQCSTLVSTKDLATMGATLAAGGVNPVTQERVVAAELVPHMLAEMTMEGLYTSSGDWAYTVGLPGKSGVGGGVLSVAPGKLAIAAFSPPLDPVGNSVRGMAAVAQVAQRLDLSLYK
GUT_GENOME141323_0095722-301GHLADYIPELANANPNRLALAMSTVDGEIYSVGDDDVEFTIQSISKPFVYAYVLQQLGIDAVLAKVGVEPSGEAFNEISLGKDGRPKNPMINSGAITVHSLIQVKHGLHSAEILRRFMSELAGRELSFDESVYDSEVKTAYRNLSIGYMLRTVGILETDPVDIVNGYIRQCAIMVTVKDLVRMGSVLANGGVDPKTGKRLLNRSVVRQVLSVMMSCGMYDAAGDWLSTVGIPAKSGVAGGILGVLPGQVSIAAFSPRLDEHGHSVRGIDILERLSRDMGL
GUT_GENOME007545_0118422-302KSIKEGTVDARVAGESRENAFGISVVLTDGRVFTKGDTEQQVSLGPVAKVPLSVVLLSQNSADALAKKGACCGGTCKCGCAKFEGMPFGKHGLRAVSAVVPQGDPDGKFSVISDMIYALSSADQGFNDNLYKYLSEKAKAAKVAEQLTEAGYKLYDDAAQSIDVYSRLLAVRLSVTQTAAMGATIAADGRNPESGEYAFDGSIAASVVALMATRGKHFIKPWMVETGLPAKKSYTGLMLAVLPGFGAIAAYSPELDESGVPVKAAAAIKYIAQTLGLNVYA
GUT_GENOME114679_0237121-308HKNDMGGTPSPKVADMNQGMFGISVRLTDGTKFDVGHSQAQFAMGAISRLPIAIQLLTQMQPADLVKKMGLGHKGCGCGCGPKPDEPKQKIKGMHAKGVRAASLVEPVGDFDGKMDILSNLMIGLMGSSPVLDDKLYEATQQKAAEKDIVNSFAETGYELYDEAGLSVNLYNRLRSMLVTTEQLAEMGATIAADGVNPATQAVVFDGELSATVCAMMAAKGPKHMGKPWLIMTGVPAMSGFGGGFVAVIPGLGSIAAFSPELNEAGVPMKAARAIKDIVTSLQLSAFS
GUT_GENOME178919_0149214-306LSGVNGKVADYIPELAKVNPALLGAYVIDIRDGRGYASGDADAAFTIQSIAKVIAFACALLDNDLEQLAQTISVEPTADGFNSIASLETRNTHKPLNPMINAGAIATIPFVRGASYAEKFERILALMQTMAGNPGLSVNHRVYRSESLTGNRNRALAYYMYSTGIIKEDIEDLLDVYFRLCSVSVTCKDLAAIGGVLANHGLNPQSGIRLLSKDNCRIVKAVMATCGLYNESGLFAATVGIPAKSGVGGGILAVVPGQLGIGVFSPLLDDKGNSIAGMSFLADLSRELDLSIY
GUT_GENOME031610_0028118-304DKNFAETAHGKVANYIPILGIVDPQQLGIAIYNVDEDEIGTAGMAGTRFAIESIAKVVTLILAIKRLGHKRVLAELENGSADYSLSSVLLDDELTEQVHRINYLNNSSALLTTALIDQLMGQNSFNALLGFCREICNDPCISLNERLFRSAIMNDKDIHALAYYMKDKDILETVDQTLITYFMQSSMMVTSQSLANLGAVLANDGIKPWDNERLISHEDNELVKKLLTTVGSFEESKEYTIKIELPIKSGTGGGLLACAPQKCGIGIFSPALDQHGNSLAGMSLLQD
GUT_GENOME096502_001368-306AYEEGLKYTDQGQVATYIPQLAKEDGSKVAIAAYDVDGSYFEKGEVDKRFSIQSVSKVITYILCLEEIPAGKIQKAIGVKPTALPFNSVLDLELGDGKPRNPLVNAGAIAATALLYEKFGEDTFDVVLNKIRDLADDEDIVLDKSIYESERSSAFTNFALFNLMVSRGNLSADIPIQKVADAYFKACSVLVNVKDLARIGYVIANDGFDKISNEEKFSKDIARRVRTVMAMAGMYDYCGEFAQLIGLPAKSGVGGGIMTASKTGLAIATYCPGLDSYGNSLVGIKMLEYMAKEMDLSIY
GUT_GENOME243488_0329733-327EMIERKDRGDVATYIPELAGVDVNSFGLVVVDSSGHIAAGGDADTSFSIQSISKVFTFTLALGMIGDKLWQRVGREPSGSPFNSIVQLERERGIPRNPFINAGAIAVTDAILSGHQPRETLGEILRFMQFLSGDSSIKIDESVAASEQRTGFRNIALGNYMKSFGVIENPVELTLGVYFHQCAIAMSCRQLAMAGRFLAHSGRNPSTGFLVVPPERARRINAVMLTCGHYDGSGEFAYRVGLPGKSGVGGGILAIAPGKASIAVWSPGLDANGNSRLGRDALETLSKRMGWSIFG
GUT_GENOME020599_0128914-303RKFLLQGKVADYIPELGKANPVHFGLCIKTEDGRHIEFGDSEYRFTIQSICKIVSLAAALQYIGEEKVFRNVMMEPSGDAFNSILKLDTASNRPFNPMINAGAIEVVSLLSGRFSFEQMLEFIRTLCNDNEIVLDESVYRSEKATGDRNRAIGYLLKSKGVLKGDVLQTLDLYFKLCSLSVNARSLSELGLLLANGGISTTTGKRLLDKKIVRTLKTLMLTCGMYDRSGEFAVRVGIPSKSGVGGGILSCVGGRMGIGIYGPALDEKGNSIGGGYTLEYLSEHLNLHILD
GUT_GENOME186824_0251521-309RVMSADGKVADYIPELSLMSAELFSLSCQGIDSSLMETGDREQFFTMQSISKILALSFAIENFGRENVFRHVGMEASADSFNSLMRIEMTSSKPSNPFMNAGAIAVCSLIHKAYKEESVERLLSFMESVTGRENGFDERVFSSEKRSADRNRALAFFMKSMGFLHGDVESILDFYFALCSLRCTSGDLAKIGAMIAAGGVAVHSGERVIQKETVFTLMGLMSTCGLYNGSGEFAVRVGLPGKSGVSGGILVAVPGRMGIGVFSPALDAKGNSVAGIKALELLSEKLDLR
GUT_GENOME042600_0155914-334LDNAYAYAKTVQGGKNASYIPALAQVPSDLLAIAVVTVNGDLLTAGSANTPFAIESISKAFNLAYVMDLIGMKQLRAKIGADPTGEPFNSVMAVELHGGKPLNPLVNAGAMATVSLVNGSDSDEIWGNMIHNFNNFANAALTVNQEIYKSESATNQHNRGIAWLLARVDGLARRLAGKNGRKGAWLLDSYGYFYNTPPMIVDLYTRMCSLNITASQLALMGACYANGGVNPVSKKRVVKEENVPPILAEMCMSGLYDSTGDWMYKAGLPAKSGVGGGLVAVAPGKLAMAAFSPPLDPAGNTVKGQAALQSIIRELNLNLFR
GUT_GENOME103868_0047810-304QQLLEQVKPYTKKGKLATYIPELGNANPDDLGIAIFHKESEYIHAGNSQTLFTLQSISKVITLALALLDRGEEYVFSKVGMEPTGDPFNSIIKLETTSPSKPLNPMINAGALAITSMLAGKDNEEKTERILHFVREITDNPTINYSSKVANSELETAYLNRSLCYYMKQNGIIDCDIEELMDLYTRQCAVEVNCIDLARIGLIFAMDGYDPYKKKQIIPKHITKICKTFMVTCGMYNESGEFAIRVGIPAKSGVAGGIFGCVKGEMGIGIFGPALDANGNSIAGFKILELLSAQE
GUT_GENOME096273_0222056-334HAKVLYGADPTRMGIAMTMADGHTYRVGDAGVEFSIQSISKVFVYALALMDAGFDAVDSKIDVEPSGSAYNDISSESGSGRPKNPLINIGAIAAVSLVRPDEGQTVFDRILETMSACAGRDLSLDQDVYEQDLAEGAHNRGLAWFLSSWGIIDGDPGDAFDDYTRQCAVSVTAADLSVMASTLANLGVNPVTGRRVFTEDVVERVLSVMTTCGMYDDSGDWVATVGLPAKSGVGGGIISVLPGQLGIATFSPPLDEHGNSASGIIVHEEMSADLGLHFV
GUT_GENOME000319_0010114-310ECRPYTAQGKVADYIPELAHVKSDLLGVAICLPDGTYIHAGDVHHKFTIQSVSKALTLCYVLMEFGEDYVFSKVGMEPTGDAFNSIAKLAETVPSKPLNPMINAGALAVTSMILGETAELKIYQFRQFLATLLNRSVEEVTYDEKVARSEYETTDLNRALLYFMRHHGIVEGDVDEIIDVYTKQCAIEIDCFDLARIGRIFAGNGEDPDTGEELIPRRVVRIVKAIMTTCGMYDASGEFAVRVGLPGKSGVSGAILAVGNHELDLKNVGFGVFSPALDSKGNSIAGMKLLELLIERY
GUT_GENOME000530_0165424-286DGLGVAMLMVDGHRYVSGSMTPVPLATLASPLFHAQALEDLGAVRVGERVGAAPVRDMDHRVEVDAVTGLPHNPLQNAGAVATASLVKGRGGRDRTARMMQLLSALFDREVTVTESAARAENRAQHHTRSAAWLMKSLDTLDADPETLLEDIATVRAAAVTVEDLALLAGTLAHGGVHPVTGDRVLSEETVRGVLSVMDSCGMDTRDSRWAYDVGQPGWASTRGGTVLVVVPGHLGLALQSDTTDENGLGAAAMAALRTLVED
GUT_GENOME243516_0068713-311LEEAYRRGKEAIPGGNVATYIPELGKADPQDLGISICTREFEHFNIGRTDKRFTIQSISKIVSLAIALDMFGPDKVFRLVDMEPSGEAFNSLIDLDTKSNKPMNPMINSGAITVASMLVGHISVEEETEIIRKICIDDEITINNQIFESEMGHLSRNRAIAYLLESKGLISNVEETLEFYTKICSLEVTAKSLATMGMVFANDGKTLRNKDDRIFSKRTAEIVKTIMLTCGMYDRSGRFAVDVGIPTKSGVGGGLVSVADGLAGIGIYGPALDDKGNCIAGPPALKYLSQELQLHLFRK
GUT_GENOME075303_0213312-304EAVAQACEKYRNFTCGAPLAPSPRVADMNSGCFGMSVCLVDGTRFDAGDTQVPFAMGSMLRVPVYLQLLTQMGVDELVGKMRFEKCPCRNSSAGDDKPAAVRPKGVHAKSLRMISLVSPQGDADGKMQIISDMMIGLMGSSPLLDDDLYKANVARSAEKDVINVLANAGYELYDATDVALGLYDKLDSMLVTTGQIATMGATIAAGGYNPVTQTPVFDCELAAPLTAMIATCGHKHFRKTWMLSTGVPAVSSYGGGFLAVVPGFGSIAAFSPDLAEDKVPVRAAMAVRDVLQR
GUT_GENOME244293_0178541-332RDQDGGATADYIEILKNADPDKLGLALCTTAGHLYAVGDCDYEFSIQSISKPFVYALALDIYGPAEVHKYVGVEPSGEAFNELSLDDETGRPANPMINAGAITVDQLIGGPTILVADRVEKIREYFSRLAGRQLCVDKQIFESEMAGADRNMAIAHMLREVGIVTDEANDAVSSYISQCSVLVTVKDLAVMAATLANGGTQPVTGEKILTAEACRVTQAVMVSAGMYDASGRWMVDVGIPAKSGVAGGLIGTMPGQMGIASLSPRLDKQGNSVRGVKIFSELSNALGLNLMS
GUT_GENOME178200_008531-253MERKISLSDLRKAVDEAYEANKSLKEGEVDSRNACADVKAFGISVMLPDGTLINKGDTDVKSPLGSISKVALSTVLFSQNSPMELIKKSGQCPCKKVPEKPKGLSFSAHGIRAFSAIEPVGDPESKWNIYENRMIDLMGSAPELCDKIYQALQKEATDTKLIDTLAADGYYLYDDASLSANLYLAQKGMKSLTVPNWYDEGKEVTIPLDLRFSPSQNAQNFFKNYKKKQTAARMLVDLLAEGEKEIAYLETVL
GUT_GENOME207903_0037344-338ENNRHWTSKGTVASYIPELAKADPNILGICVTTLDGREYHSGDYDTKFTLQSISKVITLMLAILNNGTEYVFSKVGMEPTEFSFNSITNLESRPQKKPLNPMINAGAIATSSLISGKNSEEKFNNILNFTRKICENNLIDVNNEVYKSEKATGDRNRALAYYMKSNGIIEGDVDEILDVYFKACSIEVTCKDIARIGAMLANEGVLMNQERVISREACRIIKTIMVTCGMYDESGNFAVHIGIPAKSGVGGGIMAAVPRRMGIGVVGPALDAKGNSIGGIRVLENLSKELDLSIF
GUT_GENOME157398_0043542-336IRREIEPYALEGKQADYIPALANVNPNQFGICLNTIDGKTYKIGDANVKFSIQSISKVFALAIALSAMGPDLWKRIGKEPSGSAFNSLVQLEYEHGVPRNPFINAGALVTADVLLSYVEKPKEFFLRFVRTLCGNPDINYSDEVAQSEYSCCYLNASIAYLLKHHGNLHNEVDDVLRLYCTICSLEMSCEDLSRAFLAFARSNEPFSYAGVNLTVSRVKRINAIMQTCGFYDEAGEFSFLVGLPGKSGVGGGIVAIYPSRYSIAVWSPRLNAKGNSIMGLKALELFTSETVESIF
GUT_GENOME108642_0041719-302EKYKDFEVEGAVDSRLQGVDPSKFGIVVKLTDGRVIRVGDTDVLSPLGAIARVPLFVTHRVQKLDKDNKCHCNKHAIKQPQCCKPKGLPVSAEAVLLTSKIQPKGDADGKYGVIEDMTEAMIGSAPVFDDELYKTMAKANVHDDVENKFAEAGFELYDDAPVAIDVYTRLTALRATVDQLATMGATLAADGRNPETGEYAFDGSISPKVVAYMAVKGPHHLSKPWLIKSGLPAKSSFGGAIMGVFPGVMSIAAYSPLVNPDGVSVKAYKAIHHIMKHLELSVFD
GUT_GENOME000886_0184118-307KGVIRYGSVASYIPELAKADKNKLGICLYTIDGNQFETGNTEDRFTIQSISKVMALCLALETFGAEFVFDHVGVEPSGEAFNSLVELDNRSNRPFNPMINSGAITVASLLVNQYSIEDMQKYMQEVCEDPEIAIDEAVFQSEMATCARNKAIAYLLKSKDIIDTDVEESVTFYTKMCSMSVNARNLARFGLLLANDGVQLSTGKRLISSQTVRMVQTIMLTCGMYDGSGEFALRTGISTKSGVGGGLLSVSKKKMGIGIYGPSLDKKGNCIAGCELLGYISEALHLHIFD
GUT_GENOME065654_0074623-312RTSETCIGEGKVASYIPELANVNPDNFALSITSVTGESFNYGNYDYDFSIQSICKVLLLIMALHDNDPEAVYNKVGSEPTKYEFNSLVPITDKASNPFINAGAITTASMILGNSVDDKFDRILDYYKKLSKHEKAYLMEEVYSSEMETTDRNKAIAYYLKSKEIFDDDPEMVLDLYIRACSISTNVVGLSRMGAVLANKGFEIDSHYNLLSKSEVQIVLSQMATCGMYEKSGRYLMNVGIPSKSGVSGGILGVVPGVCGIGVYSPRLDETGNSVRGKEIFNMLGQELDLS
GUT_GENOME096506_0044417-309RPFASSGSVIDHIPGKDQSHLTKLGITVISNDGEVYSAGDDDYVFSLQSISKVINLLIALEDFGEDTVFQKVDKEPTDDFFNSISNLEDYEHQKPYNPMLNSGAIAVASLIKGTTVDERFHRVLEFVRTITGNEKVDMDEGVYKAEVKNGARNRSLAYFMESLGILPHQELEAALDLYFRVNSIMVTSHELAEIGCFLARQGKKEGKQMIDPRHVGTTLAIMMTSGMYNESGSYAVDVGFPMKSGVSGGITGVVTGRMGIGLIGPAINKKGNSIAGGKALRKLSQDLNLTLFH
GUT_GENOME000530_0151422-305GETAQYIPVLAEADPDRFGIALATPTGRLHCAGDADVEFTIQSASKPFTYAAALVDRGFAAVDRQVGLNPSGEAFNELSLEAESHRPDNAMINAGALAVHQLLVGPEASRKERLDRAVEIMSLLAGRRLSVDWETYESEMAVSDRNLSLAHMLRSYGVLQDSAEEIVAGYVAQCAVLVTAKDLAVMGACLATGGIHPMTGERMLPYIVARRVVSVMTSSGMYDAAGQWLADVGIPAKSGVAGGVLGALPGRVGIGVFSPRLDEVGNSARGVLACRRLSEDFRLH
GUT_GENOME207623_0243338-322HDGTVANYIPELGKADPVHFGISLAALDGHVYEVGDSRVPFTIQSMSKPFVFALALDTLGSDAVEQVIGVEPSGDPFNSIRLNANNHPFNAMVNAGAIACSGLILKARGADAFESIRDALGRFAGRRLDVDDAVFASEAATGDRNRAIAYLLRTSHVIKEPVEDVLAVYFRQCAVLVTARDCAVMAATLANRGVNPVTGEQVVTPYAVSRTLSVMTSSGMYDFAGEWIYRVGIPAKSGVGGGILASLPARLGLGSYSPRLDSHGNSVRGIKVCEALSAHYGLHML
GUT_GENOME066843_0053313-258LEKGRNVLYEGKVASYIPELAKADSSNLGVCLMKKDGTIYKAGDYNIPFTMQSISKTFSLILALQTAGYDKVFSKIGMEPTGDRFDSILQLELKDWRPFNPMINAGAIVTASCVETEDPFGSFLELVRKVCNNPRISLNEEVYQSEKRTGTRNRSIAFLLKSDNVLDGEPEDILDIYFRMCSVMVTAKDLARYGMVLSNHGIDPDTGEQLIDPTIVRIVTTLMMLCGMYDESGEYAVKVGLPSKSG
GUT_GENOME096502_0141112-307AYGEKYLDQGNIVDYIPGLKDVDPKAYGLALIDEDGNFYEAGAIDTRYTIMSITKVFLYILALETYGLEELRKYVGLKPSSKAFNSLLDLQLEDNIPVNPYVNAGALTVSYLLYKKYGAKAIDKVLEKIRTLAENEEIDIDEEVVITSEHAGYANKAMIFSLQNKGPISKDVDVFDVLSVYNRACCIRVSTRDLAKLSFVLSNDGKNRNGVQLIDPDHARISRTLMATCGTYDYSGDFAVDVGLGAKSGVGGGIMTSTKAGLGLATYGPKLDSRGNSVVGIEMLKYISEKLNLSIY
GUT_GENOME212983_0198315-300DKAYEDFKSDRSGAPNPDVQTPDADALGIVVALTDGRVIKKGDTAVPSVMGALNKLALATVLLTQNSPEELVKKHYAGCCCKSKCPKPQIGISAHGLRMVSAVVPQGDRDGKMAIISQTLEGLVGSAPVLDDKFYESLRAAQAAEKAVDKVAEAGYTLYDDTEASLDVLDRLTALQLTAEQAAAMGATIAADGRNPLTGDIVFDGSIAASVVTVTALGGPHHEKRAFALEVGLPVKRGRAGLLVALLPGFGAIATYAPRLDDNGVSVKGRKALTQIARSLGLNIYA
GUT_GENOME096099_0304671-365LDQVRPLLGRGRVADYIPALAEVPADRLGIAVCTVDGQLFSAGDAAERFSIQSISKVLSLTLAMHRYSEAEIWQRVGKEPSGQPFNSLVQLELEQGKPRNPFINAGALVVCDMLQTRLSAPRQRMLEVVRGLSGSDDICYDARVARSEFEHSDRNAAIAYLMKSFGNFDNDVITVLHNYFHYCALRMNCMELARTFVYLANRGRALGLDTPMIGERQARQINALMVTSGLYDGAGEFAYRVGMPGKSGVGGGIIAVVPREMSIAVWSPQLDGCGNSLAGVAALECLAQRLGRSIF
GUT_GENOME103818_0261415-309KKYIKDGHVATYIPGLATVDPNQLGVAIYDLDTNKEFFAGDYDKRFAVESISKVPTLILATLDNGLDEVFEKIGTEPTGFPFNSIMNMQINRSKKPTNPFVNAGAIATTSLVKGKDTEERSARILDFFKKIMGDDGVKLNTEIYLSEKRTGDINRSLAYYMKGNDMLDGDVPDILDTYFRQCSMEVTAIGLARMGAVLANEGICPWNNERLFPVKTATVVKSLMVTCGLYDESGEFSVHIGIPSKSGVGGGILSSVPNKCGVGLFSPNLDKQGNSVASMKLLKDISDELKLDIFR
GUT_GENOME096506_0299712-310DWLENVRPLARDGKVADYIPALAKQDPDELAVAVYDLDGECISAGNLVCCFTLQSISKVLSLALALMDNGEEYVFERVGMEPTGDPFHSIYRLEQHAPSKPLNPMINAGALAVTNMIHGATPDEKVGRLLSFIHEMTDDQSIRYNEDVAVSEFQTAYLNRSLCYYMKQHGVIEGSVEENLDAYTKQCAIEVNINHLARIGAIFANDGKDLESGRQIIPRHLARICKTFMVTCGMYDASGSFAIRVGIPAKSGVSGAVVGSVNGFGGIAVYGAALDEKGNSVVGLKLLELLANRYDLSIF
GUT_GENOME139178_0124010-303LKKAVDEAYELNKSISGGASDPRVPAVKANTFGISVMLTDGRTVDKADADVPAPLGNIARVPVSVALFSQTSPDKLLANVRCGCGCGCKDVKGLKHEVPFSLKTLRAISAVEPHNDPEGKYGIIYNQLASMTTGEALLSDALLETLKAQADSSKALDKIAEAGFALYDDPAISLNAYLRLEALQLTTKQLASLGATIAADGRNPLTGEYAFDGTLATPVVTLMATGKHERPWLMTVGLPVAKSFSGAILAVLPGFGAIAVYSPEVDERGLSVKGAKAIAHIAKKLGLNVFASAR
GUT_GENOME274350_003568-298ENVKKAVDAAYQANKANADGAVDPRLDAAPADFAISVVLTDGRKYEVGDSTKGAPLGDIAKVPVAITLLSQLEGKECGCGCGCKCGDAKKPKPEIPVSRHGVRMVSRIAPSSDADGKWDVIMDTVLNMTAEAPVLDDALYKSLTAQNEKEDTVNRLAQAGYELYDDATIAVDLYTRLTALKFTTGNLAALGATLAADGRNPFTGQFAFDGKYAPTAVAAMAVKGKGKIHRRWLVRTGVPAKAGFGGLMVAVLPGFGAIAAYSPLLDCNAVPAKAAAAIKDIAFTLGLNVFA
GUT_GENOME035747_0114614-299LDEAYESVKSLKEGEIDPRNHEAKAGQFGITVTLADGTTISKGDSEIKSPMADIVKVPLSSVLLSQNSPAELVKKSGQCPMHKVEKKPHHLGARGVRAFSAVEPVGDPDSKWNLFVNRTIDMMGGDAPVLNDRLYEAEKRAAEENQVVDKLAEAGFYLYDDAAMSVDLYNRARALTASTRQLAMMGATVAADGVNPATGKIVFDGAISQNIVGLMAAHGPHKMNGPWLVLSGLPAKRSWGGALLGVYPGVMAVAAYAPELNAAGVSVKAARAIIEFMQRLDLSVFA
GUT_GENOME177666_0096019-304DCSHGHLASYIPELAKINPNLKALSIITADGEEINIGNYNYKFTIQSISKIFLLALALMDNNEEEVFSKISVEPTGDPFNSTLRLETNQKVFNPFINAGALASASMIKGKSPKEKIERLKNFISRFTESDDIDISNEIYESEYKTANKNRAISYLLKSINVLETDVEDNLSFYTMACSIILSTKDLAKAGLVLASGGINSQGERIIPVKICKIMVALMTTCGMYDHSGSFAASVGIPAKSGVSGGILGVVPKKMGICSFSPVITEQGNSIIGVEIYKDLSDIYELN
GUT_GENOME147550_0112426-316REVEGGAVNTSIPELAGADPGTVGIAVATVDGALYQAGDTKHEFCIQSISKAFTYAQALTDRGADGVFEKIDVEPSGDAFNEISLQPETGRPSNPMINAGAIAATSLVRNTSHGTRMERILRLYSACAGRRLRINKNVQAQERRAGDRNRALGWLLTSRGIIDGDPTGALDDYFGQCAVMLNCVDLARMGATLAAGGRNPVTGERVLEPAVVSDVLSVMSTCGMYDDAGRWALRVGLPAKSGVSGGVIAVLPGQLAVAVFSPPLDRHGNSVRGVAACERLTQDLDLHFART
GUT_GENOME117163_0160815-282RCFIKLGSPASYIPELARVNKYQLGACIVNLDGTVMECGDTRTRFTIQSISKLASLILAISDKGSDYLFHDKVGVEPTGDPFNSIVKLETKTKPFNPFINAGAITVADCIEGNSSEEKFERFLSFVRKLCGDEEISLNEAVYLSEKATGDRNRALAYYLKASGILEGNVEECLDFYFKMCSVNVTAVDIAQMSAVLANHGACPVTQEELIPKESAKEVRALMLTCGMYDGSGEFAMTVGFPAKSGVGGGILAVADKKMGIGIFGPTVW
GUT_GENOME096459_0122438-331DEVAGVEEGHLSPVYPTLTRADPRCFGLALARADGPLHEVGDSRVPFPIMSVAKPFVFALACQEHGTAPVRDLVRVDATGLPFNSATAVERSPGGRTNPMVNAGAIATTSLTPGADLESRWEHLRAGLSRFAGRELLPDAETLETSRRTNTQNRALALLLAGAGAIRGDAMEATELYTRQSALEVTAVDLAHMGACLAAGGVNPWTGERVVGEEAARVALAVMTVAGMYESSGSWLVDVGLPGKSGIGGGIVTVSPGKGALGSYSPLLDAHGNSVRGTLAAQWLSGELGLDILA
GUT_GENOME244348_0120130-313GAMADYIPELANQDPTLFGLSLCTPEGIVYSAGDSDTQFSIQSVSKPFVYAMALADRGLERVNESVGEEPSGDPFNSISLDEETGRPDNPMINIGAVTTHALVHSKGASREEREKRILAGMSAFAGRELSYDDAVYKSEVDKAWRNLALAALVRANDLIISDPAEVVRGYTRQCSVAVTAKDLAVMGMTLASGGVNPQTGERVVPEWVAQQVLSVMTTCGMYDAAGDWLTNVGIPAKSGVSGCIMGSLPGQMGAAAFSPPIDKFGNSVRGVDAFERMSEDLGLH
GUT_GENOME104183_0136920-315CRPIAQKGVTASYIPALKSADSSMLGICIAGNNGELLLAGDAEKIFSMQSIVKILILTCSLLDSGIERVAEKVSIEPSSDRFNDIINLETKNNHKPLNPMINAGAIACLTLVRGDTFQEKTERIVSFAKQLSGNSTLNIDEEVYISEKLTGSRNRALCYYMQSTGVIDGEIDVEALLDAYFYVCSIAVTCVDLARIAQVFANNGQDMRTGDTLFSRNISRVVRAAMVMCGMYTESGSVAVRIGLPTKSGVGGGIISLVPESAGIGIYGPALNQAGSSIAGLALLERISNVMDASIF
GUT_GENOME096202_0424320-314ESRKLTSRGNVASYIPELAKSPASALGIHLLNGQGEHLSAGDCGLPFTMQSISKVFTLILALMDHGEETVFSKVGKEPTGDDFNSMLKLELVEPGIPFNPLINAGAIAVSSLIKGDGPEAKSARVLSFFRELAANDKLDYDEAVYRSENETGHMNRSLAYFLKENEVLDGSVEEVLQVYFRHCSIRVTCGDLSRMALVLAFDGQDPLTGKTLIPRRFVQIAKTFMTTCGMYNASGEFAISVGLPAKSGVSGGILTLVPGRYGIGVIGPALNRKGNSVAGVDLLEKLSGRFDWSLF
GUT_GENOME176533_0021417-303RPGDRGAPASYIPELANVDPERFGVSLATIDGKVYGSGDTETEFTIQSIAKPFVYALALEDRGFRDVLKRVSVEPSGEAFNEVSLDEEGRPLNPMINAGALTTHSLVGGDDWTQGQRLQRIIDGLSAFAGRRLQVDERVCGSELDHAHRNLSIAHMLRSYDIFPQDPRVIVEGYTRSCSLLVSTRDLAIMAATLANKGINPITGEKVVSGRVVRQVLSVMTTCGMYDAAGDWVTQVGIPAKSGVAGGLIGALPGQIGVATFSPRLDEHGNSVRGVRLFERMSRDMGI
GUT_GENOME199944_0037613-313ALQQAYDCTKETNAGKNADYIPFLAKVPSELFGISACLPDGEVIAVGDTDYAFGIESVSKVPTALLAMEQYGPQKVMNRIGADATGLPFNSIMAILLEKDHPSTPLVNAGAISACSMVRPTGDAEGKWKAIVGFIEGLAGSEVQLLDELYRSESATNFNNRSITWLLKNYNRIYDDPELSLDLYTRQCSLGITARQLAVMGATIAFGGRNPVTRKQLFRAELTPKIVALMATVGFYEHTGDWLFTTGLPAKTGVGGGIVGVIPGVMGIAAFAPPLDASGNSVKAQLALKFIMDRLNLNIFN
GUT_GENOME096500_0285114-305YLVQYGKVADYIPALKNANPNEIGICIMDVEGNLYSGGDYNKKFTIQSISKVLSLMLAVMDKGEEGVFKKVDMEPTDESFNSLYKLDLPYGEKPSNPMINSGAIVTTSLIDGRGEEKFNRFLEIVRKITQDDNIIYNKEVYISEKETGDKNRAIAHILKNKKLIEEDIEDILDAYFKQCSIEVDCIDLAKIGIFFANGCKIPNTGEALCDEEIGTLVTAIMTTCGMYDFSGEYAAKVGIPSKSGVSGGILGIVPGRFGIGVYGPALDKHGNSIVGYGILKGLSKELGLSIFK