UHGP-MC 117197
Information
- Number of sequences (UHGP-50):
- 169
- Average sequence length:
- 70±8 aa
- Average transmembrane regions:
- 0.01
- Low complexity (%):
- 0.34
- Coiled coils (%):
- 0
- Disordered domains (%):
- 0.84
- Pfam dominant architecture:
- PF00908
- Pfam % dominant architecture:
- 7929
- Pfam overlap:
- 0.37
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC117197.fasta
- Seeds (0.60 cdhit):
- MC117197_cdhit.fasta
- MSA:
- MC117197_msa.fasta
- HMM model:
- MC117197.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME096527_00223 | 109-180 | QQLWIPAGFAHGFLTLAQDTTVLYHCSDYYVPNDQITLLWNDPDINIQWPNSHTSPVLSPKDQAGMTLKEYI |
GUT_GENOME142662_00893 | 111-182 | LFVPRGFAHGFIALEDFTKVSYMVDNFYNPKSERGILYNDPQLKIDWGVNDSEVLISDKDLYLPTLEQIQDF |
GUT_GENOME137243_01316 | 106-174 | LSASNKKQLYIPEGFGHAYLALEDSKVLFKVTAHFVPGDEIGFSWNSKAFNVNWPLSANEIIFSDKDRE |
GUT_GENOME264149_00067 | 113-185 | IYMEKGFAHGFLVVSDNALMSYTVSGVHSNSGESGIFYKDEDLKIDWPIKDMKIIQSDRDAQLDTFKKYCEKV |
GUT_GENOME096381_04457 | 100-177 | FFELSGETQATLYIPAGCAHGFQALTETADTSYRIDRPHDPAEDVTIAFDDPELAVPWPLPVTSMSQRDREAPRLAEV |
GUT_GENOME014067_00697 | 114-174 | LYIPEGFAHGFQSLSDEVEIFYLATQAFCPEADSAINALDPTINIKWQFEITNISTKDKNA |
GUT_GENOME272387_01981 | 106-181 | HRQLYLPAGVAHGFLALEDAMVYMKVTTHYTAGDEIGFLWNDCEVGIEWPRMGDMKPILADKDMKWRNFKDMISQL |
GUT_GENOME006937_00599 | 111-186 | VYIPEGFAHGFIALEDNTIFSYQSTGRYMPEYCGGIRWNDQELNIPWPLKEYGIEKVISTEKDANWPGISEYRKRS |
GUT_GENOME046287_01536 | 104-177 | SKENKILYIPEGFAHGFLSLEDSKVIYKSSNYYFAEDQYGISIFSEKLKINEILKRYKIEEILITEKDKKLEII |
GUT_GENOME188398_00063 | 120-180 | HGYLTLTDAMVSYQCNTNYQPSYDHGILWNDSALGIEWPVARENLIISEKDMKNMSFEQFQ |
GUT_GENOME248616_00580 | 117-186 | QLYIPEGFAHGFLTISETASVFYRCTDFYHPEDEGAILFNDANLGIRWPDVGEITVSDKDRRAMSFFEWR |
GUT_GENOME013402_00212 | 101-187 | VELTPENGRQLYVPRGFAHGFVSLKDNTLFQYLVDSDYKPSHENGFLWNDPELSIDWKLEQYGIENPILSEKDKARKTLNEINPNFY |
GUT_GENOME056484_02467 | 127-212 | VELSEDNHRQFFIPRGFAHGFSVLTEIAVFQYKCDNFYAPQADGGIQLIDGSLGIDWRIPTNKVILSEKDLKHPLLAEYDSPFDYN |
GUT_GENOME236436_01286 | 120-199 | LIGEKHNEILVPAGCAHGYLVLEESIVSYKCAEKFYGEFDDGIFWKDPEIAVDWPLDLVGGEDRVIISDKDRDLQTFAQF |
GUT_GENOME274355_00011 | 102-180 | ILSADNRKIMYIPRGFAHGYLTLDENVLMQYCVDNDFCADATRSVRFDDSDIGVKWIFEPDISTLSAKDAGAARLEDVY |
GUT_GENOME014048_00813 | 113-184 | LYIPSGFAHGFLSLEEGSIVHYLQTKLYAPQSDSGILYDSFGFDWQKVASQAGIQELIISKRDREFERFSKD |
GUT_GENOME258050_00551 | 112-180 | IYVPCGYAVGTYAVEDADFICMCGENPFVPEYASGIRWNDPTINIKWPIEKDREPIISKKDQKLPAFKN |
GUT_GENOME284814_01892 | 107-186 | NCLYVPKGFLHGYLVIDDSVVTYKCDEEFYKDGDSVIRYDDKTLNIDWHIDKFIGNSTKIILSEKDNNAPSFEYYIGKFE |
GUT_GENOME147684_01758 | 109-184 | KQLFIPRGFLHGFSVLSKDALVMIKIDRYFALGESIGVKYDDEDLNIDWKIKKESVIVSEADKNLLAFKDIKSPFI |
GUT_GENOME153732_01659 | 113-187 | QLLIPEGFLHGFYVMSEEAVFSYRCTRFYCPSDEYGVMWNDPEIKIKWPTSNTASLILAPRDCNNHSFAEFKMKL |
GUT_GENOME093429_00359 | 109-182 | KELFIPEGFALGTYALQDSIFECKCGEKFYAEYDDGIRWDDGDLNINWPIKINFGELIISDRDKGLPSFQHYIK |
GUT_GENOME019806_00757 | 105-176 | MEEMYIPRGYAHGFITLEDNTELEYLTDNKYCCETAKAIKWDDLGIDWTVGGKVEVRTDLLSDKNKYAQSLI |
GUT_GENOME090447_01440 | 110-184 | RQLYIPREFAHGFLVLSEQAKFLYKVDNGYAPESELCIRYDDPQLAIPWESWGIAPEELILSDKDLRGISLADYT |
GUT_GENOME243214_01363 | 110-198 | KQLYIPEGFAHGYLALENETIVHYKCTEYYYPAKQCGIIWNDEDIKIKWKGLIYNSDHPNESKLKDESKIILSHKDINNMSFRNLFDKE |
GUT_GENOME009088_00998 | 111-178 | TSLWIPPGFAHGFLALKDLTVVQYKVTKLWVKESEGSIRWNDPDLRINWPDLGSAVTLSQKDLDAPLL |
GUT_GENOME063685_00622 | 106-190 | ILSESNGRQLYVPRGFAHGWLSLTNDVHFTYKVDNYYAPEAETPIAFNDPEIGIDWQFDTSKAIMSEKDKKNPLWRDAGITAKYG |
GUT_GENOME185643_01397 | 104-186 | VRLSAENKKQFFVPKNFAHGFIVLSEEAEFCYKVTDFYHPNDEGGILWNDAEVGVEWPMPEGMTAADLILSDKDKVQKSFAEY |
GUT_GENOME096547_00836 | 151-237 | VVELSDVNRRGLFLPVGVGHGFVVPAGAGPATVCYLVSEGYNPGAEHGVCPTDPGLGIDWSVAGLDPESLVLSDKDRAAPGLHEVEA |
GUT_GENOME139671_02174 | 71-144 | NKQQVLLPRGVGYGFVTLEENTALLEKVDQYDAPMEVTKVRWDDPDMGIEWGVEDPIIADDFKEGQFFKDIEDS |
GUT_GENOME251899_00060 | 114-187 | QAIFIPRGFGHAFLTLEDDTFVYYKADAPYAPGQEGSIRWNDPTLQIEWGSERFAQTPILSEKDAQAPLLGDWL |
GUT_GENOME007920_00129 | 114-186 | LYIPRGFGFGFLALENDSLFEYYVDNEYAPRLDDGIPYNDKDINIPWDEIKKEYNIDEFILSNKDQNLTSLEL |
GUT_GENOME257273_00056 | 114-166 | YGNGYYVLSDTAVYSYKQSTYYGEYKQFTYKWNDSRFKIKWPAEASIISERDA |
GUT_GENOME162907_01161 | 122-194 | DKNQLWIPPGFAHGFYVLSSKALVTYKCTDFYHPNDEGCIKWNDPDIGIDWPINSNPTLSPKDACAPSFRAFS |
GUT_GENOME112873_00433 | 109-182 | QLFIPRGFAHGFAVLSESAHFLYKCDNYYCPEAEDGIAYNDPELGIDWQIDLAEAIVSEKDSHRPTLSQWLNKK |
GUT_GENOME175022_03312 | 1-56 | MVLSDVAEFSYKLSDFYHPEDESGVLWNDERLGIQWPITEDMQIITSERDSSNLPF |
GUT_GENOME072282_01078 | 112-191 | FFIPKGFAHGFLTLEDDTEFQYKCTDFYAPQYDSGIMWNDKDININWNFEKYGLKEEDIVLSEKDKKHQSFKEYTEKYIG |
GUT_GENOME100806_00145 | 110-183 | KEIFVPRDCALGTFAVADSVFSCYSDGKYVAKDCSGIIWNDTDLNIDWPLKREPVLSKKDQDLCKFVEYKKYEG |
GUT_GENOME236039_00222 | 103-174 | LNSKNHKQLYIPKGFAHGFITLCDNCELTYLMSQKYMPGYSCGFRYDDPKFNIDWPITVNCISEKDKNWSLL |
GUT_GENOME025996_01442 | 111-170 | LLIPQGFGHGFQCLCDNCIVTYQVDQYYNKESDRSIKWCDPQIGIQWPIDNPILSKKDRK |
GUT_GENOME000700_03161 | 117-179 | LYIPKGCAHGYVTLEDDCQLLYLMSEFYEPGYSYGYRYDDPAFDIKWPEYGKLTISDKDQKLP |
GUT_GENOME012135_02514 | 101-173 | VELSEENGKMLYIPPYVAHGFQTLEDHSMICYFVGAYFVPDAYGYLRWNDPFFNIKWPECQNRIMSEKDKNIP |
GUT_GENOME227742_01826 | 102-179 | ILKDVDNAALYVPKGFAHGFKCLEDNTLMLYQVTSIYDKKHDAGIAYDSIGYDWKIDKPIVSNRDLGFIKFNEFESPF |
GUT_GENOME194820_00169 | 94-166 | LSESNGKCAYIPRGYAHGYLTMEDNTLVQWCVDNDFNSNAAKCISYESVDIEWPVSRQCFVISEKDKNVERLS |
GUT_GENOME030216_01999 | 115-180 | EGFGHGFVTLSDFADIAIGVTKAFDKESERCIRWDDPALEIDWHGIIAPRVSLKDLEGKLLYEAEV |
GUT_GENOME142155_03036 | 109-176 | NIAYIPNGFAHGYQALEENTIVIYKLSSPYSPSNEGGIKWDSMDINWNKINPIVSNKDEELPTFAEYI |
GUT_GENOME142012_01033 | 110-182 | QLLVPRGFAHGFVVLEDDTVFAYKVDNYYSPECDRGIAYDDESLNIDWILKKEELNLSAKDTKQPKLNETNDL |
GUT_GENOME086287_00434 | 104-180 | VKEKGKALYIPIGCAHGYKVLEDNTITLYMATEVHSPKNDTGLKFDSFGYDWKIDNPIVSKKDENLLSFKEFKKIYI |
GUT_GENOME140365_06267 | 111-175 | LYVPEGFAHGLMALEDETHVTYQVSYPYTPGAESGLRFDDPSIGIVWPGPVTVISEKDASWPDFM |
GUT_GENOME244030_00472 | 112-186 | LWIPPDFAHGFLSLAEKNVIQYQCTDSYDMEYEEGILWSDQDLNIDWKLDEYGFSEEELIISEKDKKQKKFVDYE |
GUT_GENOME137556_00828 | 115-186 | EILIPSGCGLGTLAIEESLISCICGENYYRDYDDGILWNDKDIGIEWPLERLDGTVIISDKDNNFKSFKDFK |
GUT_GENOME000203_02741 | 17-82 | GFAHGFLVKSKEAIFTYKCSDFYNPEHESGIIWNDKNINIDWPIDNVDNLIISEKDKNLKTLYEVD |
GUT_GENOME147152_03600 | 110-179 | LFLPKGIAHGFISLADNSVMVYKTSTTHSPMNDAGIRWDSFGCDWGIESPVLSVRDKNFIQFSDFESPFF |
GUT_GENOME062906_02018 | 112-185 | FFIPKGFAHGFLVLSDNALVSYKCEGAFSKETDTGIVWNDPELKIDWPTNQVDEIIVSEKDRRLQTFSHFKEKG |
GUT_GENOME158388_00590 | 109-180 | ECLYLPDGVAHGYLALERSMVAYSCGAGYDPAREGGVRWDDPSLAVEWPLEGDARPILSERDRALPGLRELC |
GUT_GENOME079237_01013 | 110-181 | QLLVPKGFAHGFVTLTPNVNFMYKCDNYYNAAADGGIAFNDPALAIDWPIAPEKAITSEKDQKHPTLKEFEA |
GUT_GENOME141754_02666 | 33-105 | LYIPEGCAHGFLTISEDAIVHYKVAERYYPQEEQTLVFNDPDIAIEWPLHALPVDIPLILSEKDRQGKTFSQL |
GUT_GENOME096390_04269 | 111-181 | VYIGKGFAHGFCTLTDDCEIVYKTDQPYAPQHEGGVIWNDESLRITWPILKPVISERDLSLPSLDEFKRIQ |
GUT_GENOME025519_01961 | 53-141 | VELSEENHRQFFIPQGFAHGFLTLTDNVEFRYKVDNVYNKESEGGMRYDDPTCNVDWGSLLNGIEPVLSEKDQVGPTLEESNNQFVYEE |
GUT_GENOME214432_00527 | 111-180 | MYVPRGFAHGYLTLEPDTVLQYCVDNDFCGPAAKALRFDDPDIGIVWPAKPDETTLTEKDRNATLLQDLS |
GUT_GENOME208036_03530 | 116-179 | KGFGHAFFSLADDTEVMMRIDSSFINEYSRSIRWDDPEINLAVPDRNPILAIHDQQAKLLKDSD |
GUT_GENOME248142_00456 | 471-532 | RGFAHGFQVLSEQAVFAYKCGDVYHPEDEGGLRYDDADIGIVWPGQDTPILSEKDQNWPALR |
GUT_GENOME085776_02670 | 141-216 | LYIPQYFGHGYLVLEDSVVSYKCGEVFYGEGDAGIMYNDPDMAIEWPFDKIGGIEKLIISDKDKNLMSLKEYIEKV |
GUT_GENOME151948_00284 | 130-196 | QLYIPEGMGHGYLALTDARVLFKTTTHYIPNDELGFAWNSEQITIDWGIESPIQNDRDRNNRNFSEV |
GUT_GENOME116433_01313 | 108-175 | QLYVPPGFAHGFCVISDTVLFQYKCTDIYNPASERGIMWNDPDLNIPWPTEKPVISDKDTKHPFFRDL |
GUT_GENOME273588_01039 | 389-471 | YLSSNNNTQLFVPRGFAHGFISLEDDTIFQYLVDNDYAPELEDGIYWNDPDLDINWQEIFKEYNIKNISMTAKDTMYQKLSDK |
GUT_GENOME137663_00908 | 101-179 | VELTPESGLALYIPKIYAHGFLALEDNSRVLYKVDAPYSPSHERAFNYQHPRLISELQKYLPLDQLIISDKDKAAPTLL |
GUT_GENOME047838_00386 | 414-477 | GFAHGFLVLSDVAEFCYKCTDVYDPTAEGGIPYDDPTVNVQWPDCGCEHKTSAKDKEHTPFAEQ |
GUT_GENOME104704_00440 | 106-175 | KSNFSLYIPKGFAHGFLALEDSIFAYHCDGQYEPNNCGGLRWNDPKLDIKWPIEENGITKLIITDKDNNW |
GUT_GENOME000729_03622 | 103-184 | ISMNEVERKLLYIPEGIANAHLIKRDNTKVLYQLGSKYMPDYDEGVRWDSLGIDFSIRNPIITDKDASLPSFDEFESPFIYG |
GUT_GENOME033457_00922 | 116-175 | GYAHGYVTRTPDTLVAYKVDRYYAPESERSVRWDDPGLSVRWDVTDPILSAKDAAAPLLD |
GUT_GENOME162191_00571 | 106-187 | ILDSDKGNMAFLPKGLAHGFCVLSESALMLYKVTSEYAPMYDMGILWNSAGIPWPVKKPILSRRDQDFVTLSQFKSPFYYEK |
GUT_GENOME011031_02000 | 115-172 | KGFAHGFQALCDNCDVQYKVDAAYQPKAERCIRWNDSQLNINWPISAANLSKKDSQGM |
GUT_GENOME243474_00110 | 111-177 | MYTPRTYANGFVTMENCTKLEYFTDNKYSFQHAKSIRYDDPDIEIDWSMKGVITVRKMIMSEKNLNT |
GUT_GENOME139064_01151 | 650-703 | GFAHGFCVLSEEAEFHYKCSEFFDPKDDRGLPWDDPAIGVAWPVEHPILSKKDT |
GUT_GENOME013450_00476 | 129-201 | QFFIPRGFAHGFLVLSDEAIFTYKVDNPYAPQAEASIRWNDETIGIKWPIEGENVLLSEKDLNKALSFEEAEY |
GUT_GENOME096221_02333 | 112-181 | NSLLVPRGFAHGFQALEDDCELLYLTSTPYAQAAEDGLNPTDSRLGIAWPLDMTECSDRDRNHPPLTDAF |
GUT_GENOME048651_01234 | 105-181 | LSIENHRALYVPKGFAHGFVSLEDETIMLYQCEGAYDKETDTGVLFNDPEIGIEWPIEEKDAIHSERDLQLMSISEY |
GUT_GENOME109493_00488 | 106-167 | QLFAPDGFVHGFYVMSDQAIMHYKCSSYHDKNLEYAYRYNDPTFHIDWPNDTFILSERDQRH |
GUT_GENOME112582_00713 | 413-471 | HGFAVLSDVAHFHYKCTDYYHPEDEGCIRWDDPDLAVDWPVSGPQCSSKDRQGMSFREY |
GUT_GENOME031470_01455 | 107-179 | LSGESAEALFIPERYAHGYLTLEPETLVFYKVSELWHPEAERHCRWDDPTLGIAWGIQHPILSPKDAQAPWLV |
GUT_GENOME098314_01119 | 112-181 | IPAGFAHGFQVVSDENALVVYQCIGKYDKESDAGIRWNDKEIGIDWPLKEKCIVSERDSEQMTFKTFKET |
GUT_GENOME183430_01490 | 120-189 | RGFGHGFVTLTDHVEFMYKADNYYVPEADGGIRWNDPTLDINWGIENPILSEKDIKSPFFDELENKISFT |
GUT_GENOME018819_00295 | 111-177 | LYIPPEVAHGFQTLEDETYVYYQLGEFFKPEYYKGVRYNDSAFGIDFPIKENIIISEKDKNYQDWEV |
GUT_GENOME204217_00131 | 115-173 | LYVPAGYAHGYVTTEDDTIVLYKVDKPWVPGSEASCRWNDPTLNIDWRCENPILSEKDA |
GUT_GENOME133659_00900 | 110-175 | RKQFYIPEGFAHGFYVTSEIAVVCFKVSNYWHPNDEIGIPWDDSELNINWPIKENEMPIVADKDLH |
GUT_GENOME071145_02884 | 105-178 | RAARYLYIPRGLAHGFLALEEGSIVNYAQTSCYAKDQDCGIAWDSFGFDWGIQRPIVSGRDLTFEKLENFKSPF |
GUT_GENOME199135_01760 | 113-172 | LYIPPGVAHGFLTLEDNCEIFYQIREPYQAGYGRGVRWNDPAFGIDWPDTPVCIAERDSE |
GUT_GENOME096525_02161 | 98-160 | GFGHAFLSLEDHTNVVMKIDHYFHQDYRRGIAWNDPDLCIPFPIDKPLLAQHDLEAPFLKESD |
GUT_GENOME118404_00500 | 111-179 | RQFYVPPHFAHGFLTLADDTEFLYKCTDYYHPEFDGGVRWNDPQIGVDWKLEHYGFTESELLLSDKDRV |
GUT_GENOME130217_01826 | 132-201 | QGLYLPEGFAHGFLVRSDFALVSYLCSGAYDPKGDGGIRWDDVDVSVDWGLPAGIRPIVSQRDENLPTLA |
GUT_GENOME243068_01610 | 43-93 | HGFVVLSETADFFYKCDDFYSPKDELSIRWNDPAIGIKWGVDKPQLSAKDA |
GUT_GENOME142546_02678 | 111-179 | NSRMMVIPEGFAHGFQVLEPESELLYLHTAFYTPEAEDGLRHDDPSLGINWPLAVTDLSARDGSHPLID |
GUT_GENOME281551_00751 | 100-182 | LIDLNEDNKNLIYIPSGCAHGYYIKGNKHALVYYKVTKPFVKDQRGGIHWSSLDIPWDFNYNNIEVETEDSNWPKFEEFNSPF |
GUT_GENOME172378_01408 | 178-265 | MKQFYISEGFAHGFLVLSEKAEFCYKVTDFYHPGDEGGLAWNDPEIGIDWPMLTGTYKGSADSAGYLVDGVALNLSDKDKKWGTLKEL |
GUT_GENOME237865_01157 | 101-176 | MAELSPENAKALFIPGGFAHGFAALEDSLVLYKAAGAYCPQAERGFRYDDPKAAIPWDLPFEPILSQKDAARGALD |
GUT_GENOME140365_05098 | 103-181 | LSSGNRRQLYIPEGFAHGFQTLSDDTELGYLISTFYAPASAAGLRYDDPAFGIDWPLPVTAISENDRTWPDFEKALLPM |
GUT_GENOME000537_03024 | 112-183 | IYIPRGFAHGYLTVSDMAIVGYYADNRFASEYDGGILWDDPDIGIDWPLEKVDNKIVISPKDELLETFKSFY |
GUT_GENOME239690_01599 | 110-176 | SLYIPKGFAHGYKTMTDNTIIHYLMSENHTPSHESGVRWNDQAFGIKWPETEHLIISKKDESWEDFD |
GUT_GENOME018218_01102 | 163-233 | QLFVPRGFAHGFAVLSDEAVFSYKVDSPWTPNAERCIRLDDPAIAIEWPIPPGERILSEKDSRGISLQEWI |
GUT_GENOME062725_00006 | 109-176 | SLYIPKGFLHGFLTLSTTAIVSYKCDSFYCPSSEFTVQFDDPDLNLPWLRNYVIQSDKDKAGKSFKEL |
GUT_GENOME188391_03283 | 110-179 | KEIYVPGDCAFGSLALEDSVIACSCGSEFIAKYSDGIRWNDSVLDIDWPLWKLEGEPIISEKDERLSNFN |
GUT_GENOME237174_01202 | 103-184 | LTPDNHRSMFVPRGYAHGFVTLRDGTELQYLTDNTYSYGHAKSVRFDDSDLDIDWTMNGTVPLRPDILSDKNRNAPYLRDTV |
GUT_GENOME001039_02379 | 110-184 | QLLIPKGFAHGFMTLTDDVEVQYKVDELYAPECDRGIRWNDPTIAVDWPIHDIQPVLSAKDEKAPLLEEAENNFT |