UHGP-MC 104817
Information
- Number of sequences (UHGP-50):
- 191
- Average sequence length:
- 65±9 aa
- Average transmembrane regions:
- 0
- Low complexity (%):
- 2.4
- Coiled coils (%):
- 0
- Disordered domains (%):
- 0.02
- Pfam dominant architecture:
- PF02677
- Pfam % dominant architecture:
- 8063
- Pfam overlap:
- 0.38
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC104817.fasta
- Seeds (0.60 cdhit):
- MC104817_cdhit.fasta
- MSA:
- MC104817_msa.fasta
- HMM model:
- MC104817.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME231144_00191 | 27-99 | TRVVLHSCCAPCSTAVLECLLQNKIKPVVFFFNPNIHPQDEYLKRRDEWLHLCELLKVETVIGDYEPRCFFEA |
GUT_GENOME203491_02382 | 221-291 | LLLHVCCGPCSGNVLKEISEFFDITIYYSNSNIYPDTEYFRRFHELENFIQRFNQDFNQNIQVVEKEYQPK |
GUT_GENOME112096_00656 | 25-78 | LLHVCCAPCMSAGIVSLASHFRVAAYFYNPNIFPEEEYDKRLGEVKRLIAALGY |
GUT_GENOME041664_00480 | 1-67 | MTLIVLHACCAVCAGYPLELLHETGFEPTVFYFNPNIHPKAEYDRRLEELVRYCGKKNIKLITGDDD |
GUT_GENOME009245_01151 | 10-79 | ILLHVCCAVCSAFPVIKLKNLNYTPILYFYNPNIYPKSEYFRRLKELILYSKKLNIKLIIDHFQPIVWQN |
GUT_GENOME134442_00041 | 26-81 | KPRLLLHCCCGPCATYPVSILYKYFDLTLYYSNPNIYPIEEYDRRFETLKRFIEEY |
GUT_GENOME111119_00239 | 2-67 | KLLLHACCCPCTLEPLRLLLEEGHDIAIAYMNSNIHPAEEYEHRRAVLLDFAREQGLEVIEGVYDP |
GUT_GENOME142012_01800 | 1-80 | MLVHICCAVDSHYFLEKIQEEFPDEKLVGYFYDPNIHPYSEYRLRYLDVEYSCKKLGIPLLEGPYNLEEWLKKVKGMEHL |
GUT_GENOME238202_01401 | 1-68 | MSKIVLHTCCAPCSGAIIEYMVQNGMRPLVFFSNSNISPREEYEKRRAEVLRYAGSFGLEVVDDDYGH |
GUT_GENOME256437_01190 | 7-91 | KLLMHCCCAPCFTYIENDIKLNGILNEEGKKEKVDLTACFYNPNIHPRVEYERRKNAFIKFCKIKECKNVIIDEYNMKDYVRFVV |
GUT_GENOME111522_02146 | 24-99 | RLLLHACCGPCSSAVLERLCRYFDITVLYYNPNTWPAEEYHRRGEELERFVAAAHPLGVTVVEDRYDPQEFYSAVA |
GUT_GENOME042231_00524 | 44-116 | KLLLHVCCAPCSSYCLEYLSKYFDITVYFYNPNISIADEYNYRLSEEKRLVSLMPFEHPVKVVEGEYLPKDYF |
GUT_GENOME113174_00267 | 6-70 | KILIHACCGVCFSYPLILLREMGYEPVVYFINPNIYPKEEFERRYLELEKYCSVNNVELIKEEYN |
GUT_GENOME074402_00941 | 5-76 | RLLLHICCAPDATIPWPELISEEYETAGYFYGGNIHPAEEYERRLEAVRILASETRGELVVPAYETDEWFAV |
GUT_GENOME172377_00777 | 5-68 | NTLLLLSCCAPCSGGVLARLAAENRPVALLFYNPNIYPAEEYKKRRDEQKRACEHFHIPFIELP |
GUT_GENOME189150_00139 | 28-96 | KLLLHCCCAPCSSHCLSVLAEHFDITAYFYNPNITDYDEYIKRFRELDRFVHEVYVEGVDVELAEHEPQ |
GUT_GENOME238203_00589 | 18-95 | LLLHVCCGPCATGCVDPILQMKRKIILLFSNSNLNTSEEFQKRLDSVKIVARHWNLELLVDDYRHEGWLKSVSEVQHY |
GUT_GENOME154921_01150 | 57-123 | RLLLHVCCGPCAVMPLRRLLDEGYAVTAWFMNPNIHPLTEYLRRREAAGECAAQLGVPIVYADATWD |
GUT_GENOME243297_00101 | 3-70 | KAAKGRLLLHLCCAPCGTVGWTDLLAEGWDVTAFFWGANIHPYGEWVRRREAVVQLSKALGGPALLRP |
GUT_GENOME011771_00229 | 5-66 | NKILVHCCCGICAGYPLTLLGEDCVAFFYNPNIYPLEEYNKRLEAMKTLCKKLKKELIVSQY |
GUT_GENOME253371_00380 | 4-50 | RLLLHTCCGPCLLGSIDHILDYDITVLYYNPNLSDENEYNIRLQALK |
GUT_GENOME112054_01577 | 46-110 | KLLLHACCAPCSTYCLTQVLPHFDVTLYYANDNITCREEWDKRLGELTKLVDIVNSGKFVVATPL |
GUT_GENOME176079_00966 | 1-62 | MKLLMHICCAPCANMPIDALRADGIDLTGYWYNPNIHPFTEYRARRNCLQEYAKTIELPLLM |
GUT_GENOME065114_01298 | 2-61 | KLLLHACCGPCSLEPVRLLRAAGHDLTIAYLNSNIAPADEYARRLATLRAWAADEQVPVV |
GUT_GENOME238261_00075 | 6-79 | RILLHTCCGVCAAHCIRVLKAEGWEPVLFFSNFNLYPAAEYARRRAAAVALAQAEGVEWVEDVPEHAQWREAVA |
GUT_GENOME055813_00422 | 108-176 | ILLHACCAPCSAAVLEWLLKNNMAPTVFFFNPNIAPRTEYEKRKAELVRHSAFLNVPMIDGDYDHTTWL |
GUT_GENOME108492_00791 | 1-62 | MEKVLLHSCCAPCSAAILEWMGQNGYEPVILFYNPNIFPEEEYLRRKNEIVRYAHETGVAIV |
GUT_GENOME180765_01335 | 1-64 | MKMLLHACCAPCSLEPTRILLERGADITVYFSNSNIAPKAEYDKRLAELRTFASDIGLHLIEGE |
GUT_GENOME070248_00450 | 17-81 | RILLHSCCAPCSSSVLWYLMPLSYVTVFFYNPNVMPYSEYAKRLTEQRRLCSIMGVNLVEGDYDA |
GUT_GENOME238996_00849 | 1-62 | MKLLLHICCAPCSAACIKVLREENIDIVGYWYNPNIHPFKEYDNRLKALKEYSKMINLNVIY |
GUT_GENOME130633_00181 | 69-138 | KLLLHACCAPCSSAVLERLGNLFQISILFYNPNITSEEEYTKRLVELEKFVGRFKTKYKIDVIAGRYEPR |
GUT_GENOME236517_00435 | 4-63 | KLVIHTCCAVCMCYPKTLLENENYETIFYFYNPNIYPIKEYERRRDEFINYAKSFNIAVH |
GUT_GENOME023195_01618 | 1-61 | MRLLLHMCCGPCSCYPVKKLREEGIEPVGYFFNPNIHPYKEWDMRLKTAKEFAEKVNMEFY |
GUT_GENOME109794_02464 | 1-66 | MKKLLLHVCCAPCASACVERLLESERDIVLFFSNSNIHTEEEFEKRLFHVRRLAEIHSLELLVDEY |
GUT_GENOME121272_00688 | 55-113 | RLLLHVCCGPCAAGVLPRVTPHFDVALFYYNPNILPKEEFIKRLDTLKRLLLHFPEVKL |
GUT_GENOME032767_00011 | 192-271 | RILLHSCCGPCSSAVLERLVKTADVTVYYMNPNIHPEAEYRRREYVQQKLIHKFNAERGTDVYFLAAPYDPSSYFEAVKG |
GUT_GENOME252255_01396 | 1-62 | MKLLLHSCCGPCSCYPTQELTEKGVDFTLLYYNPNIHPYREFKHRLAALRELAEKKEYKLII |
GUT_GENOME112247_00573 | 55-123 | KMLLHSCCGPCSTVVIERLKDSFDLTVFYYNPNIEPKEEYEKRKVEQQKVCKKFDVPFVDFDYDNDGWR |
GUT_GENOME113180_01844 | 60-125 | QIGETKQAPTLFLHSCCAPCSSYVLEYLSSYFRITVFYYNPNISFDEEYKRRVAEQKRLIRAFNEA |
GUT_GENOME199838_00483 | 265-343 | RLLLQSCCGPCSSYVLEYLTRYFRVTVLYYNPNIQPRAEYDRRLYWQQELIRCLPTPEPVALLACDYDGQRYIDAVRGL |
GUT_GENOME110771_01281 | 19-84 | HVCCAPCASGCLGMLEEGGRQFTLYFSNSNIWSREEYEKRLRSVEKLASLCRLDLQVDPYDHASWL |
GUT_GENOME070478_00539 | 1-73 | MKMLVQICCSVDSYYFLTRLKQEFANDEIIGFFYNPNIHPQSEFELRFCDVARSCKQLGIELVKGEYTPLDWL |
GUT_GENOME139644_00604 | 7-71 | RLLVHACCAPCLVAVYDDIVSSLDEFGLQTIEDFDVLWYNTNIHPKVEYEHRKNTLKEYLSLMGK |
GUT_GENOME244103_01329 | 11-72 | KVLLHACCAPCVAVGVKILLQEHWDVTLYFYGGNIHPPAEWSRRLESLQKLAADYAVPLVVR |
GUT_GENOME017603_01032 | 1-65 | MPRLLLHICCGPCSLMPIVHLRDEGWEPTVYFFNPNIHPADEWRKRRDAAREAASRLNVPFLEEG |
GUT_GENOME177933_01579 | 274-342 | ILLHSCCGPCSSYVLEYLIQYFDILLYFFNPNIHPETEYIKRLETQKDVLIKMKLNDTVKLIEGKYDPD |
GUT_GENOME141421_01659 | 149-220 | KILMHVCCAPCSTYTLEYLSQWADVTIYFANSNIHPKDEYYRREYVTQKFVHDFNKNTGYSVQFLSAPYEPN |
GUT_GENOME254219_00218 | 23-83 | KLLLHTCCALCFSSAFMQCKDYFEITVFFYNPNIYPNAEYEKRKQETLRLIEIFNQEYHLN |
GUT_GENOME025996_01331 | 1-75 | MKLLLHICCGPCSLYVIDALRSRLPQWEITGYFYNPNIHPYDELIRREAGAVKACDYKGISLIRDTEYDLAAWKG |
GUT_GENOME014164_01229 | 14-83 | ILLHCCCAPCASAILEAMVEAELDVTVFFSNSNIFPFEEYLTRKDELVRYCERLGVPYVEDVYAHEDWQK |
GUT_GENOME081001_00539 | 2-73 | KLLMHTCCAPCSVYCIDRLRKEGIEPTVYWYNPNIHPYKEYEARRDCLKEYTKSINVEAIFEEEYGLREFCK |
GUT_GENOME146460_04542 | 25-97 | NKLLLHSCCAPCSGEVMEALQASGIDYTIFFYNPNIHPQKEYLIRKDENIRFAEQHGVPFIDADYDTDNWFER |
GUT_GENOME120672_00218 | 25-78 | KLLLHCCCAPCASYCLEYLNEYFDITALFFNPNIMPKEEYDLRLGAFDKLLTNF |
GUT_GENOME044087_00835 | 1-59 | MKVLLHTCCAPCSIKCIEKLKDDNINPTVFWYNPNIHPFKEYQIRRDTLIKYVKNNNID |
GUT_GENOME109192_01455 | 27-106 | KLFLHACCGPCATYPVTFLNNFFDITIGYFNPNIAPFEEWDLRLKELTRFINEFNKENGSNISIVTMPYDHQIYLDAIKG |
GUT_GENOME205742_00114 | 316-389 | LFLHACCAPCSSYVLEYLSRYFTITIYFYNPNIMPEAEFSHRLAELRRLLQEMPLEGTVSVIAGTWEPERYLAQ |
GUT_GENOME285829_01219 | 47-105 | LLHACCGPCATACVERLAPDYSVTVYYYNPNITDSEEYYLRRDNLRKFLNGFNRDHEGE |
GUT_GENOME238207_01096 | 1-75 | MATLLHVCCGPCASACVPALRAAGRAPLLLFANSNLDTEEEWARRLDAARALAAAEGVELLVEPYDHDEWLEQVA |
GUT_GENOME010663_00005 | 191-256 | QNKLLLLSCCAPCSCAVIKKLAEEHADFTVVFYNPNIRPKEEYDKRCSENKRVCELYGVPFIELEY |
GUT_GENOME190844_00985 | 7-72 | KVLLHACCGPCSIMCIQSLRDEGYDVTGYFANPNIHPVSEYFRRREAMEQVAEQMDLPMLWQDDVY |
GUT_GENOME018140_00754 | 19-87 | TLLLHICCGVCSVYPLIYLRQYFKITILFTNSNIYPYEEFLKRLDALNQYLEYLGDKEIKLIVDKYDYD |
GUT_GENOME039046_00062 | 190-274 | KLFLHSCCAPCSSYCLEYLRQYFDVTVFYYNPNISFGEEYRHRAEELRRLVKELNEEAAGSETLNQIRLEEGAYEPEKFFEATKG |
GUT_GENOME096069_01365 | 1-71 | MRILLQLCCAPDGTVPLDILLCEGNEVICFFYGHNIHPQEEYERRLCAARQLAEFFRVTLVVPPYDPDRWL |
GUT_GENOME120666_00183 | 172-246 | LLLHSCCGPCSSYVLKFLHDFFDITILYYNPNIFPRDEYNHRLDTQKEIVKKLGYNIKIIEDVYDHKSYLDYIKG |
GUT_GENOME065882_01877 | 3-74 | RILLHICCGPCSLSCIDELQREFPGAITGYYANPNIHPYDELLRRQGSARAACAAKGILLLTENTYDFAGWA |
GUT_GENOME283577_00277 | 36-96 | PKLLLHSCCGPCSTAVIERLMNEYEVTVFFYNPNITDRDEYVKRRHAQLSVIDRFNEMTEC |
GUT_GENOME214923_01757 | 29-104 | KLLLHSCCAPCSSHVLTLLAHIFDVTILFDNPNIYPEEEYNKRYDELCLLIQRMNVNIRVIKVNYNHQRFLDAIQG |
GUT_GENOME106058_00374 | 14-82 | RLLLHCCCGPCSSAVLEYLSARFDLTLLWYNPNIYPEAEYEKRYAALTELIEKMELAGRLAVIREPWQS |
GUT_GENOME238994_00147 | 23-84 | TKVLLHTCCAPCSSAIIECMLQHNIRPTIFYCNPNIYPKAEYEIRKNECVRYAKSLGLDVVE |
GUT_GENOME139231_01004 | 46-129 | IVLHVCCIVCACWPIDFLKENGFDVTIMYNNSNIWPKEEHDHRLNELKRYLKERWHDEIPLIVEPYDYDTYASTILKNRNNDPE |
GUT_GENOME286090_00861 | 25-101 | ALLLHSCCAPCSSYCLETLSEDFAITVFYYNPNIYPEEEYWKRVREQERFIGLLPAKHEISFLEGNYEKERFYETVK |
GUT_GENOME079795_00649 | 50-106 | RHPKLLMHACCAPCSAFPLEFLSGVFAVTIYYNNSNIYPAAEYQRRLEELKRYLEEV |
GUT_GENOME208129_00084 | 28-106 | LYLHSCCGPCSSAILAYLAPFFQITVYFDNPNISPRAEYDKRLFYQQKVIDALPEEYDIRLATGLYDPTRFQEAVAGYT |
GUT_GENOME253512_02026 | 10-67 | KLLLHACCGPCSMEPVRLLRERGIEPSIFYANSNIHPDGEYEHRLDTIKEWSVREDLA |
GUT_GENOME237440_01691 | 51-130 | TLLLHACCGPCSSYVLEYLVKYFQITVFYYNPNIYPPAEYERRFEELKNLYKVFPPAIQGNVQIIEKTYNPQEFYDAIQI |
GUT_GENOME286625_00811 | 52-120 | VLLHSCCAPCSSAILEWMRGHGFAPTVFYFNPNIFPEAEYLVRKRECQRYCEKLGVPFIDADYDHARWR |
GUT_GENOME051151_00293 | 3-65 | NQKPKMLLHSCCGPCSSAVIMALKDDYDITVLYYNPNIYPEEEYLHRKREQIKLIESVNKEGE |
GUT_GENOME062725_00039 | 23-88 | MNKLLMQVCCAPCCGAVLECFKYHNISPTLFFYNPNIHPRAEYEKRRDELIELAKLLGFDYIIPEY |
GUT_GENOME154089_00814 | 505-582 | LLLHSCCAPCSSYTIEYLSQFFKITVLYYNPNISPKEEYEKRKSEQLRLIGEMKTKYPVSFLDCDYDYNEFLDIARGY |
GUT_GENOME121411_00678 | 45-118 | VLLHSCCGPCSTSVIERLAHDYEIIVYFYNPNIDDPKEYEKRKKTQKEFIDKYNKDNKLDISFVEGDYKPDDFH |
GUT_GENOME034176_01222 | 42-123 | RLLLHSCCAPCSSYVLEYLSRYFTITLFYFNPNIYPPQEYQERIGEQERLIREMECAGALGIPVEFQGGEYEPEEFYRAVRG |
GUT_GENOME214847_00037 | 193-255 | MKSLLLHACCAPCSLEPVRLLREEGFEPTICWTNPNIQPRDEWQRRLDELRRWCADGDIELIE |
GUT_GENOME236867_00696 | 14-83 | KILVHVCCGPCATSSIKRLLDEGWEPLLFFSDSNIWPEEEFEKRYSTLLQVASFYNLPVIKDEYHHEKWR |
GUT_GENOME109793_00361 | 3-64 | VLLHACCGVCASHCVDFLLGRGDEVVLFFSNANIFPHEEFLRRLGGAETLAAHYGVPLVVDA |
GUT_GENOME176680_00948 | 272-328 | RLLLHSCCGPCSSAILERINEYFDIDVFYYNPNIDREEEFYRRADEQVELVKNLGLE |
GUT_GENOME237934_00914 | 29-107 | ILLHDCCGPCACFPLLFLCKHFDVTIYYTNSNIYPEEEFDKRLGEVKKLIQYLKDTWGYDVKLIVSPYNHEEYMKDLRS |
GUT_GENOME238256_00453 | 2-65 | EVVLHTCCGPCASACVKRLQDENSKVVMYFSNSNIDSREEFERRAAAAKKLAAADGVEIIIDEY |
GUT_GENOME157276_01703 | 40-98 | RLLLHSCCAPCSSAVLDALCADFDITLFYYNPNISTEAEFTHRVDELRRFCHETGIDVE |
GUT_GENOME054737_00237 | 13-73 | KSKAKEGSKLLLHVCCAPCATYCLTRLLDVFDITLYFSDDNIFPDAEWEKRLGEVKKLVDI |
GUT_GENOME238249_01286 | 22-86 | LLLHCCCGPCATACLERLAAQGRRCALFFSNSNLDSQEEFRRRLEALETVAAHFGIQEVLVDPYR |
GUT_GENOME220356_00557 | 25-97 | SVLLHCCCAPCATSVTERVIKTVKPVLYFYNPNIYPENEYKKRLHELEKLAAYFSLELIAEDYDEGEFLSAVG |
GUT_GENOME051358_00500 | 33-110 | LLLHVCCGACSAFPLIYLIDLFNITIYFSNSNIYPLEEFNKRKETLEKYVDIINKKFNKNIKIIADNYNYEEFKKDLI |
GUT_GENOME157169_00634 | 27-102 | IKLVLMSCCAPCSAGAIQQLVNGDIAGVSDFIVLFFNPNIFPESEYTKRLDEQIKYCKKLGVKYAIGNYDHAAWRA |
GUT_GENOME095596_01246 | 28-93 | KILLHVCCAPCGEYPYVKLLQEGFEVKVLFYNPNIHPLEEWQLRKHNVEIFSHLHHVDCSYDDTYL |
GUT_GENOME117452_01636 | 18-92 | KILLHACCAICSSYPVSFLKDAGYEVVVYFYNPNIHPAEEYQKRLDAQRTLCKHFGVELIEEKYEPEEFFEYVKG |
GUT_GENOME014927_00870 | 21-82 | KLLLHCCCAPCTLGVIERVIEHFSVTLYWYNPNIMPLGEYEKRLSELEKVAGIFSVPLIIGE |
GUT_GENOME004606_00444 | 27-103 | LLLHACCAPCSSAVLEKLTAHFKITVLFYNPNIYPEAEYQKREAELKRLISEMPCTKEVALVDLPYVPEEFFTAVRG |