UHGP-MC 33832
Information
- Number of sequences (UHGP-50):
- 141
- Average sequence length:
- 52±5 aa
- Average transmembrane regions:
- 0.02
- Low complexity (%):
- 0.3
- Coiled coils (%):
- 0
- Disordered domains (%):
- 1.92
- Pfam dominant architecture:
- PF12850
- Pfam % dominant architecture:
- 8936
- Pfam overlap:
- 0.29
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC33832.fasta
- Seeds (0.60 cdhit):
- MC33832_cdhit.fasta
- MSA:
- MC33832_msa.fasta
- HMM model:
- MC33832.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME273764_01968 | 117-178 | ADIVMYGHTHRPVIEIDSDIIAINPGSLSFPRQENRKPSYIIMEIDDAGDAHFTLNYLDYNF |
GUT_GENOME236237_00499 | 100-160 | LGCSVVMYGHTHIPDIDDVGEIICLNPGSLSEPRQIGYRHTYIVATTDDEGGLSYELKELE |
GUT_GENOME122083_00717 | 98-149 | KELGVDLALFGHTHTAVIENLDGVTLINPGSVAMPTNFEKSYAYVVVNDKKI |
GUT_GENOME030479_00870 | 823-876 | LLHGHTHIPAAQSFGDGYYYINPGSLSIPKEGSKNSYMILEGRTFTLKSVEDGS |
GUT_GENOME022298_00646 | 106-154 | QADLVAFGHTHRRFIKTEGDLWVVNSGSIGLPRDNRQGTYAIVSIEDGA |
GUT_GENOME278174_00356 | 86-139 | MASKHADFYVYGHTHVQDIRTYQGMYILNPGSTTFPRDGSIGTYLVLSWDEKGE |
GUT_GENOME284174_00337 | 111-157 | GEIFIQGHTHIPMLKKENGRILANPGSITRPRGCDLRCYLLLDEEKI |
GUT_GENOME047484_00434 | 97-149 | CDIVCFGHTHRSYIEKRKGVFLLNPGSTWMSRDGNPPSYAILDVDTSISAQIV |
GUT_GENOME096501_00386 | 102-160 | KSLDCDLAFFGHTHAFDDRVVDGIRLINPGSCSWPRGGDGKKSYVILTLDDDGGMGVER |
GUT_GENOME179657_01213 | 108-162 | RAALFGHTHVPLCETRGGITIFNPGSIGVPYPPRKPSYGVITIEGDLVMYEIREV |
GUT_GENOME142252_00723 | 98-147 | EVDADIVIFGHTHTPFREIKDGVLYINPGSTSLPRGVSYKSFVIMDIEED |
GUT_GENOME055679_01774 | 97-149 | DFLKKYNGDVLISGHTHMPLVKEKNGLWLINPGSTSWPRGGSKRSYAVMTIDG |
GUT_GENOME067866_00479 | 101-152 | MECNIAMYGHTHVPDNSVYGGMIIVNPGSVSLPRQMNHKPTYAVMKVDEQGR |
GUT_GENOME095922_00205 | 117-169 | GDVLLYGHFHTGFAFKKDGMVIANPGSVSLPKEGTAHSYIIIENGEVLLKDLK |
GUT_GENOME121306_00602 | 129-181 | ADILLFGHTHQPLVDRSGDFWVMNPGCIGPSVRRTYGVITLEDGKVDCAAFRL |
GUT_GENOME284015_02060 | 103-158 | QCDIALFGHTHKPLLEETDPAVLILNPGSISSPRQENGRPSYLVLELEADQRPVAE |
GUT_GENOME096480_00351 | 98-150 | KEKGVQVIVFGHTHQPLVEVMDGILFINPGSLVKPRGVPQGSYAMLSIEKGRW |
GUT_GENOME114554_00151 | 99-147 | ANLVCFGHTHCPMIKEIDGLMVINPGSLKNGNYALIMINDGKITAELKR |
GUT_GENOME087578_02869 | 94-160 | ALRYFGLQNHADIVMFGHTHVPYLEKTDDLILLNPGSISKPRQDNKIPTYTVMEIREDGKIDFRMCE |
GUT_GENOME160823_03113 | 104-166 | QEMGADVVFYGHTHCPAFHYYEKEGVTVFNPGSIALPRQMTPAGPTFLIIDLADDGRLTPELY |
GUT_GENOME216297_01346 | 105-150 | RADVLLFGHTHEPLVDYDDGLYILNPGSLSYGRPTYGLLDITEAGI |
GUT_GENOME253443_00956 | 97-146 | ENKNADIVLYGHSHIPSIDWLDGRLFVNPGSLCRPRDGYKPTYCVIDIDG |
GUT_GENOME282114_00786 | 95-147 | KEVGADVAFFGHSHLLGAELVDNVLFVNPGSLLKPRGIADKSYVVVDFESGKW |
GUT_GENOME256402_00686 | 136-190 | LHCSVLLFGHTHISIVNNDGRVLAVNPGSPSCPRGGRSPSFALLEIEDGDVRAEI |
GUT_GENOME282538_02103 | 103-151 | NSDICLFGHTHIPLLTSFNDVVYMNPGSISCPRSEFSNSYGIIEISNDK |
GUT_GENOME170594_01246 | 95-146 | EELGCNVALYGHTHIPKLDYDGKLYIINPGSPSCPRGGAPRTFAILTAECGS |
GUT_GENOME092719_00405 | 95-156 | LQEANCDILLSGHTHVPMYKEVDGYTLINPGSTTLPRGGSNRSYCVIYVEGKDLKVEFKNLD |
GUT_GENOME276849_00878 | 95-135 | FVFGHTHHHLVEKIDDCYIFNPGSVSLPRDGTNGTYLVLDI |
GUT_GENOME175656_01798 | 110-157 | AEVLLFGHTHIRYERYVNNLYIFNPGSIALPRDGRPSFGILEFRGNDI |
GUT_GENOME121337_01894 | 125-172 | GHTHIPRGETVDGVHFWNPGSTTLPKGGFPASYGVFETGAFRVFGLDG |
GUT_GENOME146130_04481 | 116-167 | LNQNDVLVYGHTHLPVAEQRGEIFHFNPGSVSIPKGGNPAGYGMLDNDVLSV |
GUT_GENOME103750_00006 | 103-150 | DIVLYGHTHIPIIDKLENIIFMNPGSVGKPRNAHSTAGIIEINDKNEV |
GUT_GENOME248738_01736 | 102-161 | EMDIVMFGHTHRPVIEVEDNITLINPGSISYPRQADGIPTYIMLEIDDNNEFSFMLKNVY |
GUT_GENOME233272_01140 | 108-162 | DIVMNGHTHRKTLYRDAGVIYLNPGSIAYPRDGIASYAVIDDGGIRLYDLETQEV |
GUT_GENOME158805_02713 | 97-149 | CDFVFYGHTHIFSSQQRDGVILVNPGALSRNRDGTPPCYAVITIDGNDVSIER |
GUT_GENOME015192_00323 | 110-163 | GIVLYGHTHVPRIESLDGRLIVNPGSAKRRLAADSRSATYAVLTLDASGASAKL |
GUT_GENOME000232_01625 | 101-160 | NADIVLFGHTHQPLLEEYHGITLMNPGSISFPRPLGKTPSYGVIELNQGKTDKFEIRYIV |
GUT_GENOME260243_00516 | 100-145 | READIVLFGHTHVPFEQYIDGAYYLNPGSLCRKGGSPSYGIVDITK |
GUT_GENOME164241_00536 | 135-186 | LEKNRFDAGLFGHTHVPFAKFSNDILLLNPGSISLPRNNSQKSYAVLAVTKE |
GUT_GENOME029628_00838 | 98-150 | HEKGADIVIFGHTHIKENKIIGGVEMINPGSLSLPRDGIKGSYVVMDLENGKY |
GUT_GENOME139825_00863 | 102-151 | GCETALFGHTHIPFLEREDGILLLNPGSCAYPRGGGKKSFAVLETEEKKL |
GUT_GENOME213666_03204 | 101-154 | DQECDVLLCGHTHVPHLEDRYGVLILNPGSLCFPRSSAGASYAILDTGGKRADA |
GUT_GENOME274355_01776 | 103-157 | KADICVFGHTHVPMVEKHDDLVIVNPGSLSRPRGGSRAGYAVIDIDDKGEIDVKL |
GUT_GENOME238877_02207 | 120-174 | QAEVVLFGHTHVPCVEEQEGILFVNPGSLTYPRQEGHKPSYAIMEISDSGEIRVN |
GUT_GENOME284474_00592 | 99-157 | HRLEANIVVFGHTHVPLVKWYGDVLLVNPGSPSRPRSMDGPTFGVLTLNEGEKPSAEII |
GUT_GENOME081675_00896 | 121-171 | IVFFGHTHIRVWEIYDGITIVNPGSLLFGMDGNDKSYAIVNVTKEDVQVEF |
GUT_GENOME237834_00605 | 102-157 | ADILLYGHTHQAVCSREEDGMWRMNPGTVGGEHASATYGVILLEGGKISCEIKRVS |
GUT_GENOME263097_01233 | 101-149 | IVCFGHTHVPYDEQFEDIHLFNPGSLSYNRDGSKPSYMILHFDGQQVSA |
GUT_GENOME083297_01431 | 107-147 | VLFGHTHSPICQNINNLAVFNPGSISLPRGTDFATYGFIEI |
GUT_GENOME158759_00164 | 107-151 | LFGHTHLPMNETIDGILLLNPGSLSKPAGGMQGSYALLTVSSDGP |
GUT_GENOME017990_01038 | 105-155 | DCDALLFGHTHIPMIERRDGIVLANPGSISRPRQEGHRPSYMVINIEEDGP |
GUT_GENOME274004_01742 | 402-458 | AEVALFGHTHIPHAETRSGVFLFNPGSCGRCYTGPNTYGILTLDEGKVTAYEHKEVP |
GUT_GENOME051051_00813 | 94-145 | DCLIYLFGHTHCHMVSNLDKNHYIANPGSLTRPRDGSSGTYLIITLDENGPQ |
GUT_GENOME129877_00568 | 91-142 | IDQHHPDIYLYGHTHLVKEQIYNNCLILNPGSITLPRDDWHGSYIILTIDKK |
GUT_GENOME045178_00623 | 122-171 | LVYGHYHVPLCKKVNGIYILNPSSISLPKAGTNSYGVYENHVFTIKDLDG |
GUT_GENOME000586_00167 | 100-156 | EENADIVLFGHTHIPMIDQDDGLLMVNPGSVSLPHFGKEKTFGILTIENESVSGEII |
GUT_GENOME238598_00622 | 119-176 | QVLLSGHTHIPSWQWVGDLFCANPGSVSLPRGGSPHSCLLYEDGLFRWLTLEGETFHW |
GUT_GENOME078809_01660 | 103-150 | ADIALFGHTHLPYYAYTDGVYLFNPGALSMSQSGRCTYGTLMLEEGKE |
GUT_GENOME114214_00240 | 100-144 | ADIVLFGHTHTALSEQVGGTLFVNPGTLSPYSAHASYAYLAFDEK |
GUT_GENOME178291_01480 | 304-360 | ADLVMFGHTHRPFFLQKDGMTILNPGSLSFPRQEGRRGSYMIMEVDGDGKLSFEQKY |
GUT_GENOME285915_00133 | 101-151 | EIGADVCLYGHTHIPVVDNYNGMVIMNPGSLTSPRGGSTFSYGIIKIENGI |
GUT_GENOME213226_00273 | 102-154 | EQNADLLLFGHTHVPLVDASARPMLLNPGSIGDPHRPTYGVLECKDGKIIPSV |
GUT_GENOME058112_00759 | 116-167 | FNRMGILVYGHEHIPYIKKNENMIYVNVGSISLPKNNSNPTYAIYKNKNITI |
GUT_GENOME047473_00476 | 103-152 | QVDIICFGHTHIPLITKKKGILLLNPGSLAFPERGYQNSYIVLTFSENGI |
GUT_GENOME014874_01781 | 334-391 | DCDVVMFGHTHKPFLKTVDGVTCLNPGSISYPRQVDRRPSYMVIDVSEEGDFEFRPVY |
GUT_GENOME147776_02794 | 103-148 | ADICLFGHTHQPCQQYQDGIYLLNPGSCARPAQGRPTYGVLDIRRD |
GUT_GENOME094823_01883 | 105-157 | DADMVIFGHTHVPFLECSSDMIVVNPGSIAKPRQSDHQPTYGWMEIDGGKLKL |
GUT_GENOME057581_00638 | 100-165 | EEQGADVVLFGHTHRPVYDDRGRVMLFNPGSISMPRGGTPPTFGILTIEENGRMEGAIMEYHRDMP |
GUT_GENOME164270_01285 | 105-159 | KAQIVLFGHTHIDGVAMFDDKLFINPGSISLPKGPHTAIGGTYAVLTVSDTAYSV |
GUT_GENOME037699_01388 | 102-154 | EQADIALFGHTHIPYLENEGGILVMNPGSISLPRSSCGRTFAFITIENGNITA |
GUT_GENOME096554_00136 | 99-150 | QEMQANVVLFGHTHIPMVDYREGMHLMNPGSLGMPRGSNGTYGVLDVTEQGI |
GUT_GENOME258569_00728 | 103-154 | RDVGADICIFGHTHIPHLQTEDGILMFNPGTSFRTYGVLKIDDGGNIDAEIK |
GUT_GENOME276125_01044 | 119-175 | ASLFLYGHTHLWELNCNNEGMWFCNPGSISLPKEGRPASFAVYDNGTIAIYTLSGEL |
GUT_GENOME008081_01275 | 103-154 | PNCEFKQGDIYCHGHTHIPSIRKLDQIIVCNPGSVSLPRAGFKASYMIIDDK |
GUT_GENOME003067_01009 | 121-181 | LQKKDVVLSGHTHIPITEPTDGGLFFNPGSVSIPKNGSKNGYMTLENGLFIWKSFEDGEYA |
GUT_GENOME175358_02489 | 231-289 | LKPGDIFMHGHTHVLRTEKVGDITILNPGSVSIPKEGNPPTYAILENSRFTIKTFDEEI |
GUT_GENOME018982_02054 | 102-154 | ECGCQVVLFGHTHRPFQQMSDGVLIANPGSISRPRQPGYQPSYGVLSISEKGE |
GUT_GENOME049427_01519 | 103-158 | EADVVLYGHTHVPFVEQSSQMTVLNPGSISRPRQSGFECTYAWMEFLPNGEISIEI |
GUT_GENOME096508_00471 | 98-151 | ANVVCFGHSHLPQCVARGGALFINPGSLVAPRGFPEPTYAVLEAESGKVKVAYF |
GUT_GENOME199425_01498 | 100-150 | RDFEADICIFGHTHEPFCEEQDGILVLNPGSSRFTYMVLNINNGKVEAELK |
GUT_GENOME242999_01005 | 114-156 | ILTYGHTHVKQLQKTQEGLILLNPGSTSRPRDGIKSFAYMQDG |
GUT_GENOME055197_01575 | 104-151 | EARLVLFGHTHRRFCQRDDDMTIVNPGSVSLPRDGRVGTYAVCAIADG |
GUT_GENOME100678_00510 | 119-175 | GSVLIYGHEHMPYIVNEDNRWFINPGSISLPKGDTYPSYLIYENRTFTIYDVKGQVL |
GUT_GENOME040009_00091 | 102-147 | NKAQLVVYGHTHCRDCRYSNGVYYVNPGSIALPRDGLAPSYAAIDI |
GUT_GENOME160038_01004 | 101-152 | QVQVVMFGHTHYPYLEEMEDLTVLNPGSLSYPRQPGRKPSYLVMEIDEQGKA |
GUT_GENOME096509_01997 | 96-145 | GAKVVCFGHTHVAMTEKVDDVLFINPGSIKKPRIRVEKTYCILDWEEQKM |
GUT_GENOME047596_00937 | 132-195 | VLPRGSIVLCGHTHIPVCKESDGITYLNPGSVSIPKEASPHSYMIFENGVFTWKDIEGGAYMTY |
GUT_GENOME028395_00680 | 116-173 | QRGDILLFGHTHAYMLKKMDGVVYVNPGSPSFPKNGNPPTYAVMEGLHLEIRRLDGDQ |
GUT_GENOME254219_01683 | 95-153 | QCNAFFYGHTHVPFYEYYKGIYLLNPGSLAYPRSSYGKTYAIIDLMQDRTVNVEIKSLE |
GUT_GENOME096506_03904 | 101-148 | ADVVCFGHSHMAGAEKVENRLFINPGSCRQPRDHREPTYAILSWNDNR |
GUT_GENOME096472_01164 | 306-367 | EEVNADIVCYGHTHIPIAEMEDGRLFLNPGSISLPIVAPYPTYVILTIMDDRTVQVSYRQVD |
GUT_GENOME136911_01164 | 92-136 | QGDALVYGHTHVSSLRKLPNGIFALNVGSIALPRDGAPSYLVLDD |
GUT_GENOME043515_00075 | 113-158 | QIVVCGHTHIPHMEKRGDVWVVNPGSISQPRQPGHQPSYLVMNIEH |
GUT_GENOME088990_02140 | 85-143 | DMPEATEADIIINGHSHRYRCEKKGRQWFLNPGSCGRRRFNYPLTWMILITDKGELTIE |
GUT_GENOME000232_02368 | 97-145 | REKQADIVLFGHTHRIFCEQHNQLAILNPGSIGDPRYPGRPSYGLVFIE |
GUT_GENOME163211_01161 | 119-169 | KEIGAQIALFGHTHCRYYAYEEGVHILNPGSAGAPRDGKPASYAYIDITKD |
GUT_GENOME119536_00607 | 108-162 | LFGHTHLPTLEERDGVTLFNPGSIGAPRFGGPSYGLLQLYENGEVKLQHKELIIW |
GUT_GENOME109440_00445 | 112-162 | PPAHLKKGDVCFYGHEHVPYIKENKGVYFINTGSVSLPVKNSPRSYIILDT |
GUT_GENOME134080_01689 | 102-146 | QKADIVLYGHTHIPFYERKNGIIYANPGSISHPRNSDESYGVLEI |
GUT_GENOME236231_00206 | 99-150 | EENGADIAVFGHSHVAFKEKVDGVLLINPGSLAYPRDGGGQSFMVLELHEGA |
GUT_GENOME254050_00202 | 127-181 | GDAFIYGHYHTGFIEEKSGVIVANCGSVSLPKGGTPRSYIVLDGQGLCLKDIDGN |
GUT_GENOME012853_00844 | 106-153 | NKYDVVLYGHTHIAHFEQKNSVYYFNPGSISRPRDGSGGTYGVIDLTP |
GUT_GENOME102015_00074 | 140-187 | DIVLYGHTHIPEISFYSGRWFINPGSPVRPRGGYQGTYCILTLEKERV |
GUT_GENOME247289_00384 | 99-148 | KDYPANVFCFGHTHIPYFNEVDDVTLINPGSLALPRNYPRHRTYAIYDTV |
GUT_GENOME028649_00583 | 123-164 | DLYMFGHTHLPLCEIQDDKIYLNPGSIGFPKGGHVPTYIEIN |
GUT_GENOME231418_00976 | 103-154 | DVVIFGHTHTYTQANKSGILFLNPGSVSLPRDGKASMMLMTLDNENINIEKI |
GUT_GENOME100510_00377 | 123-170 | HGNIFFFGHTHVPCLIKERNIIYANPGSLGVPRNGSISGYIIFNDDKI |
GUT_GENOME014886_00217 | 100-148 | QADCKIALYGHTHIPDIRYADGLYIVNPGSCARSRNGSNSYAVIDIRKN |
GUT_GENOME136236_01016 | 98-146 | DCSIALFGHTHIPLIAQQDGCLLVNPGSFNLPRGGSKPGYAILTLKNGA |
GUT_GENOME101761_00900 | 106-154 | ADIVMYGHTHVPYLKKVFGVTVLNPGSISLPRQEDGKKSYAIMTVADDE |
GUT_GENOME077741_00706 | 103-154 | DILLFGHTHIPYCEQVDGLWMLNPGSCGGRGATYGVISLENGEVMCYTVGIA |
GUT_GENOME161288_00739 | 102-162 | RRLEQDMVIFGHIHRPVWEEQDGVWILNPGSPSRPRDGSKAGFAVLTLKRGESPQVEFCCL |
GUT_GENOME120056_00851 | 102-156 | EVDIAVYGHSHIPEMHWEDELLILNPGSVSRPRFKNPTFAIIEIDDWGNIKPEII |
GUT_GENOME175921_01086 | 101-157 | KEAGAQVALFGHTHRSFCRSQGDVLLVNPGACGGPTGTYAVLETGQGAPTCEIRPVR |
GUT_GENOME008021_00085 | 99-146 | KEAHADIVLFGHTHISCIVYDDGLYLVNPGSVSEGRDGRQSYAVIDIM |
GUT_GENOME018906_00974 | 97-153 | DVLLYGHTHMFSDYSYEGVRFLNPGSCRLNRDGTPPSYMIINIDDDHNIDVERVDIK |