UHGP-MC 4441


Information


Number of sequences (UHGP-50):
101
Average sequence length:
70±5 aa
Average transmembrane regions:
0.01
Low complexity (%):
0.3
Coiled coils (%):
0
Disordered domains (%):
1.37

Pfam dominant architecture:
PF18832
Pfam % dominant architecture:
7426
Pfam overlap:
0.86
Pfam overlap type:
equivalent

Downloads

Seeds:
MC4441.fasta
Seeds (0.60 cdhit):
MC4441_cdhit.fasta
MSA:
MC4441_msa.fasta
HMM model:
MC4441.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME161170_00320806-879SVITLDTTGVEIEQHEGLWHTVDKMEIENEIFYLMRHNEFGDSVAAVILNSDGELVAQELEHGFDQGAMEAIRD
GUT_GENOME239259_03366441-506LHSDEIEIEGYTGTWYVMDVQRVKDRDLFLLESEIYGDEAACLIVNENRNVVMDNVHNGFADYLEA
GUT_GENOME088277_0329121-92QITESTKGFTAEGHFGTWHSIQMQEFHNEKFFQMRHEEFGEQVADIIVNEQGQVIAEDLWHGFSPEAMKLIG
GUT_GENOME199098_01173114-188VTLDTKNLFIDGCAGTWMATEECVVESQRFFLMQHEVYKEKAPNVIIDASGKLVSTEIRKGLDEDARKKILDYLN
GUT_GENOME057671_01454114-187ISYDTADFHIEGKAGSWLAYDSIMIDGREFFLMEHTTYGAQGANVVLDADGKIVADNVFHGFDETVKQQIREYM
GUT_GENOME048741_000287-73GQSLRDFCGTYELIDVAWIRGRKYVMYRSEQRPEALRLVYNVEGRLMCRTDYNDIKTAFVEELSWEM
GUT_GENOME094003_01828459-531VDMDTAGHTVEGHDGTWHSIEELAVHGNTYFLMESEAFGQEANMVVVDCAGKLVAEDITPAMRADVREIVAEV
GUT_GENOME059608_00351347-413LNDIHIDGYLGTWHEMIRVTANDEEFIVFENDEFGNQTAFVITDKDLNPITETNGDIETAIQDYLEM
GUT_GENOME271320_0140787-162LHSHAEHIRIAGHIGRWYVIDEGDFSPRTNKNLVRHLFLLEHESYGDEAACLIVTDEGVIVLDDVWNGFDDLREAG
GUT_GENOME272664_009074-65QTDGQRFKGRRGTWYMIDMDYIHGSRYYLYESEIWGDDAAHLVINDEGRVMCETFDDLHTAV
GUT_GENOME098708_01500413-479IGINSSHIKVEGHIGTWYAIDSTQIEGKDYFLLEHEDYGDEAACLIATPQGEVVLDDVWNGFGDLEE
GUT_GENOME000660_02895333-400LTKDTVGYEIEGKDGTWEVIDYLLVEGKNYFLMEHEQYGKDVAYVVLDQKGNVLVDSTYNGFDDVVKQ
GUT_GENOME243626_00444867-936QQDYPLDYIEDGIVLEGINDTFYIKDRETINGIEYYLLESQREYEDVPNLIVNKAREIIDDDIINGFDEF
GUT_GENOME039439_02038117-184VDTKGYTIEGKKGTWQVIDYILMNERKWYLMEHEEYGPRAAYVVLSDDGAVVMNDNYNGLDAEAREKI
GUT_GENOME109627_010491-63MKQIKLEGYKGTYTKIDETWYYGFKYYLFESDVYGDEAEAVVTNKDLEPITSGYDDIVTLLDD
GUT_GENOME279987_01346118-180VNLGSSGIAVSGHYGTWHTLESHNIQGRQFYLMESDEFGRDAANVVVDGTGKLVAEDVLTGFT
GUT_GENOME218909_01278838-914GLIHGDSDHIAVEGHIGTWYAIDETEIGGEKFFLLEHEEHGDMTACVAVNEQGKLVAEDLWNGFDEDFQEAVQKYLS
GUT_GENOME216553_00899418-487SLEGEISADSDSLIIDGYESTWYVVDTEAVDGKELFLLENEEYGDETFGIIIDKDRNVLVDEAWNGFADY
GUT_GENOME059329_0025262-132VSLEATGLHVEGHQGTWHSIEQKEILGHDFFLMEHDEFGSDTANIVVDDSGKLVAEDLWNGFNQDVVRMIT
GUT_GENOME004441_02258114-188LISMDTVDYQMEGVKGNWLAIDETKVEGNVFFLMQSEQYGANAAYIVVKDSGELVVRESSGFDDKTIEQIQRYLH
GUT_GENOME090400_00978105-170ISMDTKDYEVVGKKGKWRAVDTLIIDGKQYYLLEHQEYGSRVPTVILDSYGKMIAESDKGFSEEVK
GUT_GENOME098704_00816130-188FKSYQGTWTEFDEIIYCGEKFKIFENDKYGDNSFYVFTDEFNNPIDTTFNDLLTALEDY
GUT_GENOME106647_0274437-100VDEKTSGLAVEGHFGTWHTIEKVQVDGKDYFLMEHDEFGDEAAGVVVDSNGRLMAEDVTCGIEP
GUT_GENOME104454_00923358-433SIIDENTSGLAVAGHIGTWHTIDHKEVDGHTFWLMEHDTLGDDISCIIVDERGELALSHIYDGFDDHTVDLLRQEV
GUT_GENOME215061_0049716-80ELENIKIKGHTGTWYEIDRRTLYGKTFYLMESEIYGDEAPGIIIDEAHEPVIEDVFDGFDHETVS
GUT_GENOME101338_00180330-401INIDTRDYEIEGKVGKWQSTDELILDGNFFYLMENQKYRGDAAAVILDTYGKIIADDVMHGFDEETKQKIRD
GUT_GENOME235224_00170135-213LSWISDNIKIDGHEGTWYIIDEGDFQITPDVNGKPQTLTAHLFLLESRKFGDEAACLIVDKKKQIVMEDVWNGFDDLED
GUT_GENOME226447_008315-74AAGNTVAYMVPNYQIDGKTGTWLPYDSEVVEGVCFFMMRNEQRKDAVSPVVVDSKGVFVTDAPNGFENVR
GUT_GENOME046050_00688351-418IIEEGYNIEVDGHIGTWYVIDTDMMENTKYFLLEHEEHGDSAACVIVDGDGKLVLDDVWNGFDDLKEH
GUT_GENOME130023_02380107-181AVIHADTVDYKIENHNGKWRSVDYLIFDGKQYFLMENQKYGKQSVAVILDQYGKLIVDYCKNGFDEDAKRKIREI
GUT_GENOME000252_02465589-670LDEAEVTITADTRGFEADGHAGTWHTVDEREYAGEKFFFMEHDEYGSDVAGIIVSEHGQLVAEDLWNGYDAGALEAISEYLQ
GUT_GENOME010224_02910304-383GALITMETENYQIDGKKGNWIATDTIIIDGKQFYLMEHQVYRDQAQGVILDAYGKMVVEECKKFDEKTKQKIHDYIQQQV
GUT_GENOME140278_00032110-178MEKAISMESEQIAVAQHIGTWHPIEKQEIDGRLYFLLEHDTYGDEVASVIVDEKGILYAQEVYDGFSEE
GUT_GENOME126809_009902-63DKDNIKIKGHVGKWHVIDKKKHRGKTVYLLEHNTYGDMAAGLIIDENLNVILDDVWNGFLDL
GUT_GENOME015025_0158655-120DRFKVQRHGGTWYVIDAAYCERLETMVFLLESETYGDEAAHLIVDEDFLVILENVWNGFDDLTEAF
GUT_GENOME198843_00085111-189TALLTMDTEGFQMEGRKGSWMAADETIIDGKHFFLLASEKYGRSAAYAVVDDQGRKAAEDTFQGFDEETIRQIRQTISL
GUT_GENOME074587_01232123-191LISLATTNFQLEGKEGRWLAFDNLVVEGKEFFLMEHTTYGKNAAWVVVDGTGKLIVDQVTAGFDETVKE
GUT_GENOME045530_016295-69ISLTSNNIKVSGHIGTWYVYASRVYHGRRLFLVEHETYGDHAANLILDKTGNCVMEDVWNGWEDY
GUT_GENOME094635_01159103-179ALVTLDTKDYRIDGMEGNWLAADEIIIDGRQFFLMEHQDYHRQTTRVILDCYGKKIMEECGNGFDQQTKEILHGYVR
GUT_GENOME169082_00781105-179LSVDTQNYRIDGYSGNWRVTDYIIIDGKQFYLLEHQEFREQAARIILDSYGKYVAETGIGGFDETAKQKIREHIH
GUT_GENOME133503_001701-59MRLHVEGQKFHGHLGTWHVVDTEEIRGHRLYLYEREQEPDGPWIVVDSGGAILCETEYY
GUT_GENOME223479_01162109-191AVIDIETSNYEIKGKKGVWEACDQLLIDGEYFYLMESMTYHNDAAFAILDAYGKLIVSENLKGFDIETIAEIRKSMYERRRMI
GUT_GENOME273500_00790575-650VTEETKGLLVDGHFGTWHTAEARKISGKMFYRMEHDEYGNTVAGIIVDEKGKLAAEDLEHGFDEGAMEAIEEYLQE
GUT_GENOME236884_0015037-108VVTMDTDGLTVDGHFGTWHSIDTMQIGGKDFYLMEHDEYGDEAASIIVDATGKLVAEDLWEGFTPEIVAMIA
GUT_GENOME011297_00641465-527NITLDGYEDTYHVINSDIIEYEQVYLLESETRGENAPHIVINAHGEVRADDMYSLKEYKQAID
GUT_GENOME003825_00310315-388IDMDTKAVELEQHEGLWHTVEEVEVEKEHFYLMEHNEYGASVAPVLVNGDGKVVAQDLENGLDQEAVKAIGEYL
GUT_GENOME159334_010442-81ITGESKRIKVEGHYGTWHVIDDGWYIFTPDTPEGPETITVHCFLLEHDEYGDEAASVIVTQDGRLLAENVCNGFDDLLEA