UHGP-MC 3655
Information
- Number of sequences (UHGP-50):
- 78
- Average sequence length:
- 76±7 aa
- Average transmembrane regions:
- 0
- Low complexity (%):
- 1.53
- Coiled coils (%):
- 0
- Disordered domains (%):
- 0
- Pfam dominant architecture:
- PF00528
- Pfam % dominant architecture:
- 6410
- Pfam overlap:
- 0.42
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC3655.fasta
- Seeds (0.60 cdhit):
- MC3655_cdhit.fasta
- MSA:
- MC3655_msa.fasta
- HMM model:
- MC3655.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME277665_01569 | 104-181 | VMNATNIPLMNPDIITGISLMLLFVFAGTLIGLDPILSFRTLLIAHITFSLPYVVLSVLPKFRQMDRSLPEAALDLGC |
GUT_GENOME208070_00705 | 342-414 | NNILLVSPDVIIGASFLIFFTAVGFISLGFTSVLLSHIAFSIPIVVLMVLPKLQEMNDSMVDAARDLGANNVQ |
GUT_GENOME211297_00585 | 100-174 | MEGTNQITMLNADIVTAIAFMLFFLIVQFIPFGWATLIISHTMVCIPYVVMSVLPRLSQLNPNLYEAGQDLGASP |
GUT_GENOME141703_00556 | 96-171 | KRAMEGLLYIPVVIPEIVMGISMLAFFSSLNLPAGLITLILAHITFCISYVIIVVRARLDGFDAALEEAAQDLGAT |
GUT_GENOME243514_00091 | 111-169 | IGIGLQLIFIKLGIAGKIFAIILVHIVYIIPYSVWIMIPGFESFDEDLYNQARILGASK |
GUT_GENOME211116_00610 | 175-240 | DIITGISLFLLFVSLGISQGLTTVVLAHITFCTPYVVLSVMPRLKQMNQNIYEAALDLGATPFQAL |
GUT_GENOME135791_00769 | 102-175 | NDFPMMNPEIVTAIGLMLLFITFKIERGFVTLLLAHIAFCIPYVMLSVMPKIRSLDPNLADAAMDLGATPWQAL |
GUT_GENOME032633_00578 | 1-75 | MSTSKANMLIPDVITGIGLAILFSITIIPLGINLGFATVVLAHISFCTPYAIIIIYPRMQKMNKNLILASMDLGY |
GUT_GENOME261029_00366 | 100-173 | SNLPIINPEIVTGVSLMLIFVLVFSLFGGKLGFLTVLLSHICFNTPYVILNVLPRLRRMDVSLYEASLDLGCSP |
GUT_GENOME000338_04306 | 100-171 | LISSPLLLPQIFLGLALLQFFYYFTNSPQNFFTLVLGHIIITLPYVVRTCVNSFLGISPYIEEAGRDLGAGA |
GUT_GENOME000734_00935 | 99-186 | TVNNIPLTNADIITGVSMMLLFLLAVSAWNGSLGQLTGVKWSMGFLTLLISHIAFDVPYVILSVMPKLSQLDPNIYEAAQDLGDHGFH |
GUT_GENOME218037_00159 | 98-169 | ASMPLLNADIVTGISLMLLFLGMGLRLGYGSILMAHVVFNLPYVILCIMPQFLSLDRSCYDAARDLGATPFV |
GUT_GENOME239123_01154 | 109-189 | NVSQLPMVNPDLVTGITFMMLFAFVNSLLVKLGMSQMGDYVKLLIAHISFSIPYVIFSVMPRLKQSSNLLYEAAMDLGCSP |
GUT_GENOME258557_00613 | 103-176 | METGLTLPIMIPEIILGLAFMVVFNALGLRGNDLRLVLAHTTFCIPYVFLNVKSRLVGMDASVFDAARDLGASP |
GUT_GENOME237880_00497 | 108-184 | KKAINSSLYIPIVIQEVVLGISLLLLFDVCHFPLGIWSMILAHTTFCVPFVVIIMRGRIAGMDMSVEEASMDLGANR |
GUT_GENOME243299_00889 | 108-173 | IPMVNPDIVIGVSLLSLFVICHIKLGYVTLILAHITFNVPYVIFAVLPKLSQLDPNLEEAALDLGA |
GUT_GENOME095713_00347 | 106-173 | NQIPVLNPDIVTAISLMVLFHIFHLNLGLFSLTLAHIIFLIPYVLLSIMPRLNTMESSLPEAALDLGA |
GUT_GENOME123265_00864 | 288-384 | MNKRARGAVMAVSYVPVVNPEIITGVSLMLLFVVFNRQVSAALSWLLQREVDFQGFGLCTLLIAHITFCLPYVIFNVSPKLRQLDPAVYDAALDLGC |
GUT_GENOME133943_00337 | 109-185 | KKRQRMILLNNVPIINADIVTGVTLMIVFSIIFSELGFTSLLLSHIFFCIPYVVLSVLPKLSEIDNNLYDAAVDLGC |
GUT_GENOME215161_00877 | 124-204 | VETFTQLPVVNAEIVMALSLAILFKVLQTSYSFATLLVGHMVLTVAFVYLNVKPKLMQMDPNIYEAALDLGATPWYSLFKV |
GUT_GENOME195449_02085 | 104-169 | VSPEMVMGISLLIFFISVKLPLGFVSMLIAHTTLGLPFVALTVLARLAEFDEHLVEAARDLGATEA |
GUT_GENOME244252_00247 | 107-178 | NLMIYVPIIVPEIVLAVAMLIIFITIGLQLGMGTIIIGHCTFCIPYAVVTIKGRISGDNQSLEEASMDLGAN |
GUT_GENOME040958_00461 | 88-169 | IQNASNIPIISPDIVMGVSLMLLFTTLGVIFNFEMGFFTVLISHICFCVPFVVLNVMPRIKRMDQSIYDAALDLGCNQWQAF |
GUT_GENOME052699_00846 | 99-192 | VNAVNNIPMMNADIVTGVSLCLLFVVFFNGWGAFAGWMNSWQSLVVLPERLTMGFGTLLLAHICFNIPYVILSVGPKLRQMDRNLIDAAQDLGC |
GUT_GENOME157142_00266 | 104-179 | TVNQLPMINSEIVMAVSLMVFFSAFGFIFPQGMTRLIISHVAFCTPYVILSVLPRLSRMDPNMYEAALDLGATPFG |
GUT_GENOME179445_01168 | 105-184 | SEITVVNAEIVTAVGFFLLSIFLRDVVMIPVQKGVGWLIIAHTIITTPYVILTVSPRLNQLNPNLYEAGLDLGAGPMRSL |
GUT_GENOME098792_00641 | 109-190 | PMVMPDIITGISLVMLILQVQALLRGSNLSWLQFERGWFTIWLGHTTLCMAYVTVVIRTRLGELDQSLEEAAQDLGAKPWKI |
GUT_GENOME018544_01908 | 112-189 | LNSAVYVPIVIPEIVLAVALLCIYIKIDFPLGLWSIILGHTTLTLPFVVINVKSRLAGYEKSLEEAAMDLGANQRQTF |
GUT_GENOME149567_00860 | 109-180 | LPMIVPGVVFAVPLMALLVLMGLKKGFWAVVLGNVILMLPYMILTVRTRFLGLDRSVEEASMDLGASGVETF |
GUT_GENOME159235_01732 | 100-178 | NQVPVVNADVVTGFSICILIVVVFGVSKDTYIPLVAGHVVLSAPFVYLAVVPKLKQMDPSLYEAALDLGATPAYALFKV |
GUT_GENOME012655_01621 | 108-179 | LPMMLPGIITGITILSFLQLVGVVQGAFAVVLGHTTFLLGTVLPQVYTRLRQLDRNLLEASQDLGAGGVQTF |
GUT_GENOME047860_01540 | 99-175 | ESLSILPIMIPEIIMGMSLLSVFTMADIPLGLGAIILAHITFCIPYIYLVVRSRLSGIDKSVVEAARDLGAGRARAF |
GUT_GENOME009824_00563 | 167-246 | NNIPLLNADIVTGISIMLIFSLIMKIPGFHYIFGFPTMLIAHMYFCIPYVILNVLPKIESLDSNMMDAALDLGLKPYKAL |
GUT_GENOME063855_01462 | 15-85 | LMYMPILLPGIIMGISFLTFFSLLHLLFGYLTMIIGHSTFCIPYTVIMVYTRMIRFDTSLEDASRDLGASS |
GUT_GENOME234136_00920 | 104-180 | QSIHHTLVYTPLIMPDILIGISLLMLFVAVRVECGFWTILVAHVTFCVSYVVMTVLARLQDFDDRILEASYDLGAGP |
GUT_GENOME189239_01615 | 421-490 | TVNNIPMSSSDTIMGVTFMLLFAAIGLDKGYLTLILAHVTFCTPYVILNVMPKLRQLDKNAYEAALDLGA |
GUT_GENOME130327_02167 | 95-175 | MMGITNIPILNSEIVTGISLMLLFIACRVTLGFSTILLAHITFCIPYVILSVMPKLKQTSKSTYEAAQDLGAGPVYAFFKV |
GUT_GENOME047636_00496 | 111-181 | ELAGQIPILNPEIVTALSLSILVVAVGIGFNYFTLLIGHVVLTIPFVVLSVIPKLKQLDPNVYEAALDLGA |