UHGP-MC 108210
Information
- Number of sequences (UHGP-50):
- 91
- Average sequence length:
- 70±12 aa
- Average transmembrane regions:
- 0.19
- Low complexity (%):
- 29.87
- Coiled coils (%):
- 0
- Disordered domains (%):
- 18.58
- Pfam dominant architecture:
- PF04829
- Pfam % dominant architecture:
- 4725
- Pfam overlap:
- 0.67
- Pfam overlap type:
- extended
Downloads
- Seeds:
- MC108210.fasta
- Seeds (0.60 cdhit):
- MC108210_cdhit.fasta
- MSA:
- MC108210_msa.fasta
- HMM model:
- MC108210.hmm
Sequences list (filtered 60 P.I.)
| Protein | Range | AA |
|---|---|---|
| GUT_GENOME147138_01145 | 3852-3948 | VQGALAAAQNKNAMVSATGAATGELAGMMATELYHKDASQLTEGEKETVSTLATLAAGLAGGLTGDSTVSALASAQTGKTVVENNLLTGKDALNMLR |
| GUT_GENOME140815_00385 | 2975-3041 | EVIASAIAKSLYPDVDPSKLTEDQKQTVSTLATLSAGMAGGIASGDVAGAAAGAGAGKNVVENNMLN |
| GUT_GENOME143407_04436 | 267-362 | LQGNSAAAGGAGALSGELAAIYIKNNLYPNIETKDLTEAQKQVIVNLSSLAAGLSGGIAGDSTGSAVAGAQAGKNAVENNAVSCSTLTCLNNPLDL |
| GUT_GENOME142612_01413 | 4041-4122 | KGENVAAQATGAMTGETVGILSHSLYGKTPEELTESEKQNISAWATLASGIAGGLISDNSTGVANAAQAGKVVVENNVFNLA |
| GUT_GENOME141192_01771 | 2505-2576 | IATQMYGKPVSELDETQKQTISTLATLAAGLAGGLTRGSTADTVTAAQAGKTTVENNFLGATSSDKLDKAVE |
| GUT_GENOME143724_04826 | 230-291 | ILKTMYPGKHVSELDESDRQLVSNLATIASGLAGNLAGGDSKSTTTGAQSGKNAVENNALGD |
| GUT_GENOME140939_04368 | 2856-2949 | MQGNNVAAGAAGAATGEMAARAIAGMLYPGVKQSDLSEEQKQTISTLATVSAGLAGGLTGNSSASAAVGAQSGKNAVDNNYLSVSEKTELEIAK |
| GUT_GENOME143527_01924 | 33-98 | EMTAKLALDLYGKKPEQLGEDERQLVSALSSVAAGIAGAAVSDNTASVQTAAQAGKNAVENNAISD |
| GUT_GENOME147130_04577 | 133-210 | GNATVGAVSAFTAEAAAPAIINAMGWDKDHLTEQQKQTVSALGTLAAGLAGGLVGDSSNSAVAGAQAGKNAVENNTLS |
| GUT_GENOME144553_02946 | 2864-2930 | ELAARYIIDNYYGGRTDNLSEQERQQISMLATIASGIAGGLAGNSTSAAGTGAQAGRNSVENNYLSV |
| GUT_GENOME216800_00488 | 112-172 | KNALYGNVPVSQLSEEQKQTLVALGTVAAGLAGGLTGNGTADAVAGAQAGQNEVSNNMMSM |
| GUT_GENOME231923_04594 | 3123-3203 | GNSALAGAAGAVSGEQMARLVMKQLYPGREVSDLTETEKQTISTLGTLAAGLAGGVVGDGSGNAVAGAQAGKNSTENNFLG |
| GUT_GENOME232344_04441 | 1661-1724 | QVISERLYGTRDSSTLNEAQKQTITALASLAGGLAGSVVDGSSGGAIAGAAGGKNATENNFLGG |
| GUT_GENOME095841_01103 | 2570-2636 | FLASTLYDKSPEKLSEEEKRTVSSLSQVAAGIAGGSLSDSSDGAIIAAKTAKDSVENNSMADDVHPS |
| GUT_GENOME207259_00152 | 401-475 | LMAEMYPGKTASQLSEEEKQKVSALATLAAGLAGGLAGDDTASALAGAQTGKNAVENNSLSVDQNQDRIKELTQC |
| GUT_GENOME143496_03654 | 23-83 | IAEALYPGKDIKDLSEEEKQGVVALATLASGIAGGVVGGDVSSGIDGAKGGKNEAENNALG |
| GUT_GENOME060658_01487 | 650-751 | MAHALLSAVEFQVTGKDPLTGAIAGVTGEATAEIIARAYGKPVSELTANEKENISTLSQLAGGLASALTAKANGTTTEQGGNFLAATAGAETAKRAVENNFA |
| GUT_GENOME143092_03473 | 363-424 | MIAKTLYGTDDYSKLDETQKQTISALSMLASGLAGGLVADSAASAVAGAQAGKNTSENNDMF |
| GUT_GENOME216800_03206 | 3513-3588 | GAAGAAGGELAAKAIVKVMYPDTDIRDLTESQKQTVSALSQMAAGLAGGIASDSALGGGTGAGAGKNAVENNTLSP |
| GUT_GENOME121634_00177 | 2360-2459 | LAHAVWGAIEAQVNGGSAAHGAISAAGAELLAPQIASILYGKSEADLTPDEKAGVISMASLAGGIAGAIMNGKSEGVEIIGNTAINAQIAENTVTNNYLS |
| GUT_GENOME141707_00250 | 2326-2404 | ALAGAAGAVSGELLGRWIATEYYPDVKTEELSDEQKSTISALSTLAAGLMGGLSGGSSADAVAGAQAGKNAVENNLLGG |
| GUT_GENOME207741_00112 | 3250-3328 | AVAGGLGAGSGELSAHVILNTLFPGKKVSDLTESEKQQVSALSQLAAGLAGGLTTGDMAGAITGSQAGKNAVENNYLSD |
| GUT_GENOME208088_01434 | 1139-1229 | LSGGSTADGLKAGAIGAITASAMTDHLVSALYGDKKSSDLTPEEKRLVSSLVSIAGGLAGAAVTDGSVSMAAMASETAKVEVENNSLSVVI |
| GUT_GENOME031612_01605 | 2847-2910 | RIIAEQLYPDTRPENLSEQQKQRVSTLSSLAGGLAGGLVSGNSDGAVTGAKTAKNAVENNAISG |
| GUT_GENOME143489_00975 | 2618-2709 | MQGNSATAGALGGGGGELAARIYMDQVHPGKKVSDLSEADKRIVSAIGTLTAGILGGLSTDSSTGLITGAQAGKNAVENNALSVAQTQSLIK |
| GUT_GENOME096136_03155 | 3470-3550 | KGENAGAQSLGAFTGEAVGMLSEKLYGKEPSQLSESEKATVSAFASLAAGIAGGLVGGDTSTAANAAQAGKTTVENNLLSN |