UHGP-MC 108210


Information


Number of sequences (UHGP-50):
91
Average sequence length:
70±12 aa
Average transmembrane regions:
0.19
Low complexity (%):
29.87
Coiled coils (%):
0
Disordered domains (%):
18.58

Pfam dominant architecture:
PF04829
Pfam % dominant architecture:
4725
Pfam overlap:
0.67
Pfam overlap type:
extended

Downloads

Seeds:
MC108210.fasta
Seeds (0.60 cdhit):
MC108210_cdhit.fasta
MSA:
MC108210_msa.fasta
HMM model:
MC108210.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME147138_011453852-3948VQGALAAAQNKNAMVSATGAATGELAGMMATELYHKDASQLTEGEKETVSTLATLAAGLAGGLTGDSTVSALASAQTGKTVVENNLLTGKDALNMLR
GUT_GENOME140815_003852975-3041EVIASAIAKSLYPDVDPSKLTEDQKQTVSTLATLSAGMAGGIASGDVAGAAAGAGAGKNVVENNMLN
GUT_GENOME143407_04436267-362LQGNSAAAGGAGALSGELAAIYIKNNLYPNIETKDLTEAQKQVIVNLSSLAAGLSGGIAGDSTGSAVAGAQAGKNAVENNAVSCSTLTCLNNPLDL
GUT_GENOME142612_014134041-4122KGENVAAQATGAMTGETVGILSHSLYGKTPEELTESEKQNISAWATLASGIAGGLISDNSTGVANAAQAGKVVVENNVFNLA
GUT_GENOME141192_017712505-2576IATQMYGKPVSELDETQKQTISTLATLAAGLAGGLTRGSTADTVTAAQAGKTTVENNFLGATSSDKLDKAVE
GUT_GENOME143724_04826230-291ILKTMYPGKHVSELDESDRQLVSNLATIASGLAGNLAGGDSKSTTTGAQSGKNAVENNALGD
GUT_GENOME140939_043682856-2949MQGNNVAAGAAGAATGEMAARAIAGMLYPGVKQSDLSEEQKQTISTLATVSAGLAGGLTGNSSASAAVGAQSGKNAVDNNYLSVSEKTELEIAK
GUT_GENOME143527_0192433-98EMTAKLALDLYGKKPEQLGEDERQLVSALSSVAAGIAGAAVSDNTASVQTAAQAGKNAVENNAISD
GUT_GENOME147130_04577133-210GNATVGAVSAFTAEAAAPAIINAMGWDKDHLTEQQKQTVSALGTLAAGLAGGLVGDSSNSAVAGAQAGKNAVENNTLS
GUT_GENOME144553_029462864-2930ELAARYIIDNYYGGRTDNLSEQERQQISMLATIASGIAGGLAGNSTSAAGTGAQAGRNSVENNYLSV
GUT_GENOME216800_00488112-172KNALYGNVPVSQLSEEQKQTLVALGTVAAGLAGGLTGNGTADAVAGAQAGQNEVSNNMMSM
GUT_GENOME231923_045943123-3203GNSALAGAAGAVSGEQMARLVMKQLYPGREVSDLTETEKQTISTLGTLAAGLAGGVVGDGSGNAVAGAQAGKNSTENNFLG
GUT_GENOME232344_044411661-1724QVISERLYGTRDSSTLNEAQKQTITALASLAGGLAGSVVDGSSGGAIAGAAGGKNATENNFLGG
GUT_GENOME095841_011032570-2636FLASTLYDKSPEKLSEEEKRTVSSLSQVAAGIAGGSLSDSSDGAIIAAKTAKDSVENNSMADDVHPS
GUT_GENOME207259_00152401-475LMAEMYPGKTASQLSEEEKQKVSALATLAAGLAGGLAGDDTASALAGAQTGKNAVENNSLSVDQNQDRIKELTQC
GUT_GENOME143496_0365423-83IAEALYPGKDIKDLSEEEKQGVVALATLASGIAGGVVGGDVSSGIDGAKGGKNEAENNALG
GUT_GENOME060658_01487650-751MAHALLSAVEFQVTGKDPLTGAIAGVTGEATAEIIARAYGKPVSELTANEKENISTLSQLAGGLASALTAKANGTTTEQGGNFLAATAGAETAKRAVENNFA
GUT_GENOME143092_03473363-424MIAKTLYGTDDYSKLDETQKQTISALSMLASGLAGGLVADSAASAVAGAQAGKNTSENNDMF
GUT_GENOME216800_032063513-3588GAAGAAGGELAAKAIVKVMYPDTDIRDLTESQKQTVSALSQMAAGLAGGIASDSALGGGTGAGAGKNAVENNTLSP
GUT_GENOME121634_001772360-2459LAHAVWGAIEAQVNGGSAAHGAISAAGAELLAPQIASILYGKSEADLTPDEKAGVISMASLAGGIAGAIMNGKSEGVEIIGNTAINAQIAENTVTNNYLS
GUT_GENOME141707_002502326-2404ALAGAAGAVSGELLGRWIATEYYPDVKTEELSDEQKSTISALSTLAAGLMGGLSGGSSADAVAGAQAGKNAVENNLLGG
GUT_GENOME207741_001123250-3328AVAGGLGAGSGELSAHVILNTLFPGKKVSDLTESEKQQVSALSQLAAGLAGGLTTGDMAGAITGSQAGKNAVENNYLSD
GUT_GENOME208088_014341139-1229LSGGSTADGLKAGAIGAITASAMTDHLVSALYGDKKSSDLTPEEKRLVSSLVSIAGGLAGAAVTDGSVSMAAMASETAKVEVENNSLSVVI
GUT_GENOME031612_016052847-2910RIIAEQLYPDTRPENLSEQQKQRVSTLSSLAGGLAGGLVSGNSDGAVTGAKTAKNAVENNAISG
GUT_GENOME143489_009752618-2709MQGNSATAGALGGGGGELAARIYMDQVHPGKKVSDLSEADKRIVSAIGTLTAGILGGLSTDSSTGLITGAQAGKNAVENNALSVAQTQSLIK
GUT_GENOME096136_031553470-3550KGENAGAQSLGAFTGEAVGMLSEKLYGKEPSQLSESEKATVSAFASLAAGIAGGLVGGDTSTAANAAQAGKTTVENNLLSN