UHGP-MC 16761


Information


Number of sequences (UHGP-50):
70
Average sequence length:
66±7 aa
Average transmembrane regions:
0.02
Low complexity (%):
4.2
Coiled coils (%):
0
Disordered domains (%):
2

Pfam dominant architecture:
PF04233
Pfam % dominant architecture:
7429
Pfam overlap:
0.1
Pfam overlap type:
shifted

Downloads

Seeds:
MC16761.fasta
Seeds (0.60 cdhit):
MC16761_cdhit.fasta
MSA:
MC16761_msa.fasta
HMM model:
MC16761.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME061278_00975243-300MYPGGFGVPHEDINCRCVLLQRARWALGEGEVTKMNNETKEIVKIKEKDFKSFKKRYN
GUT_GENOME066290_01885246-314MYPGGFGSASEDIHCRCAVLQRARWAVDDEDNSFTKWDGVNHELLNIKAKNYENFKKAYRNMVDKNQTL
GUT_GENOME027131_00591257-325GLASEDCNCRCCLLQRARWALDDEELQTLKDRAAYFDLDKTKEFEEYQRKYLNLPENADTIDINSIDIV
GUT_GENOME259106_02297237-294AMYPGGFGIAKEDINCRCCVNQRARWALGSERYKYSRFSGDIVSIKSGEYKEWKEKYL
GUT_GENOME014239_00751182-242IIHPRCVNFLTEISNYTLDDDELKTLQERAAFFGLDKTQSFNDFKQKYLKLPDNADIMNVK
GUT_GENOME234273_01395244-339TAMYPGGFGIAAEDINCRCVVLQRARWALDQSEVEKAVGNLDGMTDEQLEELAKKLGVSKDELIKGSNGIIESDGSINHRIKAQNYNQFKKKYQKK
GUT_GENOME267682_00397261-327MYPGDPSGSAAEVINCRCVLLQRAKWALDQKELDRLKERASFYGLDKRKSFDEFNKKYIGTVENSKG
GUT_GENOME085630_0047940-116MYPGDPKGKAAEVINCRCSLDDVPRWYAQQGGHCFRRDNETGEIIECKNYAEFKEKYLKVSKGLSANGNKKSKGNAP
GUT_GENOME043903_01313259-326MYPGGFGIASQDVNCRCALLQRARWALDADELKTLKERAEYYGLDKTNDFDEFRQKYYKIEDFMKEQS
GUT_GENOME107810_00274256-335MYPSDPAGGAAEVVNCRCALLQRAKWALDDDELETLKDRAKYFKLDKTENFEDFKKKYLKAAENVASQATEKSSKTVDKT
GUT_GENOME242963_00132151-224MFPSDFGIAREDVNCRCVMLHRARWALDEDELDTLKERAAFYGIDKASSFEDFKSKYINIAEIVTEDNKKTAKA
GUT_GENOME025433_00733244-317YPGAFGRPEEDCNCRCVALTRARWGLDEAELQTMKDRAKFFGLDKTEGFREFEEKYLKAAEESEKVFYNQERIT
GUT_GENOME017700_00352257-325PGIGGSAANVCNCRCALLQRARWALDDKELDILKERAEYFGLDKSKDFEEYKAKYLQLSEHVREDARNM
GUT_GENOME251368_01442262-341MFPGDPNGGAAEVVNCRCTSNTRARWALGEEELQTLKERAEYFGLDKTENFKEFEEKYLKAAKTLENDGKSGTIKLSDIL
GUT_GENOME266522_009994-76MQPGDFGDPAEDCNCRCALLQRARWALGNDYTKWSPDAPVEISDDGTTQFVKVKALNYNDFKGKYKKACNDLQ
GUT_GENOME153320_00238244-309MYPCDPNGSASEVCNCRCALLQRATWALGQKDLDKLKERAEYYKELNESFNKDESFKDFENRLKKI
GUT_GENOME225090_00178241-318YPGDEKGSAAEVINCRCVALTRARWGLDEDELKELERRAAYYDLDKSENFEDYKKKYLKASEQKAVDIQKGSKSTIDE
GUT_GENOME110728_00578252-314YPGDFGRPEQDIHCRCCALQRARWALGEKELEQLKERAKYLELDQAKDFGEFQQRYLEAQNNG
GUT_GENOME108393_00114240-296LPGDKNAPAAEIVNCRCKAVFVPRWEVDSKAPRLRMDNIEKKVIEADNFQDWRQKYY