UHGP-MC 96562


Information


Number of sequences (UHGP-50):
51
Average sequence length:
67±7 aa
Average transmembrane regions:
0.02
Low complexity (%):
1.24
Coiled coils (%):
0
Disordered domains (%):
2.26

Pfam dominant architecture:
PF04055
Pfam % dominant architecture:
196
Pfam overlap:
0.43
Pfam overlap type:
reduced

Downloads

Seeds:
MC96562.fasta
Seeds (0.60 cdhit):
MC96562_cdhit.fasta
MSA:
MC96562_msa.fasta
HMM model:
MC96562.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME057169_003681-69MRVENIKIGERYTYKELCDTLGVKCSEATNKKVEFLEKLESYCKYERPDNRHFVVTEIFETPLPTLDDG
GUT_GENOME260001_0080514-69NKTNEVKNYKKMCEILHEKEKRGKSEKGKQFDVWKMYFEFHNNGQKFIIDKVLDEN
GUT_GENOME247942_005616-87LSQLQIWEYYKYSELCALLGEPEKQGNSKKAQEKEWARYFEFEKIGKNKGTRYIVDDIYDVEKRKAPSGNSKYIQMIESLLS
GUT_GENOME207518_0348415-85NKEFKYKELCGILNLKEKGRGKSRTLQLKDWERYFNFYKPNSKGQKYIIKEFYTGVKPKINNRGKSEGSRG
GUT_GENOME000716_0251278-144LTNGLIFKNWKEVCGFMGWKETGGDYKKAKLKELDTMCKWHKEGNKIVVDEVYENNVKFEDNTRARK
GUT_GENOME212988_0506829-88IIKNYKIMCELLNEKVSEGGSTKKAQINRWKRYFEFHKNGQKYIIDKIYDNPLIATDQRT
GUT_GENOME269865_001248-74LNKEYTWKQICELTNIPYKTGNTKIKQMKQFESLCKFTKEKTKFTIHEIYSMPKEIEDGRKNNRGSD
GUT_GENOME191999_001502-65KKINNGQKFKNYKALCEFLEEPVKTGKSKQLQMKDWERYFQYHKEGNAIIVDAVHDEVLKKKRK
GUT_GENOME019136_0001413-89SENINRVVENQIFPNYRALCSYVGQPVLSGNSKKAQIKQWQRYFDFSKDGQKIKIKKIYKSSKNKVDNRKFNGKSTY
GUT_GENOME125783_005375-71LKVGEYSYKEMCELFHEKQKTGKSKQLQLKHWELYCRYERPTNRLFVVNEVYKTPLERIDGRKTNGR
GUT_GENOME259495_021518-71GQKYTYKKMCELLDEPYKGGTAKAAQLKEWSRLYKLQKDGTKYTVVQKYDEPLPKEMRKSTSSF
GUT_GENOME227291_049515-71NLHIGDVIKNYKLLCEIVEEECKAGQSKVCQLNNFERYFTWERKGQKYIITNIYDEPLKKNRVNNNL
GUT_GENOME188395_0057111-79KEKISDYVGVSLRYKDLTDLLGLKYYRGGNARDSQIKEIERYVDMEKNSTKYLINGIKAVPDEKIDKRT
GUT_GENOME142395_0109611-74YKNYKELCETLNEDIKTGKSKQLQLKNWERYFTWNKDGYGFIVTEIYSTPKKKIDGRGKSVGSR
GUT_GENOME033834_0124011-88VDVSQLEIGMVVKNYKEMCSILGEEPKSGCSKKSQMKNWERYFEITKVKGSQKMMVSDIYDEPLTVVDGRSRGNNSIY
GUT_GENOME090392_022043-73VSCLETGMVVKNYRTLCELLGVKPVTGKQKKLQLKEFSRYFEYEKIKGSQRIIIVEIYEQPKEKTDSRKNG
GUT_GENOME207533_012762-68SNLKEGQVFKNYKELCGFLGWEITNGNSKKKQMDTLSTMCEWYKEGNKIVVDEVYEKQLAKKDGRKN
GUT_GENOME225917_013825-72KIYKNYKELCNAIGWDIKGGKSKQLQLKDLERYCVYEKNGIKFDVKEIFDIPKIKEKNNRNNKYGDNL
GUT_GENOME104752_011161-73MKIENLTVGQEYKYKELTETLGVKYLGGKQKINQLEDFRRFFNYEKNGTKFLIKEIYTMAKEKQDKRRLGNNN
GUT_GENOME000598_032072-73DTLNLKKIKIGQVIKNYKELCRLCGWKVTTGTAKKAQLKELARYFKWHKDGNKFVIDEIYDTPLAKIDNRKY
GUT_GENOME143578_0028115-80LSEGLEVKNYRALCELVGEEPKTGNGKKAQLKEWERFFAMEKVKGSQKMIVTEIYERPKERKDKRL
GUT_GENOME064085_020633-81IDNLKVGQVIKNYKELCELLDIEAKGGASKIAQHKEFDRYFEYEKQGHKYIITKIYENPKERIDNRMNGNNTVFADDIE
GUT_GENOME262622_016042-81VDVTKIEIGKIYKYGQLCELFREDRKSSNSKSSQLKEWERFFRWQNITAQKYRIEEIYDIPKEKMDGRKKNGGNSTSKYL
GUT_GENOME265945_00002162-241IGKTVKNYKVLCELLGQEVKDGKAKKYQLENFKRYFEWEKTGQKFIIMDIYDTPLEKEDLRKLGNNSIYTQAIEVILLQY
GUT_GENOME207758_0062110-70VKNYKELCEFFGVEPKAGGQSKISQIKDLERMGDFKKDPKGYGYTIYSKYKEIKPKETARG
GUT_GENOME076603_0128121-78KLFEGKVFKNHRALCEEFGWEYINSTNSKKAQLKVLSQYCELHKQRQKIIIGKIHESV
GUT_GENOME031091_011451-69MNLNKLSNGQVVKNYKTMCEILEEPYMGGKSRVIQLDKWRLYFDYGKDGVKFIINDIYTQEKTNKLKQA
GUT_GENOME010629_014736-72SIATGEVFRNYSELCTVLDEPHKTGKSKQLQLLDWQRYFTYEKQGHKFIITEIFEEPKEKVKKQNTK
GUT_GENOME012770_00208171-232VKNYKELCELLHEKPLKSGNNSHKAQQKRWARYFSMEKGRGRSIVILDIYDEPLPADDKRKA
GUT_GENOME141243_027892-68LQSGMKFNNYKILCEHMSWKIYKSGSNSYKAQMKQLDSLCKWHKEKRIFIIDEVYTKPLVKKDNRRK
GUT_GENOME034537_0063016-76AVGKSLKYRELCELINEDAKQGNSKASQLKRIERYLRMDRPTTHTYVIREVYPTPLDEVDG
GUT_GENOME007399_0108613-76LVDNQSELNYRELTNFLELPYLRGCSKDKQLSELSKICKIEKNKTKYKITEIYNSALIKKDGKS
GUT_GENOME195312_0138019-68YVELCKIIKEPAKKGKSKQLHIKRIEKYTDIEKTGSKYIIHKVYENDDEM
GUT_GENOME227708_023576-67LKHGQTLKNIKELCEVLKIPYKDSTDSRKAIFKDLDRYCKYHKEGRKIVIDEIFQKEQPKVD
GUT_GENOME225811_0008614-63YADLCEIVAEQKTQGGRNRKLQLERWLTYFEFEKKGQTYIVTKLKKPPLI
GUT_GENOME000760_015231-79MKIENLILKHPYKYKELCEVLGIQPTSKANNSRIAQFKELERYCEFYKEGHKIIITSIKNTVGVKMDKRKLVKETDKRR
GUT_GENOME001757_0088211-85VDTSRIEEGQIVKNYRQLCDLLGEERTTGNAKKAQMANWERFFSITKIKGTQKMRIDEIYIEPLERVDGRAKGGR
GUT_GENOME103765_019367-68SKLHPGMIVKNYKELCKILGIEPKTSNSKIRQLKEIEELISYQKSGHAFIIKEVYDKKTLKF
GUT_GENOME001141_0156130-98IENLEVGKVIKNYKELCNVMGWDVVSGNAKKSQIKELDRYCNYHKEGNKFVIDEIYENPLPKIDARLNG