UHGP-MC 16979


Information


Number of sequences (UHGP-50):
84
Average sequence length:
62±7 aa
Average transmembrane regions:
0
Low complexity (%):
1.16
Coiled coils (%):
0
Disordered domains (%):
0.23

Pfam dominant architecture:
PF18809
Pfam % dominant architecture:
5000
Pfam overlap:
0.27
Pfam overlap type:
shifted

Downloads

Seeds:
MC16979.fasta
Seeds (0.60 cdhit):
MC16979_cdhit.fasta
MSA:
MC16979_msa.fasta
HMM model:
MC16979.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME065702_01513206-265FFGKVYNQFKGKAKEAVNFLMRHRSGKLLGVFHRNDIGEISLVWGDEKGGLAHIISKHIV
GUT_GENOME013144_0134160-120LIGPEFKGVRRQAAVRKLLEEKRGHVKAAFHRKDIGDIDLIWGNDKVGLNHILKRRAEQGV
GUT_GENOME115794_0160866-125FGENYSEYAGKPKEALNFLIDKKSGQVRNAWERDDIGDIDIVYGKKAFGLRHIIGKHVGI
GUT_GENOME112019_0102368-117EYTGVKGKDAIDKLMKEKQGFVRDAFSRVDIGNVALIWGNKDIGLCHIIK
GUT_GENOME107732_00834723-790TPIGESDFGFVYDQFKGNAQGAIQQLMKMQDGEALGALHHDEIGDIDLVWGKAGTKKSDGYGLAKLVK
GUT_GENOME096417_002401028-1085PIADFGENFSEFRGQGAKAVEKVLQERSGQVQGAFYREELGEIDVVWGDEKIGLQKIV
GUT_GENOME269292_02042147-215APDEWGTAYTSMSGREADAIRHLLEKKNGFVPAAFHREEIGDIDVVYGRTGKGLKDEGGYGLAHILKRH
GUT_GENOME044727_0075854-115EQFYGEEIKGKNLKGRRALFKMLEERKGFIRGAFHRDDIGDIDLVWGDSEAGLEHIIQRRMD
GUT_GENOME260826_0023459-119SETDFGIDYKQYKGNPKEAILYLCETKKGQVKGVWEREDVGSIDLIWGNRNQGLRHIIKKH
GUT_GENOME143418_007741482-1568KELEDFGINFEGFKGKEAVLKLLEEKRGQVKGAFYKEGLGEIDLVWGDDSFGLKHILNKHGGEFENLARELSEAVENGKIVKDDKGR
GUT_GENOME222573_01369768-838DPFGPAYTEFSGKSAEAIEHLMRVREGHVPAAFHKEGLGDIDLPWGRSGKKGYGLAHIIERREQLGMDGEA
GUT_GENOME239236_00612345-423RKHIETRTNTTPLKEFGTNYAEFYRDGQGAVKKLLAEKQGQVAGAFYRDDLAKATGTGEIDLVWGDSTKGLQHILERRA
GUT_GENOME130507_017472656-2736RAEDAAEFLEMRRGGDILGVFHRDEVGDIDMVWGDVNGGFAHILDKHVGENKSFRTAKEAAQSINDIIASGDIASMTPDKI
GUT_GENOME140785_00329834-887EFKGDGLGAINKLLETKKGFVAGAFYKEGLGDIDLVWGNKDYGLEHILKRRIES
GUT_GENOME187628_02026180-246LGPMMPEYNGKPSEGLACLRQAKTGAIPALFYHKDFGDLGIMYGKPGKPNREYADGYGLAHIDAKHP
GUT_GENOME010197_00611531-610SAKELMGKEYKGYTGQKAIDKLLQEKNGHIKNAFTREDIGGISVLWGDDKAGLKHIITQRTKQGFTQEKLDKFFSELGNV
GUT_GENOME237872_00557287-350LELIFNFKPIKEYGENLAEYYHQPYEAAREINYTKKGQIVGAFYREELGDIDLIFQRIKIKDSD
GUT_GENOME143153_0090376-147EFGTNYAEFYHKPIEAFKKLQAEQSGQIAGAYERKELGDIDLVWGKVWRDEKGEIQGYGLSKILDKHPEITP
GUT_GENOME237141_00433219-287FGKEFKGYSGAKAIEKLLLEKQGFVQGAFSRNDIGDIDVVYGEVTDPKTHKGYGLAHIFDKHPEVTPEI
GUT_GENOME113475_00975160-217FGQPFRGYHGTEAINKLKNEWKGYIPSAFHRDDIGNIDLIYGNECIGLKHIIKERIGR
GUT_GENOME142628_0101432-98QGIKRKQYPRLDKFGTNYSQFENEPEKAIEFLLKRKQGQVIGAWEREGLGKIDIVWGNDKRGLQHIR
GUT_GENOME096338_00958283-346LKLQHATTPLKEFGRNYPEFALKPKEALEKLLQEKNGQVAGAAYREDLGGIDFVWGTPKTKDSV
GUT_GENOME219868_00394170-221GKAYSGYTGQKAVRKLMREKQGYVPNAFKRKDMGNISLVWGTENLGMQHFIM
GUT_GENOME244371_0004672-124LGKEFIGYKGKSAIDKLLKEKRGYIKGAFKNPTLGDIALVYGNEKLGLRHIIL
GUT_GENOME243008_006891306-1372QGEFGPIYDQFKGKAKEAVEFLLRKKNGEAVAALHHKDIGDIDLVWGKAGTAHSDGFGLAKLSMYHP
GUT_GENOME239130_0087747-107KDYPEYKGKGQKAVDFLIRQKNGCVNGAFYRKDVGSIDIVYGEVHDPVKHTGYGLAHIIDK
GUT_GENOME096039_01655970-1043REAIENALNIKPIKEFGTNYAEHYHSGESAIAKLISEAQAHKESGAKGEYKGQVAGAFHRKELGDIDLVWGEVT
GUT_GENOME044585_0056552-116RYAKDELKVFGRNYFKYAGKPKEAIDFLLKERNGQVIGAIEREGLGKIDIIWGDERKGLRHIRKR