UHGP-MC 12895


Information


Number of sequences (UHGP-50):
110
Average sequence length:
69±8 aa
Average transmembrane regions:
0.06
Low complexity (%):
2.33
Coiled coils (%):
0
Disordered domains (%):
6.23

Pfam dominant architecture:
PF00145
Pfam % dominant architecture:
7455
Pfam overlap:
0.21
Pfam overlap type:
shifted

Downloads

Seeds:
MC12895.fasta
Seeds (0.60 cdhit):
MC12895_cdhit.fasta
MSA:
MC12895_msa.fasta
HMM model:
MC12895.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME036466_01639140-205YSIAYRILDAQGFGVPQRRRRVFVVGHLGRDWRQSAAVLFESEGMCRNTSAIQKTRAKDSRLAGTL
GUT_GENOME120297_00875151-222RWARSGMVRSKRCHLAWRVLDAQYWGVPQHRERIFLIACFGNRGGRPEVLFESEGMSGYFAESQSKKETITR
GUT_GENOME030077_01115140-220GGGRKHPNSGVVSGPKRTVAWRVLDAQWHRVPQRRKRVFVLAVAGAGNWASADALLPVGERVPGNLEACRQAWKEAAGNAG
GUT_GENOME015313_00536162-235GCIVGDDFSVAWRTLDAQYFGVAQRRHRIYLVADFRKECAGEILFNFESMSGYSPQSLCSWERTPNDAKDSIRA
GUT_GENOME011357_01099112-189DSGRWSKGGLVRSPKCNVAWRVLDCQYWGRPQRRARIFLVAGLGADSRPEVLFEPESLSRNTEESQDEGKSTARDTAQ
GUT_GENOME227890_01970155-233GEKWTNVGYVSGPKRTIAWRILDAQYFGVAQRRRRVFVVASARTDICPAEILFEYDSMHGDITPGGEEGETVAALTKNG
GUT_GENOME140366_00392146-214RKWPNAGHVAGPRRTVAWRVLDAQHCGVAQRRKRVFVVASTRKRCAGDVLFEYGSPRRDQAPRRPAWQA
GUT_GENOME215559_00747164-240WPDADYYLGDGWSVAYRVLDAQWWGVPQRRKRIYLVADFADHGAPKVLFESEGVSGYSAEGFRAWQRTASGAESGAG
GUT_GENOME001599_0015559-136WRTAGCVMGDGWSIAWRVLNAQFWGVPQSRRRIALVADFGGESAPEVLFERESVSGNSEESRAKQEGASGVSAVSVGD
GUT_GENOME261921_00150120-190GAILADHYSLAWRTMDAQHWGVPQRRLRISLVLDLTGGRAGEILFEPESLRGHFAPGVTPGQATAGAVENG
GUT_GENOME081986_01212164-232TVIWRTLDAQYWGVPQRRRRIFLVADYTGGGGTEEILFKPESLHRNTEQGNEEGKEITGHTKTGTRSTG
GUT_GENOME239661_01776279-349GYLLADGFSIAWRTVDAQFFGVPQRRRRIALVADFAGLCAPEILFIENGLQRYYPPGQSTAEAIARSLAAS
GUT_GENOME161981_01021156-229KWEKSGAIILDEGDSFSLAWRTLDAQFWGVAQRRRRIFLVADFAGQGALQILSEPESLSWNPETSRGTREGTSW
GUT_GENOME168909_01312151-228WAPAGVVRGRRCDIAWRVLDAQYWGVPQRRKRIFLVADFAKHGRCADKILFEPKSVQGDITQGAKETQSSAEGIEASA
GUT_GENOME069535_01080122-201GCLSDMDGKWSIAWRVHDAQFWGVPQRRKRIALVCDFGGHTAPEILFERKGLHGDTAEGGTAREEIARTAGNGIEGASGF
GUT_GENOME123865_03403176-260GVCKWKNAGIINPGPGGFGLAWRVLDAQYTRVSGFPRALPQRRRRVFLVGYRGDWRLAAAVLFDRQSLSWDPPPRREAQAGVART
GUT_GENOME232114_04370186-251KWSKHGCVIGRSRRLGWTTKDAQYFGVAQRRRRCFLVASARTDFDPTSVFLESGSVCRDSAPSREA
GUT_GENOME224712_01469162-240RWESAGAVLLGDEFSMAWRVMDAQFWGVAQRRRRIFLVADFGGTTAPEILFKQDGLFGDTQESGGPWQGAAAPAEGCPD
GUT_GENOME095520_01009127-187SFAQALADCGYHLAYRILDAQFFGVPQRRRRVFIVGYLGDWRPAVAVLFEPHGLRRDTPPR
GUT_GENOME207398_00423143-201KWANAGMVRGREFELCWRVLDAQYFGEPKLLQRRKRIFIVFDFGGFRSHKILYKPKDLL
GUT_GENOME000161_02120173-238GRWANAGMVRGGCPDLAWRLMDAQHWARPRLARRQRVFLVADFGGRRAPEILFKPRPMLPLPAPCT
GUT_GENOME236230_0080068-140GGQFSIAYRVLDAQFWGVPQRRRRIALVADFGGTTAPEILFERESVCRNTAESGEPGEAITSAAESSLSTTSE
GUT_GENOME121280_00005233-303KWAKSGMVQCGGVQVAWRQLDAQYWGVPQRRKRIFLVASFGCSCAEQILFKPESVPWNIAQSRQTKERTSN
GUT_GENOME072566_01171163-238WAYADCINGDGWSVAYRTFDAQYWGVPQRRRRIYLVADFRGGRAGEILFKREGLRGHTAQSGTQGQETARCAKNSV
GUT_GENOME031028_00082158-231WATSGYVLADGASLAWRVLDAQFFGVPQRRRRIFLVADFAGQSAPEILFESQSSTRNPDASKEEKQNPARAISP
GUT_GENOME089325_00822152-227KWQQAGCILGGHYSIAWRTFDAQYWGVPQRRKRIYLVADFAGECATEILFKPESVSWHTPKIFQSRQTVAGCATDC
GUT_GENOME096561_02473153-229KWNGAGEIVAETYSIAYRTLDAQFWGVPQRRRRIYLVGDFGGQRAGKILFEREGLSWDPAPGRGAWKKVTGYIDRSV
GUT_GENOME001315_00619151-207GRWANAGMVRGRGVDLAWCVYDAQYFGTAQRRRRLFLAADFTGHSAGEILFVPKSLR
GUT_GENOME251951_00018149-225GRWAKAGMVRSERVGIAWRVLDAQYWGVPQRRSRIFLIADFAEKNRCAAEVIFVEQSVPRDLEKSGTTGKEVASFSR
GUT_GENOME257704_0029441-118GKRWPNSGCVYGPKRAIAWRVLDAQYFGVAQRRRRVFVVASARKGFHPAQVLFEREGVHRDSPPRRGEGQDLAGTIAG
GUT_GENOME012396_00314159-234WATAGVVRGGAICAAWRQLDAQYWGVPQRRKRIYLVGSFGSDRAEKILFECDSVRGYLAPCGTARERTAATTEGSV
GUT_GENOME180786_00404129-193YSLAWRVLDAQFFGVAQRRRRVFLVGHLGADVGAAASVLFERDSVSGNTVSGKQKREELAAVPEG
GUT_GENOME160348_00152111-185RWSKAGAIAGNGWSLAWRQLDSQYFGVAQRRKRIALVADFGGQRAGEILFERTSMSRHPDPCIPAWKEVTGLTAN
GUT_GENOME238202_02534150-208QWPNAGLIKGPVRNVAWRVLDAKYFGVPQQRRRLYVIAGGKNFHPENVLFERRGNAPLG
GUT_GENOME029476_02616154-223YGVSWRIFDAKYFGTPQRRRRVYLVASFGSLRSADVLFDTKPASIAPRAGLGEKDLLTARNGSSISTSNI
GUT_GENOME000584_01146157-231WQSTGEILGDNFSLAWRVLDAKYFGVPQRRRRIFLVADFDGGSSREILFEQKSLSGDTSEGCEKGKRNTGAIKEG
GUT_GENOME046426_00028232-296YGLAWRVLDAQFFGVAQRRRRLFLVGHLGACPPVGVLIEPESMRGDLESSAEKRASLAEAARRSP
GUT_GENOME118800_01437157-232RWPSAGLVTGAKRSVAWRTLDAQFFGVPQRRKRVFVIASSRQGFNPAKVLFERPSLRGDSCESAEKRETVASFAEG
GUT_GENOME202700_00423504-579LVAGPKRKAAWRTLDAQHFGVPQRRRRVFVVASPVEGGGAIDPAKVLFERQGLLGHSSQSDEPRETASAFAQSSFG
GUT_GENOME171418_00480155-226RVQKWQKAGAIVADQWSIAWRVLDAQFFGVPQRRRRIYLIADFASERAGHILFEPSCSSRDLAQGCSEKQNP
GUT_GENOME170487_00801150-214WAKAGMVECESGTLGWRTLDAQYWGVPQHRERIFLVNHFGTGGGIDEILFKPQSMPRYSQTSQEG
GUT_GENOME236210_01531141-205KWAKAGLVELPECQIAWRVLDAQYWGVPQCRARIFLVADFGAEERRAAEMLFVAAGLQRNPCQSR