UHGP-MC 28499


Information


Number of sequences (UHGP-50):
83
Average sequence length:
111±9 aa
Average transmembrane regions:
0.17
Low complexity (%):
1.9
Coiled coils (%):
0.92
Disordered domains (%):
17.09

Pfam dominant architecture:
PF14191 - PF14195 (architecture)
Pfam % dominant architecture:
120
Pfam overlap:
0.65
Pfam overlap type:
reduced

Downloads

Seeds:
MC28499.fasta
Seeds (0.60 cdhit):
MC28499_cdhit.fasta
MSA:
MC28499_msa.fasta
HMM model:
MC28499.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME232186_0241837-161TMYYEINEELCKRGHEQNHLITDYKPGSTTASYRAAVDAFAAKCEAAKQGCRPGHEAKLDALADRYARRLAAYYNNSAANNARHVSWFVAGPSNYNMRAHEKWSRREEKLREDWNAIQRMEDEIS
GUT_GENOME000223_008721372-1504DTPEQTAVHYYTINEGAARRAKEANSFDDYRPGSATAEYRSMVDKAVEIAQRQKKRVAPEFHDKIDQLLDTYARLLAQNMNRGYEIAARVPSVMIAGPANFPVKKKQKQNAADDKNMREWQDIQGLLDKIRST
GUT_GENOME000880_015653-109RIYFNINESGAKTAHNMMSFSDYKEGSRTAGYKVLVDKAYELGENVILKKPSEEERVANLCERYSRRLAQNINRDIQIGMMCPSVMISGAGNFPVKKKEKQVAAWDK
GUT_GENOME255140_01400574-672KYFDIDEEQARRGRESYSLFGYKEGSETAYYRQQVDYAFKLAAVAKEKTADPELQARADMLAERYAKKYAEWLNEKNKIDASYPSWIVAGPANYNTKKN
GUT_GENOME090750_013293-108RIYYHIEEGTAKTAHEMRSFEPYKWGTVTARYQELADGAWELAERVARECPEKAEAAHKMAERYARRMAENLNKANRIGTRCPSILIAGPAGVDSKKKERQCAAYM
GUT_GENOME233284_004506-117INFYPINEALALRAHESTFTSDYKMNSATNEYKSILAEFSEDINLLINAHPNNATSDNMELINQLADRYSKKLADAINELNRIDSSCVSWMLSGPANYPMKKHQKQQQARDN
GUT_GENOME263819_0059318-123MKRKYYELNEGLARTAKSINSFSDYVENTATNEYKYYCDKVYDVLEKIIEQKPNLAEKATYKVDRYCRKLADYYNAYYKNEASCPSILITGAGNFPIKKKNAQNKR
GUT_GENOME234507_011143-109YYEINEVNAKRAHEMSSYSDYKPGRATKEYRNQIDEAGKELDYVLERCKTPSQMEHAADLFDKYCKTLAFAINEENRIGCMCPSVLITGSGNFPTRKKEKQVAAWGA
GUT_GENOME172984_0164912-118YYEINEEMARQAKAMHSMDDYIANSETESYRKVVDEVHEIGQAAKERTIEDKHEYIDYLCDKFAKKYAEYVNKGLSIKMQCPSVLVCGPANFPTRKKEKQNIARDNH
GUT_GENOME206704_004831-114MKYYPINEEAARTAWQMNHFGEFRSDEAGYRASVDEAYALAETAAEARPEQKERALALADRYARKLADWTNKKYRIDSMCPSVLISGSGNFPVRKKEKQNRAIDAHWQEYEKLK
GUT_GENOME067377_014236-129YYPINESSARTAHNMMSMRDYAEGSTTAEYRRTADRAYDLADRVAAERPEEAERAYRLAARYAKKMADYYNREASIGMMCPSVMISGAGNFPVRKKERQVAAWERNHQFYEEAQKILGKIESIL
GUT_GENOME278552_00457219-318VSEDWAARAQEMRSFSSYQQGSATSSYNASVDQFDKNVNELIQRYGNNATLTDKDWEEVYSIADRYAPNLEKYTDETNRNEASYPSWFISGPARYNTKKN
GUT_GENOME124593_009182-124YYTIDETMARRANNAYSFSDYREGSATAEYRQMVDDAAALAERCKQGRGEAAAAKIDALLDRYARRLADNINARNRNTASCPSVMIAGPANFPTRKKSRQNAREDSLMRDYQEIQHILHQIRT
GUT_GENOME222937_022433-111IQFYTISEDMARAANDANSMSDYKQGSATEEYRKRVENVYAVIEKIKEKRPNLAEKAERMAGRYSKKLAEYYNSYYRNEASCPSVLISGAGNFPVKKKNKQNSRRDSLM
GUT_GENOME265631_009821-111MARYYEINEADARMAHDANSMREFKAGSETEGYRAQVDEAYRMAEEQAERFPELAEKAYALADRFARKYAEWLNEGYRIDAMCPSILVSGGGNFPVRKKERQNARRSSHME
GUT_GENOME030570_00029456-564DFAKKYYEINEENARLAQELNSFSKYETGSATRIYHEKCDFAYSILDKILDEHPEQAESTAQKIDYYCKKLAEYYNDYYRNEASCPSVLICGPANFPSKKKERQNERRT
GUT_GENOME214339_009252-102YYEINEATARLAHDNMSMRDYIPNSATNEYRAAVDRAAAVLEEVKAKCKTQAQRERAEYYFDRYAKKLAQAINQENAIGTRCPSVLISGASNFPVRKKEKQ
GUT_GENOME110648_000971-108MKYYNIDESAARLSHEMMSMSDYQKGEKTLEYRGLCDFAAEMAEREKKRKPEYAEAIDALLDRYARKLAEWMNTESRIGTLCPSVFIAGGDGVSAKRKAKQNARMDAH
GUT_GENOME079230_00681545-648NPVSYYPIDESGAKLAKEMNSFSDYEAGSATSEYQKLVNRAAEIARNQKEQVDPIYHPKIDALVDRYARKLDENMNRQFEIDARMPSVLISGGGNLSVRRKEKQ
GUT_GENOME283956_000883-113YEINEDMARRAHEMRSDREYVPGSATAEYQRQVDEARRIAEEVKAQCKTTAQRNRVDGMLDKYELTLAFAINRDNEVGTWCPSILIVGGANFPVEKKKRQGEAWSANLMNY
GUT_GENOME095032_013475-111YFPINEATARTAKELNSFGDYVPGSATAHYRDMCDDVYTAAEDIAQRLPYLAEKAEAKAQNYAKKLAEYYNDYYRNEASCPSVMICGPANFPTRKKEKQNSRRDTLQ
GUT_GENOME231248_0040370-187PISEKTARDAKRMNSFSDYIEGSATAGYRSQVDDAAYTAFRQKQRVDPMYHEKIDRLLDAYARKLAENTNAGNSIAASCPSILIAGGSNFPVRKKEKQDAGADRNMAEYNEIQGLVDK
GUT_GENOME275578_0065820-132RYYEINEDTARNAHYCVHMSDYQPGSATNGYRAAVDEAAALVEARKAKVSPHYHDKLDALLDRYARRLAQWTNDYNRNQASYPSQFISGAGNYNMKKHEKQMSREGTLWKEYD
GUT_GENOME161540_003288-117INEDAARQAKAMWSHSDYVMGSKTEEYKKAVDEAYDLVDKIKDQRPKQAEKAENIARRYAKKLADNYNKGFRIELMCPSILISGAGNFPVKKKEKQNEARDRNLKEYNQI
GUT_GENOME277800_003541-109MKYYYINETTARAAHDMNSMRDYCDGEKTARYKRRVDEATEIAEARKKRYPDEADRIDLLLDRYARKLAEWYNKESEVESMCPSVLISGAGNFPVRKKEKQNERRDALM
GUT_GENOME205974_008281-113MTNFEKIKQMKEINKMKYYEINETAARQARECWSFRDYQYGSKTAEYKAQVDKCYSLVDKLPDDLKEKGATMADRYAKRLADWYNKQFRIERMCPSVMISGGSNFPVRKKEKQ
GUT_GENOME011738_00935150-250YGVDEGLARRAHESYSFSDYKDGSESGSYRSRVTAFEANVEELRSRYKDKNYTQEELDEVEHLTEAYARNLANYTNENNRKEASYPSWAIAGPANYNTRKN
GUT_GENOME133652_021506-116INEKAAKEAKSLWSFSDYEEESKTREYQSQVDEVDTIVSELPEELQEKGEALADASAKNLADWYQSAKKLADWYNKQSSVMISGAGHFNVQKADASAKNLADWYQSVKKLA
GUT_GENOME014667_01116273-377INEELARWAHESHSFSDYKTGSATAGYNAQVAEVAELAEKVKSSAPEEFHSEIDSLVSRYASRLAEWTNRRNRIENSCPSWFVSGPANYPMRKFEKQQAALKNCY
GUT_GENOME070861_0031516-114YYEINEETAKASKLMMSLDDYEENSTTNEYRTMCDRAQEIADEQKQKHPECAETIDMILNRYCRKLAEWINRENAIGTMCPSVMIAGPAGINRAKKEKQ
GUT_GENOME053596_010514-122QYVTINEESARIAHSLMSMSDYQEGSKTASYRAQVNEAYDLAEKVIEKRGERFEKRARILADRYARNLGKYYNEDARIGCMCPSVLISGAANFPVRKKERQVAAWDKNQEYYKYCDKIL