UHGP-MC 14500


Information


Number of sequences (UHGP-50):
57
Average sequence length:
61±3 aa
Average transmembrane regions:
0.01
Low complexity (%):
0.29
Coiled coils (%):
0
Disordered domains (%):
0.95

Pfam dominant architecture:
PF10604
Pfam % dominant architecture:
6140
Pfam overlap:
0.43
Pfam overlap type:
reduced

Downloads

Seeds:
MC14500.fasta
Seeds (0.60 cdhit):
MC14500_cdhit.fasta
MSA:
MC14500_msa.fasta
HMM model:
MC14500.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME001435_0369673-131MDNVNFSGEWTGHFRQAENGTEVIFTESISLKKWWLYPFALLYLKLQQKRYIADLKRKC
GUT_GENOME000994_0022672-129MENKNLTGHWQGVFSYQNGITVLELIERVKARNIFMRPFMKLYLKNKQKKYIAYLRKE
GUT_GENOME044231_0041071-138LENGNITGHWIGLFAEMDGHTQLELTETITAKKFFMRPFVKAYLKKQQARYIADLKRALEMERGNANG
GUT_GENOME183802_0185872-141MENDNISGKWTGEFQKLAARKTKIIFTEDVIAKKFYMKPFIKSYLEKQQQTYIRDLAAYLAKNLTLKNQK
GUT_GENOME217503_0116672-132LKNENMEGRWKGEFFRADGGTKVVFTEEVRVGKFFFRPFAKAYLKARQKKFFADLRRCLGC
GUT_GENOME254341_0015772-131MENDNLTGRWKGKFIFSGGRTFVEFTEEASAKKFYIKPFLKAYLKSQQKIYIADLKKKLE
GUT_GENOME149848_0150770-129VDNSNMKGIWKGEFLTENGKTKVKFVENLKAKKVWLIPILKIYVKRQQKIYMRDLEKYLY
GUT_GENOME066014_0386685-145MENQNMRGHWSGIFESCDGGTQITFTEEVQVRKRIMNLFVKGYLKRQQARYIADLRKAIAK
GUT_GENOME094937_0203271-133MENTNMRGHWTGVFRSRGDNTEIQFTEVVTAKKWMLKPFVKGYLKKQQARFVTDLKQALQTQG
GUT_GENOME067288_0140671-131MENSNMSGHWEGRFTSTRVGTQVTFTEKVTVKHWWMGLLVTGYLMRQQKAYLEDLERKLGE
GUT_GENOME063431_0081871-131LENGNMKGRWRGVFSYADHQTTLDLIEEVTVKKALMRPFAGIYLKRQQARYIADLRRELGG
GUT_GENOME188428_0126372-131MNNDNMSGKWRGTLTKRGEQTQAVFTEEVYAKKLFLKPFVKRYLAKQQKQYMQDLKRHLE
GUT_GENOME139502_0163172-133MKNKNMQGRWIGKFVRGGNGGCIIEFTENVTVFSPVLNLFVKSYLKKQQKQYVKDLKKALGE
GUT_GENOME111314_0171972-129LENQNMEGMWTGRFIKVQRGTEIEFTEQVAVKRRLLRPFAKAYLKRQQREYVEDLKKA
GUT_GENOME073290_0154572-135MENSNMKGHWIGLFSSHNGITSIDFTENITPKKFFMKPFVKRYLKKQQAAYIRDLTAVLEAIEH
GUT_GENOME016336_0037574-133LDNANLAGRWAGEFRPEGGGTQVTFTETITVKKPWMRFFAKGYLRRQQARYLADLGRALE
GUT_GENOME256840_0228272-132LQNQNLSGSWTGLFRETAAGTEVKFIEEIQVKNPMMALLARVYLKKQQQRYTADLRKVLGE
GUT_GENOME153104_0097086-149MENENMQGHWIGTFEKTAAGTRADFTEKVTAKKLWMKPFTGFYLANQQKRYMEDLRRKLEGIQE
GUT_GENOME207914_0240872-133ITNKNMNGHWSGVFSIANKNGTQIEFTEEVSVNNPIMNLFVGLYLKKQQSLYIADLKKELGE
GUT_GENOME216267_0090370-130IENNNLTGHWKGKLYIKDGKTHIDFTEDITPKKIIMKPFVKFYLRKQQAAYLCDLKKALGE
GUT_GENOME097137_0166573-132LENENLTGTWRGEFRAVEDGTRVTFTERIQPRKWWMRLFARRYLQSQQRRYFEDLQRTLV
GUT_GENOME141059_0124771-128IENDNMTGHWVGNFTDHETYTAVEFIETITAKKILMKPFIKMYLKKQQALYCHDLKQA
GUT_GENOME119669_0099671-133IENKNMSGSWEGCLYDTDAGTQVIFTEEVRVKKFYMKPFIKGFLQKQQETYMQDLANALDKQG
GUT_GENOME066021_0082471-126MENTNMRGRWQGKFTKSGAQTDMIFTEEVTLKKFYLKPFAKRYLKKQQEKYLADLT
GUT_GENOME098569_0029771-130MENGNMAGCWTGRFADAPGGCTAAFTEEVQAKRVWMRPFVGWYLRSQQRRYWADLRRRLE
GUT_GENOME068415_00542102-161MENGNMSGSWEGIFEAVENGARLHCTETINAKRWWMRPLVPGYLKRQQKLYLGYLRQALL
GUT_GENOME103623_0097371-136IENENIKGTWIGKFYSHGNNTTLDFTENVVSKKFIFKPFVGLYLRKQQKLYFKDLKNELNCNEASF
GUT_GENOME188413_0203775-134MRNANLRGDWTGEFSERDGGCRVRFNERVTVEQFPMTILAGAYLKKQQRRYLEDLKKALN
GUT_GENOME093111_0052072-139LENSNLRGHWQGLFSATTGGTRLELIEELEVKKPLLRLVAGKYLRHAQQRYLADLRREVERQNVAKSK
GUT_GENOME112679_0159772-131ENNNLNGHWTGVFRETIYGTEIEFTESVHVRKWWMRPFAETYLKKQQTVYLADLKKACEK
GUT_GENOME032990_0097776-131LQGTWIGRFDYQDGQTILDFTENITVKMPLLTPFVWLYLRKQQKQYFRDLEKALKV
GUT_GENOME080269_0060671-128LENNTMQGHGRGVFLAKGKETELDFTEQIHCKKLILRPFIKLYLKSQQSQYVSDLKKH
GUT_GENOME202265_0127971-131LKNRNLTGHWTGLFEPETGGVRITFVERVALRRRWMTPLAALYLWRRQRRYLDDLRKKLAR
GUT_GENOME197527_0043971-131LENDSLSGTWTGLFEARDSGTRIVFEENVSVKKLWMKPLAGFYLRRQQNVYLRDLRRALAS
GUT_GENOME282802_0166372-132MENGNLSGHWIGIFTEKDGGKTEVCFTEAVEPKKSMMKLLAKPYLKRFQKKYISDLQNALE
GUT_GENOME110696_0062071-130LQNKFICGHWRGTFLPSGTSTLIDFTEEITVKKFYLKPFIKFYLYKQQKLFVKDLQNYLY