UHGP-MC 72342


Information


Number of sequences (UHGP-50):
123
Average sequence length:
128±8 aa
Average transmembrane regions:
0.14
Low complexity (%):
0.34
Coiled coils (%):
0
Disordered domains (%):
13.92

Pfam dominant architecture:
PF01641
Pfam % dominant architecture:
8455
Pfam overlap:
0.87
Pfam overlap type:
equivalent

Downloads

Seeds:
MC72342.fasta
Seeds (0.60 cdhit):
MC72342_cdhit.fasta
MSA:
MC72342_msa.fasta
HMM model:
MC72342.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME030676_0170264-181LTPREADIILRKATEAPWSGEYEKNTAAGTYLCRQCGVALYRSSDKFDSGCGWPAFDDALPGMVRRQPDADGHRVEIVCEHCGAHLGHVFEGEGLTAKNTRHCVNSLSMRFAPAGSEK
GUT_GENOME000530_0055534-159RLTDEEWARRLTPEEFQVLRRAGTERPFTGEYWDSHVKGVYACRACGSELFRSDEKFDAHCGWPSFFAPLAEDRVEYLKDTTLGMERIEVRCAACGSHMGHVFAGEGYDTPTDLRYCINSISLRLA
GUT_GENOME143140_00140234-375TSSLDPARYQKLDDPALRYQLTKEQYDVTQNAGTERAYTGAYYDNHEQGIYVDIATGEPLFSSADKFDSGTGWPSFTKPIDPDVVVKREDGSYGMVRTEVSSRVGDSHLGHVFNDGPIDKGGLRYCINSASLRFVPLADMDA
GUT_GENOME232826_0114859-195FEKPDRKVLKEKLTPTQYRVTQEDYTERPFCNEYWDNKEDGIYVDIISGEPLFSSTHKYESGTGWPSFHSPIDKNNLIFKEDKKLFSTRTEVRSKIADSHLGHVFDDGPAPTGLRYCLNSASLRFVPKDKMAEEGYE
GUT_GENOME254185_00783184-308TDNEQEWKRKLTPLQYDVLRRKGTERPFTGRYYRFSEDGTYSCAACGNPIFMSQDKFDSGCGWPSFDKAIPGHVKFTPDFSHGLERIEVTCARCGSHLGHVFEDGPTETGDRFCINSAAMNFTPE
GUT_GENOME056632_02050278-417TGDPVYRKPDDARLKARLSPEQYAVTQKNATEPPFRNKYWNEYREGIYVDITTGEPLFISTDKFDSGCGWPSFSKPIQKGLIEERMDTSHGMRRVEVRSKTGNAHLGHVFNDGPREQGGLRYCINSAALRFIPKEDMEAQ
GUT_GENOME096029_00886381-512YPKLSEKELKMKLNVQQYKVTQQGDTERAFQNDYWNFFEAGIYVDITTGEPLFSSKDKYNSACGWPSFTKAIVPEVVTYHKDTSFNMIRTEVRSRSGNAHLGHVFDDGPRDRGGKRYCINSAAIQFIPLKEM
GUT_GENOME231300_0326645-173THLNVSNAEWKKILPHNLYAVAREQATEQAFTGKYWDSETKGTYYCAVCGNKLFRSDAKFASSCGWPSFFEPVRKESVVYKNDNSYGMHRIEVECGRCNSHLGHIFDDGPAPTHKRYCMNSVSLDFEPD
GUT_GENOME208009_0148727-158AAARTEAQWRAALSPMEYHVLREAGTERPFTGALLEENREGVYRCRGCGAELFRSTTKFESGCGWPSFYDPRDSSAVTLSTDTSHGMVRTEVRCASCGSHLGHVFDDAPHTPTGQRYCMNSVSLTFEPAAQE
GUT_GENOME011896_0123441-166RDDIKVSTSKLTSKEYEVLINKGTEFPFTGDLLDVKSDGVYTCKLCGNLLFKSDAKFNSGTGWPSFDDAVAENIKLVKDGCRVEVSCAKCGGHLGHVFYNEGFTDKQTRYCINSVSLNFVNKAEFE
GUT_GENOME244252_00614255-389KDLPSDGDLKKMLDEESYQVLRQKGTEPPHSGDFIVPSEPGLYVDKVSGKPLFSSDDQFDAGCGWPSFSMTVTTDASDYAEDYSHGMSRIEVESRGNGNHLGHVFPDGPKDLGGLRYCINSVSLRFIPREQMEAE
GUT_GENOME147523_0398522-147VRKSDAQWREQLGEAQYYVMRQHGTERPFSNQMCELFEPGLYRCAGCQNLLFDSGSKFDSRTGWPSFAQPMKSNAISYHLDETLARPRIEVRCNQCESHLGHVFPDGPAPSGLRYCVNAVSIEKVN
GUT_GENOME080362_02864317-434NQLLSPEQRRIAFEQGTERPGTGSHLDEKRPGTFVDPVSGAPLFRSDSKFESGSGWPSFFQPLPGAITLHSDRSHGMVRTEVRSASSGIHLGHVFDDGPPPSGKRYCINGKVLKFVPD
GUT_GENOME212901_0383248-171DADWRQRLTPAQYAVLRQQATERPYSSPLNKEHRQGTFACAGCALPLFSSSTKFESGTGWPSFWAPLHNAIGEDRDVTFGMLRVEVHCRRCGGHLGHVFNDGPRPTGLRYCMNGAAMVFVPGAA
GUT_GENOME042247_0082343-165PDPGPARTRRPLTETEKRIIEHQGTEPPFSGEFVSLFETGIYSCRKCGAPLFRSEDKFDAGCGWPCFDDSLPDAVTSVHAGCGRRTEITCARCGGHLGHIFNGERLTPKNMRYCVNSLSLEFR
GUT_GENOME047722_01237213-343ARENEPVYRKKSDEELRRVLTREQFAVTRNNATEPPFRNEYFNNDRPGIYVDVTTGEPLFLSTDKFDSGCGWPSFDDEIPGAVRREPDADGRRTEILCAKCGAHLGHVFTGEGFTAKNTRHCVNSLSLDFV
GUT_GENOME146117_029006-130SAEELKKNLSEMQFYVTQNHGTEPPFTGRLLHNKRDGVYHCLICDAPLFHSETKYDSGCGWPSFYEPVSEESIRYIKDLSHGMQRIEIRCGNCDAHLGHVFPDGPQPTGERYCVNSASLRFTDGE
GUT_GENOME206579_01811108-240DYKKPAAESIRDKLTSEQYRITQENGTERPFENEFWNQFEKGIYVDVVTGEPLFSSTDKFESSCGWPAFSKPIEDPAVVESVDKSHGMIRTEVRSRAGNSHLGHVFTGDSESPNGVRYCINSAALRFVPYAKM
GUT_GENOME160484_01054182-303INSDIEIKMNPSDNLTLLSYQVTKLAMNEEAFSGKYADFDEEGVYVDITNNEELFSSKDKIKTNSGYACFSKAIDENAIDNLRDFSYGMVRLETRASKSGIHLGYKRKGHYEINSAALKFIY
GUT_GENOME007180_0293231-160PKQIQKTDAEWKKELTPEQYEVLRKKGTERAFTGEYYNHFEKGNYVCAACGNVLFTSNAKFHSDCGWPSFDQAIKGSVIYKEDTSFGMVRTEVMCAKCGGHLGHVFDDGPAQTTGKRFCTNSVSIKFVPA
GUT_GENOME015678_00006154-277VYRNEKMWKRKLSAERYGVMREQGTEKPHSGKYVVFDEDGTYRCGACGQPLFDSSSKFATTCGWPGFDRPLKGAVKTRKDLSHGMVRTEVVCSRCGSHLGHVFKDGPTETGDRYCINSVALDFE
GUT_GENOME185730_01139174-304MYTRKTRQELKEALTPLQYCVTQENATEEPFKNEYWDKFDSGIYVDITTGEPLFSSSDKYDAGTGFATFSKPIDPNAVIEEPDVLDENLTIEASSRIGRAHLGHIYTDGPSGQKRYSINSASLRFIPSENI
GUT_GENOME096289_0308234-159KPKSEWKALLPANSYRVLFEEDTEPPGSSPLNEEKRAGTFVCAACSLPLFASSTKFESGTGWPSFYDHLPDSVAMKTDYKLVLPRTEYHCARCGGHQGHVFEDGPKPTGLRYCNNGLALRFVPENE
GUT_GENOME176876_00147180-303VERIWDLTLEQFAVTQNAATERPFVNEYDEEFEPGIYVDIVSGEPLFSSRDKFDSGCGWPAFSRPIAGDLLTEHEDHRIPGRDRIEVRTSDTQIHLGHVFTDGPADRGGLRYCMNSAALRFVPR
GUT_GENOME136455_007516-116LSYKKTYVALHNSCNQSLKEGNDNSYEDETYNCKNCNTPLFSLKNKLASRIGWFSFANAIPGKIRHQPVTENIRSEIFCNCCGTFLGYIFFSERLSSNGFRYFINALSVKI
GUT_GENOME096499_0096341-157NKLTPEEERVIIYKGTEVPFTGELLNNNEKGIYVCKRCDTPLYRSEDKFDGHCGWPSFDDEIEGAVKRVRDADGRRTEIICNTCGAHLGHVFLGEGFTPKQTRHCVNSISMKFIPEG
GUT_GENOME045534_02447164-260YEYLKRSYAKRHLSKLQYEVTQNKLREKPFKNEYYDNFKEGVYVDIIDNTPLFSSRDKINKNCGWPIFSKAIDDNLIKFIKDEEIINKIENFISKVE
GUT_GENOME236870_00618143-253KEFEVTQLSIDEEAFSGKYCDFYEDGIYVDVIDGEELFSSKDKVKTDSGWPTFTKPINESAITKNRDFSYGRTRIEIRSAKSNSHLGYLVYEGPNGEPRYRINSTSLKFIP
GUT_GENOME135371_00604163-284KQFDDMTYRVTKFKETEPAFDNEYVDNFEVGIYVDKETNKPLFSSKDKYDARCGWPTFYKSSFEDEIEKHIDLRNLMIRMEVMSKSGKNHLGHLFYDGPVNYGGKRFCINSAALRFIPKDKM
GUT_GENOME057787_0185836-155KNKLSADEKAVIWGGATERHFSGKYDKFFEDGIYTCKVCSAALYSSSSKFDSGCGWPAFDKAFPQAIKALPDPDGERTEIRCARCGAHLGHVFRGEGFTPTNTRYCVNSISMNFIEAKNL
GUT_GENOME096525_04513186-320AKQESPEELKHRLTPLQYEVTQNNATEPAFRNEFFDHREEGLYVDIVSGEPLFTSLDKYDSGCGWPSFTKPVEADKVKEKGDYSHFMVRTEVRSKEADSHLGHVFTDGPQDQGGLRYCINSAALRFVPIDKLEEE
GUT_GENOME007180_0306618-149PDKKVVKTNDEWQKLLTPEQFRITRLKGTERAHSSEMCSLFEPGKYACVCCGTLLFDANEKFESGTGWPSFTQPVTENAIAYHKDISYGMVRVETTCNTCDAHLGHVFPDGPGPGGLRYCMNAVALKKVESN