UHGP-MC 22052


Information


Number of sequences (UHGP-50):
124
Average sequence length:
109±10 aa
Average transmembrane regions:
0.09
Low complexity (%):
0.27
Coiled coils (%):
0
Disordered domains (%):
8.52

Pfam dominant architecture:
PF13306
Pfam % dominant architecture:
323
Pfam overlap:
0.47
Pfam overlap type:
shifted

Downloads

Seeds:
MC22052.fasta
Seeds (0.60 cdhit):
MC22052_cdhit.fasta
MSA:
MC22052_msa.fasta
HMM model:
MC22052.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME013662_007851943-2064KSGSVNDNVQIDPSGFVYEAVPTNRIEGVQASIYYKETVEDQYGDPHENIVLWNAEDYAQKNPLFTDENGMYRWDVPQGLWQVKFEKDGYITAYSEWLPVPPPQLEVNVAITQNKQPEVVEA
GUT_GENOME261444_004911699-1798VILDPSGVVYEALESNPVEGATATLWTRGSASGGSEQEWNAEAYEQRNPQTTSGDGSFAWDTPTGQYQVRVSKDGYRDSASEWLNVLPIQTDVNIKLESS
GUT_GENOME153026_02516979-1119DPSGYVYEGIADNRLEEVKTTLYYRSDKNCQVTQWDASEYSQSNPLYTDSEGKYAWDVPEGQWQVKYEKDGYETAYSEWLDVPPPQTSVNIGMVSKTLPIIEWTKATTAYIEISFNQYMKASTITGDKFVIKDAGGSTIEY
GUT_GENOME185664_00491952-1074VLIDPSGVIYDQSGQPIAGASVTLYQWDETNGKWFDTPWDANSHLQVNPQTTGADGAYGWMVPAGVYQVRATAPGYLDAVYGTAPPLDSPEGKDVIRIPPVRTDVDITLNKIQSPAQAKAAAQ
GUT_GENOME220434_0106114-108IDPSGYVFEAVVSNRVEGVTASCFYRDENGDMIWWNAAEYGQHNPLSTDGDGYYEWFVPVGDWKVVYEKDGYEAAESDWLPVPPPQTEVNQSLVS
GUT_GENOME033109_000131287-1388FKFVVDPSGIVINNVSKNPVTGATINLYYQGENGEEVLWDAGEYSQFNPLVTTVNGEYAWDVPEGFWKVKVNKEGYESAESDWLPVPPPQTEVHFNLKPLSY
GUT_GENOME092387_006411244-1363ISWIIDPAGYVYDAATKEALQGVTTTAYWIELDPEATAEEYEAFWATPPAEDEYGTLWQAEEWGQKNPLQTDQDGYYQWDVPEGWWRVQYEKEGYETAWSDWLPVPPPQTDVNIALKPEG
GUT_GENOME096372_040341511-1599IDPFGTVRDSVTGAALAGVKVTLYWADTEKNRSAGKIAGTPVSLPELAKMLPGQNKNAQTTSSDGRYGWLVYPDGAYYLIAEKDGYETY
GUT_GENOME181063_01612234-350NWHIDPSGFVYDAETWARLPGVKTTAFYIPVPDKDDIGDFWSIPPSESEEGTLWNAAEYSAENPLYTDEEGRYAWDVPEGWWRVKYEKEGYITAWSDWLPVPPPQTEVNVGLQSQET
GUT_GENOME010328_018871148-1266KLNFVVDPSGYVYDAETNDRIDGATVTAYWIPYDDSDDFWKNKPADTEYGTKWKSEEYEQANPLLTNVDGKYAWDVPEGWWRVKCEKEGYQTRWSDWMTVPPVQTDINIAMHMIGEEKP
GUT_GENOME097793_015371737-1853IKWDPQGNVYDQFGKPIKGATMTVYFSKNADGSDAVAWVNEGEAGDAGDAAYEEIGRPWMWDEPGVQKTNAAGFYAWYVPDGYYYQVVAEKEGYTTAKSDWLRVIPPRFGVDLVMTN
GUT_GENOME064659_009061694-1816KGIKVDVKLDPSGIVYEAVTSNTLSDVTATIYYSEESDGSNAVVWDAQPYDEINPQITNITGAYAWDVPEGYYQVRFTKEGYEDAATEWLPVPPIQVNLKTAMISKSAPAILSAHAYPEYTVV
GUT_GENOME282382_011901285-1398RWKIDPSGYVYDVSDNSRLPGVTATAYWIEYTDADSDDSFWDHAPSSTEYGTKWDASDWDEINPLTTDANGAYSWDVPKGWWRVKYEKEGYDTVWSDWMPVPPVQENVNIGMTA
GUT_GENOME117834_002001197-1301RWAVDPSGYIYDADTKERVGGVTVTLYYKQDENSKAIKWNASEYLQQNPLVSAEDGEYAWDVPEGLYQVKCEKDGYQTAYSDWLPVPPPQLGINIGIKKKKSEEP
GUT_GENOME243792_011801463-1561DPSGQVYEAVPSNLLSGVTATIVTKDGANAENWEAADFNQENPQTTGADGSFYWNVPQGEYQVIFRKGGYQDAATDWLTVPPPHLGLMIPMVSTESPVV
GUT_GENOME139741_013095-115IVDSKTNEKMSDVTVTLYWIESDGSSTSFDEAPADDNYGMPWNIEDWEQTNDSISNSDGSYSWSIPDGWWRVKCEKDGYETVWSDWISVPPSRTDVSIAMTPIAKPVIILG
GUT_GENOME096530_031481695-1808IDPSGYVFEGSMDNRLEGVTAVVQEQQFNGAWKNWDAENFGQVNPQVTDEDGRYGWDVIEGNWRVLFSKEGYDNYESRVVVVPPPETKLDVPLVRSSEPVVENVSYEDASIEIS
GUT_GENOME251633_000872287-2385PNYVIDPSGTIINSVTSKALSGAQVTVYYKDANGNEKIWNADDYSQLNPVLTSVNGEFAWDVPEGLWKVKVSKEGYVSSESDWLPVAPVQTGIDFKLVP
GUT_GENOME170531_007211016-1136TGNIKFAVDPSGFVYEGSEENRIENVIATAYWIESDGSDTFFQNKPSSSTYGTKWNAAEYSQANPLLTDIDGKYAWDVPEGWWRVKYEKDGYETVWSDWMPVPPPQTDVNICMTKLNDPNK
GUT_GENOME142591_027322369-2467WIYDPSGYVYEVTEDNRIEGVRATAMRWNEEAERWDVWDAEWYGQDNPLYTDANGRYAWDVPEGKWKVRYEKEGYLPAESEELVVLPPHFDVNIPMVST
GUT_GENOME235695_016991127-1236LTVVDPSGIVYDADSKKPIYGATVTIYYKEDLGDKNAALWDAAEYDQQNPITTDEDGFYAWDVPAGYWQVKAECDGYETSYSDWLPVPPPQLEVNIPLKKINVPHVHTRS
GUT_GENOME245069_002261705-1813CLDPSGIVYEAVESNPLSGVTAELWYSASKDGTNAVKWDAENYGGQINPQLTDAEGMYSWYVPDGYWKVKFTKDGYIAAETEWMEVPPPQLDVNIGLISTAAPAVQSIN
GUT_GENOME096544_015661096-1213YIDPSGTVVNQLGQPVEGATVTLERSDEPDGPFAVVPDGSDLMSPANRVNPMTTGSDGNYGWDVVAGFYRLTAQKDGCTGPDGAPVTTSAVLTIPPPVTDLDLVMLCEVVGDVTAPVV
GUT_GENOME095001_015742285-2409VSVDPSGYVYETFEDNRIQGVTATLYYENDAGDMEVWDAEEYNQKNPLITDNDGNYAWDVPEGLWQVVYTKEGYETVYSEKMIVPPPQLDVNIAMVSTEPAHISKVTNLGDKLKITFDKYLLVDS
GUT_GENOME019194_001751617-1720YRANCAVDPSGFVYEAVESNLVEGATATIWEANDDQGAKEREWNAEPYEQENPLTTNANGAFNWDVPTGWYQIRVTKAGYGEARSAWLPVPPIQLGIKLGLVST
GUT_GENOME011393_000511723-1843DPAGVVYEGLLSNTVEGATVELWSADDAAGTNAVRWDAEEYEQDNPLATGADGAYNWNTPTGWYQVRVTKDGYEEARSAWLRVPPIQTEVNIGLVSTAAPEVASAHAYTDCVEVEFTQYMD
GUT_GENOME216267_02453623-738MDCIKNNQEFSFSGFIRFIIDPSGIVYEAVIGNPVEGAIVTVYFKDTETGETIKWNAEDYDQMNPLLTDNDGKYLWDVPEGEWKVVCEKEGYETVESEWMSIPPVRTDVNLSLVSK
GUT_GENOME236230_015305-107DPSGYIFEAVDSNRVQGVTATVYYQANDRSEVKWDAESVGQVNPQLTDNEGRYGWNVPFGYWKVRVTGGDYETVESEWMKVPPPRVDVNIAVTSLVPAKVTRV
GUT_GENOME009803_008861156-1283IIDPSGYVYEGVTTNRLSDVTTTLYYKEKKTDKKGIVWDASEYDQENPLVTDEAGSYAWDVPEGYWQVKAEKEGYETAYSDWMEVPPPQTDVNIEMKPLAAPEIKEIHSYETFTKVIFTQYMDPETVK
GUT_GENOME015498_017041654-1766KIMKKTHKVQPIMDPSGFVYEAVESNTLSDVKATVYYAEDSSGTNAQVWNAEDYEQINPQITDDSGVFAWDVPTGWWQVRFEKDGYENAQTEWMRVPPPKMGLKIAMISEKTP
GUT_GENOME260991_009131683-1782RLGIDPSGYVYEAVKSSRLEDVTAEVWYSASADGAGAVPWDAESYEQVNPQPTDGDGVFGWYTPVGFYQVRFTKAGYEEARTEWMAVPPVRTGLEIGLRT
GUT_GENOME258868_00443290-410IIDPAGFVYEGVPANRLEGVKATIFYREAVDNPLTGAIEWVERKWDAEEYAQENPLFTDAEGAYAWDVPAGQWRVMFEKEGYETAWSEWLPVPPPQLEVNQEMRRVTPPSVASANAYDSEN
GUT_GENOME220585_004003276-3393IDPSGYIFEGTEENRLENVTATVYYLDTASGEWVKWDSEAHGEGPNPNISGADGKYGWDVLIGKWKVVFEKDGYYTVESIELDVPPAHLDVNMSMVSTSPAMLKAVRAGAMGAYIDFT
GUT_GENOME277793_000101415-1538DPSGYVYEAVESNRLQGVTTTAYWYEPEKGETEANSKLWDASEWDQCNPITTDINGCYAWDVPEGMWRVKYELDGYDTVYSDWLPVVPPQTDVNIEMCSYAVPEIAQTSVDKLSAKIVFSKYMQ
GUT_GENOME285819_00412198-310AIIDPSGYVYAGVESNRVEGADATVYEVAENTGTRTLWNAEAFDQSNPYITGADGFYQWMVPSGLWSVSVAANGYDAYTTGEKDGTDAKEINGTWAMPVAPVQLDVNIDLQAS
GUT_GENOME066712_00510731-836IDPSGQITDAQTNEPLEGAVVTLEQKQGEDWIKWDAESCMQENPQVTNEEGRYGWQVPEGVYRLKISRVGYQDKTVETYEQADGTVSDISILPIRTDVDVALESDG
GUT_GENOME262408_01455223-323DPSGYVYEGIESNRLSRVTTTLYYSAGSAKPTGSASESQRWNAADFGQQNPLTTDAQGQYLWMVPDGWWQVKYEKEGYDTVYSQWLPVPPVQTEVNVGLTS
GUT_GENOME006914_002981505-1602YGIDPSGYVYEAVPSNRLEGVTAKILKENSGSYTEWGDAQQYGGQSSTITTTETGEYRWDVPEGTWKVEFTKAGYFKAERENLTVPPPRTDVNVGMIS
GUT_GENOME239118_006991088-1196DDVTASYKYAIDPSGFVYEAVPSNRLGGVTATIYYSPNSDGSDAIKWDASEYDQNNPIKTYEDGTYFWDVPEGYWQVRFEKEGYETTYSEWLPVPPPQLDVNVGIVSKA
GUT_GENOME078845_014691662-1773DPSGYVYEAVPSNRLEGVTATVYYQDGNKMMEWDAENYDQLNPQITGADGGYAWFVPEGQWKVTFSKDGYQPADSSDVPAAAANEENTGWLPVPPPQFNVDVGMISLASPEV