UHGP-MC 12101


Information


Number of sequences (UHGP-50):
196
Average sequence length:
74±10 aa
Average transmembrane regions:
0.05
Low complexity (%):
0.6
Coiled coils (%):
0.28
Disordered domains (%):
5.05

Pfam dominant architecture:
PF03432
Pfam % dominant architecture:
8469
Pfam overlap:
0.19
Pfam overlap type:
shifted

Downloads

Seeds:
MC12101.fasta
Seeds (0.60 cdhit):
MC12101_cdhit.fasta
MSA:
MC12101_msa.fasta
HMM model:
MC12101.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME235148_007891-83MAVTKIWPIKGRLRNVIDYTVNPEKTFNPDYTFEELQALKDVIDYAMESDKTEKQFYVTGVNCTAEFAREQMVNTKRLFKKED
GUT_GENOME014261_004781-73MAITKIWAIKTNANNTINYVGNKDKTAHEDNDDLDDALKYINHDTTTEEKHFVSGINCNPLTAYQEMIATKRL
GUT_GENOME237330_013921-65MAVTAIWSVKGSVGKVLRYVANPAKTWNGQYAEAAKFHTVERVLEYDADAMKTEKQFYVTGINCS
GUT_GENOME275586_001541-52MATTKIWSVKNRLDHVLEYVSNEDKTIDKLGSYVSRNEATENQKYMTCVNCS
GUT_GENOME199922_000081-59MAVTKIWPVKDSIGRLIDYAKNPDKTAFENLETVMAYAKNEDKVIFEKGETCYLVTALN
GUT_GENOME036390_0173932-96LLMTSRLKEVFDYAENPDKTIGKKYVDSDLFAALQYAADDKKTDERMTATKKRFGKTGGNVAYHG
GUT_GENOME138581_001171-71MATTSIWSIKNNLNQSINYIINPEKTINKDYGKNSYSYLEIENTEYNYKNEKVCYVSSLNCSEYNPYEDMK
GUT_GENOME265853_004281-68MAVTKIWPVRGRVDAVIDYAANPQKTSKESLKNVIEYAVNTDKTDKQLFVSGINCDTTNIKEEFNIIK
GUT_GENOME175865_013981-73MATTRIWSIKGRLDSVINYVKNPEKTDSAKYTDSELQALEDVIRYAENADKTHQRLYVTGINCSSDIARDQMI
GUT_GENOME071229_009171-67MATTGFWPVKANLLALIKYTENPEKTSVLGEKEMKALHDALDYDMNSDKTEKKLYVTGINCLPETAY
GUT_GENOME129191_011031-65MAVTSIWLIKGRLDVILKYASNPEKTVESNYINQSELHSVDNVIEYAADEMKTEKRMYVSGIGCN
GUT_GENOME115652_0142628-105IATTGIWKIEKRLDNVIDYTTNIEKTINSDYGKDSYYDLHNVIDYVEADYKTEKQFYVSGVNCNPKTALEEMTITKEQ
GUT_GENOME137647_003031-86MATTKIWDINVRLDHVIDYITDLEKTKNLDYGDEFSIFHNNESLDIKTEKECFVSAINCSVDIAYEEMMITKKTYNKTGGILGFHA
GUT_GENOME041969_009521-61MATTSLWHIEGQLKDLIAYVENPDKTIAAKPELQSLWEVMSYVSRPEATEQGEYVSTINCL
GUT_GENOME048334_017231-69MATTSIWSVKSTLGHVIEYAADDTKTANPQWSKSEYQSMRDVMDYAMNDFKTEQQYYVTALNCDAGCAR
GUT_GENOME265704_003141-82MATTKLWKVENRLDNVVNYASDKSKTENKKYQNKNINNIYELLDYTTNPDKTEKQFFVTGINCELDTAIRQMTQTKKKFNKD
GUT_GENOME186073_014361-69MATTAIWKVKGSLGRVVNYAANPDKTENTAFTEQDLQGLRDVMDCATQDYKTERQHYVSGINCAPKIAR
GUT_GENOME068832_006911-68MAIVFIKDIKSRLDRTIAYIDNKQKIKNENYEEAFLDLHNALDYTVDDLKTEQKFFVDGINCRVENAY
GUT_GENOME133472_003884-76IAVVKIWKVVTHLDQVVDYAEDHNKTDLSNFKDLNDLMYYAMNGDKTEESLYISGINCAPNNATKEMIMIKDK
GUT_GENOME116224_006681-94MAWTGIHAIRFDIGSSINYTMNKEKTALSNAVDYAFNRDKTEKTIYESAICCGKGTAYREMMDTKRRFDKKDGIQGFHLVQSFAADEVTPDKCH
GUT_GENOME006621_022191-83MAYTKVFAVRNHLARAVSYAANPEKTDEAITLGASVDYALNPDKTEQRLFESAVNCQRPGAAYGEMVATKQRFSKTGGVLAYH
GUT_GENOME189574_009811-61MATTSLWAIRGRIDHAIEYIENEEKTLEQVIDYATNEYKTGDKHFVTCINCNMFDPQKSMI
GUT_GENOME236174_018081-65MATTKIWPVYDSLRRVLDYANNPEKTEYSSLRDVIHYAENGEKTMLTGEETVYLTTAVNCDFGGD
GUT_GENOME176105_001951-84MAVSKLWSVTTRLGQVIDYATNPKKTAVRIIYTKAQYQALRDVIDYAKDEEKTEQEFYVEGINCNPETARDQFVSVKQGYGKED
GUT_GENOME198843_000081-79MAVTSLWRVKGYIGKVVMYAMNPEKTTEKEIYKTDKADGGAESTLGGVVSYVERDEATNLKSLVYGIKCQKDTAVKDMM
GUT_GENOME138794_015621-67MAVTKIWAVKGRVDKVLSYVKNPEKTDTTLRDALHYAVNDMKTEKGFYVTGINCDPETAEKQFTIVK
GUT_GENOME054033_00427111-209MATTAIWPIRGRLDHVMTYTENPDKTANPKWKQADLQALSDVVEYAMDEAKTRGVVNELEHIADEGQCQCQYFTSGVNCDPATARDAMNMTKRRFGKEG
GUT_GENOME286953_000761-63MAITKIIAIRDRLDKRVNYATNGEKTTLDAGITYAVNPEKTEQNFFVSTLNCSSSETAFTQMM
GUT_GENOME230962_019101-74MAVTKIKRIRGNPGNPLSYIGNPEKTRNLDFSDSDRQALADVIAYAADEGKTEKQYFTTGINCDVENAREQFNI
GUT_GENOME091627_008471-77MATTKIWNIKGKIGCLINYAANPEKTDENQYNKAEMQALHNVMAYAADGYKTEKRLYVSGVNVTPDTATYKMQQTKL
GUT_GENOME123748_008931-67MATTKLWHIQGRLKDLVDYVENPEKTVKPGLQDFFNVFSYAQNPDKTAGGQFVTAINCQKDIALQQM
GUT_GENOME048025_006761-68MATTSLWKVSGSLKKVLDYAENPDKTSLKNVIDYASDGSKTDDELFVTGINCEVERAYEMMSETKRQF
GUT_GENOME164217_002991-89MAVTAIWNVKGWIGSVLNYTVNPEKTENPEWSEKQQQSMLDVMEYAMQSAPKKVLDKIIGYAIDDFKTEKQYYVTGINCDPATARNDMI
GUT_GENOME213908_007371-78MATTKIWAVKDNISRVLSYASNPKKTSLKDLEQVLKYAENTDKTKDDREKVVLVTGVNCNRETAYQEMLSIKKRFNKT
GUT_GENOME276647_002751-67MAVTSIWAVHGDNVATVMKYIANQDKTDTGQFPAVASFHAVNRVLQYGADEMKTEQQLLVTGILCDT
GUT_GENOME021693_004711-75MATTGFWPVKGRLKDVINYAENPDKTIERKYLDDDLAAALNYVENSDKTDRTMYVSGINCPKKRAYEQMMPTKRR
GUT_GENOME232220_053581-80MATTAIWDVTDRLKRVIDYATNPDKTAQDRAEKNEGLRQVLEYAKADAKTERQLYVSGINCDPLTAYEQMQRTKRQFQKT
GUT_GENOME151837_014001-73MATTGMWAVKNRLDHLVNYVSNPYKTIGLETVIDYTTNGEKTIEKRYVTYLNCLFDNPKLSMVNTKRHFHDES
GUT_GENOME237880_007561-65MAVTKLWPIRGRVDSPLNYAADEEKTANPGWDKSSLQNLTDVMHYAADENKTEKQFFVSGVNCNP
GUT_GENOME048914_002801-74MAVTKIWAIHDSVSRVVDYCTNPSKAKLSDLEQVLLYAADKGKTLDEGEQQYAVTGIGCRAESAAREMAAVQRR
GUT_GENOME210683_003841-85MAYTSIIPVSRLDNSITYIRNKDKTTKKGQSAGSLEEAIDYAMNRDKTERSIFEDAIGCVCETAYQDMVATKKRYHKMDGVQGYH
GUT_GENOME088525_012271-77MAYTKILVIHNRLDKCVGYTQDPEKTSLEAAIDYALDREKTEQTCYETAINCDRESVYADMMDTKRRWGKEGRKRKG
GUT_GENOME044209_006563-72LAVTGFWPIYKNLRATLDYADNPDKTTAPEYLDEDLYAALRYAENDDKTDRKMFTGGINCSAQNAYAEMI
GUT_GENOME053414_000351-83MAVTRLWKVTDRLDRVIDYAEDEEKTRKPKFSDTEWQALKDVLEYAKDEEKTEQQFFVTGINCNAGKAREQFIAVKQQYNKTD
GUT_GENOME091626_006511-82MATTKIWKVQKRLDHVIDYATNEEKTRKNSNGYGMNRFDSIRQVMAYATNPDKTEKQFYTTGINCEVKDSVKQMQLVKMIYG
GUT_GENOME243525_002511-65MAVCEIWDVRGRLDHPIDYAENPEKTANPKYTEADLQAMVDVMEYATNKDKTEQRFFVTGINCDP
GUT_GENOME255439_004251-69MATTGIWKVEKRLDHVIDYVINPEKTMKDGDYQELHKLDGYKKLNYETEEECFVSALNCSTHRPYKDMM
GUT_GENOME037770_003891-65MATTKIWPIHGNLSCVIDYISNPEKTVGNDEIYKLLHYVSNEKKTAGEDAGSKEKVMFVTTLNIV
GUT_GENOME096561_0245617-96VAYTKIINIRNLGHLTDTINYAANVTKTLETLTEYAGNEEKTVDGARYVSAINCCPETAAKAMTATKKKFGKEDGRVAYH
GUT_GENOME015857_005567-79IATTSLWKVGGNLKGTIDYIENEEKTKIPIDSLETTLNYAENSNKTEKQLYVTGINCNVDSALKEMNEVKKSF
GUT_GENOME001517_030914-67AVTSIWCVKDRLDHVVHYVEDEEKTVDAVIDYVNNDNKTNEKKYVTCINCDSFDPCKSMMNTKK
GUT_GENOME284875_030781-77MATTKIWPVQSRIDHVLNYVMNKDKTENAAFETVLTDDDSEVLKQVTDYVAQDEKTKQPFYVTGLNCLPDTAVQEMV
GUT_GENOME147107_006081-88MAATAIWDIRGRLDHVLGYAENPEKTKNPDWGKTNSADMTDVMEEAMRQAQARGLADVIEYATEDAKTEQQYFVSSINCAKQTARKEM
GUT_GENOME243988_003744-74MAYTKIFPIRTRLDRRVRYALNGKKTALETAAGYALDAAKTESVLFTDAFNCGLAAPCAEMYATKRRWGKD
GUT_GENOME256613_006441-68MATTKLWKFKSRLDRLIGYAINGEKTENKLYVSGINCMPDTAFYEMTNVKKQFFKTGGIECYHGYQSF
GUT_GENOME026955_006081-63MATTSVWAVNDNLKRVLDYAANKEKTESKDDDLSDVLSYAANADKTDNMSFVSGINLDPAHAY
GUT_GENOME121961_009231-77MAVTKIWPVRGRIEQPISYAMNPKKTDKNLWAGDASIEDVISYAANSEKTEIICKTAEKQYYVSGINCEPESAAEEF
GUT_GENOME170146_022341-76MATTKILPVRKRLKDCLDYAANPKKTEIFRGDALDRLMHYTQNEDKTEHQLYVTGFNCDPQNACLIMEATKRRWHK
GUT_GENOME084935_011301-72MATTSIWKVRGSVGKVVRYAENPEKTSIEDTLGDAEGGLDDVIRYAAQPAKTDCRQLVSGINCVPETAAEEM
GUT_GENOME087264_008721-71MAVTKIWDVKSRLDHVIDYVTDPEKTYNAGYDDGEQKYFVTAMNCNATCAKRQFQNVKKQFSKTSGIIAYH
GUT_GENOME159385_006834-73MAYTKIKAVKNHLKRCLDYAANPAKTKTGQDLQDALTYAHNSDKTEQQLFVTGFNCDPAHACEVMLRTET
GUT_GENOME171590_020881-78MAITKIWPVKDSLNRVIEYAKNVDKTENPDFLNSDLYQVLHYAADENKTRFERQYYVTGINCAVETAGQQMQITKERF
GUT_GENOME239207_0196410-94MAVTKIWAIRGRIDHVLNYTMNDEKTRDREQGNSEIEFDDAEIQSLYDVMNYAMDNAKTEERLFVTGVNCDAENARSEMIKTKEQ
GUT_GENOME001315_014371-66MAVTKIWPIKDSIRRVVEYAENHEKTQCSDLETVIHYAGNRKKAEQDQTIFVTGINCGRDTAFQEM
GUT_GENOME084462_006401-68MATTKIWSVKGSIKRVVDYASNPDKTTNDDYDEPVGFDDVNNVVKYATNSKKTEQALFVTSLNCGSDP
GUT_GENOME155771_0157411-90MAYTKIFAIRVRLDDRVAYVTNVDKIKQLGEMIEYSANGEKTEEHLFETAFNCQSTKTAFREMMDTKERWHKTDGVLGYH
GUT_GENOME104031_016041-99MATTAIWPIRGRLDRAIAYIKDQEKTYNPAWEQGDFPALSSLMQCAMEQAAAQGMGNAIGYVTDGEKTDYQYYVSGIRCDPQSAREAMALTKRRFQKEG
GUT_GENOME259428_009173-85LATTKIWPVRDSLRRVVDYAANPDKTQYGGLAQALHYAGNGEKTNLTETVQLVSGIHCRPETAWREMRAVQEQFGKTRGVVAM
GUT_GENOME055562_006407-89VATTKIWKVVKRLDHVMDYATDEKKTQKVEFVVEKGNYIVLVDDLKDVLDYAMNSDKTEKQFYTTGINCEVDSAYEEMIDTKL
GUT_GENOME272775_008771-73MAVLKIHVIRHRLDDRLKYAINPEKTAQPDFPNLNGLPDTLTDAFNCYCDSAYEDMTETKKFIGKEGNVLGYH
GUT_GENOME255533_00236146-245VATTSIWRVKGWLGKLVVYVENPDKTENPALYEKQDMTNRDTQGLADVIDYAVQQEKTEKIETDDEGAGIMQRFVSGVNCTPATARDEMIAVKKRFGKEG
GUT_GENOME246859_005821-74MAITKIFAVRSRLDDSIKYATNAEKTSAEVLEGKIEYALNAEKTEKQIYETALNCFSASTAYSEMQRTKEKWHK
GUT_GENOME056541_011311-61MAVTKIWPVKSHLGELLKYVGNRSKTSADGNVNPEIMKLIGYDTDGFKTEQRLYVTGVNCT
GUT_GENOME198857_007934-86IAVTKLCKIGSNLGRTIDYAEDKKKTNSNLKDLNVAIDYAMNKDKTEESFYISGINCEIDNALEQMNQTKKLFNKTKGILGFH
GUT_GENOME251527_010031-66MAVTKIWPVRDRFDKVLDYAANAKKTDADIEKYHALDGVIDYAADGDKTEKCFYVSGVNCMPENAK
GUT_GENOME042361_007191-84MAITKILNIKESEGRNPASHLKNALEYIQNPDKTEECVLVGGINCLPDTAFEQMEETKNIFHKTGKRQGYHVIISFSPEEKVTA
GUT_GENOME068185_010751-82MATTSLWRVRGYIGKVLLYAENPEKTTLPKSIGFDDADKDALEDVIAYASREEATNQRQLISGINCCPKTARTEMLHTKEAF
GUT_GENOME130695_008661-69MAVTKIWNIKDSITRVVDYAMNPEKTTDDTVLRHDLSQVLDYAENADKTDHFCYVTGVNCIPETAAQQM
GUT_GENOME135531_003651-69MAVVKIWKVKSNLNKVIDYVSDENKTLKDEDSSLLKELHQVIEYTTDSFKTEEKCYVTGINCIPATAYQ
GUT_GENOME107341_01759110-201LAVTSIWRVKGRLERVVRYAKNPEKTQNPDLSGNGMLEYVIRYASDPEKTAAPHIHDEKIPLMKQFVSGINCMPETAEAEMLAVKKRFGKES
GUT_GENOME207165_000481-89MAITKIIPIKVRLDHVLDYACNENKTNNKNYGAINYQDLHNVLDYIEADYKTEKKLYVTGINCNAEDAYEQMKITKQIYEKEDGILAFH
GUT_GENOME000059_033471-77MDHVIDYVNNAEKTENVKFSRQDLQGLRDVMDYMLQDYKTEQQYYVTGLNCHPQTARQEMMVTKKQYQKEGGIIAYH
GUT_GENOME246611_005881-82MAYDKIIPIRRRLDHCVAYVLNSEKTDLGRVLEYIGNAGKTIAPDGEAVLETAINCQLDTAYREMQATKRRWGKTGGVLGYH
GUT_GENOME193290_001201-67MAVTKIWPVRGRLDSLIRYAENEEKTRMPDLTGLAPAKDDLAEVLQYAANHSKTTQDRQLFVSGLNC
GUT_GENOME284166_010561-70MATTRLWSICSRLDLVVKYVGNKEKTTTDEDIQTLRKVLTYAENDYKTEEKKYVTGINCSPETAYKEMQI
GUT_GENOME043645_008781-68MAVTKIWAVTDSVNRVLSYAANPDKTIYSDICKALHYASDERKTVIGEEKAMYVTGVNCTAETAFAEM
GUT_GENOME000612_025111-62MAYTKVFAIRTRLDNRVKYVANTEKTGLGERLAYAADPEKTERRLFVSALNCESPATAYAEM
GUT_GENOME250410_008041-82MATTSIWAIKGTAAAIRRVEMYIENPEKTIEQTGLEYQSSLHQIDQNGSMIHEYSPEELEQEKVCYVSGINCTAPNDACEEF
GUT_GENOME116778_011791-78MATTRFWKIEGTGKNLSNVINYVNNPEKVEPGEYLELNLLHLLDYATNEEKTEKSAYVTGINCSVQNSFNEFMTVKDQ
GUT_GENOME137768_008501-71MGVTKIWDIKVSLYKALRYVTDPNKTVLQDDTASAGAVSAAIQYAANGNKTQSGRYVTAMNCNATCAEKQF
GUT_GENOME062463_010921-77MAVTSIWKVEGRLARVLGYAGNPDKAEGSPDEWELQGVGAAILYAADPGKTERQLYVTGVNCLPATALEEMNATKRQ
GUT_GENOME173896_006268-85MAVTSIWPIHGRVDKVIDYVRNPEKTTEAGLPELASLHAVEDVVEYAADEMKTERRSYVSCLNCREDTAAAQFMETKR
GUT_GENOME090240_013281-68MAVTKIWQVKGSLADVVKYAGNPEKTRTHDLARVVGYAADEHKTLDENEQFYAVTGINCNAKTALAEM
GUT_GENOME156476_016581-96MAVTSIWSIKGWIGKVINYAENPDKTKENNEGRISENQSRSTQGLNDVITYGFRVLLTFADESTKTQQHAGFTTKREANAFRDEVIGQLHTGTYIV
GUT_GENOME227337_006421-79MAITKVFAIRIDFKQTVRYAVNEQKTSLDGMIDYAVNPDKTEQRLFESCLNCSSVENASRDMERTKRRYNKTGGVQGYH
GUT_GENOME170705_006541-87MATTKIWAIKDSVKRVIDYTCNPKKTEISDDLWAVLHYSMNPNKVVTRYEETTAFVSGVGCSTENAYQEMMQIKHLFGKTMGNQAYH
GUT_GENOME098057_010481-77MAITAIWDIKDSLKRIINYAANKEKTEIENDDLMNSIRYISHDVKTENKQYVTGINCSLKTAYDEMTATKRNFGKES
GUT_GENOME079526_019101-65MAVTSIWAVHRSIKDALDYAANPEKTENPNEEDLRRLMDYAENPDKTEQRYLISGVNCLPELAYE
GUT_GENOME110977_020211-121MAVTKIWTIHEGSDIKQVIDYAANEGKTVLNIHVETDETYNQVERQQFQDVIDYTMAGYHDENDDMANVLDYAAKGIKTEQKKYVSGINCSPEYARDQMMLTKTHYHKKGGILLWHGYQSF
GUT_GENOME059279_009221-73MATTSIWDVKDNLKRVLDYISNPEKTKNTNESDYHYNGLGQAISYTTQDCKTEKQLYVSGINCSMATAYQDMM
GUT_GENOME116475_018811-88MAIIKVFAVRKQLKKTVNYITDSDKTDSDLARKIDYALNSEKTAGTAAASEQFLYESVINLPDVSSAYERMQATKERFGKTEGVLGYH
GUT_GENOME243099_007971-83MAYLGIKPFKARLDKGVDYVTNEEKTKMKEAVLETLYEYDTNIDKTNHFGNKLITGINCDFEKAASEFIEIQKMYGKDDDLIS
GUT_GENOME238888_003881-104MATTSIWRVGGRLGKIIDYAENPEKTTNPSVAHPSVAPDCERDLSDVINYAMQQRKTVKESGDEEQPVMLRFVSGVNCHPDTAGEEMLMVKRRFGKEGGTLAYH
GUT_GENOME158522_008811-72MAVTKIWKINGWIGDCLLYIENPNKTQNPLVIGDPKLKEEEYQSLIDVIEYAEDGNKTVRDTERFVSGVNCD
GUT_GENOME237704_013131-61MATTKMWPIKDSVARVIAYADNPQKTAAGLSTALHYATDEVKTEAPMEEKRLFITALNCNG
GUT_GENOME117260_025921-60MATTSIWKVKNRIDHVIDYVTDTNKTIGLQQVLNYTTNEEKTLQHQYVSCINCMRNNPYQ
GUT_GENOME085353_004751-75MAITKIWSVKSRLDTSLNYITNPEKTNVKPDIDAIEGVIQYIENKDKTEECKYVRAFNCSKDNAFDRMIQTQDSF
GUT_GENOME163660_004387-73LAITKIWAVKDDLRRVLNYIENPDKTKEELSDGLKEVLAYTTQGYKTNEKEYITGINCEPSTSLKQM
GUT_GENOME256701_010981-77MATTSLWEIHQRLDKVINYATDSEKIKNKNYDQELYKSLHNTIEYATNNFKTEKQMYVTAINCDEKTAFKEMIKTKH
GUT_GENOME284170_011811-61MAVTKIWKVTNNLGKVINYSENGSKTKRTSLDDTLDYAMNKDKTEKQFFVTGINCDARIAS
GUT_GENOME275241_010754-81IATTAIWKIDNKLSRVIDYTSNAEKTKNMVDPDLYKSLHDVIEYAEADFKTEEQYLVTGLNCHPDNAYEEMKLTKEFF
GUT_GENOME057742_002451-89MAVTSIWSIKGWIGKVINYAENPDKTKEQTSQELISESETGQLQGLNDVITYAVNAEKTRLRQSKSMEMEVVGESEELMEQYVSGLNCA
GUT_GENOME080265_027071-67MATTKLWKITDRLDHIIDYVYNIEKTSSLIDTVHYVSNEEKTLQRRYVTCINCNENDPCQSMNNTKK
GUT_GENOME149708_0121511-101MATTAIWDVKGRLGKILIYAENPDKTDNPDFYQSNNDQDIQSLGDVIEYAMRDNATQSEELKQQFVSGINCFQNTARQEMLAVKKKYGKED
GUT_GENOME077638_001601-70MATTSIWAVKYSLKDVLDYAVNPMKTENKDYEDYQFQGLEDVIEYTTDDLKTEKQFYVSGVNCDPSNVYE