UHGP-MC 5944


Information


Number of sequences (UHGP-50):
77
Average sequence length:
119±13 aa
Average transmembrane regions:
0.14
Low complexity (%):
5.86
Coiled coils (%):
0
Disordered domains (%):
14.21

Pfam dominant architecture:
PF00574
Pfam % dominant architecture:
260
Pfam overlap:
0.48
Pfam overlap type:
reduced

Downloads

Seeds:
MC5944.fasta
Seeds (0.60 cdhit):
MC5944_cdhit.fasta
MSA:
MC5944_msa.fasta
HMM model:
MC5944.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME220380_006081-82MAQRKALALGSGHKQHSRNRRRHAGAYREDIARYILHGVIDAKTGIHAATGRIDIDAHILARVRAIEVQQLALYHIGGIIIY
GUT_GENOME274993_005411-116MDHDLRVGKRKALALRAGGQQKRAHRRGHAHADGGHVALDELHGVIDRHAVGHAAAGGVDVELNVLIGVLCLKIEHLRNDQTGSGAVDLFGQDDDTVVEQAGEDIIRPLAAACLLD
GUT_GENOME204328_0040617-142NLTRRYLDIRGLSLCPTHRLVNHHTAVGQGRPLAYDSYHQQYGCHRRRHARANRSYGTADKLHRIINTQSGIHRTAGRIDIDGNIFSRIHRIQIKQLSLQRIGRVIIYLGSQEYDTIHHQTGKHIH
GUT_GENOME083078_0053911-138LTRYAAAAYEGLVYHDLAVRQSDTLSLCTGRQQECAHRSSHAYTNCRNVALDILHGIIYCHTGCDNSSGAVDIQINILFRVLSLKIKKLSYYHACRSVVDLVAEHYYSVIEQARVNIVAALSSSRLFY
GUT_GENOME061026_0060857-178LGADIRCGTANTTEGLVHEDAGVRGCVALADGACGEQELTHGCCHTGHHGHHVVRDELHGVVDCHTSGHGAAGRVDVQVDVCLGVFSREQQHLGADEVSVGVAHLGAEPDNALLQEAVVDVI
GUT_GENOME199445_0143716-134MLGHDLDIGLLRHAHAAHQLVDEHLRVAVHLTVVAAGQDHRGHRGHDALAHRDHGAIERLHGVDERQPRRDAAARAVDDEGDGLGRTLMLQHQHGLNKAVAGRIVDLALQQQHTAVEQF
GUT_GENOME111394_005556-133ACLSLHSAERLMDEYFAVGQRDTLSFRACRKQERRHTRRHTDAYCRNVAFDEVHCVVYSHAGGYAAAGAVDIQADILLGVLRFKVQELSHHERRGDVVYLLTEEYDTVVEQSGIDVKRTLAAVRLFNY
GUT_GENOME209659_0094262-175MDCNICRLSLRTAEGLMDHNAGMRQGVAFPFFAGCQKYGRHGGAGTDTDGRHIRADILHRIINRHACRHNTAGAVDVKMDIFVGVFRFQKEKLGNNHVGGMIINGAVKKYNTVF
GUT_GENOME084626_0116235-167VNFDVGRLSLHAAQGLMNHDFAVGQRHAFSFCAGTEQKCAHARRHADADGADVALDKIHGVIDCHARGDGAAGGVDIQIDVLVGVLRLQKQKLRNHQRGGLIIDFIAQKHDAVIEQAGINIIRPLAAAGLFDD
GUT_GENOME166061_00851392-517LEIGLLRALSRTTHRLMDHYLGIRQDNTLTGSASGKQNRTHRRSKPHANGGNITFDVLHGVIDGKAGRNRASGRIDVERDILIGINRFQVQKLRNYRIRNGVVNGLAQENDALVEQTGIDIIAALT
GUT_GENOME213297_009521-72MNHDLTVRESAAFADLTSSQEKSSHTGGHANAGGSYVTTDILHRIIDRHPSRDRAAGAVDIETDILIRILSL
GUT_GENOME034822_008258-134LALYATQRLMDHNLRVWQGKPFAPGPGGQQDRSHTGSGTHTDSRNIRFDVLHRIVNSHAGSDDAARTVDIQMDILIRIFCFQEKKLGNYNIGRHIFNRAAEKNDAVFQQAGVYIITPFALTGLFNDH
GUT_GENOME221874_00854382-513MDTDIRCLPSCAAAGLMDHDLCIGQSISLALCAACQKKCAHACRHADTDGGYIRLDILHGIINCHACRYRTAGAVDVEVDVLIGILRRQKQQLCHNQTCGYIVDLFPQKNDSVLQQSGINIIRTFATVGLFY
GUT_GENOME164134_012881-116MNHNFAVRQSAALALLTRHEQERAHRRRHADAHRAHLRLDVLHGIINCHTSRHRTAGAVDVQADISIRVFHFQKEHLRHDGVRHVVVDFAAQEDNAILEQAGINVIRPLAAVGLFD
GUT_GENOME272508_0063438-151NFRRLSACTAKGLMNKYFAVGQRAALALRTRCQQECAHACRHANAYRRYIAADILHGVIYSHAGGYASAGAIYIQAYVLIRVIRLKEKHLRNDKVSLVVRHFPAKEYYPVFKQP
GUT_GENOME103523_0082021-138SLMRDMGGVGKHKALALRTGGQQDGAHRSGHAHADRGHVALDEVHGIEDGQPGGNRSSGRVDVERDVLIGIGALQMEKLRDHGIGNVVVNRLAQKDDAVVQQARVDVEAALAARRLLD
GUT_GENOME075195_011641-113MNHHATVGQDLALVSGNEDERAHARCHAQADGMDGTGHGLHRVVGGKAALDLAAGAVDVERDGGLGILSLQVDETADDPGTRLGVDGARELELAVGEHLVADVEAVDALGALA
GUT_GENOME228682_0166318-140GSDLNIGGLTLDSAHGLMYHDSGMRQCETLAALACDKKNGGHRGGHTRTYSRDIRIDILHGIINSETGINRTAGRIYIQTDILLRIDRIKVEQLSLNDIRTLIRHLSAEENDTVHHQTGKNVH
GUT_GENOME209997_0207919-144SNFNIRGLSLCASHRLVNHDTRVRQCRTLAFFTGNQQYSCHRSRHTCTNRSHITRDKLHRIVNAQSGVHTSARRVDINRNILARINRVQIQQLCLKRISSIIVHLCTEENNAVHHQAGEYVELSYI
GUT_GENOME151884_008311-116MDHDAAVRQRIALALFARGEKERPHRGGEPHADRVHLRTDEAHRVENPHSRVDGAAGAVDVELNVLVGIFAFQKEELSDHEARAHVVDLAGQKNDAVAQKARIKVEAAFAAAGIFK
GUT_GENOME223394_0020654-167MNHNVCCLTLSTTTWLVNHNISIRKSKSFALCACCEQESTHTCSHTHTNSRNITLDVIHCVINSHTVCNRTTWTIDIKTNILRCVLSFKEKKLSDNKACSYIINFLCEHNNSVV
GUT_GENOME139139_01722471-603MDLDLGRLSLYAAKRLVDHDIRMREGIAFALRTGGQKDSRHAGTGSDTDGGNIRMDILHRIIDRKAGGHDAAGAVDVKMDIFRRVFGFQKEKLRNDDIGHMVFNLAVKEYDAVLQKTGINIIGALSASGLFDN
GUT_GENOME023616_004311-117MNHNLGIRQGEPFAPSSAGKQKRPHGTGLTDTHGVNITTDVLHSIIHRHPGGDGTAGRIYVKENVFFGIFRFQKQQLSTNQRRHLIINLSGQKDNPLLEQTRIDVKRALSFGGLLNH
GUT_GENOME278101_011888-134LALGTAQWLMDHDTGMGQGKPFALGSGCQEDGGHRGCHADANGRNIGTDVLHGVINGQSSTHTAARAIDVEMDILIRIFRFEKEELGHNQIGHMIFNLSGHKNDAIFQKSGVNIITAFALAGLFNNN
GUT_GENOME198840_00845118-231ICTGLAASMGAVLLTAGAHRGRQPGADGRHVGIDELHGVVNTQSGRNRPAGRINIYLYVLFGIDRFEEEQLCLDDIGRVVVDRRAEEDDAIHHQAREDVHRSDVQLALLDDRRR
GUT_GENOME057900_016253-112QSGPLALRSGTQQHSGHRSGHTGTYCRHVARNILHGIVNAQPSVNTSSRRIKINRYIFSRIGRVEIQQLCLNHIGGIIVDLCSEKNDAIHHQSRENIHLSHIQLPLFNNR
GUT_GENOME097503_0369620-133NFDIGRLALVAAQRLVNHHARIRQAKTFTFCPCGQQESAHRGRLAHTDGADIRLDKLHGVIDRHPGSDHPAWGVNVQIDIFFWVFRFQEQHLRHNQVSHVVFNLASQKNDAFLK
GUT_GENOME078968_0102654-167NIACLTLCAAQRLVNHNLAVGKRVAFALRAGGKKERAHRRGQADRNGADVALDIIHRVENRHAVGNAAARAVDVQRDVLMRIARFEINKLSDDIVCKVGIDFAAQKDNSVFEKP
GUT_GENOME243296_0114820-154DLHVGGLSLCPAERLVDHHARVREGKTLPLSPRHEDDRCHTGSHAGGDGADVAVDVLHRVVDPVAGIDRSAGTIDVDGDVLAGVARLEEEELRLYHIGDIVIDGGAEEDNPIHHQAAEDVHLSDIELSLLDDGRI
GUT_GENOME164147_0128419-144LDFDFRSLAKTLVDGWLVDEYAGVWQHQTLALGTGCQQDGGRRSGLTEAHGLHVRLDVLHGVIDGGHGRHGSAWGIDVHHHIAVRILAFEHQQLGHDVIGGSVIDLHAHEDDAVFKQTGVRILPFE
GUT_GENOME048366_0163322-140DIRSLSLSTTERLMNQHAGMRQGGSLSCGASTEKNRGHRCCETCADRSHIRLYELHSVIDSQSGRNLTSRGIDVEGYVRSGVGRGKEEKLCLDYVGDVVIYRDPEKNDTIHHKAAEHVH
GUT_GENOME067160_004971-117MDHNLAGGPRQALPLLPGHQQKRAHAGRHTDAHRAHIALDVVHGVIDSHTSGYAAAGAIDVQGDIRIRIFHLQEQKLGHNQIGHIIVNFAAKEDDTVFQKARIDIERTFTAIGLFNN
GUT_GENOME068617_007434-135DVRCLTLHSAERLMDHYLAVRKCEALTLCAGGEQERAHARRHADADGGNVTLDIVHGVVYCKTCADRAAGAVDVEADVLVGVLCLEKEQLSHYERCGHVVYLVCEEYYAIVEQAGVYVVCSLTSAGLFDYGW
GUT_GENOME114103_005261-117MNHDAGVGKRIALALGAGRQKEGPHRSGHAHADRLHVRLDELHRVVNAQTGVDRSARAIDVERNILVGIFTFKEQQLGNHKICRFVVDFARQEHDAVAQQTRINIEAAFTAAALFDH
GUT_GENOME107712_0128822-143DFNIRGLPLGTSHRLVYHHAGVGQCGAFAFGTGNQQYGCHACRHSSTDGSHITTDELHGIINTQTGIYRTARRIDIDGNILARIHRIQIKQLSLQSIGCIVINLRTEENDAVHHQTGKHVQL
GUT_GENOME254054_010825-105FSAGACGAIALFAVGEDQCGHACCRAECVGMDWAFDAMHIVDDAETRTDTPARRVDDELNRLMVDGVEVKQLRDNLFGGLVIDWLRQEDGALKEQGGTNTG
GUT_GENOME017838_009911-116MDHDAAVRQDFALIAGDEQQRSHACRHVKANGADGAAHGLHDVVKRQAGLDFAARAVDIKRDRHRGILALEVKQAADDDGARFCIHRGHKLELALVKHLVADVEAIEAFCGFADDL
GUT_GENOME082961_006661-116MDHDPGVRERVALALRAGGKKESTHARCHSEADRVHIGLDELNRIVDPEAGVNAAPGAIDIERDVLIGILALEEQELRNDEVRRFIGHGARQENHPIPKQSGIDIERALTAPALLY
GUT_GENOME049403_009721-103MRQRRTFSLGSGRQQDGPHAGSQPGADRSHVRTDQLHRIVDSQPGRHRSSGRVDINLNVFVRLGRFEKQQLRLDDIGHVVINGATQKNDPIHHQPRKDVHRGH
GUT_GENOME174670_0298732-133DQRLMDHDLGIGQRHPLSLRPPGKKEGTEACSQTDTQRAHITFDKLHGVIYGKAGRHGSARTVDIQLDIPFRILRLQVQKLRNNQLRRCIADFLSQKNNPIG
GUT_GENOME005061_014781-116MDHDATVRKHFSPSARDKDQCRHACGHAQADGGDGAGERLHDVVERKAGLDLAARAVDEHLDGSDGVGALQVEQAPDEDGGRLGVDGAAKLQLAVGKHLVTHVEAADALAGLLQDL
GUT_GENOME017745_005093-132FNIGSLTLHSAKRLVNHNLAVRKREAFAFCAGTEQERRHGSSHSYADGGNITFDVVHGIVNSHACGNGTTWAVDIKTYVLVGILTFKIKKLCYNKACGGVVYFIAKHYNSVIKQSGKNVVRALAPVCLLN
GUT_GENOME209003_0153765-197LNFNVCRLPFGTAGRLMNHNLRIRQRKPLALGAAGQQERSHRTGLSDAHGRYVAADVLHGIVHCHAGRNNAAGRVNVQKDVLFGILGFQKQKLSANQRRNFVVNLSGQKNNPFFQQARKDVKRALALGRLFND
GUT_GENOME197981_0142413-125LGCLTLCTAGWLMNHDFRVWQSHSLSFGTAGKKECSHAGSHTDTDRGYITFDVLHRIVDSKSCAYRTSRTVDIHVDIFSRIFRLQEQKLCNDQVCRYVGNFLSQKNNTVVQQS
GUT_GENOME139883_0087131-151DLLSLYLNIGALTLAAAGGLMYHDLRIGKCQTLSLCAGGKQKCAHGGGKADADSRNIGLYILHGIVDSHARRDASAGAVNIKLYVFIGVLCFKVKKLCNNKARSRIIDFLGQHDDAVIEKS
GUT_GENOME027328_009581-98MNHDLRIRKRIALAFRTAGEQKGAHARRHADADRGDIALHILHGVIDCHPGRNTAAGTVNIEVDVLGRILRLKKQELCDHKTRADITDLLIKKYNSVL
GUT_GENOME232407_010781-118MNHDFRIRQSIPFALGTGGEQKGSHRCRHSDADRGNIALDIVHGIIDRKACGNRSAGAVDIQGNVLVGVLCFQEQKLCDHEGSGDVVHFVRQKDDAVVQKPGIDVIRPFTPAGLFNDG
GUT_GENOME202038_0086059-172VSGGAADAAGRLVHEHTGVRGQVTLALGTGAQQELAHGSTHAQAHGRHVRLDELHGVVDGHAGGDGAARRVDVQPDVLLRVFGGQHQQLSADAVGHIVLDLFTHPDDAVFEQTV
GUT_GENOME219266_003755-136DFDISRLSLCAAQRLMDHDFRIREGHSLAFRAACQQKGSHACRHTDTDRRDIAFDKFHRIINCHTGRNRAAGAVDIQMDILVRVFRFQEKKLSHNEARRYIVSFLTQENNSFFEQSGINVIGTFSAACLFDY
GUT_GENOME243952_0140117-142DLTSLNLDIRSLTLNTTQWLVNHHTAMWQGRTLAFLTRNEQYGSHGSSHTCTDGSYITGNKLHSIIDTQTGSNTTARTVQIDGNILAAIYRVEIEQLCLQGVGSIIINFCTQEDDAIHHQAREHVH
GUT_GENOME021143_00763561-682DFNIARLTFCTAQRLMNHDARMRQCIALAFFACGQKERAHRRGHAEADRIHLRANETHRIKDRHAGRNRAARAVDVKRNILVRIFAFKKEQLSDDQRGRLIVYFRREENNAVTQQTRVNVKT
GUT_GENOME076549_0179421-152LDVRRLPLGPAQRLMDHHPRMGQRRALACSTGGQQHGAHRSGQSRTDGRHIGLHQLHRIVNSQTGRHRSARGVDINLDVLFRVGRLQEQQLCLDDIGRIVVNRSPQKDDAIHHQTRKDVHRRHVQLALFDDR
GUT_GENOME071666_015921-100MWQSRTHAFFACHQQHCTHRGCHAGTDSSHLATDKLHGIIDAQTGIYRPARGIQIDIDVALRIHRVEIEQLGLNHVGSIVVHLCAEEDDTVHHQTGEYVH
GUT_GENOME254504_0117416-156LFRSDFEIGLLRAALGATHGLMNHDLRIRRDETLARGTRSSKHGGHGSGHAHANGGHVAANELHGIVDGQTCAHAASRGVDVERDIFLGIGALEVQELRDNDVCNIVVNGAAQEHNAVIEQSGVNVIRALAAGRLLDDVRN
GUT_GENOME197708_0073470-191AAGLVNHDLGVRQAEALALGAGRKQHSGHRRGHTDANGGDLRLNEVHGVKNRQTGVDLATGRIDVERDILLGILALKMQQLGNDQVGADGVDLLAQEDDAVVEQARIDIVAALAARRLLDNV
GUT_GENOME254902_0122559-187DVRSLSLRSAERLVNQHAAMRKGRSLSGSTCAEQYGTHRCGHTCTDGSDIRFHELHGIIDGKTGRNLSARGVQIQRNIGSAVVRSEEKHLGLNDIGHIIINRNAQEYDTVHHQTAEHIHGSDIELALLD
GUT_GENOME075971_0151840-176MDLDIGGLAGKAATLPADQRLMDQNFRVGQCKTLSLGAACQQEGTHGSCHAHADGGNIALDILHGIVDGHAGADGAAGAVDIQADVLIRVLPFQIQQLCHNKAGGGIIDILAQNDDAIVQKTGENIIGTLAMGRLFH
GUT_GENOME029487_0127214-153QDLAGMDVDVGGLSLESAEGLVDHHLAVGQGVAHTLLARAEEEGSHAGGAAHAHGGDGRGDVVHGIVDGHARRDGAAGAVDVEINLFFRVLRFEEKQLGADEGRHRVVDLGAEEHDAVFQQAGKNIISAFGAARRFDDDG
GUT_GENOME221158_0007219-153LGVQLHVGRLTARAAAGLVNHHFAVGQRKAHPGFARHQQDRANRGGNAEADGANLRLAVAHRVKDGHAVGHASARAVDVHGDIGVRVFHFQKEQLGDDAVGQRLVHLAAQKDHAILQQAGVNVIRTFAVRRLFND
GUT_GENOME067739_011627-122CLPLCAAGGLMYHDFTVGQGAALTLCARSKQECAHGSGHSDADRGNFAANILHGIVNCHAGGNAAARTVDVKINVLVGVIRFKEEHLRNDEVGGVVIHNAAQKDDAVLQQTGINVV
GUT_GENOME113468_019151-117MDHDPCVGQGEPFAFDARAEQDRRHACGLTDAVGFDIAGNYFHAVENRESGGDDSARTVDIEINIFFRVFRLQVEQFRNNEICHGVVNRSAEKDDPVFQQTGINIISALPLAGLFDD
GUT_GENOME165900_01501291-429TEDFTGMDFDFRSLSLHAAQRLVDHNVGMGQCIPLSCGTGSQKDCCHAGAGADADGRYIRPDILHGVVNSQSCGNYAAGAVDVKVDILGRIFCFQEEKLGNDDIGHVVFDLAVQENDTVFQKTGVNIIGPFSLCCLFNY
GUT_GENOME282976_014678-134LAARASERLVNHDLTVGQRAALALHARCKQERTHAGSHAKADGGNIALDVLHGVINRHTRRNGSTGTVDVKADVLVRIIRFQKEKLCHNQIGLLIGDLAAKKDDPVFQQARIDIVRTFAAVGLFHNH
GUT_GENOME064609_0179625-147GGSLGTSGNLVNHYVGVRQAEAFAGSSRSQKDGTHGGRYAQAICIYVTRDELHRIVNRETRRDGSSGGVDVDVDVFFRVSHLKEQKLGNDGIRHIVVNSGADEDNAVFQETGVNVKGTFAAAV