UHGP-MC 119663


Information


Number of sequences (UHGP-50):
62
Average sequence length:
83±13 aa
Average transmembrane regions:
0.02
Low complexity (%):
2.18
Coiled coils (%):
0.5
Disordered domains (%):
1.75

Pfam dominant architecture:
PF03483
Pfam % dominant architecture:
8710
Pfam overlap:
0.53
Pfam overlap type:
reduced

Downloads

Seeds:
MC119663.fasta
Seeds (0.60 cdhit):
MC119663_cdhit.fasta
MSA:
MC119663_msa.fasta
HMM model:
MC119663.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME199330_0056151-131PLDEIMKKEEIANTRLCYKAVGKDPHRYRNAAEAMLRRVVKGKGLYRINNVVDINNLVSITTGFSISTFDVSSINPPLELT
GUT_GENOME096049_0217156-138ISANPLIHEWRQAFKKFKTKKGARFAVENLLKRAKKGNPVRSIDPLVDLYNAISLRTGFPIGALDLDKIKGGISLEVAKGGES
GUT_GENOME235546_0100690-198DEINSLGEKYKETLTTESLKEISGIAATRKVYRACGKDPSRYRPASEALIRRMLQGKELYQRDTLVDLVNLASIAYGYSIGGFDADKFEGDTLTLGVGKADEPYEGIGR
GUT_GENOME221971_0099355-138IKEIPAIAATRSCYKRTGKDPNRYRPSAEQLSRRVVRGLGLYYINALVDIGNLLSLRTGYSIGVFDRDKVGSDIVFGVGYADEP
GUT_GENOME044035_0128083-147SSEALIRRVRRGDGLYHINSVVDVNNLLSIKSGLSVGSYDLDKLHGAITLRKAEAGEGYTGIGKD
GUT_GENOME181759_0220752-143IRQDPRIMATKQIYKQAGKDPARFRPSSDSLWRRVAKGKGIYQINALVDLNNYLSLRFKFPFGSYDVDHIDGDVQLTTGSAGQQYAGIGKSM
GUT_GENOME236405_0168052-143SVSENPHVSATRKAYKALGKDPSKYRNSAEAMLRRIAKGNGLYRINNAVDINNIVSISSGYSLGSYDLSAVRGEIVWKRAPDGEKYSGIGKD
GUT_GENOME217842_0010574-165INKRPGIAATRQAYKACGKDPNRYRPSQEQLCRRIVSGKELYFVNNVVDVFNAVSVLSGYSIGAFDADKIQGDKLALGVGRSGEPYVGIGRG
GUT_GENOME142588_0090861-144IKEWRQVFKQTGKDPNRYRHSAEALHRRIKKQNFLSPVNSAIDINNFFSIKYQIPIGLYDTDKLSGGVTIRIGTETDEYVGLNG
GUT_GENOME096393_0316352-135LSLEEFSSNPVISVWREAFQKFKTKKGARCSIEALLKRVKNGNHIGTINPLVDIYNSISLRYGLPCGGEDIDTFVGDIRLTQAN
GUT_GENOME032719_003811534-1617MEDIKHQPVIFATREAYKRCGKDPGRYRPSAEALRRRLMRGIPLYQIDTLVDLINLVSLRTGHSIGGFDADKIQGTHLELGIGK
GUT_GENOME009824_0050559-154DLKLKESRDGYKILGKDPSHTRLASEALLRRVVKGEKLYRLGDVIDLGNVLSLKTKRSVCVVDADKIVGDVKIRIGNKEDNYEGIGRGMINVTSIP
GUT_GENOME212130_0293762-131PRILAVRSMYKKMAFDPSRYRPASEALIRRVLQNKGVYYVNSAVDVNNYCSIKFLLPFGLYDAAAIEGDV
GUT_GENOME036765_0085655-138IKLKNILSSRNAYKKLGKDPSRYRLSSESLVKRVVKGNDLYIVNNIVDINNLISLHTCYSVGTYDLDKVKGSITFTVGEENERY
GUT_GENOME176879_0072675-139IASWREAFRGFGAKPQRTRNCLEALTRRAEKGLPRVNALTDVYNAISVLHQIPLGGEDLHRYNGP
GUT_GENOME096374_0271059-123EGIKEWRALWKKFGADPGRYRPAMEAMMRRIRNQNYLQPLNSGVDLNNFFSLQYEIPVGLYDLSH
GUT_GENOME127701_0116053-126EIAALPRIRDAREGYRAAGKDPHRYRSSCEAMLRRITSGKGLYRINNVVDCNNIVSVESGFSLGTYDTARLTGG
GUT_GENOME265968_0139456-161TKLENIIAGREAYKALGKSPSKYRTSSEALLRRIIQNKGLYFVNNVVDINNILSIRTGCSCGSYDLQKIIFPLEFSVGATGETYKGIGKDMINIENLPVFKDEEGC
GUT_GENOME036603_01372167-281TLKENRIIQGFYDLHKEVGIPKRKILPASENLLKNLLKKQEFHKINSLVDIYNLISMDTKLALGAHDLAKTEGNISLKLTTGNENYIPLGSEEAKVVKAGIYSYIDDANDIICFS
GUT_GENOME096393_0066662-142IREWRQIFKAVGMDPSRYRPSSEALMRRVLQNKPLHWIHTGVDVNTFLSLQFGLPAGLYNRDCIEGDVTLRLGRENETYEG
GUT_GENOME238508_0062655-154IKERAGIYATRLAYKAFGKDPSRYRPACEQLARRILQGKELYHINTVVDVVNLLSLYTGYSTAALDGACIAGNNVELGIGRANEPYQAIGRGLLNIENLP
GUT_GENOME140601_0277060-128GIREWRAAFKRLGIDPSRYRPSSEALLRRLIQGNPFHLVNSAVDVNNFLSIHYALPYGIYDLSKLSGQV
GUT_GENOME195525_0000843-149NEVAPYLKTMLETTPLAQIPNLDESRKAYKAFGKDPGRFRVSSESLYRRVRQGKALYQINSVVDANNLVSLETGFSLGSYDTARIGADIVFRLGKAGEVYPGIGKDD
GUT_GENOME047484_0076641-138ANMQTSDVAKMPAIAATRKAYSTLGKSPSRYRNAAEAMIRRIVKKQGLYQINNVVDLNNLMSIDSGISIGSYILDSIEGDIIYKQADTDAYYAGIGKD
GUT_GENOME099510_0102235-127LLDEIEEVSQQLISSMQIEDIKSREHIAQTRACCKALGKDVKRYRNSAEAMNRRILQGKGLYHINNVVEVNNLISVKTGYSLGTYDLDCLQGE
GUT_GENOME114810_0021752-165KDLSTIPLAQMPGIGEARSAFKTFGTDPGRYRVSSEALYRRLRQGKDIYRINSLVDTNNVLSLQCGHSCGIYDAAAIAGNVVLRLGLEGETYQGLGKGSLPLQNMPLLSDDAGP
GUT_GENOME000677_0169558-130EPRIVAARSGYKALGKDPSRYRLSTEALLRRLIKGSGLYFVNNAVDIGNVLSVKTQRSVAVLDLDKIQGDVLI
GUT_GENOME256840_0162657-147TLPGIEAARAGYRACGKDPHRYRNAAEALLRRTVQGKPLPAINSAVDAGNLVSLVCGCPLGMYDLAAVRGRVIWRIAAEGERYRGIGRDEL
GUT_GENOME258629_0025757-130ADYPGITAWRKIFKKTGADPSKYRPSAEALYRRIKKEPSAPTGNSSIALNNFFSLQYEIPLGIYDVKAIKGDVH
GUT_GENOME017531_0088755-127NVRELPSVAAYRAAFTKLGMNPNKFMCSIEALMKRVQKSGHLPHINPVVDLGNAFSLRHQLPMGAHDVDRLED
GUT_GENOME138463_0122465-145EPVRAWREAYRRFKTKKGARCSIENLLKRVIKGRPVGPITPSVDVYNAVSLKYALPVGGEDIDSFAGDLRLTVTQGGDAFV
GUT_GENOME145988_0015954-130EELSKHPRIANWRKIYSDMGVKPSSYRCSLEALLRRVIKGEGLWNVSSVVDCYNCVSVMTLLPMGAYDAHKLRGDLT
GUT_GENOME008420_0034855-127VKQEKELQPYRSAFTQLGINPNKYMSSIEALLTRIAKKKGMPHINPVVDLGNAISLKYYLPVGAHDLDTMDGE