UHGP-MC 19559
Information
- Number of sequences (UHGP-50):
- 136
- Average sequence length:
- 76±5 aa
- Average transmembrane regions:
- 0.03
- Low complexity (%):
- 3.28
- Coiled coils (%):
- 2.28
- Disordered domains (%):
- 2.59
- Pfam dominant architecture:
- PF07866
- Pfam % dominant architecture:
- 74
- Pfam overlap:
- 0.01
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC19559.fasta
- Seeds (0.60 cdhit):
- MC19559_cdhit.fasta
- MSA:
- MC19559_msa.fasta
- HMM model:
- MC19559.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME140236_01417 | 144-214 | DPVFLQFLDTEDFDLRMECLKQLENRITQQELDSMYLVLDMKPERGSLQEQLYAVRRFLTMQKRYDGSRLR |
GUT_GENOME017099_00261 | 183-253 | ARRGMLQILDAESFREKRQLLVGLRPYIDKLLLHNIAAALDVVLEEGTLDDQFESLLHCVDAHARYEGGRL |
GUT_GENOME040713_00441 | 89-169 | ENSQKEESRAETIFFQFLDAESSSEKIELLEKLKPYLDVRMVNNIAASMDLPSDEDDVEAQYTFIMQNLKQRSKFECSRFR |
GUT_GENOME155739_00467 | 71-151 | FEKIEEPKETNPLLLRFLDAETYEDKLELFQSWEAYADDQLLESIAVSLDIVLGKGSTKEKYRQVLNCLKTMEHFETNRFR |
GUT_GENOME148381_01845 | 109-177 | AIQKMLAFLDAESCHEKIRILEEMQDDLTEHILNNIAVSLDLSLEDDEDGFERIMAELRMREKYESDRG |
GUT_GENOME270891_01096 | 117-199 | QENVQKPEAQDEDSFLVRFLDASEYKDKLDILTLNRARLTEEIVDIMAESMDTCVPEGDIESKYNSLRKFIMAHVKYETNRLR |
GUT_GENOME236871_00434 | 118-186 | GFLAFLNAATYEDKIMILIKNKETYTRDMLETMAASLDFELPPGDINDGIEYIKKSLEVKDRLEGTHLR |
GUT_GENOME094734_00551 | 130-201 | EQQDLLIRFLDAESNEERLELLRRYEDRISETVLDSIGLSMDFPLNSEDRGRKLRELEGFIKTKMKYEKKRR |
GUT_GENOME258411_00158 | 82-160 | EDESEGEIDPNLLAFLNAGNFHAKVELLESMQDKITDSLIDSFAAASDLEIKPGNVYDRYEELKNCLSTHARFECDRLR |
GUT_GENOME233270_00399 | 146-218 | LKPGVEEFLDADNVIDKANILENIKNIIDQDDVTIMASVMDIEIDESLPLGERIRQLMHCLDTRGRFETNRLR |
GUT_GENOME140278_00364 | 62-136 | ENGHTSDLLLRFLEADSNEEKLSILQKNRPEVTEDLLEAAAQSMDYALSGESEEMQFLDFENYLRTKIKYERKRR |
GUT_GENOME012924_01763 | 111-190 | ETSESEEQANPFLLRFLDADTYEEKYKILNEMENDITDRLINDFSVVLDVVIPEGDLTGRYIQLKQCVATMCRYETTRLR |
GUT_GENOME279992_01406 | 117-186 | QKKVMQFLDEDSYVIKLEILSELRGKLNRSMMETLAISLDFDLGEGSLEDQYFSLTQYLKTKIRYEQPRR |
GUT_GENOME065600_03020 | 111-185 | ESPNPDLIRFLEAEEMDEKLAALAQLEKSASPADMRGAAAALEIELGEGDVPEQIENIRKFLRIQAKYDGSRLRD |
GUT_GENOME018982_01493 | 18-92 | EKPPQFDNLMAFLDAETYEEKLEILYRMQLDLNDYLIDTMAVAVDVVIPDGDIDDRYRQLKGCLEARRKYEISRF |
GUT_GENOME255913_00390 | 156-233 | ELAGVNPALLAFLDTDDFEEKYKILCGLEQRGEVTNHLIDQFAVTLDLVIPEGDADDRFVQLKNCVRTRSHFETNRLR |
GUT_GENOME267520_00908 | 78-144 | SDLVRFLDADTYREKMKILESMKDDLNEHILNNMAVSLDLSLEDGVDGYSFLMSELKIRSRFEGKRG |
GUT_GENOME049637_00348 | 167-235 | GMMQLLDAESFHQKREIFKGLKKYLTKNLLMNIAVALDIVLEDGTAEEQYDSILHCLDALEHYEGGRLR |
GUT_GENOME249907_01362 | 127-202 | ELQGVNPHFLDFLDADTYGRKYDIVTEMEEELDDHLINQMAASIDEIIEDGRLEDRILQLEACLRTKARYELNRAR |
GUT_GENOME007499_01947 | 125-196 | NAHEMLMEFLDADTYAQKLNILRGMKKHIDNKTIHDMAVSLDIVLNEKSLEEQIGDIENCLKTYARFECNRL |
GUT_GENOME257723_00794 | 108-196 | VPDENTPLHPLFFEFLDADDIPAKLAAVARMINKDDQGQEHPLVGQRELDAMFLALDMKPESGDAARLLASVRKHLETQLRYDGSRLRR |
GUT_GENOME152070_01050 | 376-457 | EPEEEKPMNPHLLEFFDAMDVRDYDKMLEALARLSQKAGQKEVDDICMVLDIRPQGDSAAEQISGIRQHVRMLKKFDGERLR |
GUT_GENOME024262_00517 | 113-188 | EKPLDPFLERFLDARTTEEKLTALAEMRGHVTDEMIDTMALACDVEVGPGALSVRYDDLRDCLLTIEKYELERGRL |
GUT_GENOME070804_00004 | 227-300 | QPNPFLIEFLDAEDIEDQLAILRKMEGCVGKRELDSICVYLDISTRYGSLEEQTEGIRKYLKMQLKYDASRLRR |
GUT_GENOME014763_01242 | 140-216 | QDEQVNPLLMEFLDAEDYQQKLNTLTGMRSKLDDKLIDAMAASMEIEVPEGPIDKRYASLRSCILTHAKFECVRLRN |
GUT_GENOME237868_00565 | 106-192 | QKESTPETEESASGEPLGIDAVLDARSIAKKKELLLSMRKTITSSEIDLAAMSIGVEIEDGPVEKRFEELLECLDTIGKYEIGRDRL |
GUT_GENOME237871_01093 | 48-126 | EQDPAVRDMLAFLDTDDFDEKYMIIEHMATNDELNDTYIDNMAASIDIVIDDGPLDGRIKELLKCLDTRKRYENTRLRG |
GUT_GENOME096230_01937 | 169-249 | REDEEASAPINPALLSFIEAETYEGRLEALHNMRGRVSQDDLGIIYVALDMKKVEGSVDAQLDAVEQILSMQRHYDGGHLR |
GUT_GENOME149065_01287 | 509-583 | EQELSPLLLPFVEADSIDRQLELLMAMTGKIGQRELDILYVALDLPQHGGSVEEQIHAIRQYLEMQKRFDGGRLR |
GUT_GENOME280181_01575 | 147-218 | RRGFLALLDAETYREKRQILTGMRDYINELYLNNLAAALDIVLEEGSSQEHYDTLLHCLETYEKYEGGRLRR |
GUT_GENOME142625_02831 | 87-170 | DKGEAEQEPKEASPFLMEFLEAETYEDKLALITEKGRFASNQAVEMICEIMEVPVLGETAEEKLKDLMRNLEMQIKYDGSRLRR |
GUT_GENOME059640_00376 | 145-219 | PADGVDTRLLDFLDAPTYQDKINYLNLIRNNIDDDLINNIAAAMDITINDGPIDARFFSLRSCLMTKAKYECTRR |
GUT_GENOME172958_00264 | 86-160 | PKEETSLIFDFLELTEDEARLQFLLRNKTRIDEAFVSAACESLEYVPGEGTLEEVYQELMKLLKTRMRYEGRRLR |
GUT_GENOME123864_00373 | 120-194 | GETVDPGLLAFLDAESMEEKYDIIRMMRSEMTDRLINSFAVSLDVVIPEGSTDERYEALKNCVRTRMRYEGDRLR |
GUT_GENOME001279_00258 | 150-222 | INPILLEFLDADTLEEKMHIMTFYRNQMDEALLNSIAISLDLVVDKKGLQETYDEIMNCLSMMKHFECTNRFR |
GUT_GENOME155726_01365 | 121-194 | NPYLMEFLDALDSGNYDRKLACLARLRDHVTQKELDDIYMVMDMQPAAGDTAEQLAVIRRQLLLLQKFDGERLR |
GUT_GENOME000232_01686 | 105-181 | EEACPINNWLIRFLDAESFDEKYEIVTEMKSDINDKLIDDLAVSLDVVIPEGNLQNRYQQLKNCIRTRQKFEGGRIS |
GUT_GENOME035377_00830 | 144-220 | EEPELHPLVAEFLDQETTEGRLEVLQRMSGKVAKRDLDSLCFCLDMNPVGGSVEEELDAIRQCIRMQQRYDAPRLRY |
GUT_GENOME117513_01134 | 113-197 | NDSMPQEPEETNGVDPRLLAILDADTYNEKYKLLSSMQGDMTDRLINDLAVALDIAVDDGPVDQRYEKLKSALAMLCKYEVNRLR |
GUT_GENOME112419_00454 | 129-200 | ETAGVDPLLIQFLDADTYTDKLKVFLEMRDTINEVLLTDIAASLDISTGTGSLEEQYASVQYALHTLAKYER |
GUT_GENOME073780_00092 | 104-182 | KEESTEQINPVLEAFLDAETYVKKLEVLQENSGTITDKMIDAMAASMDIEVEQADIATRIKSLKNCINAHAKYECTRLR |
GUT_GENOME103760_01083 | 90-157 | IMEFLDLDTKEEKVEFLQRERMNMTEDFLSAAAMSLDYVENSEDLDLRYEGLMHYLKTLIRFENRRGR |
GUT_GENOME083849_01900 | 177-245 | GFMIFLDADSYHDKRLIFLSLRQYLNDTMLNNIAVTLDLVLDEGSPEQHFDTILNCLETHEHYECNRLR |
GUT_GENOME002161_01763 | 89-163 | LQDDMTMILEFLDLEKNEEKLEYLKKHRLVLSERFLTAAAESLEYAEREESLEERYAGLVRFLQMKSRYESGRLR |
GUT_GENOME243097_00723 | 113-188 | QEAPSAFLLAFLDAKTTEEQRELLLAQLGSVTQRDLNSVCTACGISERAGDIESQARSIIRALALKARYESDRHRG |
GUT_GENOME000963_00451 | 91-167 | EGKKAEPSLIMQFLDCRSNEEKLRFLYQHRTEVKESFLTTAAESLGSVLTEKTTELRYEELMSYLRMKIKYETGRLR |
GUT_GENOME000729_00736 | 103-188 | LQSQADLSPDRCLHENKNLLAFLDAGTYHEKLEVLEERKDHFSPEELLAICEIMEIGRPDSEPEEKYYAVKRYLELQHKYEGARLR |
GUT_GENOME019248_01241 | 169-247 | TNAEEEPQEESKLIRFLDAYDYKEKLDILTSMRSELNDGLIDIMAESIEVAVPEGDITDRYNSLRKCLMAHTKYEGLRL |
GUT_GENOME000582_01089 | 178-249 | VSPRFLDFLEARTFESKAVILESMREELDDELIDGLALAVDVEIPEGPLEGRYRQLRQCLQTMARYEDERLR |
GUT_GENOME153233_02218 | 95-164 | ALILKFLELEDNGDRIQFLQRHQTEIDGRFLTAAAESLEFAENGESVEERLAALMRFLRTKMKYEGRRLR |
GUT_GENOME063416_00083 | 149-222 | ESADKRLIEFLDADTFEEKRRVLINIKDGITDRLIDDMAAAIDVTVDEGDIDTRFMSLLNCINTRAKYEVNRFR |
GUT_GENOME191482_02522 | 113-190 | EEQRPNPYLMAFLEADSTAGQLEALKRMEGRVGQEEVDCLRVVLEMGPGGGDIAKQLDDIRKNLEMQRRYDGSRLRDS |
GUT_GENOME239523_01426 | 120-205 | ADSVPISTAEEEPTLDPLVLEFLDADSYEQKLNILAGLHHRITNEMITTMAISCDIEVNEGDLEERYEELKNCLLTMEKFECNRLR |
GUT_GENOME050291_01064 | 140-215 | DGDKVNPYLMGFFDCDTSASQIEYLNSIRGKIDDRLINDIAVSLDLTVDDGNIDDRIDSLIRCLKMKARFECGRLR |
GUT_GENOME143400_02428 | 96-169 | EASGGLLYDFLDAPTLKEKLKVLEDRKRTPDDLTITNMAISVDVVVEEGTPEERLEELIDCLKTMARFECTRLR |
GUT_GENOME000259_01880 | 120-194 | EEGTVNPRLLEFLDASTNQEKLDILQKIRREIDDKLMDDLAMSMDLTLNGKDVQQKYKELQNCLLTYIKYESVRR |
GUT_GENOME125804_01477 | 156-230 | ESVINPWLLKFLDADSMEQKYQIVCDIKNDITDRLIDDLAVVVDVVIPEGKMGDRYEQLKYCIRTRQKYEKSSLW |
GUT_GENOME056785_01410 | 127-197 | LDPEVEAFLDAESVYEKLNILAGLRHRITDDILDIMAAASDIELNKGSTQERYAELKNCLLMTEKYERKRI |
GUT_GENOME070914_01239 | 107-188 | PVMTDEEELEGCNPDLLAFLEADTYEEKRDILVNIRKRIDDRLINDIAASLDVTVDDGDLDMRYRSIMNCLDTMTKFECNRF |
GUT_GENOME139725_00408 | 122-203 | EKTIENADEELPNPDLLLFLDADTFEEKKNLLVSMQNRMTDELINSIAASLDVSVDEGDLETRFKSLYNCVSTMSRYEVNNR |
GUT_GENOME265999_01572 | 93-171 | VQEKEISIDPMVLEFLDADTYEKRLEILMCLKPRITDEMITTMAVACDVEVPDGGIEERFNGLKSCLSMLDKYECSRLR |
GUT_GENOME149640_00213 | 131-206 | VLDKFLDARGYEEKRGYEEKLDVLIRYKSRFTNPMIDIMAESLEIVVPEGELSGRIDSLRKVLQAHTKYEGSHLRP |
GUT_GENOME007135_00476 | 63-136 | EDPEQGLLYAFLNADSNEEKLILLQQNRSEMTEAVLETVAQSLDYTLDSSDPDRMFLDLEQCLRTKIKYERRRI |
GUT_GENOME067046_01321 | 103-169 | LERFFDAGTYKEKIEVLDSLEDTVTNSMLDSMAVSMDINLPEGQVYERFMALKRTLQMMRQYEGHRE |
GUT_GENOME174924_00687 | 6-77 | MREILFDILDAETSKEKIEKMRFYRDELDERTLGNIAAAFDVSGDCQSDDELFEQIVQDLTIKARYETDRLR |
GUT_GENOME172932_01242 | 120-194 | QSGVNSVLLEFLDARSYNDKLEILLNRKKYIDERLLNDMAVSIDCTIEEGTMEERIKGLIFALETLARFENKRLR |
GUT_GENOME285247_01881 | 88-161 | EDDRSLIWEFLDLSSSKEKMDFLQKKKSEITEEFIGIAAQSLDFVENDGTVEERYEAVMQYLRTVMRYESGRLR |
GUT_GENOME115589_00069 | 99-165 | ETTEELPDGISPKLIAFLDADSDEERYTILNEMEDIVDDHMIDTMAVVSDLVIEDGPISQRFQELKP |
GUT_GENOME001870_00114 | 115-189 | EGDPSFLMDFLDAGTYEEKLCILKERGGHASQDQLDMICEVLEVTAGKDGERTSLETIQGYLETQIRYDGSRLRR |
GUT_GENOME133324_00928 | 78-157 | DTKEDLKGVNPVLMGFFDRDTCTDKIEYIVQVRDKLDDRLVSDIAASMDITIEEGSLDDKIDSLLICLRTKAKYECNRFR |
GUT_GENOME017990_01652 | 184-255 | AKRGFMMILDAGTSREKRALVIGLQKVLTPLQMSNLAVALDIVLPDGTRQEKLDSLIHCLGTLEKYECRRLR |
GUT_GENOME061534_01052 | 57-138 | ETQEINSLHPQVFEFLDLDTMEEKSEFLHYKLQKENITEKDLDLMAVSVNITLQEGEISDKIEDLIGCVDQLAQWEANGRRR |
GUT_GENOME041603_02310 | 98-175 | EKTEEKPHPLLMEFLEAEDLGQQIESLKRMKGKIGQKELDSIYVVLDMIPSQGEPDIQLACLIKALETRKKYDGSRLR |
GUT_GENOME269664_00241 | 103-178 | EQRPEEAENMEMLFQFLDTCDLEERLSLLMQYRSQWTESMLDSMGVAMDYVLNGKDRDEKYYELDKMIRTKLQYEK |
GUT_GENOME082534_00261 | 158-238 | ENQADESKAQINPKVLEFLDTEDFDERYNILVSLRDELDDQMVNILAVALDVVIPEGRIEERYDALKNCLRTRQRYESTRL |
GUT_GENOME008455_01270 | 137-208 | MNLLLYAFLETDTVHDKLKLIKNSDNRKYIDNKCVDDMAACMDFAIDEGDIDERIRQLITCLDTMKRFEVLR |
GUT_GENOME023484_00943 | 98-177 | PVEEFFEEDNPLLEFLDASTHEERLNVLLKYKESISETMLESMSLSMDCVLTGEGPEEKYEELVKILRTKIQYERNSRLR |
GUT_GENOME008152_02260 | 154-227 | GAVHPLLMKFLDAETYDQRLGIFDEMEGIADMHMLNAVAASLDVTLGDTTIEEAFSLIRDNLATQKKYECSRWR |
GUT_GENOME147168_00726 | 8-83 | GEVNPILLEFLDTDSFEEKYKILVATPIMDFDNLLIDNMASSIDVVVEDGDIESRVQDLKNCVRTRSKYETLRFRR |
GUT_GENOME045468_00426 | 64-137 | DEDMRLIEDFLDIVENEQKLYFLQKHKKDITEKFMSIAAQSMDFAEKETSMEMRYQELLHFIRMKMKYEGGRLH |
GUT_GENOME031788_01804 | 130-214 | EEVRTQEPVQKLYEPSPAFLKFLDADTYEERMECLSAMARTAQQRDLDSLYLVLDMKPETGTIPEQVQAIGRFLTLQNRFDGKRL |
GUT_GENOME252525_00449 | 143-221 | SENPVEPQPDPGLLAFLDADSYEEKLEVFAALEGKADLHMLNAIAASLDLELSEGSLEEQYDTLKSCLMTLERYECNRL |
GUT_GENOME188395_02085 | 117-194 | ESESVDTQQALLDFLDARSYQDKLEQLDLLKKHVDSHIVNSMAISVDIVLATDTVEEQLEEIRNCLLTHIRFEDRRLR |
GUT_GENOME252036_00004 | 119-190 | DITELLMDFYDAATYEEKYNILTAMRDGITNVMVDNMAVVLDVVIPEGELDKRYEELRRCLKTHRKFETTRS |
GUT_GENOME214562_01322 | 8-77 | NLLIKILDADTVREKLILIKDNKEKLDARTLGNIAVVFDLVPDTDHPEELFVQIVQYLETRARFETERLR |
GUT_GENOME096551_01013 | 135-210 | EYGMVNKDLLAFLDAESYGEKLEILFAIRNKIDDRLMTDIEMSLDLSGHEGTIEDRFDLVKNNLQTLSKFESKRLR |
GUT_GENOME031499_01987 | 150-232 | ELHDVSDHQGVNPKLMEFLDAETMEEKYNVLVSMRDDITDRLIDDMAVVVDVVVPEGDLMTRYDDLKFAIRTRQRYEFSNRLR |
GUT_GENOME224854_01835 | 142-214 | EANLDLMSFLDAKDCEEKLEILYSIRKNIDERTMGNIEIALDLPVFEGTIEERVDIVKNKLQMMAKYENRRLR |
GUT_GENOME091089_01756 | 140-220 | MEESREEGGLDPRLLRFLDAETYEEKLDLLIRMHDGITDDLLTTMAVSLDIDLEEGELEERYQTLKNCILTLEKYECNRLR |
GUT_GENOME140242_00506 | 4-75 | ELSLILRFLDLEDPKKQAEFLTAHRGEVTDKFLTSAAVSLDYPETGKDLETRCSDLIHYLNTRVKYEKKRVL |
GUT_GENOME172393_00345 | 163-245 | SDNSCMTQEPQVSPKLMEFLEAESFEERYNILVTMADDITDSMIDTMAVVMDTVIPEGPIEKRFEDLKYTIRTRQQYEFARLC |
GUT_GENOME240528_01315 | 76-155 | ETSKEEMEKGQSIIFAFLELSDAEEKIRFMQRHREDMTEEVLSVIAESLEFVESQKDTEFRYEAILDYLHTVARYEGRRG |