UHGP-MC 124707
Information
- Number of sequences (UHGP-50):
- 146
- Average sequence length:
- 72±8 aa
- Average transmembrane regions:
- 0.03
- Low complexity (%):
- 2.68
- Coiled coils (%):
- 0
- Disordered domains (%):
- 2.52
- Pfam dominant architecture:
- PF01264
- Pfam % dominant architecture:
- 7192
- Pfam overlap:
- 0.21
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC124707.fasta
- Seeds (0.60 cdhit):
- MC124707_cdhit.fasta
- MSA:
- MC124707_msa.fasta
- HMM model:
- MC124707.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME071730_00265 | 262-339 | MLDEIKRAKNDGDSVGATVECVTTGLKVGIGGALFGGLEGKIASVVYAIPAVKAVEFGYGERFASARGSFVNDAMRFD |
GUT_GENOME214094_02780 | 225-294 | MKRDILKIKENGETAGGVLECIARGVPAGLGEPVFDKMNAMIAHAVCSIGAIKGIEFGAGFQVADMLGSE |
GUT_GENOME157401_00866 | 202-274 | MEKLILEVKTDGDTIGGVITCVVKGVPAGLGEPAFSKLHADLGAAMLGINAVKGFEYGEGFAGVDQRGSQQND |
GUT_GENOME243938_00480 | 195-270 | FKSEILKAKNSHNSVGASVITIIKNSPIGLGEVLYDKFDARLAAAMMGINAVKAVEIGNGIKSAKIYGDENNDEIS |
GUT_GENOME253776_00889 | 346-407 | MRSCIRACAADGDSIGGTIECKVTGLPVGVGSPMFDGIENALSRAVFAVPAVKGIEFGVGFD |
GUT_GENOME237843_00648 | 189-261 | MIREIEKARKSNDSVGAVIECVVFGAKKGLGNDYFMGLEGKMASLLYAIPAVKGVEFGLGFNLAKTRGSRSND |
GUT_GENOME181148_01501 | 195-263 | EIRMTEAILAARKSGDSIGAVIECIIEGLPCGLGGPLFEGFDGKLAFGLFGIPGVKGVEFGRGFEAARM |
GUT_GENOME053867_00113 | 197-267 | MIAAITDAKAKGDTLGGVVSCVIKGVPAGMGEPVFSRLNAQLSMAMMSINAAKGVEFGLGFDFVNHRGSEV |
GUT_GENOME255556_01104 | 194-269 | MENEILAAKSEADSVGGAVEAVVIGLKPGMGSPFFESVESRIAQILFSVPAVKAVEFGIGTSFSQMRGSQANDPFY |
GUT_GENOME275813_00356 | 202-277 | DDIATDLMKEKIDKAKQEGNTLGGKFEVIYGNLPIGLGSYVQADRKLDGKLAQAIMSIPAVKAVEIGCGVEAAERT |
GUT_GENOME095591_00002 | 200-266 | MKDEIDAAREHKDSLGGTFRVIVRGLVPGLGGTGCSDNRLTAQLGSAIFGIPALRSLDFGLGRASAS |
GUT_GENOME007837_00429 | 444-513 | DEVEQARKRGDTLGGTAKITIRGMKSGFGSFTAFDRKIDGKLAGALMSLQGVKGVEFGEGFALSGKYGTE |
GUT_GENOME164423_00355 | 154-222 | LRQRAAEARETGDSVGGRIRCTVTGVPAGLGGPDWRDTMESEISRHVFAVPAVKAIGFGDGEGFAALRG |
GUT_GENOME254183_00736 | 198-267 | MEKLILSKKQEGDSIGGIVETVAVGVPAGLGEPVFERLDGDLARILMNINAVKGVEIGFGFDVATSCASE |
GUT_GENOME243142_03877 | 189-258 | LEAYMDALRKSGDSVGACVDVVADNVPPGWGEPIYGKLDGDLAAALMSINAVKGVEIGDGFASAAQKGTE |
GUT_GENOME139743_01035 | 195-271 | MKAEILKAKSDGDSLGGVIETVISGIPAGVGEPWFDTVEGALAHAMFSIPAVKGIEFGGGFELATAKGSKANDQMRF |
GUT_GENOME056860_01054 | 196-261 | MRASVEKARLDGDSVGGVIECAVVGVPVGIGANIFSTVEGHISSALFGVPAVKGVEFGAGFDFAKM |
GUT_GENOME014781_00567 | 191-266 | MKEEIEKARADGDSVGAKAECLCVNVPRGLGGSGENGLESKLSSGLYAIPGVKGVEVGDGFDFCKMKGSEANDGIC |
GUT_GENOME221531_01243 | 194-294 | MQAAIRAAGSEGDSVGGILETIVTGLPAGIGEPWFDSVESELAHLMFAIPACKGIEFGAGFGFAAMHGSEANDPFTMRDGKIATATNKNGGINGGITNGMP |
GUT_GENOME114313_00359 | 177-251 | QKVLESAQKEGDSLGGTVECIVQGMPIGVGDALFSGLEGKIASNVFAIPAVKGIEFGEGFNISSMKGSEANDPIV |
GUT_GENOME112780_00475 | 189-256 | ERIEKARSQGDSVGGVAEVIVYDMIKGIGDNLFDGLENKIAYSLFSVPAVKGVEFGSGFDGVELLASE |
GUT_GENOME242951_00477 | 193-254 | KNENDSLGAEVELIIKGVKAGLGEPPFAPLDGELARYMYLIPSVKSVSFGRGYGFAQSLGSE |
GUT_GENOME000611_01812 | 163-226 | KALEQAKQEKDSLGGIVALTITGVPIGLGGTRDASAQSALAHALYAVGAVKSVAFGAGEKAARL |
GUT_GENOME225852_01104 | 212-280 | IRDVIDKTKRDANTVGGQVQVIATGMPVGLGSYVSADEKLDGKIARAIVGINAFKGVQFGGGFDNAEKY |
GUT_GENOME245373_01020 | 35-101 | RQEADSIGGTVECMISGLPVGLGGPLFEGLDGRIAQAVFAVPAVKSVEFGEGFGAALLRGSENNDPY |
GUT_GENOME118051_00682 | 161-237 | DASRFEELLRNAAAEGDSLGGVVACRVRGVPAGYGEPFFDTVEGVAAHLLFAIPGVKGVEFGSGFAAARSCGSANND |
GUT_GENOME040391_00769 | 196-270 | KTALAEVAAKGDSAGGTVECVVYGLPIGLGDSGTDGLESKISAAVFGIPAVKGVEFGAGFDLAEMTGSRANDPFA |
GUT_GENOME226406_01404 | 194-266 | IKERIEVARMNQDSLGGVIECYAIGVPAGIGEPFFDSLESTIAHLAFSIPAVKGIEFGKGFSITEMLGSEAND |
GUT_GENOME129166_00970 | 191-263 | MEAVIETAKKESDSVGGIVDCQISGVPVGWGEPLYGKLHAALGAAMLSINAAKGFEVGDGFALASMRGSEAND |
GUT_GENOME046051_00433 | 193-259 | FLEEKMAQKDSAGGIVECVISGVPAGIGDPVFEKLDANLAKAICSIGAVKGFEIGDGFEAAKTVGSV |
GUT_GENOME056587_01317 | 154-222 | QELLKVAQNDFDSLGALVSVNVQGVEGGLGGNLFDSLESKIGSAMLAIPGVKGIQFGLGFKFANKFASQ |
GUT_GENOME075388_00607 | 195-274 | MEQFIDRAKENGDSVGGVIETAVFGVPAGLGSFVQWDRKLDAKLACAMMSIQSIKGVEFGLGFQAAERYGSAVHDEIFYS |
GUT_GENOME242963_00645 | 191-252 | VEAVKAEGDSLGGIVWVKANGLPAGIGEPIFDKVESRLGQMIFSIPAVKGLEFGSGFEGARM |
GUT_GENOME097834_00513 | 187-257 | EEMEFAKNDNNSVGGVVETFATGLPSGLGGPMYDGMESIISPIAFGIPAVKGVEFGNGFAASYLLGSENND |
GUT_GENOME259479_00671 | 192-269 | MKEEIYLAKQRGDSVGGIIQTVIHGVKPGLGEPFFYSVESKLSSLLFSIGGIKGVSFGDGFGFANSYGSEVKDELYYE |
GUT_GENOME245179_00374 | 189-282 | IRPDMEKALRFAADNNTSVGGIVECAAVGMPAGKGRPIFESVEGMLSQLLFSIPGVKGVDFGAGFDISSMFGHESNDQWRYDETGKPITTTNHA |
GUT_GENOME136238_01107 | 195-265 | IEETRLDGDSVGGTVECKISGLPAGLGNVWFNSSESMLSEMLFAVPAVKGVEFGKGFALSEMTGSEANDPY |
GUT_GENOME092265_00336 | 195-279 | LMHTTILSAKKRLDSVGGAVECAVYGVPAGIGDPMFDGIENRIAHIAFGIPAVKGLEFGAGFKVASMYGSQNNDAYRMSNGRVCC |
GUT_GENOME260725_00971 | 59-123 | QKITETSADKDSVGGVAELVITGVKAGVGEPFFASVESRLASMMFSVPAVKGVEFGAGFSITDKT |
GUT_GENOME170478_00928 | 190-269 | DNEAMNEMIKTIKEAKINMDSVGGIITVVAFNIPVGFGEPFYSSIESRIAESMFGVPAVKGIEFGLGFDFVNYKGSECND |
GUT_GENOME075278_00184 | 190-251 | MQQRIREARAEGDSLGGIVETYIDLRAHWIGGPFFDSVESKLASYLFALGGVKGIAFGEGFG |
GUT_GENOME009748_00931 | 185-247 | MKAAIDVARQNGNSLGGIVTVGATGVPMGLGEVFPYAKRLDSVIAANLVGIPSVKGISFGTGD |
GUT_GENOME029180_00531 | 378-450 | MQDYINEVRSNKDSIGAIIECGVVGLKAGYGNPMFEGIESSISKKLFSIPGVKGIEFGQGFGVADMLGSDHND |
GUT_GENOME284400_00246 | 186-254 | DSMMTERILSASEDGDSVGGVVGCMVNGLPIGFGGIWFDALDVTIAKMMFSIPGVKGVEFGKGFSIAEM |
GUT_GENOME159606_02176 | 190-264 | MEEQVAEARRRGDSVGGRVGCLLTGLPAGIGDPVADKLQARLAMAMMSINGAKAFEYGIGTAAASSLGSDTADRF |
GUT_GENOME020682_01075 | 188-255 | ALDMLNEILAAREDGDSVGGVIRCLTEGLPVGVGEPFFDTLEGEIAKMIFAIPGVRGIEFGKGFAAAA |
GUT_GENOME096518_01283 | 196-261 | MKELIERVKEEKDSIGSIIEVGVIGMPKAVGKPIFNTVEGRLSQMAFSIPGVKGVEFGMGFDCAKL |
GUT_GENOME140605_00976 | 208-284 | AREEKDSLGGRLETMVTGLGPGYGSLFFDKLDARLAHAILTIPGVKGISFGSGFQLATMKGSQANDPMVMTKKGPSF |
GUT_GENOME042627_01038 | 248-325 | MRDTVEDARMAQDSIGGTIECCVTGIDAGYGEPMFEGVEGVIAKAVFGVPAIKGIEFGKGFALSKMRGSQSNDPFEYK |
GUT_GENOME212592_00528 | 190-260 | MERLIVDARSKGDTLGGVVTGVIAGCPVGVGSPVMDKLQASLAAAMMSINAAKGFDYGMGFAGASACGSEV |
GUT_GENOME233012_00357 | 214-286 | MEEFIKKARAEGDSVGGVIRCRIEGLPAGLGDPVFDRFEAELAKAMLSIPATKGFEIGGGFASARMFGSQNND |
GUT_GENOME004866_00564 | 196-277 | MKEKIKAARKEGDTLGGIFEVTVRGLPAGLGIPCRRRPPGSHIQWDRRLDGKLAGALMSIQAIKGVEIGAGFDCAILPGSQI |
GUT_GENOME254355_00080 | 577-670 | LNDLRLAGDTAGGVVEIRAQGVAAGTGEPFFDSIESTAAHLIFSIPGVKGIEFGAGFAAARSRGSHNNDLIADRHGTTATNNDGGINGGLANGN |
GUT_GENOME183874_00781 | 186-261 | KELILSVKNAGDSIGGCVLTRVTDAPAGLGEPLYGKLDATLAGALMGINGVKAVQIGAGVKASALKGSENNDFMRA |
GUT_GENOME236233_00615 | 212-273 | GDSIGGIVRCTITGVPVGTGEPIFNKLQAELAFAMMSINACKGFDYGTGFEGVGLKGSEANR |
GUT_GENOME014402_00523 | 180-244 | DKIDEARLFGDTLGGTFVIKVKNLKCGFGSYNGERLSSKYAAALFNVPSVKGVEIGDGFLLADKK |
GUT_GENOME104638_00034 | 264-336 | MERRMRDARDRLDSLGGIIEVCALGLPGGIGGAMYSGVESVLAPILFGIPAVKGVEFGAGFHSARLAGSVNND |
GUT_GENOME253933_01057 | 200-277 | MKARIDEAKELGQSLGGTFRIVVNGLLPALGGFEEARNRLTSKLGGALFSIPAIKGVEFGIGFKNAEVLGHEAHDEIY |
GUT_GENOME096069_00812 | 203-273 | IDEARSSGDSLGGTFAISVKNIPAGIGSYSEWDRRLDGRLAQALMSIPSAKGVEIGGGFGLSQRPGSEVHD |
GUT_GENOME246640_01150 | 192-270 | MREKIEEARSMGDSVGGIVECAVVGLPAGLGEPMFGGMEGRLAQILYGIPAVKGVEFGDGFGVADRLGSENNDPFRIVE |
GUT_GENOME244339_00619 | 171-243 | MIEQIKKAKSQKDSIGAVVRTTISEIFPCLGSPIFGSVESKISSLIFSIPGVKGIEFGKGFGAATLYGSEYND |
GUT_GENOME247252_01128 | 196-271 | MKEDIRNASHGGESLGGIIECVTINMPAGVGSPIFEGLENTIAQLIFGIPAVKGLEFGAGFQVAEMVGSQNNDPFY |
GUT_GENOME103623_00890 | 196-302 | IRNLLEEVKKDGDSIGGICQVFGENIPKGLGDPLFDSFEAKISYLSYGIPALRAVSFGLGLDAMKMRGSDHNDQFEIKNGQVKTITNNAGGVVGGITNGMPVVFDLI |
GUT_GENOME056281_00592 | 193-265 | MEEEILAAKAAGDSVGGVIECVALGLPGGLGNPRFDGVTNRLAAALFAIPGVKGVEFGDGFAGAAQRGSEQND |
GUT_GENOME253443_01127 | 179-248 | MKSEILSAKSAGDSVGGTIEVRIDGVPAGTGGNMFSSLDGKIAASVFCVPGIKGIDFGAGFSSAYLLGSE |
GUT_GENOME112278_01227 | 203-278 | MQKLIEHCRDQLDSVGGVIECAIAGVPAGIGDPMFDCVESKISSLLFGIPAVKGVEFGLGFGLSSLQGSQANDPFY |
GUT_GENOME114281_00488 | 192-259 | MTKAVIGARNAGDSVGGAIECMVFGVPAGAGDALFGGLESRISAAVFAVPAVKSVEFGLGTGFAEKKG |
GUT_GENOME058604_00810 | 195-262 | EQINKAKKNNDAVGGTIETFAIGVPIGLGEPFFDSLESLLAHAIFSIPGVKSLEFGDGIAMCNQVASK |
GUT_GENOME054080_01946 | 193-268 | MIAAIKEARDSGNSLGGVVKCVVRGCPPGLGDPVFDKLDATLAHAMMSIPATKAFAVGSGFEAAGMTGLEHNDPFY |
GUT_GENOME217414_00555 | 191-279 | MRAAIESARAAGDSVGGVVECLAFGVPAGLGGPYGEGLEGAIAGLVFAIPAVKGVEFGGGFSLCPMRGSAANDPLRIRGGAVVTETNRS |
GUT_GENOME275846_00220 | 185-259 | IEKIIAQTRQNADSIGGTVTCTVFGLKAGIGNPVFGKLDALLASACLSIGAVKGVEFGMGFDGCSRYGSEVADCY |
GUT_GENOME237525_00279 | 182-265 | ALDTTLESSQKQAILEAKNAHDSVGGVALIKAFGVPVGLGEPLYDKLDSHIGAQMLGLNGVKAVEIGEGCNASKLKGSQNNDCM |
GUT_GENOME277280_00965 | 188-261 | KKTMLDTIKKARNNNETLGGMVETFILNLPPALGEPFFDSFESILSHLLFSIPGVKGLLFGDGLDLAVKSGSKM |