UHGP-MC 75029
Information
- Number of sequences (UHGP-50):
- 310
- Average sequence length:
- 51±2 aa
- Average transmembrane regions:
- 0.18
- Low complexity (%):
- 0.42
- Coiled coils (%):
- 0
- Disordered domains (%):
- 18.45
- Pfam dominant architecture:
- PF04552
- Pfam % dominant architecture:
- 8774
- Pfam overlap:
- 0.31
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC75029.fasta
- Seeds (0.60 cdhit):
- MC75029_cdhit.fasta
- MSA:
- MC75029_msa.fasta
- HMM model:
- MC75029.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME008167_01793 | 445-496 | IVQLVENEDSASPMSDRRIAEAILIKGIAVSRRTIAKYREELGIPSSSDRRR |
GUT_GENOME273599_00482 | 408-458 | VIEQEDKKAPVSDQRIVEILAQNNVQVARRTIAKYRELLGIPPTFLRRVYD |
GUT_GENOME096297_00678 | 391-435 | DVRKPYSDQKILEILKINGVEIKRRTISKYREQLGILPSNLRKEY |
GUT_GENOME074386_00565 | 240-292 | ISKIINNEDKHKPYTDDDICSILKHYGYDIARRTVTKYRLDFLDIPNSRGRKE |
GUT_GENOME237433_01566 | 423-475 | LMELVQGEDKSSPYTDEQLTKLLTDKGFVISRRTVAKYRDKLNVPSASKRKSI |
GUT_GENOME031261_02123 | 341-392 | FIADEDSKAPLSDERIGEMLKERLKVNIARRTVAKYRTALDIPSSSRRKEHF |
GUT_GENOME103694_00266 | 423-475 | IQELIQEENKKKPLSDQKIADLLKESGIEVSRRTVAKYRGEMMIADASGRKNL |
GUT_GENOME120945_02083 | 388-439 | LLHLIEKEDPAHPFSDQQLSKKLADDGILLARRTVAKYRSLLNIPSASGRKR |
GUT_GENOME021967_01284 | 386-434 | LVEGEDKARPYRDQQLTDLLLAQGISVTRRTVTKYRQQMGLPTASDRRI |
GUT_GENOME182432_01015 | 427-478 | SLIVHENKSAPLSDQKITRSLQEKGIDISRRTVAKYRYEMEIPAAEKRKMYP |
GUT_GENOME237865_00957 | 450-501 | IKELVGAENPVKPLSDSKIADILSKEGFQVARRTVAKYRDVLGIASSSQRKR |
GUT_GENOME268963_01172 | 401-452 | LQNFISDEDKKHPLSDETLRCMFTKLGLNISRRTIAKYRGELNIPSTSQRRK |
GUT_GENOME006782_00361 | 387-436 | NLIEEEDESSPYSDSRIVQLLKKQDICVARRTVAKYREEMNIPNLSERRI |
GUT_GENOME066829_00982 | 387-430 | NKMCPASDNLLVKWLYEKGFKVARRTVTKYRNELGLPAASARKS |
GUT_GENOME233276_01677 | 429-477 | IISAEDTANPLSDAEICQQLKNKGMDVARRTVAKYRERLGFPIARLRRN |
GUT_GENOME117783_01515 | 394-445 | LSRIVQAEDKEKPLSDEMISTALSTLGIELSRRTVSKYRMQANIPPASKRKR |
GUT_GENOME168406_00767 | 738-789 | IKEIIRSEDKQHPYSDEKIRQLLEEDGFVISRRSVTLYRKECGIPSSRNRKT |
GUT_GENOME096268_00161 | 383-428 | EDKKRPLSDEGVKKKLKDEFDIDIARRTVMKYRDQLGIPSSVKRRK |
GUT_GENOME236958_01721 | 404-454 | DIITSEDVSAPLSDQKIGEMLSAAGMTIARRTVAKYREELGIPSKTLRKRF |
GUT_GENOME103750_00565 | 400-451 | IISIIENEDKKKPLSDSAICNILKNEGIEISRRTIAKYRDELNILSSAQRKR |
GUT_GENOME001757_02101 | 373-423 | LIEAEDKSHPLSDQKLAELMKKDNIVVARRTIAKYREQMGILSASKRKIYD |
GUT_GENOME170487_01673 | 373-430 | LTALIQAENKQKPLSDQAIMEHFKTRNIAIARRTIAKFRTHLHIPNSTMRKRIHALQQ |
GUT_GENOME222887_00275 | 431-482 | IKDLIAAEDKSHPLSDAKICAALNEKGIEIKRRTVAKYRESLGIEAQSRRRW |
GUT_GENOME101248_01507 | 421-472 | LQELVDGENKHKPLTDDQLVDEMTKKGYKVARRTIAKYRDQLNIPKARLRKE |
GUT_GENOME188579_00383 | 373-424 | IEELIKSENKSSPLSDEKISLYFKNKGCSIARRTIAKYREELGIPSTRERKR |
GUT_GENOME109639_00963 | 389-442 | IAEIINGEDKSKPLSDGAIADILKGRNIDIARRTVAKYREQLNIPPKSMRKRFD |
GUT_GENOME243076_00153 | 407-459 | ISEFINKEDPKKPLSDQKISEMLLDRGIDVKRRTIAKYRESLEIPASSKRRRF |
GUT_GENOME142588_03586 | 390-444 | IQNLIEKENKQKPLSDQEIVRIIRENDGMVISRRTVAKYRDQLGIPSSSKRKRYD |
GUT_GENOME047786_00169 | 381-426 | AENKCEPLSDQKLMQLLQRRGMRVSRRTVAKYRAELGIPGTHVRRS |
GUT_GENOME245512_00270 | 367-420 | LKELIRNEDHKKPYSDQKLCDILLHEGYTISRRTIAKYRDEFKIPSAAKRKSYE |
GUT_GENOME141720_02487 | 377-429 | LKDLISHENPSHPLSDEKIVEILQNKSNIQVARRTIAKYRQKAGIANARQRKT |
GUT_GENOME039082_01361 | 355-404 | SIIRQEDKHKPFSDQQLVIKLSAMDLKVSRRTVAKYREQLHIPGSSQRKL |
GUT_GENOME035512_02022 | 386-436 | ISAIIEHEDSHSPLSDQMILRKLQDQGISISRRAIAKYRDEMGIPSSKARL |
GUT_GENOME233269_00086 | 451-506 | LKTLIDNEDKSKPLTDQQLTDQLNEMGYPIKRRTVAKYREERLNTPVARQRKEHIV |
GUT_GENOME275835_00006 | 473-523 | DLIENENPASPLSDSKLVSLLKERGLDIARRTVAKYREELGIQSSHLRRNF |
GUT_GENOME045540_01699 | 403-454 | IESLIAGEDKRKPLSDQAMSDRLALQGIMLARRTVAKYREELNFPPAHQRKQ |
GUT_GENOME218105_00469 | 366-417 | IAKLVEEENKSNPLSDQKIAEGLQQKGVHISRRTVNKYREQLHIPPASQRRV |
GUT_GENOME141764_01393 | 373-423 | IKNENKANPLSDDQLVANMQKNKINLSRRTVAKYRKCLGIKNSYQRKTAEN |
GUT_GENOME000979_02789 | 400-452 | LRRLIEEEDKAAPLSDQKLCERMAQEGCPLSRRTVAKYRDEMNIPGASGRRRY |
GUT_GENOME056087_01557 | 388-439 | ISELIASEDQNAPLSDHKIQALCAARGMQVSRRTIAKYRSALGLPSAAGRRA |
GUT_GENOME236868_01616 | 452-503 | IKDLVANEDSSAPLSDQAITDALMRMGMKVARRTVAKYREELRILPARLRKT |
GUT_GENOME036678_00016 | 409-460 | IIDSEDKHKPYSDRVISEMLENEGISVSRRTVAKYRESMGIYGTSMRKDFFK |
GUT_GENOME090247_01091 | 266-315 | FVDAEDRQHPLSDGDLADRIRLQTGATVARRTIAKYRGQLGIPNQASRRE |
GUT_GENOME152697_00430 | 383-435 | ISTLVEKEDKKNPLTDKEIAERISGLGTAVSRRTVAKYREELGYAPSAKRKAF |
GUT_GENOME037223_01005 | 93-144 | LQKLIDQESSHQPLSDQQLVEHFADHGVTLSRRVIAKYRKKLNIPNSYQRKR |
GUT_GENOME263459_01022 | 401-453 | LCAMIQQEDKLHPLSDEALAAALARDGVEISRRTVAKYRSQLQIPPAGGRKSL |
GUT_GENOME022784_01732 | 407-455 | IIDAEDKLHPLSDESLRRVLSSMDIEVSRRTVTEYRKEFDIPSAAYRKR |
GUT_GENOME041802_00561 | 403-454 | LQRLIGQEDWGRPYSDDKLAALLRQGGLDISRRTVAKYRGECGIRCAAERRK |
GUT_GENOME135965_00985 | 418-469 | MRRLIAEEDSSKPLSDARLAAALQEQGVNIARRTVAKYREQMKILPASLRRG |
GUT_GENOME218569_00163 | 432-478 | LAQNAGNTKPLSDQKIADELEKRGIKIARRTVAKYRANLSIESSYTR |
GUT_GENOME051611_00711 | 373-428 | MTALIAAEDKAHPYSDQELTDRLAAEHIYIARRTVAKFRRQLGIPKSCIRRQLKNS |
GUT_GENOME071666_01539 | 450-501 | IIDKEDKKEPLNDDRLVEMLGEHGYKIARRTIAKYREQLGIPVARLRRSVMQ |
GUT_GENOME052132_01065 | 358-409 | IKDFIAQEEKAFPYSDYTLYLLLKKEGYVISRRTVTKYRKLLNIPSSYERKE |
GUT_GENOME276852_01192 | 345-398 | NLIYNENKTKPFSDESIKKALAKENIHISRRTVTKYREQLLIPCASKRKSKQNG |
GUT_GENOME063438_00760 | 408-456 | LIEAENPRHPLSDARIEAELSRIGISVARRTIAKYREQLNILPASMRRR |
GUT_GENOME236234_02964 | 447-496 | QIIDEEDRKQPLSDDAISKQLKSLGIKCERRTVAKYRESLGIPGATKRRI |
GUT_GENOME079490_00183 | 115-170 | LEEFIAAENKARPYSDSKLVDLFKEKEILISRRTIAKYRKELGIKSSFARKELVDE |
GUT_GENOME279992_00699 | 389-438 | NIIEKENKKCPLSDQNIVNKLEKMGYHLSRRTVAKYREEMQIGSTRIRKV |
GUT_GENOME008828_00132 | 359-410 | IVELVDNEDKEHPLSDNQILKKLEELELHCSRRVIVKYRQQLNIPSSTKRKM |
GUT_GENOME158713_00181 | 384-435 | DIISSENKSSPLSDQEITDQLMNRGIRISRRTVAKYRDGEGIPSAQRRKKYI |
GUT_GENOME096568_01473 | 447-497 | IQEIIAAEDKRKPLSDEAVASLLAERGLKVARRTVSKYREQLGIAKAGLRR |
GUT_GENOME254256_00026 | 387-438 | IGELINAESRIKPLSDQKIVDALSQEAITVSRRTIAKYRMELGVPGTSVRKE |
GUT_GENOME009394_01831 | 403-451 | MMEEENKKHPLSDQQMADLFAKRGTKISRRTVAKYREELGFANCRERKE |
GUT_GENOME001636_00926 | 390-442 | LKELVAAEDKKKPWSDEQLAGLLQDAGMPISRRTVAKYRMELGIGGAFQRKDG |
GUT_GENOME038080_00822 | 420-466 | VEDEDKRRPLSDEAICKILRADGYDVSRRTVSKYRDRLGIPAGRLRA |
GUT_GENOME142418_01455 | 409-457 | LIESEPAARPLADEAIAGLLSRQGVNIARRTVAKYREQLDIAPARERRR |
GUT_GENOME245263_00041 | 391-442 | MRMLIHEEKKSNPLSDQAIADQLCREGITISRRTVSKYREEALLPAATARRE |
GUT_GENOME234239_01242 | 387-435 | LIEAENPKVPLSDNKLAEQLKKFGISVSRRAVTKYREQLSIPGSYDRKG |
GUT_GENOME283183_01295 | 405-453 | VIAREDRDKPYSDQKLSEILSVMGIEISRRTVAKYRDELGIADCRARKF |
GUT_GENOME215209_02038 | 362-414 | LQRFIKHEDKEHPYSDQELMLILKDEGVEISRSAIAKYRKILNIPPAYKRKNF |
GUT_GENOME096508_03324 | 397-448 | IKEMVQTENKQRPFSDEQLARALKSKGVKISRRTVAKYREELMIPSSTKRRR |
GUT_GENOME069566_01386 | 475-526 | MRSIISGEDHKAPLSDRRICELMAERGVPIARRTVAKYRERLDIPVARLRKK |
GUT_GENOME014975_00890 | 354-403 | IVDEENKLKPYSDEKIVTILKSMGLNLVRRTVTKYREELGIPSSRDRKNN |
GUT_GENOME092451_00962 | 392-442 | IINQENKSKPLSDQKICTLLEAYAIHISRRTVNKYRSEMNIPDKIGRKAWA |
GUT_GENOME162155_00665 | 378-429 | ISEIVSGEDKAKPLSDQKITERLCGEGYLISRRTVAKYREQLQILPAAKRKL |
GUT_GENOME141041_02222 | 395-449 | LKQLIAKENKEKPLSDQKLVDYLQKEHQIEISRRAIAKYRDELRIPSSSKRRRYA |
GUT_GENOME098264_00422 | 365-418 | LLSLIDQEDRSKPYSDNALTKLLVAQNIPIARRTVTKLRLQLGIPNSTIRKAHN |
GUT_GENOME149298_01131 | 422-473 | LRQLISQEDPTKPLSDNALVQQLAKHNILIARRTVAKYRDLLGIPTSQVRRR |
GUT_GENOME005466_00158 | 409-460 | LRQLIGNENKQKPLSDEAIRQRMEALGTPISRRTVAKYREELGIPAAFARKV |
GUT_GENOME127105_00369 | 411-462 | IEQIISREDPSEPVTDEHISALLKENGYDISRRTVSKYRLAAGIPGCAQRKR |
GUT_GENOME000625_04595 | 398-450 | LGELIKNEDKKKPYSDSVLAELLAGQGLKISRRTVAKYREMMEIADCRGRKEF |
GUT_GENOME151695_00106 | 399-451 | LKRLVDQEPKAKPLSDAKLASALEQQGFAVARRTVAKYRESLGIAPASERKNL |
GUT_GENOME218226_01678 | 422-472 | LRQMIEAENPRKPLSDAKLTDQLKQAGIAVARRTVAKYREAMNIPPSHERV |
GUT_GENOME093879_01143 | 367-418 | IQNIVKKEDPYHPLSDDKISNLLNDLDYQISRRTVAKYRDILTIPSSSKRRK |
GUT_GENOME036651_00552 | 379-430 | LGRLVAEEDPAQPLSDEQIAARLAESGVPLSRRTVAKYRVILGIPCAYDRKR |
GUT_GENOME218245_00692 | 378-428 | LVEGEEKHNPYSDAELVEKLGEQEIHISRRTVAKYRESLGIPGCFDRRRYD |
GUT_GENOME066223_00535 | 395-446 | ISRLILGEAKDKPLSDMSISKMLAEKGVRISRRTVAKYRDAMGIPAAAARKR |
GUT_GENOME274864_00024 | 429-484 | IKQLIEAENKKNPLSDQAIQESLQAQGFDIARRTVAKYRERIGYPVARLRKTYRND |
GUT_GENOME146011_02570 | 402-447 | ENKNKPFSDQKITELLHSRDINISRRTVAQYRDSLSIPDCRIRKLY |
GUT_GENOME183375_02189 | 416-467 | LLRFVQAEDPAKPLSDEALRAALEAVNLPVARRTVAKYREELGIPSSSARRR |
GUT_GENOME062039_00402 | 398-449 | LSQLIESEDSSKPYSDEALVKLLKQRGIEIARRTVAKYRESLDIPPAHKRKF |
GUT_GENOME069124_01353 | 439-490 | LQKIIDGEDKSQPYTDDQLVGLLREKGFSIARRTVVKYRDQLGIASTRYRKV |
GUT_GENOME237873_00166 | 421-470 | DLIKNEDPDNPLNDADIVDALAVKGIKVARRTVSKYRERAGIPVARMRRK |
GUT_GENOME143139_02689 | 307-359 | LSNLISEERREKPLSDQALADELVSRGVSISRRTVAKYRESLGIPSSPQRKHR |
GUT_GENOME238206_01648 | 443-495 | LTECVDGEDKRHPLSDDELTKILSSRFGYDIARRTVAKYREECGIPVARLRKK |
GUT_GENOME025324_01188 | 427-475 | LIEVENPENPLQDEEIVEELENMELYASRRTISKYRKELNIPNSKQRKK |
GUT_GENOME102034_01672 | 434-485 | MKDLIENEDTHSPLTDQALCEYLNSRGYDLARRTVSKYREKLGFPVARLRRQ |
GUT_GENOME097834_00264 | 382-434 | IAEMIEKEDGSKPLSDQKLALELSKMGMDISRRTVAKYREEMGIGSSSKRRKY |
GUT_GENOME096220_00179 | 373-421 | LINEEDKSHPLSDQDICDQLLLQGINISRRAVTKYRTKMNIQNSYWRKL |
GUT_GENOME113654_00373 | 428-480 | IENLIKNEDKRKPLSDSKIVMLLEKEGISIARRTVAKYREKLRIAPASERKSI |
GUT_GENOME096878_00208 | 375-426 | IQTIIAEENQTQPLSDEKIKQRLNQQGFPISRRAVTKHRSNLGILSTRERKE |
GUT_GENOME085141_01895 | 393-444 | LKSLIEQEDKKKPYSDSKLVEELKKRGIILSRRAIAKYREEMNIKGSFDRKE |
GUT_GENOME235150_01477 | 423-474 | LKEIIEQEDKSRPLQDEELAEMLQQKGFNVARRTVAKYREQLGIPTSRMRRE |
GUT_GENOME138818_01736 | 393-446 | IQEIIRSENKQKPLSDAKIAEQLEKKGIRISRRTVTKYREQMQIPNTQMRKEYV |
GUT_GENOME095250_01698 | 488-539 | IKQFVAAESPTKPLSDSQLAEMLKEQGIECARRTVAKYREALKIAPANLRKA |
GUT_GENOME253660_00652 | 388-439 | IAQLVAEEDSRCPYSDQELMERLQAEGIPVARRTVAKYRQQLNIPVARLRKT |
GUT_GENOME029144_01446 | 539-591 | MKDLIDHEDPANPLNDESLVELLKKQDILISRRTVAKYRKDLGILSAGKRKRY |
GUT_GENOME207678_00520 | 408-460 | LKDLIAAENPKKPYSDQMLSELLGRKNLVLSRRTVTKYREELGIPSSSKRKRY |
GUT_GENOME140104_03019 | 409-457 | IVEAEDKRKPLNDGEIQARLAGQNIHIARRTVNKYRQELGITDKAGRRI |
GUT_GENOME231259_03970 | 412-465 | LRRIIQGENKKKPYSDRLLGEKLAEEGISISRRTIAKYREEEGIPDASGRKEYL |
GUT_GENOME090246_01761 | 407-459 | IQQLIRQEDKHKPHSDSSLVTELERQGIHISRRTVAKYRAELGIPTAGCRKDV |
GUT_GENOME125765_02289 | 115-167 | LAQIIAEEDPQHPLSDEQLSKHLAAQHIVISRRTVAKYRSELGIPTTAVRRQI |
GUT_GENOME234142_00873 | 457-507 | DIISHEDRTNPLSDEKIGQELAGRGFNIQRRTVAKYRDHLCIPTAAKRKDK |
GUT_GENOME282926_00339 | 448-498 | LISNEDPHKPLSDNNIAEILKKDGLVISRRTIAKYRESLRIATSSQRKQLL |
GUT_GENOME167939_00939 | 400-451 | LQSLIDDEDKHHPLSDQTLCDKLNAMGIPVARRTVNKYRTKLGIPGKSARKV |
GUT_GENOME096435_01356 | 375-427 | ITEFIDNENKQKPLSDKKLMDKLYQEHGIEIARRTIMKYRKQLNLPSSAKRKE |
GUT_GENOME199821_01289 | 477-525 | LVDHENQQKPLSDAKIAELLAARGINVSRRTVAKYRDLLCIPATYLRRA |
GUT_GENOME238256_00392 | 123-172 | LIDDEDKSCPLSDDKISELLKEYGYKVARRTVAKYRGMINVPGAVERKNQ |
GUT_GENOME157767_00755 | 367-411 | DAEAPLSDQQLSDRLKALGCPISRRTVVKYREKLRIPDSRVRRQW |
GUT_GENOME024782_00736 | 388-441 | IQDLIQAEDKTHPISDQTITEKLKQEGLQISRRAVAKYRDELGIRSSFDRKQFD |