UHGP-MC 23819
Information
- Number of sequences (UHGP-50):
- 165
- Average sequence length:
- 77±4 aa
- Average transmembrane regions:
- 0.1
- Low complexity (%):
- 0.91
- Coiled coils (%):
- 0
- Disordered domains (%):
- 10
- Pfam dominant architecture:
- PF04326
- Pfam % dominant architecture:
- 7394
- Pfam overlap:
- 0.2
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC23819.fasta
- Seeds (0.60 cdhit):
- MC23819_cdhit.fasta
- MSA:
- MC23819_msa.fasta
- HMM model:
- MC23819.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME135944_01355 | 408-485 | ILVIYVPRAKREQKPVYINNDLFGGCYRRDWEGDYHCSKAEIKGMLRDAADETEDMKIVEQFDISAIDTESLKGFRNH |
GUT_GENOME108333_01343 | 102-179 | VILVVDVPRAQRERRPVYIGGDIFSGTYKRNSSGDYHCTKGEVRSMLRDQGSQTVDTKILTGMNLSVLNPESILAFRM |
GUT_GENOME046407_00498 | 101-175 | VAIKVPRADRTQRPVHINNSMNAGTYRRNGEGDYHCTMEVIRAMVRDSYDGALDKEVVERVGLDALSSDSIRRYR |
GUT_GENOME182910_00182 | 107-185 | IIAIHVPMANYTMRPIYINKNLIGGSYKRNFEGDYHCTDEEVKSMLRDANENGNDGTLVENYDLNDIDMPTLHAYRNRF |
GUT_GENOME136042_01521 | 108-181 | VPRATRREKPIYINNNPMTGTYKRNHSGDYKCTEREIRRMFSEQSDIARDNVSLEEVTIGDINIETLVSYRKRF |
GUT_GENOME257279_00696 | 99-177 | VIAISIPRADRHVRPVFIDNNPLTGTFRRNRGGDYRCGYETVRSMMRDAADEPQDLRIVEDMVASDLDSDTVRRYRNRY |
GUT_GENOME206725_01538 | 103-180 | LIVMDVPRADRMERPVFTHGNPRNSFRRNGDGDYKCTMEEIAAMMADSSRTATDMSIVRTSMIGDLSKKTIESYRNSY |
GUT_GENOME270652_01246 | 113-189 | LVIMEVPRAERHLRPVYVNGRVEDGTFRRGPEGDYLCTLDEVDSMVRDSRDPIDGMVLDHMGIQDLDGETLLEYRSV |
GUT_GENOME237878_00055 | 107-184 | VVVIEVPRADCTQMPVYVGDNPIKGTYRRNGSGDYHCIENEIKAMFRDQVEVPSDRKIFYPDLSLNDLESESIKRYRQ |
GUT_GENOME000504_01041 | 101-177 | IIEIKIHQASDQRKPIYLNNDINQSFIRNGSTDTKADKDSMKTLIRLSEDQLDTQVLRNYNLDDLDLKAVEDYKNEL |
GUT_GENOME263228_02684 | 103-176 | IMIEVPRAHYTEKPIYLNGNLYQSYKRNHEGDYKCSENEVKIMVRDSSDDSLDNTLVNGFSIEDLDLITITSYK |
GUT_GENOME022512_00787 | 104-180 | IFIIHVPKARLELRPVHINDDMINGSYKRNHEGDYHLTRAEISTMLRDASPVSYDMKVYDNIPLSVLCDETIAKYRR |
GUT_GENOME018685_01308 | 102-177 | IAIEVPRAPRQDRPVYIRNNLASGSFRRNGEGDYHCTSEELKALVRDGGDDIDRLVLEEFELGSLCEETVASYRSA |
GUT_GENOME207812_03439 | 103-181 | VLIINVPRADYKQRPIYLNENPYKGTYKRNAEGDYKSTEDEVNAMIRDASEEGNDGAIFEDYTINDLDEDTIKKYRNRF |
GUT_GENOME097844_00907 | 106-194 | FFLLFRIPRASYDIRPVFLTRNPFGNTFKRNHEGDYHCTDSEIRQMFADAHHSTLPFDNQILPNYSMSDIDTETLRGYRQRFILRKNNH |
GUT_GENOME047119_00576 | 103-185 | IIIIHVPRAERSIRPVYIGTDPRKGTFRRNHEGDYHCSPYDMSAMFRDAYQITQDKKVLSDIALDAICLDTVHSYRNRFNVVH |
GUT_GENOME255836_01453 | 154-232 | VIIVIHVPAAHREQKPVYINDDIFGGTFRRNWEGDYHCTRIQVKTMLRDQGDDTIDMSVLDTIDIDIFNSDSVRGYRNI |
GUT_GENOME212393_01027 | 106-181 | IIIHIPEVSVQEKPIFLNNELSQSYIRKHSGDYKISTDELAALLRDRSDHLDYELLDNYTIEDLDIDSVERYKFEL |
GUT_GENOME164234_00240 | 98-181 | IIVIEVPKAERRARPVYIGENPFNEGKHSGTYRRNHSGDYKCSKEEIQRMIADQLEESQDSLILEGFEIEDLNKETINSFRNRL |
GUT_GENOME094301_04260 | 110-189 | VLAIHIPQAMRKQKPVHLKKSPFGNTYQRLHEGDRVCDDMTVKRMLAEQIHDSRDNEVLSEHYTFQDDIDLDSLKVYRNL |
GUT_GENOME100287_00953 | 167-251 | IMVIRVPEADYRQKPIYLNKNRERSYKRTFEGDYLLTDEDIAMMVRDSMTEASDFSLMEHCGMSHIDAETLRKYRTAFNIRNTGH |
GUT_GENOME152600_00720 | 101-179 | ILVIIVPKADRRDRPIYVGENPFTGAYRRNGEGDYHCTQEEVRMMLRDQAESSQDTHLIERMGMDVFDYESVARYRNRM |
GUT_GENOME025112_02461 | 113-188 | IVVIEVPRADRRKKPVFIGSDPYMGSYRRNGEGDYRCTREEVESMLRDADAGSQDGQIFMDLGTEALDEGTMERYI |
GUT_GENOME102909_00634 | 105-183 | IILFHIPRAARTQRPVYRTTNPYNGTFKRNDEGDYKCTEQEVRRMFADADDSSPRDSKILKGFTIDDIDNNSVRQYRNL |
GUT_GENOME127937_02328 | 111-180 | ADRSIRPVYVGQNPLSGSYRRNGEGDYHCTKEEVTAMYRDASRISQDQKVMTGKDVSAFCMDTVHSYRNL |
GUT_GENOME175139_01921 | 101-178 | VIRIEVPRANYSDKPIFLNGNPYNGTYKRNYEGDYKCSVSEVNSMIRDSSDDNYDSVLIENYDINDLDMETVHRYRNM |
GUT_GENOME011015_00507 | 101-177 | IISIHVPRAQRTDRPVYINQNPFSGTYRRSGEGDYRCTEEEIQAMLRDAAQRTHDMQVYEELSPDVLMTDTIRRCRA |
GUT_GENOME143140_00003 | 100-178 | VLVIEVPRAKRQQRPVYISGNPVRGTYRRDHTGDYSCGMSEIEAMYRDASNESADLRLHESLSVTDLSTETVDSYRRRY |
GUT_GENOME111876_00389 | 106-183 | IVVIRVPRAERTSRPVYVGADPRSGTYRRNFEGDYHCSREEVALMIRDSALVTDDNTLLTDLDTSVFCQETVKSYRNI |
GUT_GENOME020734_00031 | 106-169 | IPRAERRKRPVFINGSMENGTYKRNGEGDYHCTVNELKQMLRDSSDSSQDSEVVTGITIEDLDM |
GUT_GENOME209236_01462 | 132-209 | IIVITVPKAKRFDRPVYLDGSIHNSYRRNGEGDYRCTLDQINAMVRDSEIKTQDMNIIECMTASVFNTDSLHSYRIKM |
GUT_GENOME269801_01088 | 103-178 | IVVINVPQADKTFKPIYINNNPLFAYRRNHEGDYRCTKLEIQSMLRDQDEQSNDSYMIEDMDLSIINQDTLAKYRT |
GUT_GENOME141754_00498 | 116-198 | VLFIGIQRASREEQPVYLHDNPNHAFIRQHEGDYRIQKHLLSRMFADKALDCFDDVVLPYRDLSHLDTETLRRYRHLFTVFNP |
GUT_GENOME111744_01462 | 122-192 | ATRDQRPVYVGLDPMTGTYRRDHEGDYRCEPSTVSQMFADRKNSEVLQEAKILPNYSWDDIDLTSFRQYRT |
GUT_GENOME148679_01267 | 103-182 | MLIIVNVPNANRQDKPIYINNNPITGTYKRYHDGDYRCNKKEIQVLFSESTEESKDEIILNEFNINSIDKETLESYRKRF |
GUT_GENOME176357_01533 | 104-179 | VLAFHIPVSANKPVYFGNNLNNTFVRIGSGDQRATDIEIDILMREKTFGMKSEMEVEGTAFNDLNTQSLQTYRRRI |
GUT_GENOME212940_02928 | 100-181 | IILMHVPIASREFKPVYINNNLLSGTYRRNNDGDYHCTQLEINNMLRDQSDKTQDYRIIERYSLPQVNLETLDMYRKRFMMF |
GUT_GENOME129476_03149 | 134-209 | IISIFIPEQEQSKKPVYLNNNYAYTYIRKNEGDYIVSDDELRRFIRNASDDLDSELLDEYTLEDLNLESVLAFKNI |
GUT_GENOME034989_00656 | 108-186 | LLVINVPRAERDMRPVYQGLDPYEGTFKRNFEGDYKCSRREVRRMFSDANLLYSSDQRILDNYSFEDDIDKESLLQYRQ |
GUT_GENOME267034_00061 | 106-184 | LLVFHIPRAPYNLRPVYLTLNPFGNTYRRRHEGDYVCTDDDVKQMISDANSLRSSVDSRILRGYSLDDIDLPSLHQYRR |
GUT_GENOME238202_02633 | 112-180 | AERQDRPVYVGEDVFKGTFRRNGEGDYRCSREAVKAMIRDQGEITADACMLEKRAVTDLNADTLRRYRT |
GUT_GENOME143284_01707 | 106-183 | LLIVPIPKMSYSEMPIFLKDNDRYTYRRIGRDDYLCDRKTIEMMIRDSLEISFDSKIIKGYSINDFDIDTINEYRYKF |
GUT_GENOME105900_00413 | 101-179 | IVVIEVPRADRRDRPVYVGGDPVKGTYRRNGEGDYHCTPDEVRAMQRDSREGTLDGQILSALPPSSLSEPSVERYRKRL |
GUT_GENOME016523_01759 | 101-178 | IVIINVPRAERSYKPVYVDGNPLCTYRRNGEGDDRCTKEEYQAMVRDASVKTQDMLVLNEMDLTVFNQESVRSYRQRM |
GUT_GENOME187639_02068 | 110-189 | ILICEIPRASYELRPVYINNNPLGNTYRRNHEGDYVCTDVEVRRMFADAEHDRHPQDGRILVGYNFERDIDIETLHQYRQ |
GUT_GENOME113208_01165 | 104-180 | VLAVYVHEAQLNQKPVYLNGSLSSTFIRKHETDCRASREELSAMLRNQSDDLDSELLNGYSIDDLDMTSIANYQARL |
GUT_GENOME236436_01221 | 108-176 | ADRRDRPVFIDGNDRNTFRRRGDGDYKCTVDEISSMISDAMSVATDRIPVMTSDISDFNPDTVKSYRNS |
GUT_GENOME080327_01366 | 119-195 | LVFNVPRAPRNKIPVYLHNNPENTFKRNYEGDYRCDASEIQRMFADADITEHPRDYKILPEFTIEQDIDKATLEQYR |
GUT_GENOME090270_00455 | 109-193 | VIYINVPEADYRQKPIYINQKLEQGTYKRGHEGDRQVTKEELALLLRDSSDNTDSQIIEHYGMEDIDAETLEKYRRSFNILNPGH |
GUT_GENOME286267_01361 | 114-191 | VVKFDIPQATHEQRPVYLKSVPYDGTYRRNDSGDYKCERDEVRRMMADADLSHPTDSRILKGCTWDDIDISSFEQYKR |
GUT_GENOME281920_00409 | 104-183 | MVVAIHVPRAPRDERPVHINDNLFAGTYRRRGEGDYRCTRQEISAMLRDQGTETMDSKVLDDLTVADFNADTVRAYRTRF |
GUT_GENOME128842_00630 | 96-174 | VIKIRIPEAPRNEKPVYLNSNISDAYIRIGDGDHKADINEIRSMLVDNSTENYDYLPNKMNHGFDSVNLETLRSFRNKY |
GUT_GENOME096927_02005 | 103-177 | IIVCEIPEAPFDDKPVYLNNHLANAYVRRGSGDHKITKDELSSILRDTNHELDRELLNNFTIDDLDMTSIIKYKA |
GUT_GENOME098095_02071 | 104-178 | IIIIPVNKIDYKDKPIYLNNNIASTYFRQGTGDFRCSQEQINAMLRDSAKESFDSTLVKDFSILDLDRETINRYR |
GUT_GENOME001334_01199 | 100-182 | IIMVIYIPKAGRDERPIYINGDMFKGTYRRNYEGDYRCTKDEVMAMLRDQPEETMDMKVLDYFDIDVLNEDTIKAYRNRHVLL |
GUT_GENOME192814_00279 | 101-183 | VLVISVPQAERHQRPVYIGDSPFTGAYRRDGEGDYHCSPDEVRAMLRDRDDAPADLAVLESRGPEVLSGETLRQFRLCMVMRQ |
GUT_GENOME018467_00336 | 119-192 | MVLAFHIPSSASKPIYFNSIKNTYIRSGSGDRKATESEIASMFRDQAFGTRSEMPIADTDISYLNEQSLRSYKA |
GUT_GENOME084579_00225 | 107-183 | LIVIHVPRADFNMRPVYVGENPYKGTYKRNHEGDYHATEHEIRGMIRDQNPEGNDNQIIEYYTMDDIDKETLRKYRQ |
GUT_GENOME224893_03018 | 111-176 | ITVSQANRKQRPVYLNSNPMTGTYIRLHEGDRKCTPDAVKIMLAEQMTDTLDDRIFEGFTIDDLDK |
GUT_GENOME259595_00236 | 99-171 | LLAFYIPEVEMKPVYFGNPANTFIRMGSGDQRATEGEIRAMFRNQSFGKKTEETIPESGIEMLNLDTLHNYRF |
GUT_GENOME167851_00599 | 101-179 | IVLIEIPRALACQKPVFLGSSPFSGTYRRGGEGDYRCQISEVQIMMENRFRELPDGAPVVSAVLDDFKPSSIQAYRQRY |
GUT_GENOME237448_01195 | 111-181 | APIEYKPIYINNNILTGTYRRNYEGDYRCSPSEIKAMLRDAESKSQDLLCLDALGIDALCKDTIISYKQRL |
GUT_GENOME237854_00876 | 109-178 | ASHDARPVFLDANPLMRTFRRNGEGDYRCTQREVSAMLRDNSDGVQDRAVLPHLEKDAICPETVRRYRNA |
GUT_GENOME285908_00492 | 105-179 | IVMDVPAADRTLRPVYIRNVNSGTFKRNGSGDYHCNASEIAAMYRDASPESRDTFVARDSELGDLSSESIEAFRN |
GUT_GENOME112596_01224 | 104-181 | LIVIRVPRADRKDRPLFINGNSRNSFRRNGEGDYRCTPDEIDSMITDSMSGPTDRVPVNTSGISDFDPGTVSAFRNSM |
GUT_GENOME157975_00096 | 203-281 | IDVPQASYHQRPVYINGNPLKGSFKRNHEGDYHCTEEEVKSMLRDASDTGNDGGLLDGYTMDDIDAETLKSYRIEYELH |
GUT_GENOME085874_00744 | 110-187 | VVVIYIREASIRQKPVYINGIDSYQHVYIRKYEGDCRATQEEYRRFVRNTQDNVDEELLNYFTVDDLDNESILLFKNI |
GUT_GENOME196404_01020 | 182-260 | VISIDIPRAAREHLPVYVGPNPMKGSYRRNGDGDYLCDGETVHAMLRDSDLLPLDRAIVEDMGVDALNADSVASYRRQF |
GUT_GENOME283696_00960 | 100-175 | LVVIDVPRAPSSMRPIYLDGHVDNAYRRRGEGDYRCTQAQVFAMARDAAPEGADGTVLEEFGLDALATESIASYRR |
GUT_GENOME059169_01269 | 103-176 | INVPRAPRQLKPVYVSGDQARGTYRRNGEGDYHCSADEIVAMVRDASPSPLDALVLENFGLEALDMDSVERFRA |
GUT_GENOME273362_01305 | 123-201 | IDVPAADRHDKPVYIGTDPMKGTYKRDYEGDFLCAEEAVRAMFADQRDVSGDVEVLEEFGLDVLNQDTIKGYRIIFEQL |
GUT_GENOME244253_00860 | 109-186 | LIIIEVPKANYKEKPIYINNNPSLTYIRQGDGDYQCTDDILRTMYRDSNNESYDSKVIRNFSLDDFDDKTIKNYRNKF |
GUT_GENOME274093_00160 | 116-193 | VVVIHVPRATYEERPVYINNNLMRGTYRRRHEGDYHCPEPIIRMMLRDAYDDGNDRMFLEHYTMDDIDIPTLEAYRNM |
GUT_GENOME237435_00997 | 103-178 | VIIDVPRATRSSKPIYIGTDMFKGTYRRNHEGDYLCTDLEVKTMIRDSLETSADSAVLDNVGLDALNSDSIKSYRS |
GUT_GENOME275148_00495 | 102-175 | IVITVPRASRQDKPVFIGGSPFGGTYRRSGDGDYRCSYSEVENMLRDAKITTPDTFLTDLPLSALNEATIRRYR |
GUT_GENOME095588_00311 | 102-181 | IIRITVPRANRSDKPIYINGNPIGGTYRRNFEGDYKCSQEEYQAMVRDNGSSVNSMDQSPLLQHSVDDFDMETVRSYRNM |
GUT_GENOME127137_00694 | 101-177 | VVVLTVPRAERQDRPVFLDGDPFRGTYRRRGDGDYRCGPEEVRAMQRDARRRSWDTRPLGKLGFRALDPKSLASYRE |
GUT_GENOME151975_00818 | 103-177 | IVKVYIPKVPFKDRPVYIKGDIKNVFKRVGTGDRLANDSDIKAMLRDAAGDDSMETLDGFDISDLNLIDLQNYKA |
GUT_GENOME096534_00359 | 102-176 | VIIHVPRADRHNRPVYLGTDPFSGAYRRGGEGDYHCSAEEVRAMMRDQADLPRDLAPLEGLDQNALCPAAVTRYR |
GUT_GENOME159606_00923 | 113-190 | FLIFFVPRASREERPVYVGQNPMRSTYRRNASGDYLCKEWEVAIMLAEQRPKLAMDAEILEGYSLDDIDKESLRGFRQ |
GUT_GENOME284320_00668 | 111-202 | VLIIRVPPASFRYRPVYIGTDMLKGTYRRDHTGDYRCMPEEVKRMLADSLPDKPDSLVLEHSTIEDLDLPTLDQYRNILRSVKPMHSFLTLD |
GUT_GENOME006402_00986 | 120-198 | LVILHIPRALYNQKPIYIGSNPYAGTFKRNFEGDYRCTKESVNSMIRDSFQNAGDNEILEWLNIGDLDKNTIAVYRTRF |
GUT_GENOME008627_00483 | 101-187 | IVVIEIPRASRRQKPVYIGGDPYRGTYRRSGEGDYHCDEAEITAMLRDSAAAPQDGELVLMLTQDALCRETIGRYRERLALMRPKNR |
GUT_GENOME031977_00616 | 691-772 | IIVIKVPRADRADKPVYVDNDTRNTYRRSGEGDYRCTYEEYQTMVRDAAIQSPDIHLVRGMGLDALNSESIRSFRQRMKLYR |
GUT_GENOME096545_00338 | 105-182 | ILVITVPRAPRELRPVFVGGNPLTGTYRRNGEGDYRCSPESYQAMVRDASASPLDTLLLDEVTTDAISPDTLASYRNM |
GUT_GENOME209188_01733 | 118-191 | IPSSELKPIWYGSPKNTFIRSGSGDQRATDMEIAAMYRDQAFGTQSEKTVEGLTIADLNAASFASYRRYIQTFN |
GUT_GENOME130359_01216 | 102-178 | VVIQIPRAGRHERPVYIGTDPFSGSYFRDGEGDYRCSPDEVRSMLRDRTDEPQDAMVLEEFSTDCLDPGCISRYRVR |
GUT_GENOME013849_00667 | 102-179 | ILVVEVPRAERTIRPVYKGQDPRNGTFRRWNEGDHLCSVEEVGSILRDASFSALDATPIKDMDMSVFCNDTINGYRNV |
GUT_GENOME091497_01348 | 104-182 | IIVIEVPRADRRDKPVYINGDILQGTYRRSGEGDYHCTEQEIKAMLQDKEEITRDTLPLAELGSEAFDEATVRDFRGRV |
GUT_GENOME001304_04282 | 104-180 | VVVIEVPRADRHYKPVFINDNLFAGTYRRNADGDYHCQREEVKAMLRDQADITLDSTIIENVDWQELDKDTIYRFRM |
GUT_GENOME136203_01481 | 111-187 | LLYFYIPRADRHSRPVYCGQNPYQGSYRRDNEGDYHCSHEEVTSMFADAKIESADGRIQEHYGMDDLDIASIEQYRR |
GUT_GENOME252875_00024 | 112-185 | PRAPVTDRPVYIGRDPLTGTFRRGHEGDYRCTPSEVRQMFADANNDVPADSRLLENFGMDDIDLETINQYRQRM |