UHGP-MC 21754
Information
- Number of sequences (UHGP-50):
- 195
- Average sequence length:
- 74±8 aa
- Average transmembrane regions:
- 0.02
- Low complexity (%):
- 0.67
- Coiled coils (%):
- 0
- Disordered domains (%):
- 2.11
- Pfam dominant architecture:
- PF04616
- Pfam % dominant architecture:
- 4769
- Pfam overlap:
- 0.19
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC21754.fasta
- Seeds (0.60 cdhit):
- MC21754_cdhit.fasta
- MSA:
- MC21754_msa.fasta
- HMM model:
- MC21754.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME222477_01916 | 1-68 | MSNFRNGEPWLDTDGNVIHAHGGHMLHHDGWWYWYGENRTENNYVSCYRTKDFKSFEFRGNIITTETP |
GUT_GENOME003390_00677 | 8-67 | KDNHGEILHAHGGQILIENGIYYWIGENRTQRNKVSCYKSKDFLNWEFCNHILTLDSEVE |
GUT_GENOME019191_00214 | 498-567 | SQKIFQPGKDWKDTAGKNINAHGGCVVKDGDYFYWIGDKRSKNNCVGVSCYRSKDLLNWTDMGFALELKG |
GUT_GENOME030759_01864 | 7-77 | FHPGELWPDDRGTHINAHGGCLLRHGGRIYWYGEHKVAGRAGNLAQVGVHVYVSEDFYNWRDAGIAFDVRN |
GUT_GENOME085116_01530 | 6-69 | FNGVPWFDQNGNTVNAHGACIVREHDTYYLFGEYKTDDKNMYNGFSCYSSKNLADWKFEGMALA |
GUT_GENOME093120_00690 | 2-77 | ICPGEVWLDTNGAPIQAHGGCILAANGRYYWYGEDKSGKTRNRRIDVVGIHVYSSENLTDWTDCGLALAGSDDPES |
GUT_GENOME116694_00813 | 41-115 | NQIENGVAWYDNNGQMIQGHGGNMLTYGDTYYWVGENKSANDSNFHGISLYSSKNLADWTYLNDIITPDTTAPEY |
GUT_GENOME122387_00016 | 47-123 | WHDTDGNPIYSQGGGIFKFTVDGVTKYYWYGVKYDESEIYYNDSSKAQKNNHFAGVTCYSSTDLTNWKYEGDVVVPE |
GUT_GENOME003173_00073 | 76-136 | DNGGTHIQAHGGQVVKVGDAYYWYGEDRSNGYDNSPGVHAYMSTDLYNWTDLGVALRAVTS |
GUT_GENOME247681_04462 | 59-130 | KTGEAWKDTAGKQIQAHGGLIQKFGDTYYWYGEDKTRGGRPIDGVRAYSSKDLYNWTDEGTVLKVMENREQF |
GUT_GENOME251930_00292 | 6-67 | QNGTLWRDTDGNELHAHGGHILFHGGFYYWYGEDRRDDIYVSVYRSRDLVNWEFRAHCLTVS |
GUT_GENOME007886_01169 | 7-90 | PALESRWKNGAIWPDNHGVHINAHGGHIVHFGNLWYWYGEHKIEGWEGRLAWHGVHAYSSPDLRHWTDCGIVLPVVDDPRSPIC |
GUT_GENOME105347_01104 | 46-141 | ILSASGAEAKDIVPGELWLDTSGNPINAHGGGILYHDGKYYWYGEYKKGKTILPEWATWECYRTDVTGVSCYSSPNMADWTFEGIVLPAVPDDPTH |
GUT_GENOME171685_06007 | 39-135 | FWHDTDGNPIYSQGGGIFVFEDPNTGKEKYYWYGAQYEEAENYYNNPTGKSSTSQFVNVTCYSSDDLVNWTFENNVLTKAEVDLPRENPDSWWAGRL |
GUT_GENOME235611_02519 | 37-117 | VNGVPWYDQNHQPVNAHGAGIIRDNGKYWLFGEYKSDTSNAFPGFGCYSSEDLVNWHFERVVLPVQKDGILGSNRVGERVK |
GUT_GENOME239663_02484 | 5-82 | IRPGKIWLDTSGKPIQAHGFSVFYNETENLYYWYGENKEKTKGGPSNTVWHWGVRYYTSRDLYNWEDRGLLIPPRPEN |
GUT_GENOME252875_01000 | 36-91 | TDGTADNAHGGGVLYEDGYYYWFGENRKGMASNGVSVYRSADLVTWENLGLALTPS |
GUT_GENOME053806_01733 | 4-87 | NGKMWADDVGKPIQAHGGCIIQHDGMWYWYGEHKGAKNCPGTTRVDVIGISCYSSKDMNSWRYEGLALDVKDSPKGSLIQPENV |
GUT_GENOME110697_01335 | 1285-1352 | LDSDGKPIQAHAFQIVRRDSLYYWYGENKEKTIPGSNVSTYGVRCYTSRDFEHWDDRGLIMTPDTTDV |
GUT_GENOME006163_01804 | 9-82 | IRPGELWLDTDGNPIQAHGGGIIYINDTFYWYGENKEKSKPGSGIWHWGVNCYSSKDMYNWKYEGIIVPPTTED |
GUT_GENOME257473_00745 | 3-79 | NGEAWVDSDANRIEAHGGCILKWDDNYYWYGEDKRKMISPGRSELCGVSCYRSKDLKDWEYRGTVLRPETDDPSHPL |
GUT_GENOME096294_02228 | 25-103 | FAQKQTVIRPGEIWPDTDGNHIQAHGGGITQIGDTYYWYGEARAQSQDPDRRYVGCYSSKDLTNWKFRGNVLEMANPDT |
GUT_GENOME168258_00322 | 26-100 | PGKTWLDQDGEPINAHGGQIIKVNNQYYWIGENRVAGRKNTGKINEYSSNDLYNWKYEGVILDLSRSPRKISIER |
GUT_GENOME249512_03744 | 309-386 | SFVPGEPWLDVDGEVINAHDTGILYYEGIYYMHGVHRVAGSAGNKCQVGVRCYSSKDLYHWKNEGIVLPVAPEGSGSD |
GUT_GENOME214037_01087 | 817-901 | VDGAVMYDTEGNVIQAHGGQIQQITYDYDYNGNGTFDDDEHTFWYWVGEDKTNDYRPCPGIRGYISKDLYNWKDMGNILKTATDW |
GUT_GENOME238203_01589 | 54-120 | WPDNNGVHINAHGGCMLYQNGTYFWYGEHRPSNFSPMGERKYGVHLYTSTDLYNWIDKGLVFETTDG |
GUT_GENOME116681_02014 | 378-447 | IKNGAMYKDDRGEWVQAHGAGFIYVGDTWYMIGEDRSRTWNPDVNMYSSKNLVDWKYEGKIIDGSTHSAL |
GUT_GENOME243755_00604 | 1-93 | MHNGTLWLDQDGNPIQAHGGMIAKFKDRYYWYGEHKGADNCPHQTRVDVIGIACYSSTDLHTWKYEGLALSADEQADSYLHPSCVCERPKVIY |
GUT_GENOME083545_01822 | 24-94 | DTSGKVIQAHGGGLLYENGVYYWFGENKEYTLGDGKIWTYGINCYSSTDLINWKNEGLIIPPDTENPHSSL |
GUT_GENOME064787_01880 | 249-320 | TDFSGTEGRKMYATNGELIEAHGGQITKWGDTYYWYGEDRSQGYYSVGVHLYTSKDLYHWEDKGLVMKMMSS |
GUT_GENOME124713_00690 | 1-79 | MNTKRYDRFYPGKVWLDTEGKRIQAHGGSIMYTNGKYYLYGENKEKTLPGSEVWHWGVRLYSSDDLYNWTDEGNIFEPS |
GUT_GENOME057183_01623 | 21-88 | NVTFRPGQIWNDTSGKAINVHAGYIVYEDGYYYWIGDSRTGNECNGVGCYRSKDLYNWTNRGLIIPLS |
GUT_GENOME067317_00172 | 482-561 | KVQSIRPGKVWVDDTGEAIQAHGGGILFDEKTKTYYWYGEHKGFDNISTGAETGTPAVGISCYSSKDLYNWKNEGVALPV |
GUT_GENOME063594_00242 | 2038-2143 | LIPGTTGTVIKADTGLPMQAHGGSAMALKEGTGDGCVNYDLDGDGDITEGKAVYLWFGEDKTNNTRPVDGVRCYSSTDLYNWTDRGTILYTQNTILPIEEGTEKAI |
GUT_GENOME254720_00898 | 113-201 | YHSFRPGKEWSDTQGEPIQAHAGAIIKVGNTYYLYGENKEFTDTNTPIWTWGVRMYSSKDLYNWNDEGLIINPVLNDPNSSLFPEKCID |
GUT_GENOME200288_02879 | 485-552 | TPGQLWPDLDGKPIDAHGAGFFYDEQTETYYWYGEYHKGGWPAVGVRVYSSKDLMNWTDGGMALTTLQ |
GUT_GENOME096513_02979 | 1894-1972 | VWFDTNAVPIQAHGGQVIWVDHVAWNGNEPSYTADPANGDGAWLWVGEDKTFGGRPIGGIHTYVSKDLYNWVDMGVALY |
GUT_GENOME237059_00650 | 383-448 | LRLGMNWYDEEGKHINVHGGCVLYKDSTYYWFGENYRPSPVKSNGIGCYTSKDMYNWKFECMAWEC |
GUT_GENOME011528_00908 | 523-597 | RYSSIKPGEVWLDTDGKPIQAHGFQVTYRDGRYYWYGEDKTATLFGTNRVFCGVRCYSSDDFYNWKDEGDILAPS |
GUT_GENOME275293_01953 | 40-118 | VSSAPLYNGGEWYDTDGNRINCHGGNIIKTDSLYYWYGEHRPGFDANYQKGVACYSSPDLVNWKNEGIVLATVNDTASI |
GUT_GENOME151329_01712 | 26-90 | GGVWKDNTGKHINAHGGNIFNYKGTYYWYGESRSENGKPYSSLGVSCFTSKDLKIWKNQGLVLPV |
GUT_GENOME254644_02147 | 26-101 | IVNGEVFPDTEGNHINAHGGAIMEYDGTYYWYGEHRGEGRPGKGQRGVACYTSTDLRNWTNRGIVLSVTDEAGSPI |
GUT_GENOME185768_00558 | 103-183 | AVRYDAFHPGQVWLDTNGNPIQAHAGAVYYEDGAYYWYGENKEYTDGKNPIWTWGIRMYRSTDLYNWEDLGLIAEPDLTNL |
GUT_GENOME014133_00468 | 42-120 | TTIYNDTFWKDTNGNNIYSQGGGIFKFGDTYYWYGVHYKGAETYAANPSKNNNDYTFVSVSCYSSKDLVNWKFENDVLT |
GUT_GENOME038331_00911 | 1-64 | MRNGHIWLDSAGQPIQAHGGWVLPCEDRYLWYGEDYGDDGACRGIHLYASRDLSSWEDLGIVLA |
GUT_GENOME158419_01116 | 5-89 | FRPGKAWYDTNGKKIQAHGGSVLYAENTFFWYGENKEGITGTATGENCPYWHHGVRLYSSSDLYNWKDEGIIMHEQAEPDHPFYP |
GUT_GENOME096530_03145 | 45-127 | GADWKDTDGNLIEAHGGSVQRLNENEIGYDVDGDGNLSKDFWFWYGEEKTSATRPVEGVKAYVSEDLYNWTDMGTVLPTHDKL |
GUT_GENOME067917_02226 | 90-160 | GASWLDTDGNVIQAHGGQIQLMPVPGENGVKTEKYVWVGEDKSSGHLGNDVAVYTSDDLYHWEFQGDVFRA |
GUT_GENOME142610_04455 | 18-78 | DMNGNPVHAHGGHMLCTDGYYYWFGENRMDRRRVSCYRSVDLVNWEWRNDVLTLDSPVQPI |
GUT_GENOME129743_01127 | 6-78 | IPGRPWFDSRGDRIQAHGGSIIEVDGRYYFYGENKVNTVPGSGVWHNGVNCYSSKDLINWTFENTILKAADDE |
GUT_GENOME102034_00072 | 36-131 | GEKWLDTDGNPINAHGAGMLYHNGKYYLYGEYKVGETVLPPDATWERYRTDVSGVSCYSSSDLRNWKFEGVALEPEESPSADLHKSKVVERPKVVY |
GUT_GENOME229830_01025 | 718-811 | QSIEPGTLWKDTRGKMINAHGGGILYHEGIYYWYGELKGDSTYWNPNVPNWECYRTEAGGISCYSSKDLYNWTFEGVVLKPDLTDKTSDLHPSK |
GUT_GENOME008677_02506 | 282-356 | WTDDEGNVIQAHGGQVQKLTYIDRETNEQVTKWWWVGEDKTQGAHGGICANSSDDLYNWHNEGIVMRNVSSREQL |
GUT_GENOME168217_00484 | 8-83 | GHPWYDTNGKRIQAHGAQLFFEKGTFYWIGENKEFTTGEDEVWTWGVRLYSSNDLMNWQDEGLIVPPDLENKESIL |
GUT_GENOME185947_02250 | 40-108 | GFPWNDREQNPINAHNGSILYHEGTYYWYGEAHTAPSFLPFQINCYSSKDLLNWNFENIVFPSLTDKAI |
GUT_GENOME098568_01830 | 15-94 | WYDEDGNRINASDGVIIYADGTYHWYGLSLRPLPFAPEGEGGQTTTTGVVMYESVDLYHWKYEGVILACSPEPDSELYGP |
GUT_GENOME099386_01503 | 797-869 | LTTYNSGEMWLDTLGSKISAHGGQIIKQGNKYYWYGEDNKISYPLTTGISCYSSEDLKNWTYEGIAFKAFDDG |
GUT_GENOME147168_01002 | 1641-1728 | KTEKEITTYSSFSGAKGAVYTDTNGNVIQAHGGQIQKLTVDGVTKYYWIGEDKTNDYRPCPGVHLYSSEDLYNWTDEGLVLRTMTSES |
GUT_GENOME209325_01461 | 30-103 | IKSGEIWPDNQGVHINAHGGGILYHNGSYYWYGENKSDSTSSAMVGIMCYSSKDLVKWKNEGAVLPVVTNDSLS |
GUT_GENOME258259_01372 | 291-368 | YTAFHPGKIWEDTDGVHINAHGGGVLWHDGRYYFYGEHKSEHTSKALVGVNVYSSEDLYNWKKEGVALSVMPEGSGHK |
GUT_GENOME199560_00658 | 28-107 | NNKLWQDSNGNYINAHGGGILFHEGLYYWFGEHRPENGFTTEVGVNCYASSDLQNWTHKGIALAVSEESGNDIERGCIME |
GUT_GENOME034178_00470 | 19-97 | VLSQRYQAITSGVAWFDQNNKEVNAHGGCVVKEGDKYYLFGEYKSDTINAFSGFSCYSSTDLINWTFEKIVLPVQKDGL |
GUT_GENOME007505_00682 | 23-96 | NTITPGARWNDTSGQHINAHGGCVVFHEGTYYWFGEDRTGSVSNGISCYTSTDLYNWKRRGLVFKASQALDPET |
GUT_GENOME037207_00680 | 4-87 | ANVNGQPWRDDQGNLIQAHGGSILPYAGDYYWYGEDKGVANVEGTSRVPFVGISCYRSTDLLTWQRIGDVLTAANDPSQQVQAE |
GUT_GENOME143497_03465 | 3-85 | KNGTLWYDTDGNPIQAHGGMILPHGDTYYWYGENKNGPTSLAPSGDIRVDFIGISCYSSKNCIDWKNEGIVLPASSAPGSDLY |
GUT_GENOME214037_00176 | 507-583 | VPVGQTWLDTEGAPIQAHGGGFLQQTDTDGKPIYYWVGEDKSHNTSNFNGISLYSSKDLLNWTYRNTILAPDAENAG |
GUT_GENOME048463_00516 | 18-100 | GMVSALENGTCWTDDCGRAVQAHGGWVLPVVDGLGGTCFYWYGEDKSAPTCGRRADAVGIRCYASTDLLNWHDCGLVFKADNR |
GUT_GENOME050993_02300 | 23-107 | SFTPGAIWKDNNGVPINAHGGGILYDDGTYYWFGEHKTEGRAGNVAQVGVHCYSSTDLYNWTDRGIALSVSDDPASDIVKGCILE |
GUT_GENOME006253_00995 | 6-68 | NGVPWYDQHQQVVNASGGCLIQENGNYYLFGEYHQPDSITFAGFSRYVSTDLEHWKDTGLALS |
GUT_GENOME149085_00452 | 3-80 | NGTLWLDADGRSIQAHGGSILQHNGIFYWYGENKDTDTHNRSVDFVGFSCYKSRDLLHWENCGLVLSAVNEVGHDLHP |
GUT_GENOME002705_01677 | 816-888 | KVSSIKNGELWYDDQENVIQGHGGNILVHGGRYYWVGEYKNNANFSGIALYSSDDLMNWKFENMILTPETPDE |