UHGP-MC 3655


Information


Number of sequences (UHGP-50):
78
Average sequence length:
76±7 aa
Average transmembrane regions:
0
Low complexity (%):
1.53
Coiled coils (%):
0
Disordered domains (%):
0

Pfam dominant architecture:
PF00528
Pfam % dominant architecture:
6410
Pfam overlap:
0.42
Pfam overlap type:
reduced

Downloads

Seeds:
MC3655.fasta
Seeds (0.60 cdhit):
MC3655_cdhit.fasta
MSA:
MC3655_msa.fasta
HMM model:
MC3655.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME277665_01569104-181VMNATNIPLMNPDIITGISLMLLFVFAGTLIGLDPILSFRTLLIAHITFSLPYVVLSVLPKFRQMDRSLPEAALDLGC
GUT_GENOME208070_00705342-414NNILLVSPDVIIGASFLIFFTAVGFISLGFTSVLLSHIAFSIPIVVLMVLPKLQEMNDSMVDAARDLGANNVQ
GUT_GENOME211297_00585100-174MEGTNQITMLNADIVTAIAFMLFFLIVQFIPFGWATLIISHTMVCIPYVVMSVLPRLSQLNPNLYEAGQDLGASP
GUT_GENOME141703_0055696-171KRAMEGLLYIPVVIPEIVMGISMLAFFSSLNLPAGLITLILAHITFCISYVIIVVRARLDGFDAALEEAAQDLGAT
GUT_GENOME243514_00091111-169IGIGLQLIFIKLGIAGKIFAIILVHIVYIIPYSVWIMIPGFESFDEDLYNQARILGASK
GUT_GENOME211116_00610175-240DIITGISLFLLFVSLGISQGLTTVVLAHITFCTPYVVLSVMPRLKQMNQNIYEAALDLGATPFQAL
GUT_GENOME135791_00769102-175NDFPMMNPEIVTAIGLMLLFITFKIERGFVTLLLAHIAFCIPYVMLSVMPKIRSLDPNLADAAMDLGATPWQAL
GUT_GENOME032633_005781-75MSTSKANMLIPDVITGIGLAILFSITIIPLGINLGFATVVLAHISFCTPYAIIIIYPRMQKMNKNLILASMDLGY
GUT_GENOME261029_00366100-173SNLPIINPEIVTGVSLMLIFVLVFSLFGGKLGFLTVLLSHICFNTPYVILNVLPRLRRMDVSLYEASLDLGCSP
GUT_GENOME000338_04306100-171LISSPLLLPQIFLGLALLQFFYYFTNSPQNFFTLVLGHIIITLPYVVRTCVNSFLGISPYIEEAGRDLGAGA
GUT_GENOME000734_0093599-186TVNNIPLTNADIITGVSMMLLFLLAVSAWNGSLGQLTGVKWSMGFLTLLISHIAFDVPYVILSVMPKLSQLDPNIYEAAQDLGDHGFH
GUT_GENOME218037_0015998-169ASMPLLNADIVTGISLMLLFLGMGLRLGYGSILMAHVVFNLPYVILCIMPQFLSLDRSCYDAARDLGATPFV
GUT_GENOME239123_01154109-189NVSQLPMVNPDLVTGITFMMLFAFVNSLLVKLGMSQMGDYVKLLIAHISFSIPYVIFSVMPRLKQSSNLLYEAAMDLGCSP
GUT_GENOME258557_00613103-176METGLTLPIMIPEIILGLAFMVVFNALGLRGNDLRLVLAHTTFCIPYVFLNVKSRLVGMDASVFDAARDLGASP
GUT_GENOME237880_00497108-184KKAINSSLYIPIVIQEVVLGISLLLLFDVCHFPLGIWSMILAHTTFCVPFVVIIMRGRIAGMDMSVEEASMDLGANR
GUT_GENOME243299_00889108-173IPMVNPDIVIGVSLLSLFVICHIKLGYVTLILAHITFNVPYVIFAVLPKLSQLDPNLEEAALDLGA
GUT_GENOME095713_00347106-173NQIPVLNPDIVTAISLMVLFHIFHLNLGLFSLTLAHIIFLIPYVLLSIMPRLNTMESSLPEAALDLGA
GUT_GENOME123265_00864288-384MNKRARGAVMAVSYVPVVNPEIITGVSLMLLFVVFNRQVSAALSWLLQREVDFQGFGLCTLLIAHITFCLPYVIFNVSPKLRQLDPAVYDAALDLGC
GUT_GENOME133943_00337109-185KKRQRMILLNNVPIINADIVTGVTLMIVFSIIFSELGFTSLLLSHIFFCIPYVVLSVLPKLSEIDNNLYDAAVDLGC
GUT_GENOME215161_00877124-204VETFTQLPVVNAEIVMALSLAILFKVLQTSYSFATLLVGHMVLTVAFVYLNVKPKLMQMDPNIYEAALDLGATPWYSLFKV
GUT_GENOME195449_02085104-169VSPEMVMGISLLIFFISVKLPLGFVSMLIAHTTLGLPFVALTVLARLAEFDEHLVEAARDLGATEA
GUT_GENOME244252_00247107-178NLMIYVPIIVPEIVLAVAMLIIFITIGLQLGMGTIIIGHCTFCIPYAVVTIKGRISGDNQSLEEASMDLGAN
GUT_GENOME040958_0046188-169IQNASNIPIISPDIVMGVSLMLLFTTLGVIFNFEMGFFTVLISHICFCVPFVVLNVMPRIKRMDQSIYDAALDLGCNQWQAF
GUT_GENOME052699_0084699-192VNAVNNIPMMNADIVTGVSLCLLFVVFFNGWGAFAGWMNSWQSLVVLPERLTMGFGTLLLAHICFNIPYVILSVGPKLRQMDRNLIDAAQDLGC
GUT_GENOME157142_00266104-179TVNQLPMINSEIVMAVSLMVFFSAFGFIFPQGMTRLIISHVAFCTPYVILSVLPRLSRMDPNMYEAALDLGATPFG
GUT_GENOME179445_01168105-184SEITVVNAEIVTAVGFFLLSIFLRDVVMIPVQKGVGWLIIAHTIITTPYVILTVSPRLNQLNPNLYEAGLDLGAGPMRSL
GUT_GENOME098792_00641109-190PMVMPDIITGISLVMLILQVQALLRGSNLSWLQFERGWFTIWLGHTTLCMAYVTVVIRTRLGELDQSLEEAAQDLGAKPWKI
GUT_GENOME018544_01908112-189LNSAVYVPIVIPEIVLAVALLCIYIKIDFPLGLWSIILGHTTLTLPFVVINVKSRLAGYEKSLEEAAMDLGANQRQTF
GUT_GENOME149567_00860109-180LPMIVPGVVFAVPLMALLVLMGLKKGFWAVVLGNVILMLPYMILTVRTRFLGLDRSVEEASMDLGASGVETF
GUT_GENOME159235_01732100-178NQVPVVNADVVTGFSICILIVVVFGVSKDTYIPLVAGHVVLSAPFVYLAVVPKLKQMDPSLYEAALDLGATPAYALFKV
GUT_GENOME012655_01621108-179LPMMLPGIITGITILSFLQLVGVVQGAFAVVLGHTTFLLGTVLPQVYTRLRQLDRNLLEASQDLGAGGVQTF
GUT_GENOME047860_0154099-175ESLSILPIMIPEIIMGMSLLSVFTMADIPLGLGAIILAHITFCIPYIYLVVRSRLSGIDKSVVEAARDLGAGRARAF
GUT_GENOME009824_00563167-246NNIPLLNADIVTGISIMLIFSLIMKIPGFHYIFGFPTMLIAHMYFCIPYVILNVLPKIESLDSNMMDAALDLGLKPYKAL
GUT_GENOME063855_0146215-85LMYMPILLPGIIMGISFLTFFSLLHLLFGYLTMIIGHSTFCIPYTVIMVYTRMIRFDTSLEDASRDLGASS
GUT_GENOME234136_00920104-180QSIHHTLVYTPLIMPDILIGISLLMLFVAVRVECGFWTILVAHVTFCVSYVVMTVLARLQDFDDRILEASYDLGAGP
GUT_GENOME189239_01615421-490TVNNIPMSSSDTIMGVTFMLLFAAIGLDKGYLTLILAHVTFCTPYVILNVMPKLRQLDKNAYEAALDLGA
GUT_GENOME130327_0216795-175MMGITNIPILNSEIVTGISLMLLFIACRVTLGFSTILLAHITFCIPYVILSVMPKLKQTSKSTYEAAQDLGAGPVYAFFKV
GUT_GENOME047636_00496111-181ELAGQIPILNPEIVTALSLSILVVAVGIGFNYFTLLIGHVVLTIPFVVLSVIPKLKQLDPNVYEAALDLGA