UHGP-MC 104991


Information


Number of sequences (UHGP-50):
80
Average sequence length:
69±9 aa
Average transmembrane regions:
0.06
Low complexity (%):
1.5
Coiled coils (%):
0
Disordered domains (%):
5.58

Pfam dominant architecture:
PF17802
Pfam % dominant architecture:
7125
Pfam overlap:
0.37
Pfam overlap type:
shifted

Downloads

Seeds:
MC104991.fasta
Seeds (0.60 cdhit):
MC104991_cdhit.fasta
MSA:
MC104991_msa.fasta
HMM model:
MC104991.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME235994_02342560-617LPLGSYVVKETKAPKGYVLNETPQAVTLKYVNQTVELVSEKASFTNERTKLEVNVNKE
GUT_GENOME105876_001481833-1893AQTEELPLGKYVVKEVNAPKGMVLDSSEIEVSLVYKDQNTPVVFESTSKVNARQKVTLNLN
GUT_GENOME027775_00951800-859DLPLGEYRIVEKTAPEGFVLDTTPHIVDLTYKGQNISVYTEPVTVHNERQKATVSITKAI
GUT_GENOME180112_00002896-959LPLGNYYAKEITAPNGLVISDDRIDIALDYKGQETDIVIGNADFTNERQKVELNVSKVDADTGE
GUT_GENOME215230_02330722-790DANGDAEVKNLYLGDYYVKEITPPEGYLLDEEEHDVVCDYEGDMVAEVSRSTTSKEQVIKQPFQLIKVS
GUT_GENOME200597_01476733-794LQLGKYSIKETTTQTGYVLDENSYNFDIEYAGQMIDVVEIKQSYVNERQKLDLQITKTFEDE
GUT_GENOME018569_02496221-282NAISKPLYLGKYFVTETKTVSGFVLNKKSYTVHLKYDGENVEVTNQNLSVRNDRQKAEISLK
GUT_GENOME274042_00188977-1054VTKEDGKAVTEPLFLGRYHVVETKAPYGMILNPTVQTVELTYAGQEVEITETSTGFYNERQKVEIDLNKVMEQNEVFG
GUT_GENOME245501_00383888-955ATTKKLPLGKYEVVEVEAPSGMVLNKNSQTVELKYKDQYTAIVMSSSSFVNERQKVEAKVIKVDSEQS
GUT_GENOME153230_01393810-864NLYLGKYTVTETKQPAGYVLPDQTWEVELTYEDQNTELVTETLEVENNTTVVIID
GUT_GENOME039897_006141197-1273TGNNGFAKSRELYLGKYEIEEIKAPYGMVLNDQIHIAELVYAGENVSITETGASFLNEKQKVEISLKKALEVNDLFG
GUT_GENOME088305_006191296-1359AEVDQLPLGKYKIIETVAGNGFVLNKEIQEFSLEYAGDWVEVVYHDSEYVNERQKVSISLKKTA
GUT_GENOME096163_029651009-1073LYLGSYRIVETLAPEGYVLNGEPVDVTLSYAGQEVEAAYAESEFGDVPQKGVIDVTKVDAESGLP
GUT_GENOME208589_01580414-496VTDSNGTAKSKLLYLGEYTVKETKAPYTFALNNEEQHLALTYEGQEISTISRSVSFYDQRQTSGIELTKVLEQNELYKVGMND
GUT_GENOME055972_004271619-1690LPLGKYIIREVSAPSGYVAEDKGYEVELSYKDQFTPLVWETLNLENKYFTVEIDLEKVFETAFESGNYAAGS
GUT_GENOME273713_00508123-182EPLPLGRYIVTEVSAPAGYVLDETRHEIALTASDHTTPVVKARLSAVNEFMAARITLTKE
GUT_GENOME000721_005011047-1130PIIRYQKDDLVATLTTDADGSSVVNNLPLGSFYLKETKAGDSFVLNTEQKEFTLSAGNDTAAVVYETVDYKNERQKVELSIQKK
GUT_GENOME254096_01893883-960ITTREGYAESKPLYLGSYLVTEKTASHGMVIHPEPNEVSLQYAGETVEVTVTETSFVNERQKVKINLKKCLEQDNLFG
GUT_GENOME159394_007231368-1440TDITGTTQSRDLYLGKYFLRETKAPNGFLVSEEEYDFELTYEGQCVTVYPEFLTLENARQNVIFKILKQFQTT
GUT_GENOME244487_02450685-765TGSDGTATSKKLYLGTYRVVEIEAPEGYVLENGETLVTLEYEGQEIAVTNTGTSFKNNYQQVNITLEKYLESSEKYGINGE
GUT_GENOME096561_02478698-761LYLGKYLVIEESVEGNFVLEQTPHEVVLSYADQVTPVTNETLELENQRQKGHVQLKKRAEHFSG
GUT_GENOME142595_00876784-839LYIGSYQLIEKSVPAPLVLDPTPIPFEVTYEDQDTQIVATKAETTNARQKAVITLK
GUT_GENOME253242_01919876-933LYLGQYRLVEKQAPEGMVLESTPHMVKLEYAGQEAAVTETVIALTNQRQRVQIEMQKL
GUT_GENOME053089_00184675-736VTDLPIGKYIVKEENPPTGYLIDKKTYTVDLKYKDQYTKIITGNANSTDKVKEMRIHIFKSG
GUT_GENOME226628_024951399-1476TDESGKAELDNLPLGEYKIVEKTAPEGFVLNEEEQKISFVYADQDTPVISQSAEFINDRQKVEVSVVKKDAENEKGLS
GUT_GENOME241809_00713838-909TDANGQIKTPQLYLGKYSVKEVAAPNGFVLNDKPIDFELKYAGQNVEVTSTSLKATNDFQKLNVTVNKQAEQ
GUT_GENOME117515_00373467-540TTENGALSDLLYPGKYTVTEVTAPTGYNIGDNTFDVTIKPDDDKVQVVTETVTGENDRQTFEITLKKEMEQSEY
GUT_GENOME265465_00429715-774DELYLGKYIVREKSTVDGFLLDRTEYPVELKYEDNKTAVVMETVSIKNERADIAVEVLKE
GUT_GENOME072321_02045902-982NQKGEVVDTVTTDATGTAVSRELYLGKYEVREITAPHGMVLNPEPHCVELAYAGQNVAVTETATSFENERQKVEISLVKSI
GUT_GENOME059239_02395414-481NGEVSTKELPLGQYKVVETQVPENYILNTKEYFVTLKYKDQNTSIVSESTTIPNEEQKGKISLKKSLD
GUT_GENOME282760_00284760-816IDNLPLGSYVLTETKAPAGFVIDTDPVDVTFTYAGQTVDIVKDSKTVEDERQKIAVD
GUT_GENOME140278_000461076-1152TDAEGKAITDLLYLGKYTVTEVTAPYGMVIASDAVTVELAYAGQEVEVTETSAAFYNERQKVSVSVDKLLEVDETFG
GUT_GENOME205416_00317809-890TGEDGTAKSGLLYLGRYRLEERQAPSGCVLNPQPEYVELTYAGETVEVTQAATGLYDERQKVDVTLFKAMETDDLFGLGMNE
GUT_GENOME265977_01247908-964ELPLGEYSVREVMAPENFVLNDETKDVSLKYKDQNTAIVFNNTSFINARQKVDISVC
GUT_GENOME121724_00500996-1072TGSDGMATSKELYLGRYVITETHAPFGMTLNTETVTVELVYAGQEVKLTETSAGFYNERQKVQISLEKSMETDKLFD
GUT_GENOME199904_00132833-914TAENGVAKSKELYLGKYTVIEKTAPNGYVNSNEQYDIELTYAGQDIKVTSTALSVFNERQKVSVSLLKELSQDEKFKLGMNN
GUT_GENOME000050_01649897-976TDENGIASSKELYLGSYQMEETNVPAGMVKPEPHTVELTYAGQEVEVTELEESIYNQRQRVSLRWNKAMEQDKLFGVQGE
GUT_GENOME201308_00190477-554TDETGVAKLTDLPLGKYYVKEKETANGYVLDGETREVDLTYRDQNTAEVTYSADWQNKRQKAEVKVLKKAKDSDRVLE
GUT_GENOME071146_00184845-910LYLGSYIVTEKTASEGYVINDQKFEVTLTYGGQDVELVNEDLSVENVRQKVEISFKKSIEIDDVYR
GUT_GENOME035511_00989788-867GTVIEEMITDTSGKASLTELPLGSYHLEEIQAAEGYVCNKKLDAFTLEYEGQNVEMSSYGSEFKNERQKVALRLLKTSSE
GUT_GENOME118381_01075944-1011DENGYAVSEKLPLGVYLVREKETPHGFVTDEKGIEIRLEYQDQETPLVTASIEYENIRQSLRIKLFKC
GUT_GENOME139578_00124253-311IDNLLLGEYYIKQTEAGEGFVLNTEQKDISLPYVGDKEAISYAEVSYENERQKIHIVLE
GUT_GENOME079930_00248888-946LYLGKYVCKEKTTNSGFVLDTTEYEVVLSYADQNTAIVTETLTRENQRQKAVVELQKNA
GUT_GENOME114174_006061079-1156VTGADGTAITKPLYLGQYEVKEKTAPYGMVLNTESAIVELEYAGENVSITTASTEFHNERQKAVIELTKSLETDTMFD
GUT_GENOME166134_00716987-1063ITSETGEGQSKPLYLGRYRIEEAESVHGMLLAEPQIVKLVYEGQFVELTEASASFYNERQKVSIELEKVIEQDETFG
GUT_GENOME210816_03748456-517TDENGKASATKLYCSKYLVKEVTAPEGFVVDNTGKEVTLTKEGETVKFTDKRQKVSLQVHKL
GUT_GENOME163579_015991219-1287TDKSGKTQVDNLYLGSYIVRETQAPEGFVISDKEYEVTLSYKDDHTAIISDSVTYLNDRQKVHIDLRKV
GUT_GENOME043379_00491722-782ALSEELFLGTYLVKEYDAPDGYELDPTEYEVTLAYKDQNTEIVTEELEVTDQPTTVTVIKK
GUT_GENOME234162_02315882-947NGSATTKELPLGKYKVYEVVAGDGFVLNKEVKEVTLTYKDENTPVVSESSEYENKRQTVNLSVIKL
GUT_GENOME129198_00034942-1017LYLGKYKISEKTAAYGTVLNTEPKYAELTYAGENISVTETSVDFHNERQRAMLNLRKLMETDEVFNIGNNDEILSV
GUT_GENOME121424_01256549-625SDEEGRAVSKSLYPGKYNIVETKAPYGHVSDDMIYQAEITAKGGAAEAGTAVLDVENQRQKVLINFTKNIEEDRLFG