UHGP-MC 102031


Information


Number of sequences (UHGP-50):
50
Average sequence length:
68±8 aa
Average transmembrane regions:
0
Low complexity (%):
7.42
Coiled coils (%):
0
Disordered domains (%):
0

Pfam dominant architecture:
PF03773
Pfam % dominant architecture:
200
Pfam overlap:
0.22
Pfam overlap type:
reduced

Downloads

Seeds:
MC102031.fasta
Seeds (0.60 cdhit):
MC102031_cdhit.fasta
MSA:
MC102031_msa.fasta
HMM model:
MC102031.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME076888_033531-66MNRTIKEIQKAAPFIENSSFIFLLGKLIIDVLILDSFGVIVIRHTTNSIGEHPLKRDRLLSSLRNP
GUT_GENOME191285_00664103-168VVDERKLELDRGIKIIEEVTPVLKNRILIILLRQLIVNIHEAHRLGVKMIVHPADAVLCHLPVSDA
GUT_GENOME088423_01783177-245VVDKGQLKMDTAVKVIEEITPIFKNSVLVLILRQLIVNVIKTDGFGIILILHPADPISCHFPVGDRFLR
GUT_GENOME076242_02736744-819VDEGELDEDRAVEVVQKVAPVLKDGSFVLVLRKLVVDVLKGDGLRVEAAVYLADTVAAHLHIGNGLLGGLADFLCL
GUT_GENOME159186_017021-66MDRSVKIIEEITVALKNLRFVIRLCKLVVNIKKLHRLGVELIRQPADSIPVHFLIGNTLLDRLRGV
GUT_GENOME225210_00151106-172TVKIVEKAAVPVKNGALVVTGSHSVVNILIGKGLCVVALPHLADTVFQHFLVGNSLLCRLRPTLFRL
GUT_GENOME068964_0029968-137IHKGKFEVDGTVEKVQKSTPLFKNRRFILLLGQLVVDILKLDCLCVVVVPYPADSVREHPLKRDGLLCGT
GUT_GENOME068270_013561-63MDGGVHIVIEVAQVFKGGRAVVRLGKLVIGIGKLNALGKRTLVQPANAVLIHDLIGDALLGGM
GUT_GENOME129578_01725642-712IEKRKLEGNRAVKVVDELAPASKNRVLVLVFGELIIDVLKLDRLVVAVRVHPADAVRVHPLIGDRLLRADA
GUT_GENOME070149_021211-63MDGRIQIVIQIAEIFKNSRLIVRLREQIVDVKKLNALRIAPVSEIADAVRVHDLIGYRVLRGM
GUT_GENOME082520_00119144-213MNPAVKVIEKITPAFKDGGLVIILRQLVIDVLKLDGFGVIAIGDTADAVRPHPLIWNGFLRGVGAFLLLL
GUT_GENOME113650_01779177-247LVDERKLHKDCAVKVVQEITPVFKNSGLVLVLGKLVVDVVKADGFGVKTAVYLTDAIPAHLHIRNGLLCGF
GUT_GENOME080325_02070298-366EERKLKRQGAVKVVQTGAPPVKDGRLIFGLGELVVDVLIFNGFGVIAVLHPAHPIPVHFPVRECLLGGG
GUT_GENOME066487_00317574-641VHKGKLKVNGTVKEIQETAPLIEDGSLIFLLCQLIVDVLKLNRLGVIAVSDTADAIREHSLKGNGLLC
GUT_GENOME093223_01492546-635VQEGKLEMDGAVEVVEKITPALEDGAFGVIVRKLVVDVLKLKRLCVPATRQTDAVRPDAVIRDGLLRAARLPAVGAVFPDDGFDLFLFLA
GUT_GENOME238304_0072297-166VHERQLEANGAVEEIQEAAPFLKDCGLVLLLGKLIIDILKLDGFRVIVVRHAADAVRKHSLERNAVLCGL
GUT_GENOME092036_01120675-747VQKGQLELNGAVEVVEEIAPALEDRCLVLVLRELIVDVLELDGLCVAAACHTADAVRPHPFIGDTVLSRFFLF
GUT_GENOME051084_01549645-715GCHVHKGQFKLNAGIEEVQKTAPFLENRRLILLLGKLIIDVLILDGAGVVVCFHPAGAILEHPLHGDGLLG
GUT_GENOME161352_01249636-712IHERKLKTNAGIEVIEEVAPAFKDGVLVLILCQLIVDIVESDCFGIQMFLHPADTITPHFQIRNGTLHGEPLFLFVP
GUT_GENOME115449_01861176-248GTAVDEGELKRDRCVKIVEKRAPAVEDGRLIFCGRHGIVDVLIGHGFGKQAVRELAHAVRQHPHIRDGLLGRE
GUT_GENOME031061_0201165-125GIKVVEEVAVVFKNQRFVLILCQLIVDIEKLQRLGEKVFVQPTDSIRIDFLIWNGLLCGSR
GUT_GENOME034694_017991-66MNAAVKIIQKITPVLKNGIFILILCQLIVDIRKPDRLGITFILHSADPVPRHFFIGNSLLRGQPLF
GUT_GENOME015738_0136590-157VDERKLEADGAVEVIQKVAPSVKYCGFVFVGVQHIVDVVKAYGFCVTVVPCAADSVREHSLKRNGVLR
GUT_GENOME222353_00020105-166NGSVKVVEKIAPVFKNGGLVVCLCKLIVNIFKGNGFTVFLFRYLAYPVRVHGKVGDCLLRGV
GUT_GENOME245594_01335177-241VDKRKLDVDGSVQIVVKVAQVFKDSGFRVGLRELIADVQKFNALGKRVGCHPAHAVLIHGLIGDA
GUT_GENOME070149_0044143-110LDRAVKIIEKVAPALKDRGFIFVLTELIVDVLKLNGLGEVAGVHTTNPIGEHSLKRDTVLRRFLLFIL
GUT_GENOME214945_004509-70MDTGIKVIEEITPVFKDCIFIFILSQLVVDVIKADTLRISFSLYPADSISSHFLISDRLLDG
GUT_GENOME018972_02279178-273IHKGQLEMDAGIEEVQEGAPFLKDGGLVLLLGQLVVDILELDCLGVVPIRDTADTVREHSLKRNRLLRCPGNAVVPLGFLYRCFQFLLLFPRQLFR
GUT_GENOME073448_00050648-719VIDKGELERQGAVEVVQERTPAAENGRLILGGRNGIVDVLIGDGFGVVAVPHPANSVFQHFQVGNGLLGGEG
GUT_GENOME196582_01654563-647VQKRQLKPDGGIEVIEEFAPLFKDSGLVLVLRKLVIDVLELDGLGIPAIRQAANAVWPHLLKRDAVLGGLFLFVGAVGTGNRCLD
GUT_GENOME264204_01461298-357HRGVEVVEELAPAVEDGVLVLALAQLVVDVLKLYGLGIEIALHPADTVREHSLERYAVLC
GUT_GENOME281479_02196581-648MHAAVKCVQKVTPALEDGAFIFVLRQLIVDVPELHGFCVVVRADAADAIRPHALIRNRLLHGMRRFLP
GUT_GENOME247085_01285154-224ERELKMDGAVKIIEEVAPRIEDRGLVLVLVELIIDVLKLHRFAVIVIRYPADAVREHPLKGNGILRRFMPF