UHGP-MC 52836


Information


Number of sequences (UHGP-50):
62
Average sequence length:
79±15 aa
Average transmembrane regions:
0.09
Low complexity (%):
3.26
Coiled coils (%):
0.63
Disordered domains (%):
9.3

Pfam dominant architecture:
PF11839
Pfam % dominant architecture:
161
Pfam overlap:
0.43
Pfam overlap type:
extended

Downloads

Seeds:
MC52836.fasta
Seeds (0.60 cdhit):
MC52836_cdhit.fasta
MSA:
MC52836_msa.fasta
HMM model:
MC52836.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME131085_025741-101MAIRWRGGPYEKFDPSKLLAKEAAVILEGDPGARDGLSVYICFAPGKTKRIATYEDMEENIDRISSDILDYLKEGTEEAIDNADTAANKANIAAEKVNVAI
GUT_GENOME237874_011131-104MAIQVRRGKEADFNAARMLPGEWAVSTDKKIVRMCFAPGVCLRMATYESFESDMEIVRQIVVDCQNIQEAIERIQEEIGIKAVEVETNARLSAASAQTATDKAA
GUT_GENOME009076_008561-129MAIYVRRGMEKDFDPEKMKAGEWAVSVDPDRKKQKIWMCFAPGVVKRLGTYEDFMDQIAEINADMMEQYIKQFNDILKQVENDKDIVASDYEFIAEFKKILEETYMPNINTASSNATSAAQTAQTAQEG
GUT_GENOME037523_003781-79MAIQMRRGQLKDFDANKMLPGEFAVTIDEAPENQKVFICFSAGTFKTLATREDFEQDLANIQQAIEDAREASKTANEAI
GUT_GENOME091729_010491-87MAITMRHGPYNKFDPQKLRTAEIAVVTEGDPHASDGKAIYQCFSPGDVKRMATYEDMLDQIDEAGGEVIDNHIEEKVGVALKACEDA
GUT_GENOME000604_056448-94MRQRRGAYGAFDETKALPQELQVVTSGDPNTRDGKAVYIPIEIGDLRRIVTSDELDQQTLDVLQQVLADLEPTIKNTQSAATYANDR
GUT_GENOME143497_002421-73MAIQNRRGDYARFDPQKLLPGEWAIVLTGDSNAADGMACYMCFSPGVVKRMATYQDMVENMGKLSADVVKQVM
GUT_GENOME231099_020686-66TIQFRRGMYADFDTSKIRPGEPVAILGNDPSVPSGKALYIAFAANDVRRLCSIEDISEMVN
GUT_GENOME221470_005561-86MAIQMRRGIYEKFLPKNMTPGEFAVVLSGDPNGKNGTGVYICFTPGSVKQLATMEDMAGTVSDMILSSTNGVIQQLTEAVNSAEQS
GUT_GENOME014513_014791-101MAIQMRRGKYADFDPTKMLPGEWAVSLDEESGKQIVWMCFAPGVTKRMGTLEDFDVEIWDMFKPHLDSIKESETKAAVSETNAKQSENNAKDSETKAALSV
GUT_GENOME239780_001011-99MAIQMRRGIYSKFDPDRMVAGEWAIVLSGAQDAANGQAAYVCFAPGQVKRVATVEDMATIIANAELEVAESITTAASKFLTDTNEAVKLAESKRVAAEN
GUT_GENOME256972_013571-95MAIQLRRGAYSDFDPEKMLPGELAVVTSGDPGADDGRSVYACFVAGDVKRMATYEDMEENIDQATQDIQEAFSAQLTQKISQADTAISQAQTAIS
GUT_GENOME213442_015631-77MAIQHRRGVFDRFNPNKMRPGEWAVVVEGDPSADDGRACYICFAAGNVKRMFTYEDASVEVRGLVDKFKEQFVDGVN
GUT_GENOME058330_021721-86MAIQDRRGEYDHFDPQKMLPGEWAVVLRGDPNVRDGKATYVCFSAGVVKRLMTEEDLTIELDERTQEVINRLVGEVGEAVKNAVEA
GUT_GENOME080422_009391-91MAIQIRRGPVIKLDPNKLLPAEFAASLDTGELHFCFTPGETKQLVFSEDVVEIIDAHTGEVVATLTADVKKAVADAESAAEHANSSAQKAV
GUT_GENOME000582_028421-93MAIQNRRGSYEDFDPYKMLPGEWGVALDSDPKCQDGKSVYMCFSAGTTKRMATYDDMVEDVKESTKEIAEQFTGEVRQATDDANTAAKGAQEI
GUT_GENOME142587_040328-64IKHRRGNIEDLLRSELLPGEYAISTDGTITICYAGGKTKDLASVEEVKRVAAELGEN
GUT_GENOME031003_008341-98MAIKKTRQRQRGGPYSKFDKTKTLPREFQIVESGDPNSKDGRAVYIAFEAGEPKRLSTYEDMQEDIDNATEEIQGTLTAGVDEKIKEADAAITSAKSA
GUT_GENOME234769_002601-70MAIQVRRGLQKDFKPDKMLPGEMAVTTDTRRIYATFAPGDCKRIATYEDMQDDIKEANEEIITELTESVT
GUT_GENOME256877_002121-94MAIQVRRGLFSKFDPTRLVAGEWAVVIGDDPSCDDGRAVYVCFGANIVKRVAMYEDMLDWFADLKENTIDQVIYDAMAGIREEYGLIKNETLDA
GUT_GENOME186025_001971-77MAIRNRRGAVKDLVESKLLPGEFAICTDGSIVICYAAGKTAKLSSVDDIAKLRAEQTSYKETIDRVISDFKEYMSET
GUT_GENOME152291_011421-75MAIQFRRGDYDDFDKTKLQEGELAVVVRNDPNTLSGKAVYICYAAGDETQVKRLIHSEDYENDFEELSAELRTEV
GUT_GENOME085041_028401-79MAIQNRRGNYDDFLPEKMLPAEWAIVLSGDPVVTDGKSAFICFSPGDVKRISTYNDMVAQFNAINADLIEKLTGDVTDA
GUT_GENOME102990_0219437-93LQMRHGKESEMDKSKFVPAEIGVVTDTKRAVVAFAPGDTKDVAFKEDIPEQDKNYNN
GUT_GENOME207525_007221-79MAIQARRGLKKDFDPNKLLPGEPAVPLDTREVYMAFAPGDVQKLATHENVKEMVEEATDEVIADFTEGVNNATEYATEQ
GUT_GENOME073435_011191-96MAIQMRKGIFDKFDKTRLLPGEYAIVLSGDPVAKDGRAVYICFAAGAVKRLATHEDMYDFLLEARDDTIEYIETVATADVKASYNQLVSQLTENEA
GUT_GENOME026545_008671-83MPIQMRRGIYDKFVPSKMVPGECGIVLSGDPYASDGRAAYVCFAAGLVKRVATYEDMSDYFAQVRQETVEWIVDTANAGFKEE
GUT_GENOME219857_021081-111MAIQMRRGAYAEFDPLKMKAGEWAVSTDSDTKKQQIWMCFAPGIVKRMGTVEDFDVEIQRLIQNYLDSIEQSVSQAQESAQTATEKANSASNSASQAQKSAQTATEKANSA
GUT_GENOME233228_006591-92MAIQMRRGKFTNFLPSKLQPGEWAIVQGDDPSASDGLSVYVAFAAGVVKRMATYTDMVDNCRDAIEQSIGDIKAKLTEKVEATNATVEDAEA
GUT_GENOME000721_050711-64MAITMRIGLEKDFIPERMSVGELAISTDTGLMRYCHGPNKIKLIATDEDIAEMRKMVNDFDLTV