UHGP-MC 1098


Information


Number of sequences (UHGP-50):
92
Average sequence length:
127±10 aa
Average transmembrane regions:
0.01
Low complexity (%):
0.41
Coiled coils (%):
0
Disordered domains (%):
0.58

Pfam dominant architecture:
PF06838
Pfam % dominant architecture:
5652
Pfam overlap:
0.28
Pfam overlap type:
reduced

Downloads

Seeds:
MC1098.fasta
Seeds (0.60 cdhit):
MC1098_cdhit.fasta
MSA:
MC1098_msa.fasta
HMM model:
MC1098.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME046366_00315310-448QMFQGLFLAPHIAIEAVKGAVFCARVMELSGFEVLPKYNEKRSDIIQAIKFNDEEKLIKFCKGIQKASPIDSFVECEPWDMPGYNDKVIMAAGAFIQGSSIELSADAPIRPPYIAYLQGGLTFDHAKIGVLVALNNIYK
GUT_GENOME286692_01517532-672ELFMGTFHAPNVTGEALKTAVFTAALFEGFGFDVTPRYDEPRADIIQAVQLREREALCAFIRGIQQGAPVDSFVVPEPSPMPGYDCDVIMASGSFTFGSPIELSADAPLREPFAAWMQGGVNFNSGRLGAILAAQSMLNAG
GUT_GENOME127701_01794311-424AVFSKLGYEVFPTASAKRSDITQAILTKTENELVELIRAVQKASPIDSNVVPYPWDMPGYQDKVIMAAGSFIQGSSIELSADAPIKKPYVAYFQGGLTLENIKLALMLALTYMK
GUT_GENOME178492_01189284-418LYKGLFYAPHTVAQALKTAHLAAYLFEALGYKVEPRWDEPRYDIIQTVVTGSAEGLCAFCRGIQQGSPVDAFVTPEPWQMPGYSDLVVMAAGGFTQGSSIELSADGPLRAPYTAFFQGGLTYESGRFGIMCAAEC
GUT_GENOME233154_00628313-429VWGAWVMENCGLEVFPKWNDPRSDIIQAVRVGSEEGQKAFCLGIQKSGPVDHSACPIPLQQPGYDDPVIMAGGTFIQGSSIELSADCPVRAPYGCYMQGGVSFAHVQIGVLRALQEM
GUT_GENOME224461_02472281-420YQGFFLAPHVVAQALKGAVFTAAFMERLGMETKPSWEARRTDLIQTVTFHDARQMTTFCQAVQMASPINSHVLPYPSPMPGYEDKIIMAAGTFVQGASLELSADGPLRPPYTLYIQGGLTFEHVKAAMIIAVNRLAEKKL
GUT_GENOME035504_01231139-255IFASKILEELGYNVEPKYNEKRADIVQTIHLGSKEKLIKFCQGIQKGSPIDSNVIPEPGEMGGYEDKVIMAAGTFTEGSTIELSCDGPIREPYIAYMQGGLTYEYGKYGILKAIEEM
GUT_GENOME211787_00465269-402NASYYPFFEGIFVAPHVVACALKGNILLGKVLSKLGINSIPKVGFIPHDTVRSIMFNDKDKLISFCQLVQSLSPIDSNVTPIPWLMPGYKDEVIMAAGTFIQGSTMELSCDGPIRPPYICYIQGGLTYEHVKIL
GUT_GENOME030161_00422447-586TPRDMYLGFYYAPGVVCEAIKTAIYAQCMLELLGKKPIPRYCEAHNDIVTCFDAGTAEALIGFCQGIQAASPVDSYAAPEPSPEPGYTDEVVMASGSFTQGSTIELSCDGPLRAPYTCYLQGGLNFAASRAGVLLAVQKA
GUT_GENOME220494_00682521-661QFFQGLFMAPTVTAAALKGAIFAANLYERLGFKVIPDSNEPRYDIIQAVELMKPEYLIAFCKGIQAASPVDSHVTPEPWDMPGYDSKVIMAAGTFVSGASIELSADGPLRPPYSVFFQGGLTWGHAKLGIMMSLQKLTESD
GUT_GENOME285156_01575293-401AAAVFSKLGYEVKPAHDQIRHDIIQAITLNSDKNLENFCVAIQVNSPVDAYVVPEPAHIPGYQDKIIMAAGTFVQGASIELSADGPMREPYNVFMQGGLTFEHGQLAIN
GUT_GENOME173470_02196332-470FYQGLFMAPHTVCQAIKTACLAAAVFESLGMTTTPPALEERADIIQAIQMKTPERLVAFCQGIQMASPIDSMALPEPWAMPGYQDQVVMAAGTFVSGASIELSADAPMREPYTAYMQGGLTYIHGRVALAKALERMVSQ
GUT_GENOME014217_00282296-410LLTSQIFRDFGYKTLPSENVCQSDIVACIKFADKNKLIEFCRAIQHSSPIDSNAVVEPCDMPGYENQVIMASGSFNQGSSIELSCDAPIREPFIAYIQGGLTYAHYKIALINALS
GUT_GENOME036138_01312301-421VYTAALCEEMGMKVAPKWNDPRTDIVQTVTFGEPDPMVKFCAAIQHYSPMNSFVDPIPYHQDGYEDDVVMASGSFTEGSTIELSSDGPLRAPYRLYIQGGLSYAHDKIAITHAVEETFYKN
GUT_GENOME215920_00875288-408ARLFCEVFSALGYSVVPRSGEQCGDIIASVNFNTADELIKFVQAVQTVSPVDSHVIPEPWDMPGYNHQVIMAAGTFVQGASIELSADSPIKAPYVAYFQGALTYEHAKIALKRCLETLFKY
GUT_GENOME212224_01440287-422FQGLFMAPLIVQQALKGAIFASAMLQNRGFAVSPTPAEKRTDIIQAVRFEREAPLLAFAEGIQAASPLEAYVHPEAAFLPGYDDAVIMAGGTFTQGSSIELSIDAPMRAPYIGYMQGGLSYAHVVYAVTLACRRLE
GUT_GENOME242929_00964309-434DALLGANLVSKCFEKMGFNSIPKSTEKRSDIVTAIELKNRRLVEIFCEAVQDSSPVDSQFTPVAWDMPGYEDQVIMAAGDFIEGSSIELSADGPMREPYYVYFQGGLTYDHVKIAVLNILNKINES
GUT_GENOME202694_01533319-453ASRGARATGVEIVPPAVFAARVMELLGYETEPVSSAVRHDIIQMIHMKEPEALKKFCKGIQFGAPVDSYVTPEPWDMPGYDCQVIMAAGAFIQGASIELSADAPMREPYTVYLQGGLTFESGKLGVLLAVEGLLN
GUT_GENOME242940_00981282-420LQGLYMSPRVVCEALKTQSLFSYVFTKLGFECTPEFNEHKSDIVCAIKFKDPDILIKFCKSIQKAACVDSFLTPEPWDMPGYENKIIMASGSFIDGSSIEISADGPIREPYIAYFQGSMSLEQGKLGLMIALQDLVDAG
GUT_GENOME030353_01664279-402GIGREAGSYFGSYLPFYQGFFSAPHVVAQSLKTSALFAKAFEYLKLPTMPPSDAVRSDIVQSLRFDTKEELIDFCACIQAVAPVDGYVVPEPWAMPGYSHDVIMAAGTFIQGASIELSADAPIC
GUT_GENOME153272_00444313-443VNAAKGAILCGQVYHDLGYEICPSLTDVRSDIIQSVKLGSPEAVVAFCQGVQAAAAVDSYVSPEPWDMPGYEDQVVMAAGAFVQGSSIELSADAPIRPPYIVYFQGGLTYEHSKLGVMMSLSRLLHDGLVQ
GUT_GENOME103858_00680281-416VLQGLYYSSKTTIESIKVALLFAQAFDNLGFEIIPTMDDPRSDIVQAIKLKSSERLELFAKAIQESCSVDCNLTPIADDMPGYEDKVIMSSGGFVEGATSELSCDGPLRKPYTVYLQGGLNKFHGKLALMKVLEKY
GUT_GENOME053673_00924296-412LLSECMSISGFETLPDKHSKLGDIICSIKFNDKQMLIDFCKLVQSNSPIDGYVTPEPWDMPGYENQVIMAAGCFVQGASIELSCDAPVKSPYTAYLQGGLTYEHVKIVAQKAVEYFL
GUT_GENOME057214_00707567-701SFFKGIFMAPNAVKSALKTAVFTAYMLEKLGYKNVSPAYNDERTDIIQTLELGSKENLVSFTQGIQSTSPIDSFVRVMAAPMPGYPFDEVMAAGSFTQGSTIELSADAPVVEPYTLYMQGGLTLEYGKLSILLAL
GUT_GENOME208799_00960308-428MMSAEFCSSVFKILGFSVTPEPGHLRTDLITAITLDTEENMCRFAEAVQSWSPVDSDATPIPGPMPGYVDPIIMAAGTFVQGSTIEMSADGPVRPPYIMYFQGGLVFEHALLAVMGAAERI
GUT_GENOME256578_01092301-427NALKTATFTAAIMTQLGFECFPKPNEPRGDIITAVLMKNPENLIAFCQGVQKGSPVDSFVAPEPWDMPGYDSQVIMAAGAFNNGASIELSADAPLREPYAAYVQGGLTYATGRMGILCALQEMFEKR
GUT_GENOME041645_00325359-495MFQGVFMAPSIVSEAVKGAILASQVFEDMGFISTPKPHDVRTDLIQTIKFGKKEPQLEFCKTIQECSPVESYLTPIPDVVPGYEDDLIMAGGTFIEGSTIELSADGPVRPPYAVYMQGGLNYAHVKIALTRTVEKIL
GUT_GENOME039789_00778340-483FYQGLYMAPHTTAQALKGAALFARVFEMLGMETMPSSNAKRSDIVQAVRFEEAEGLISFCRSIQSASPVDSAAVPEPWDMPGYTSQVIMAAGAFVQGSSIELSADAPIRKPYTAYVQGGLTYEHCSLAAMLVLTDLIKKGIAKL