UHGP-MC 104029


Information


Number of sequences (UHGP-50):
142
Average sequence length:
95±12 aa
Average transmembrane regions:
0.01
Low complexity (%):
2.18
Coiled coils (%):
0
Disordered domains (%):
1.5

Pfam dominant architecture:
PF00484
Pfam % dominant architecture:
9014
Pfam overlap:
0.47
Pfam overlap type:
shifted

Downloads

Seeds:
MC104029.fasta
Seeds (0.60 cdhit):
MC104029_cdhit.fasta
MSA:
MC104029_msa.fasta
HMM model:
MC104029.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME007189_01789187-282QEVDPTYFKRLADLQAPEFLWIGCSDSRVPANQIVGLLPGEVFVHRNVANLVVHTDLNCLSVIQYAIDVLKVKHIMVVGHYGCGGVTAALQKARVG
GUT_GENOME155645_014723-116DIAKIIETNKDYVKHHPELPKAGLTSHPTKKLGIVTCMDTRLVGMLEEALGFHRGEIIVIKTAGNSVTQPIDNIVQSLLVATYGMEIEDVLIIGHENCGMIDFSATTFMESMKA
GUT_GENOME243042_003304-114IERLLKGFERFQRRYFEARPELFDALRTGQNPQTLLIGCSDSRVDPGLLLGCDPGELFTVRNVANLVPPCGDGASGRLHGVSAAIQFAVEQLHVARIIVMGHAGCGGIRAL
GUT_GENOME096200_00252128-214NNSERVREAALGQYPKAVIVSCLDSRIPVEDVFHRGIGDIFVARVAGNVVNPDILGSLEFACKVSGSKLVVVLGHEHCGAVKSAIDD
GUT_GENOME100276_0205013-116NARHVESLAADHFEGVREAQSPAVVSVSCSDSRVPADAVWSADRAGELFTSVNVGNQVWTEIDGRLVVNDAVGYAVSALKSTDIVVLGHTGCGAVTAAYETVTD
GUT_GENOME282114_022285-116LDEILSFNESFVKFKQYEPYITSKFPDKKIVILTCMDARLVELLPKAMNMRNGDVKIVKSAGAQVSHPFGAIMRSLLVAVYELQADEIYVIGHHDCGMSAIDPEKMIEHMIE
GUT_GENOME194587_0078636-127ELPRDSARRESLAREGQYPYAFLLTCSDSRVVPEFIFSAGLGELFVTRAAGNVLGTSILASAQYAAQDLGAKLFVVLGHSQCGGVKATLSGE
GUT_GENOME237844_0194841-155STPEEALAELIAGNERFANEKSIHPHGDIDRVEETAPHQAPFAAVVGCSDSRVPVELLFDQGVGDIFVIRTAGNNVNSEMVMGSVDYAIEHLGVKLLLVLGHGSCGGVTGAISEG
GUT_GENOME096290_0219112-103FDDLLNANADFASRFDLAGFDGVARAGVAMVTCMDSRIDPLGMIGLQPGDAKILRNPGGRVTDQALVALVLGVHLLGVERILVVEHTRCAMA
GUT_GENOME096365_0130353-139QDMARVDELKSGQQPFAVIVSCSDSRVSPEIIFDQGLGDLFSIRTAGNVMADFEEGSIEYAAEHLGSTLIVVLGHTSCGAVKAFMDI
GUT_GENOME141762_02835563-645LTDSQDPYELFLTCADSRILPNVITASGPGDLYTVRNLGNLVPTDPDDRSVDAALDFAVNQLGVSSVVVCGHSSCAAMTALLE
GUT_GENOME018685_0224442-139FLQGIHDHEMHSEDKREGLIRDGQHPFAVIVTCSDSRVQPVVSFSMCLGDLFMVRSIGNFIDRVEEASVEYGVGHLGAPLVVVLGHTHCGLVHAAMTG
GUT_GENOME172372_0122019-116ALKKEFYAHLHLGQNAEVLIISCCDSRVVPNIIMKAEPGEIFITRNIAALVPPYDESHTSYHATSAAIEYAVKGLKVKHIIIMGHTKCGGIESLVKNA
GUT_GENOME207650_0067635-117ESRLRARLTRGQDPRVIVLACSDSRAPIEHVFNIGFGDAFVIRTAGHILDNAVLASLDYALENLNANLLVVMGHQSCGAVGAA
GUT_GENOME029386_0388215-118AFPKREALFKQLATQQSPRTLFISCSDSRLVPELVTQREPGDLFVIRNAGNIVPSYGPEPGGVSASVEYAVAALRVSDIVICGHSNCGAMTAIASCQCMDHMPA
GUT_GENOME142593_0240753-150VAGILAHNRAFVADGEYERYETDKYPDKKLAVVSCMDTRLTELLAAALGLKNGDAKIIKVAGAEVAHPFGSVMRSLLIAVCELGVEDIMVVAHTNCGA
GUT_GENOME141041_0190011-107EEFIQRIKEQDPTFFDELKKGQTPEFFVLSCSDSRVSPSVITQMPLGHMFVHRNIANQVVTEDESFSASLYYALKHLKVKKIVIKGHTDCGGVKAAW
GUT_GENOME095248_0117210-109RRFHDDAFPQYRQQFQALVDEGQHPTTLFIGCSDSRLVPYLLTGAGPGELFLVRNVGAFIPPYDGSHGHHGTTAAIEFAVLNLQVRRIVVCGHSHCGAIK
GUT_GENOME125510_002527-116QILKANESFVKQSLSQGGFDEVSKYPSRNLAVLTCMDTRLLSFLEPAMGLVRGEAKLIKVAGNTAFEDFDSVIGSLMVAVYELHVHDIIVMGHDDCGMLKTTADSLCRHM
GUT_GENOME156113_0193039-161MTNFHELIEGFRNFREDYLLREAEFFETLKSGQNPRTLVVACCDSRVDPALLTGCRPGDLFVVRNVAALVPACGEAARADAVMAAVEYGVKHLEVEHVIVMGHSNCGGIHGLMHPEAVADEDY
GUT_GENOME096512_0141476-185LKEGNKRFVEDKSQPNNVDAKRRKELESGQHPYATIVSCSDSRVTPALVFNAGLGDIFDVRLAGDVVDDSALGSIEYAVDHLHTPLIVVMGHEKCGAVTAAYDDVVKGQK
GUT_GENOME141118_0331877-167KMQQHDYLAQKRSSADGQFPAAVILSCIDSRAPAEIIFDTGIGETFNARIAGNISNDDLLGSLEFACAAAGAKVILVMGHTACGAVKGAID
GUT_GENOME060912_00259109-219AEQALERLRAGNREFVGTHTNTSVISQEIVRHMFEQGQHPFATVITCADSRVAPEHIFMTGLGEIFTIRTAGNVVGEMELASAFYAADHLHTPLLVVMGHTHCGAIEAAQC
GUT_GENOME142546_0049119-121FQTNYYHPENFRFEDLQHGQQPSTMVIGCADSRVDPAMLMGCEPGELFVVRNIANLVPPCEDHAHETHHSVSAALEYAVTSLEVERIIVLGHGCCGGIRALMD
GUT_GENOME080363_0285621-121LKEGNQRFVNNLKANRDLLEQVNATREGQWPFAVVLSCIDSRTSAELIFDQGLGDIFSIRIAGNFVNQDILGSMEFGCNVAGSKLVVVLGHTKCGALKGGL
GUT_GENOME237874_0073726-124RYLSRQLSEKTDYEEIRQALAEGQKPFAIVLCCADSRVAPEICFDQKLGDIFVVRNAGNVVDETVLGTIEYGVEYLGIPLVVVVGHSRCGAVAAACKGG
GUT_GENOME226261_005414-114IDNILQINNEFTSKIKINSLKKSKYPQKKLAIISCMDTRLIDFLEPALGIHRGDAKIIKTAGNIITNDFNDIIRSLLICIYQFNIKDIIVIGHYDCGMHTTSAESLVDKML
GUT_GENOME171366_007235-128MESILEFNKEFVDKREYEQFLTNKFPDKKMAIITCMDTRLVELLPRAMNLRNGDAKIIKNAGAIISQPFGSVVRSVLVAIYELGANEVVVVGHYGCGMTGLNSEGIIEKAKARGVQDVVLDTLT
GUT_GENOME096373_012802-101TLLNQILKYNKDFVASKAYEQYSTSKTPSKKAVLLTCMDTRLQDLSTKALGFTNGDLKVVKNAGATISHPYGSTMRSLLVGIYALGAEEIIIMGHKDCGM
GUT_GENOME018196_0162443-149LKRGNHDFVTVRAHERDLSSERIEELYAYGQNPFAVVVSCADSRTVPEHMFMMGLGDLFVVRTAGNVVGPVELASVVYACEHLGVKLVLVVGHTGCGAIQATIEACG
GUT_GENOME039239_0002912-105MEGNERFVTGETYARGWSIASRRKELFEQGQKPIAVVVCEADAYMPPELVFDSDFGELFTIRLPELSIDKNVLDAITYAAEKLHTPLCVVLAHD
GUT_GENOME208007_0045719-116NARFASGTTRHPNSDEDRRLDLSSGQAPFAAILACADSRVPPEIVFDCGLGDLFVVRSAGQMVENAVLASLEFAVHKLAVPLILVLGHDNCGAVAAAE
GUT_GENOME140366_08526110-190RATAGGQFPFALVIGCMDSRVPPELVFDQRIGDIFSARLAGNVAEDDVVGSAEFATKLAGAKLIVVLGHSECGAVKGAIDG
GUT_GENOME098208_004771-99MIDEIIQYNQTFTAGKGYLPYQAEKYPKKKLAILACMDARLIELLPAALGLKNGDAKIIKNAGGMVLDPYDSAVRSLLIALLELKAEKIMVIAHTDCGV
GUT_GENOME044180_0134827-119AVALKKTPHRRCAIFTCMDARLVEMVEPALGIRRGDAVVLRNAGNIIGTLEGTMIISLLVAVFMQNIEEIIVVGHEDCGFTHASSAVLLDKMR
GUT_GENOME109114_0063013-116LSAGFRRFRRHWYCEEHNIYEDLLEGQSPHALVIACSDSRVDPALILDCRPGDLFVIRNVANLVPPYSPDSGHHGVSAALEYAVRHLRVDTIIVMGHSHCGGIA
GUT_GENOME223636_002664-118LDDILAHNREYVEDQNTGYVDTDTKCSKMPSREMAIVTCMDTRLVNFLEDSMDIGRGEAKIVKTAGNCITGPFDGVVRSLLVCIYELGVNEIFIIGHHECGMAKTTAKDLTEKML
GUT_GENOME229516_0174936-135DSRLYARLAKKGQQPHTLVISCADSRVVPEMIFNAQPGELFVLRNIGGLTGPDAVQAGIEYALNTLNIRNIILLSHQDCGAVKALKEKEQLPPSLKKWLA
GUT_GENOME208134_0055186-165EELVSGQYPYAVVVSCSDSRVPAEEVFNAGLGEIFTIRTAGEALNSTDIGSVEYGVEHAGAKVVVVMGHSGCGAVAAAVE
GUT_GENOME096078_00489561-650KNERLQRDIYRQIRVTADQGQHPIAAVLGCMDSRAPTEMLFDVGIGDLFSLRIAGNVAGQKVIGSLEFACQAKGSKVVVVLGHTDCGAVT
GUT_GENOME096381_0107621-109DRDSYGRLAEGQSPEALFITCSDARVQPAVFTGARPGQLFELRTAGNAVPPHETDGRPTGESATIEFALGVLNVPEVVVCGHSHCGAVG
GUT_GENOME243108_0103965-160RYVEGVSRRHDFKHEREALATGQNPFAGILSCADSRIAPEYAFDTGRGDLFVCRVAGNFASDEMIASLEYGSSVLGVPLFMVLGHDSCGAVDAAIK
GUT_GENOME236870_0018740-123EEMSHHAQKKLAILTCMDCRLIDFFEPALGLKRGDAKIVRNAGNSIVGEDAIRSIGAALYNLGAEEVLVVGHTECGMAGADVNA
GUT_GENOME225911_0024032-118LLSHQERTDMAQDQNPFAIILGCSDSRVPAEIVFDQGLGDLFVIRVAGNIVAPSQVGSVEFAAERYDCAVVVVLGHSHCGAIQATVD
GUT_GENOME171660_0366318-115QRYANETLRYRQLAEQGQTPETLVIACCDSRAAPEIIFDTSPGEIFVIRNVANLVPPYEPDGEYHATSAALEFAVQSLKVKNIVVLGHGRCGGIKAAL
GUT_GENOME074459_0075617-115KRFINATSNPGDVSLERRVDTLKNGQHPYAIVLCCSDSRQVPEAIFSAGIGELFVIRVAGNVVDSHQLGSIEYAADHLDCKLVVVLGHNHCGAVEAAIK
GUT_GENOME226783_003126-115IINKLKDGNKQYVKGGIIGDVSKNRRIDIYLNGQSPFALVITCSDSRVIPEVIFQQGIGDLFVVRVAGNVVDEFVLGSIEYAIEHLNITCVVVLGHKNCGAVSSCLNHAS
GUT_GENOME096213_004078-114IFENNRKWQETKRQQYPDYFKDLADGQNPEYLYIGCSDSRVAAEELMGFEPGDVFVHRNVANLVHGLDMNAASAIEYAVSHLKVKHIIICGHYNCGGIKAAMKPQDL
GUT_GENOME283178_006075-99LQEVLEANKKYAAEFGEKSKLPIPPGRKFSIVTCMDARLDPAKFAGLEEGDAHVIRNAGGRVTEDIIRSLIISHKLLGTQEWFIIHHTDCGMLTF
GUT_GENOME176916_0051410-114QLIDGNRRFAEGKSRFSGYSVDLRESLVAEQHPHTVIVSCSDSRVPPEIVFDAQLGELFSVRTAGPTLDDMVLASIEFGVVNLGIRHVVVMSHTNCGAVAAAMDA
GUT_GENOME038074_0090917-118FYFDAENDYYTSLNKGQHPKAIVIACSDSRADPALLMGCDPGDIFVVRNVANLVPHADDALRRDAVLAVLEYGVHHLKVEHIIVLGHSGCGGIQALLNPESL
GUT_GENOME096443_031993-102LLQEILDYNQEFVATKQYEAFLTSKYPSKRAAIVACMDARLIELLLQAMNMKNGDANLVKVVGGDVSQAYEGVIRSLIISIYEFHVEEIFIVGHHHCGAQ
GUT_GENOME062976_016653-77ASRSPKPRLTVVACMDFRLDPLAVLGLDDGEAYVIRNAGGVLTDDVRRSLAITQHALGVEEIALVHHTDCGMEGL
GUT_GENOME079554_0126919-121LLAGNRRFAEGKNEHPWQDPQTRESLIDRQNPDAAVLSCSDSRVPPEIVFDAGLGDLFTIRTAGQVIDDAVLASLEYAVDSLHVSLLIVMGHEGCGAIDLATH
GUT_GENOME103752_0197333-120RRDPATRQQLTSGQNPAVVVVGCSDSRVPIELLFDAGFGDIFVIRTAGGCVDSAVSASVEFAVDGLGVDLVLFLAHEKCGAVGAAVEA
GUT_GENOME096381_0158727-108RKFTDPGMDARPVRRVAVVACMDARLDLHAALGLRLGDCHTIRNAGGVITDDTIRSLTISQRALGTRSVVLIHHTGCGLLDL
GUT_GENOME261969_002994-118LDDVLKANETYVENILAEHGEEGSHAAKKPQKHIAILTCMDTRLVDLLEKTMGVDRGDANVIRVAGNCITDVFSDVIRSLIVSIFELGAKEIFIVGHEDCGMEKTCPNELAERMV
GUT_GENOME126180_0146444-147RFVTLKEKHPDDNIERRQEMLKGQHPFVVILSCSDSRVPPELIFDQGLGDIFEIRNAGNVLDEHVIGSIEYAVMHCGVKLIVIMGHQDCGAIAATLSGKSETKY
GUT_GENOME003530_0084724-116FAGDISMEKRLALTGGQSPRAVVIACADSRVIPEVIFSCGLGELFTIRIAGNVIDAHQLGSIEYAVSHLKTPLVVILGHTGCGAVQAALHGEA
GUT_GENOME106452_0006633-136RLMEGNSGYKTIPANPAVLSQELREGTAQNGQSPLAVIVCCSDSRVPPEHIFHAGIGELFVIRNAGNLISDFALGSAEYAVEHLETPLVVVMGHTGCGAVGTAL
GUT_GENOME155112_013237-112LLEANLEWVEKRLDLDKDYFKNLSKGQNPPFLYIGCSDSRMPIDTFTQSEPGSFFIHRNIANQVFSNDMNFLATLEYAVEQLEVEHIIVSGHYECGGIKTAHDKCR
GUT_GENOME096469_0288644-124LAAGQHPFATVVGCSDSRVSPEIAFDRGLGDVFVVRNAGHVLDDATLGSIEYGVHVAGTPLLVIMGHEACGCIAATMAAAE
GUT_GENOME103718_0331930-115DGQRPPVVSVCCSDSRVSQEGMWAVDRPGFLFTAGNIGNRVSDRVDGERVLAGSVAYPLAYTETDALAVVGHTGCGAVGAALSAAR
GUT_GENOME098613_0177721-105QDYFDKIQGIQTPWLTLVACCDSRIHTNIMMEDATNEVFEIRVIGNVIENALGSVDYGIYQLHTPMLFIMGHSDCGAIKGSLDER
GUT_GENOME143153_001782-119KELFEGAIKFREEDYNDHKELYESLKKHQDPHTLFISCVDSRVVPNLITNTLPGDLFVVRNVGNIVPPYKDSHRPQDLREGFLATTAAIEYAITILEIKNIIICGHSNCGACSAIYEP
GUT_GENOME132078_0107155-161LLQGNELFQKNYFRKNESQLLDLVSSGQHPKALFIGCADSRVIPSLITNAPPGQLFVLRNVGNFVAPYKPDEDYHATASGIEYAVTTLNISEIIICGHTHCGAIEAL
GUT_GENOME095246_00357122-196GEKPFAVIVGCSDSRTAPELIFDCNLGELFIVRAGPTVGREALGAIVYAVERLGASLVVVLGHTQCGIVGAAVDV
GUT_GENOME096202_047432-101STIHEILAHNRIFVQEKQALPFETIRYPETKIVILTCMDTQIPELLPKALGIRNEDAEVIQNAGAVVSHPSGSILRNILLAICKLKAKEVYVIGHHECGL
GUT_GENOME212304_00621276-374VTMFDDLMDGNARYAESFDLADVPGQAARGVGIVCCMDSRLDPPAIFDLKVGDAKVLRTPGGRLTPDARIGLVVGSHVLGIDRIVMLAHTKCAMASGDD
GUT_GENOME055201_0080132-122CERYPEKMRELFQRQSPGAMMLSCCDSRTDPALLFSCHPGDLFVHRAIAALVPPLDSPVGACIRAATNYAVESLKVRDLIVLGHTACGGIS
GUT_GENOME194757_0109720-111AADPELFSRLHQGQAPGALWIGCCDSRVPAEQICNANPGELFIHRNIANLVDEDDANVMSVLEYALKALEVEHVIVCGHRGCGGIQAAISGD
GUT_GENOME275813_0019217-113RFVNNASIHPNRCQETKNSLIAKQSPYAIILSCSDSRVPLEIVFDAGLGDIFAIRTAGHVLSQEVMGSIEYAVVHLGVKLIMILGHDNCGAVHSAVD
GUT_GENOME107773_0124521-102LVEHGQTLMRAVVTYSDSRVNPEVIFSAGLGEIFVIRTAGNIPLEGSLASIEYVIEHLDLNCVLVMGHTHCGVIDAAIHGEG
GUT_GENOME140359_01524122-227ERFVAGKPQHPSQSVEHRASLAAGQSPTAVVFGCSDSRVAAELIFDQGLGDMFVVRTAGQAIDTAVLGSIEFAVSVLNVPLIVVLGHDSCGAVKAALGAIEEGAIP
GUT_GENOME095246_0044150-133DHLRKLAEAKHPFAIIVGCADSRVSAQLLFNQRLGELFVVRLAGNTVDRVALGSIAYGASVLGCSLIVVLGHAKCGAVTAAVEM
GUT_GENOME011013_0054023-128RYLESGLACLQKGALPREHTRALVIACSDSRVDPALLFGFDLGEVFTIRNVANLVPPYEPDSTTSHGVSAAIEYAVRALEVEAIIIFGHSNCGGINALLHMDEHGG
GUT_GENOME004073_0106510-93NEAAYAEKFPGEAPHLPEMKVAVVTCMDCRIDVETMLGVEPGQIHVIRNAGGLVTPDTIRTLAASQRSLGTKYVMLIQHTDCGM
GUT_GENOME243313_0230225-112KLALAQTPDALFITCSDSRVVPDLLASTHPGDLFTMRNVGNLIPPATAEGVSTGDLSEASAIEYAVLVLKVANIVVCGHSECGAMKAV
GUT_GENOME233297_0056810-118MRRLLDGYRDFFNGNPARKGFCAIHAEQVACSQNPHSMIISCFDSRVCPEAIFNTHDGHVCVHRNMLNQVCKRDESMVASLRFAVDALKVQNIVILGHSDCNAVQKLKH
GUT_GENOME112019_0065925-115NSGYDAFLETSDSSLYKDLAKTQNPHTLVIACSDSRVVPEYIFNAKAGDLFSLRMPSADTDVSAAVDYAVNTLKIKNIVILEHSDCSGLQD
GUT_GENOME142012_015396-112LIKGNKKFREASFSKYETDLKQLVKTGQKPEVLFIGCSDSRVTPDLMLDTKPGDMFILRNVGNFVPPYNPDNDYHGSSAAIEYAVNVLNVKHIIVCGHSHCGACKSL
GUT_GENOME006483_0133029-118QDLREELKNKGQKPYATIITCSDSRVPVQHIFSSSMGELFIIRNAGNVIGDFEIGSVEYASEHLGVELIVVLGHTHCGAVHSTLHNEGHS
GUT_GENOME011907_0058431-137LDKLLEGNKNFVNGTPTTKNMSLETLKKYAFHQEPYACVLTCSDSRVVPEIIFDCGIGELFVVRVAGMTTGPNVVESVEYAVKKLEVPLVILLGHDDCGVMKYAKEH
GUT_GENOME121631_0132528-148EKACAFLEQILQENARFTAELPESYIHRNDKVSKYPAKQLAIYTCMDTRLVDFLEPAMGISRGEAKVIKAAGNMITGPFDEVIRSLMVAVYELGVTEIMVVGHEDCGMQHSTSESLKKRML
GUT_GENOME022301_0124211-122LEEILAANKAFVAHGKHDYTEEDIAASKLPKKKMAIFTCMDTRLTEILEPAMGIQRGDAKIIRTVGNYLTGEFDTVIRSLMVAIYELGVEEIFVVGHYECGMAKTTADSLAA