UHGP-MC 33785


Information


Number of sequences (UHGP-50):
106
Average sequence length:
113±24 aa
Average transmembrane regions:
0
Low complexity (%):
2.62
Coiled coils (%):
0
Disordered domains (%):
0.02

Pfam dominant architecture:
PF07603
Pfam % dominant architecture:
189
Pfam overlap:
0.25
Pfam overlap type:
shifted

Downloads

Seeds:
MC33785.fasta
Seeds (0.60 cdhit):
MC33785_cdhit.fasta
MSA:
MC33785_msa.fasta
HMM model:
MC33785.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME111943_006213-123KVILLSSVLLAAPFMMNGKAQAADCVTQDCPTLGYTSVSNTGNCVKCPFGNFWACPKGNSSDGENEEKAILGQCTGYAKKCNLGDILNSDGTCTTNKVSGKTPIAIVIYLSGNCGEALALQ
GUT_GENOME111227_0160028-122MTLEAVLILLAAPFYAGTAGAEVCIAEEDCNNLGYTEDKSCTDGLKCPFGEKWHCPDKKCQIGWILNSDMSCTENVESGKTPIAVVVYLDDKGNG
GUT_GENOME101933_0075517-113FVNNVSATDCITAPDCATLGYTVDASSCSGAALYCPWDLSKAACKMGSVTCAVGSILGGDQLCYSPDSGTPEGVEPIGVVFDPENSLAVALTDINSD
GUT_GENOME007709_0073073-220RMTRKTFPAALLITTALFSFDARAATEECVEAPSCTDLGYTRTADECPDGSVKCPWNTGLVFCECGKAYKYACEGDNETAGTDKCGNYYAACDCDDGYIWKNGACVSSRPSCNIGDIFYTDNTCAAAANHDSSKTVLGIVVHVNDNGV
GUT_GENOME202102_002066-172LMSSVLFVAPFVFGESGEASAQCVATQDCASLGYTEASCPNGGIKCPFGNTWSCKGDGTTKPSCDSSYKYTCTGANQKPGADSCDGKYKSCKCASGYEWKNGVCSKKEIEPILGQCTGYAKNCAIGDILNSDGTCTVYKVLGKKPIGVVVYIGDDNCGQALALEDLG
GUT_GENOME246162_005439-155SVLFVAPFVFGESGEASAQCVATTDCASLGYTEASCPNGGIKCPFGNTWNCGNFTNEKCLALGFKYNCSGSNEISGTGKDCNGKYAQCNCKNGYSWIKEQCQLQTITCEIGSIYYSDKTCSSSVINGKTPLGIVVYVDGKGHGQALV
GUT_GENOME018669_0044320-125MRFTNILFSITLFTATFGVHNALAQVNCKQIPDCAELGYSNDTQTCNGSWLYCPFNSNYKKCVPNGASCDDFTDTDKTEWCNDIIPCDGDSSLTLCASSFRCETGY
GUT_GENOME219403_0108714-93LLYYGNVHAQCVATQDCAALGYAEASCPGGGVKCPFGNYWFCGEKKGVCQSCKAGMILNSDMTCSRAKTSGKTPIGVVVL
GUT_GENOME111366_010557-105CTCILSVLSCSAQATTCTIPPTCTSLGYTQSASDCEGKNTIRCPTDSSALFCGGTTDGSITSPAKILYADHTVSSIFNPNKKAIGVVFDETKRRAVGLE
GUT_GENOME049246_003757-136LTTAAAVMLLTAAPAAAQTCIKTPSCADLGYTKTAADCAGKTILKCPLDNTQVYCPGYEETLRTYKVGDTYVDANGIGVGVVTSVDSSGLHGRVISSSGYRGTASEAQSLCVSKTTGDLDWGVASGDAVC
GUT_GENOME219403_0014659-170RMTKMYVLLSLITLLPTLALAETCTATPDCKSLGYTQTSCPDGGVKCPWNQSLMFCNKKCAPACEEKKCQTGDVLYGNKKCYTCSDNYKLPGLSPIGVVLRTGMAVALNDVG
GUT_GENOME219403_0054353-156MTVLFLLVCFPSVSQAETCTPTPKCEDLGYNQSSCPDGGGVKCPWNTNLMYCCKKCEEKSPCQGCYVGRILNSDMTCSENKVNGKTPIGVVASQVITSKGAMIK
GUT_GENOME079424_0146120-114TTALLITTVLFSFDARAMTCTATPECAALGYKYTEAECEKGAVKCPSGSSYYCPNPRPKCAIGWIYYSDGTCSAPAAYTTSKTVLGIVVHVNDNG
GUT_GENOME286320_0030325-107TCATPPSCETLGFTKSETDCDGLEALKCPFDQSKLYCPQGGGNMSAAAPGAILYSDGTISDSVIASKTPVGIVAYVNGSTGFA
GUT_GENOME210365_005328-106ILLGGALAAMLPFSASADITCTATPDCASLGYSKTAAECPSGGVKCPFNSNLMFCLKNTTAYNFQLQVPTALYNVVYHDGSTSASYSYNANKTPIGIVA
GUT_GENOME038164_015396-154KFLLLGTSLSLIPNLTQAQCVATTDCATLGYTEKSCPDGKGLKCPFGNTFACPATEESICEKNGFKYTCNGINESKPETDDCGGKYSSCYCQNELEWNGGMCHERTHPQCEIGWIYYSDNTCSENLVKRKIPLGIVVYLNPNGGGQALA
GUT_GENOME133724_016791-120MKLSILLAGSILTSAIVGNAYAQTCGAQPSCSDLGYTYTGSTSDCLSPALKCPFNTSYFNCTKKADAIKNMVLDWSKKKSFNPTSSTYYPTSYGIVIGRIQDIDNVGAGVNINGISVSST
GUT_GENOME112860_009511-111MKKSCLIIFGSVLSLSGIPSSNAALNCTASPDCASLGYTMSAADCTDGVKVACPTDTSKVFCKTGVQPVTCAVGSILGGDQLCYKDKLPDNVKPVGIVFDTTNKLAVALTD
GUT_GENOME027163_0139415-110LTTAFSAFSVNAQTCAVPPTCESLGYDKSADECSGLAQLKCPFDQSKYFCTAYRNAGEGEPIAVGDIAYSDGTYSAVPIPTKIPVGVVYSTSGQIV
GUT_GENOME244479_0108115-107LLISVATASAVDVNCTKAPDCATLGYNKTATDCPKGGVKCPFDSNKMFCLKNTAAYDFQITKAVKLYDVVYHDGTTSTSTSVSGKTPIGIIYY
GUT_GENOME275683_0155136-129CTAAPDCASLGYTKSVDQCPDGGMKCPFDSSKMFCVSAGNMDFVFKNPIAVGHIVYSDGTTSASYNSKKLPIGIVVYVHHSPQKNHGLILAIDA
GUT_GENOME111227_013074-96TKLLSVAFLGALFPLAATAECLPTQDCKALGYTDAACPEQGVKCPFGNEWFCVGDKNTSSLDCQIGWIYYTDNTCSYDLVSGKKPLGIVVYTY
GUT_GENOME200661_001276-106KQMMGAVLLMASFSQSGSALATSCAVPPTCEQLGYKMKVDDCGENPVLRCPFALSDDNQVYCTESAQNQVVSVGAILYGDGTVASGLVTGKTPIGIVFDTA
GUT_GENOME202157_0031810-105LFCGVTMSPALALGECKPTQDCRALGYKEAPCPEQGVKCPFGETWYCPLRCKDGQVWKDGKCTVKRPKCQIGWVYYSDNTCSADVDKNKEPIGVVV
GUT_GENOME201297_0112422-115LWSSAESEAQTSGSEIACTQTPDCSTLGYTKTANQCPDGGIKCPFDGNKMFCVSGESVDYVYQNTLGLHHVAYSDGTTSASYNANKLAIGLIYY
GUT_GENOME008742_003461-122MNKIILTLLMTILFTPAANAQNCVNSSRCDELGYTKTTADCAGLDTLVCPFDENKVFCVFSEKDCEIGDILYENKKCYGKAPDGKTAIGIVFDAGNRLAVALDEKKLAWGDTSYKAAMEYCQ
GUT_GENOME285946_0151016-119ATAVTLLLGGEAAALTCTAAPDCADLGYTKTSCPKGGIKCPFDKNKMFCIREGGASDYKFVNAISRYQIAYSDGTTNKYYNTDKTAIGIITYVHQNKDRKHGIM
GUT_GENOME107121_0076623-106LAQTCVTPPSCENLGYNKTAAMCAGLDILRCPFDDTKVFCSKKYMAQIAIGDILFSDMTTSSNPVTGKTPIGIVFDSNQKLALA
GUT_GENOME008691_016701-175MRKVILLSSVLLAAPFMLGEIGEASAQCTTTPSCSSMGYTSGVNTGNCLKCPTGNYYFCPPKIEPSCDSSYKYTCTGTGYTGGSGEECGRKYKQCDCAAGYTWNGTACEPGAILGQCTGYAKNCKIGDILNSDGTCTSGKVSGKNPIGVTIYVGDNNGKTCGWAIALTDESIANV
GUT_GENOME010169_016894-99LFTPAANAQNCVNSSRCDELGYNKKAENCIDDDILKCPFDSNKVFCRTQEQPAATNCLNAKIGDILYSDKTCSSDYIAGKTPIAVVISTDKRLAIG
GUT_GENOME219403_0005618-113EQCTATPDCKSLGYTETSCPDNNGVRCPWNTSLWFCNKNDICRVKKCEIGDVLFADKKCYECFPGESKDLPHIGVVCAPGKAVNLIEMTTPWVTEY
GUT_GENOME110231_002491-114MNKKCKIWIGAMLFMTAFSVSTSRAQTCIQPPSCDTLGYTMTADQCGDAVKFLKCPLDQSKMFCLTQEEIDGNAVGHVGDILYSDKTFSTELIKSKTPIGVVFDEANHLAVSLD
GUT_GENOME039892_009563-128FKMNMITSTLLAAGFSLLAGSAAAQTCVANNCAALGFTKSASNCEGDIIRCPFDTSKVFCKEKEVYVMEAGDILYSDKTTSHYYTDNNKTPIGVVVDPSRRLAMALNCTLKQWANNGYNKNNISGI
GUT_GENOME068815_014291-136MKLKFLLLGTSLSLIPNLTYAQCSPAQSCVELGYKETTNKGGCLKCPFGNSYYCPERCEVSYQYTCTGANEQPGTDKCGDKYKSCNCTSGFEWKDGKCQTKGAILGECTGYAKNCAIGQILNSDGTCSNDKVSGKT
GUT_GENOME173198_004979-106FGVFLPALIAFVPAAQAATCAVAPTCEQLGYVQSSSDCAGAKNILKCPFDLSKLYCSKASAEVKVGSILYGDGSVSDGYEAGKTPIGVVFDAENRLAI
GUT_GENOME285458_013919-117LLGASLSLIPQYATAQCAVTDCQQLGYTSLKSCEGGLKCPFGEYWACPKVTKAELGSCNGYAQNCKIGQILNNDGTCSNDKVSGKTPIGVVVYIGNDNCGQALALNSLG
GUT_GENOME208565_002573-109FNLKLSLNLLAAGFSLTYLAAPAQAACVSGDCSALGYNKSESACSGDIIRCPFDTSKVFCKENKIEIGDILYSDMTTSHSFNIISGKTPIGVVFDTEKRKALALSEF
GUT_GENOME203591_0061417-169TECLAAIAGAILMFGAGAAQADCVATPDCESLGYRETSCVGGGVKCPWDTSKMYCNPNMCSLTLTKEKCAEQCREVGEQSCVKNGVTYYAQCGVSKCPEGQGCKAGACVTCDNTCAVGNILYSDFTCSSCLLNDKTPIGIIAYSSGSMRLAVQ
GUT_GENOME136298_014105-107MLLIAGATLLISAFSAGLIRAQTCIQPPTCEELGYIYSADECVNGNNILKCPTDLSKMVCSFDAIGNDGDILYADHSTSSEVINAKTPIAVIFDAGNRLAVSL
GUT_GENOME075988_009331-117MKKLSFILTLASGSLLSFGAPAEEISCTASPSCDTLGYTQTDCPKGGIRCPFDKSKMFCVSGFGGEDFSFKNAIYQYQVVFSDGSTGTFYNADKDAVGIVTYVHPNTEGNHGIIMAL
GUT_GENOME219403_0027488-216RKGFALRGMTVLFLLACFPSVAQAETCTPTPDCKSLGYTQSSCPDGGGVKCPWNTALLYCGENGKKICADMGHPYTCTGANEIPSGKACANKYYTTCTCASGYEWKNGKCEKNPTNGLVGDLYYCNGTV
GUT_GENOME219403_0129839-141MTVLFLLACFSSFAQAEQCTPTPDCKTLGYTETSCPNGGGVKCPWGNFWFCGEKCNKCKLKSCEIGDVLYSDKKCYTCPDIYTLPGLPPIGIVFTTGKAVGLV
GUT_GENOME068815_0061510-201HVSLAVLGAIALSAVPGVVSAQCVATTDCASLGYKSSKDDGGCLKCPFGNGYYCPERCEASYQYTCTGANEVKPSSAACSNRYQKCDCNSGYEWNSGACKQCASKYKYTCTGANESKGADSCGGKYSACQCATNYEWNSTSGKCESVAVLGVCTGKAQSCSVGQILNNDGSCSNSVVSGKTPIGVVVYISGG
GUT_GENOME051047_01620319-418VLGMMISGRAMAQAETTPTCVKAPTCEELGYDKVASDCGELAKLRCPFDPDKYFCTAYTDQNGKKLAEIGDFVYVDAISAEPISGRIPVGIVYDLSGKMM
GUT_GENOME219403_0130469-167LRMTCLLGTLFLLPSLAQAEQCTPTPKCEDLGYNQSSCPDGGGVKCPWNTSLMYCCKKCAPSCEVKTCQIGDVLYSDKKCYICLPSASPEQAPIGILFT
GUT_GENOME007680_009381-120MKQNILLYGLTAVLLSGISPARADVTCTATPDCTTLGYTKSASDCPNGGVKCPFNNSKMFCLKSGGGVEFSIKNAVKTGDIVYSDGTTSASYTYSTTKIPIGVALVTETNSAYNHGIIFQ
GUT_GENOME277749_013316-156KFLLLGTSLSFIPQTVNAQCVATQDCATLGYTQSSCDGGKGVKCPFGNTWACFPSEEDICKDNGFIETCSGEGQVGIGHPCGKLYSSCSCSPNYQYTCSGTGYKSGSGQACDGKYASCACASGYDWKDGACKQKDRDYSACKIGTLFYSDN
GUT_GENOME020081_010788-107LMSGVATVLFPLFSSHAQVCTPTPDCATMGYNKTEEQCVGLSVVRCPYDINKLFCTDGQTGGDVNVGDILYSDGSTSSLLISNKVPIGVVFYVTNSRKDI
GUT_GENOME202157_013552-103RRIFLMSSVLFVAPFTAGNAAAQCITATDCASLGYDMPSCSDGKGIKCPYGNKYACPICASSYRYSCTGSYQSPQGTPCKNKYESCNCSDNRTWTNGSCTCN
GUT_GENOME050524_004936-123KMTIGAMLLLTTAFSAFSANAQTCAVPPTCESLGYDKSADECVDKAVLKCPFDENKIFCSNGIADGVAAEDYGGGTGPNGSYQVGDQLTCKGKPVGYVISVSTNGSGGKIAALDYESN
GUT_GENOME272660_0162529-117ESCAVPPTCDSLGYTKTASDCQDMSTLKCPFDTSKVFCVSADELPTGSCGTPEVGDILYSDMSCSTDVVKGKRAIGVIFDPSRRLALAL
GUT_GENOME007709_00731270-335PITDCGTLGYTKTADQCPDGYLKCPYGDTVFCPNVPKAGYILYSDMTTSAEIISGKTPIGVVFDQE
GUT_GENOME038164_013768-158LYLLCGVTMSPALALGECKPTQDCRALGYKEAPCPERGVKCPFGETWYCPIHCEESYRYTCTGANESKGEDSCGDKYSACQCKTNYKWNSTSGKCEQIEVLGVCSGAAKNCKIGQILNSDASCTDDKVSGKTPIGVVVAINKNCGYAMTAS
GUT_GENOME111227_007189-119SVLFVAPFVFGESGEASAQCVATQDCASLGYTEASCPNGGIKCPFGNSWSCKREKENVSDDPKDIQCYNNCCIGYIYHSDGSCSKMPNTAKVPLGVVVLLDGKGHGQAIAT
GUT_GENOME246162_004506-174KFLLIGTSLSLIPNLTQAQCVATTDCATLGYTEKSCPDGKGIRCPFGTTFACCSDNCVKNGFKYECKGTGYKSGSGQACNNKYTSCTCTEGYEWKSGKCEKISEPEKAILGQCTGYAKNCKIADILNSDGTCSNDKVSGKTPIGVVVYIGGNNCGQALALKDLGRMYWS