UHGP-MC 3334


Information


Number of sequences (UHGP-50):
106
Average sequence length:
124±16 aa
Average transmembrane regions:
0.06
Low complexity (%):
1.27
Coiled coils (%):
0
Disordered domains (%):
6.29

Pfam dominant architecture:
PF00759
Pfam % dominant architecture:
5660
Pfam overlap:
0.28
Pfam overlap type:
reduced

Downloads

Seeds:
MC3334.fasta
Seeds (0.60 cdhit):
MC3334_cdhit.fasta
MSA:
MC3334_msa.fasta
HMM model:
MC3334.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME178133_00876348-469WGSARYNTDAQFLALLYSKYKNDNSYAQWAKTQMDYLLGDNTAKTCYVTGISENSVKYPHHAAATGFTDPDSKAEHKYVLYGALVGGPDKKDKHVDKTSDYQYNEVAIDYNAGFVGASAGIN
GUT_GENOME115156_00360504-628GYGSARYNTAAGLQAMVYTKETGDMIFAEWAKDQMEYILGDNPMGYAYEVGYGNNFASQPHHRSSHCSPTQSMEDPIVQVHTLWGALVGGPDLDDVHVDITKDYIYNEVTDDYNAGFCGDLAGLY
GUT_GENOME021812_011941-114MAYDLTGNTEYEQKAYEQISYVFGKNPLGMSFVTGHGLNYPQNIHSRIAKAKNISLPGGLVGGPDSYREDIISSKIDASVPDAKVYKDDYECYSTNEITIYWNSALIHLLYRLD
GUT_GENOME106563_01611502-635GYRCVSNWGSARYNTSMQYTGLLYDKMMSIETYRGWSEGQVKYLLGNNAAKQCFVIGYNAYGPKYPHHRAASGYTGKDQGTTAQKNMLIGALVGGQKIDGSYVDSASDYVCNEVAIDYNATLVAAAAAIYEKYT
GUT_GENOME170224_001461795-1946WGNMRHNAAYQTSALVAAKYASGETKDNLTNWAKTQMNVILGNNTAKNANGDVGVCFITGFAKNSIQNPHYRATSKDDSSVVMKLKDDKAVIDDDLRVENILIGGLAGGWFSENQNSFEDRRSDYQQTEVACDYNACLTVAAAGLYNFYRTG
GUT_GENOME205607_00637367-487WGSARYNAMAQMIACIYDRANNSSTYSSWAETQMDYLLGNNSNNLCYVASFNSKSVKQPHHRAAQGQSTAEASGYSHELTGALVGGPKSDGSYKDKTVEYEYTEVALDYNAGLVGAAAGLY
GUT_GENOME122428_00695493-606VIGRPRFRKYAKMQLDYLLGVNATGYSFVTGVGSFCVNYPHHRQAFADKIEECFPGLVSGGPDSHRDDKIAEELIPEGTPPMKCYSDNMECYSLNEVTIYWNSPAVFLLAGLSE
GUT_GENOME284397_003871944-2096NGSNFTVIGDNWGSIRHNTLAQTVALIYDKYQDTKSYTDWCAGQMNYILGNNSVATNGNSSTCFVSGYADNSVKNLHHRAASGLTVDSNWSNWNSWDGIYTSDGHVIVGALSGGNGNNSDYKDNCKDHVGNEVALDYNAGLVTVAAGLYSVYK
GUT_GENOME026706_00467636-781TKQGFKMINGWGSARYNASAQFMGLIYDQANQKLSMTDGNYSSWATGQMKYLLGNNKAKRCFVVGYNENAAKYPHHRAASRSNDAGQVREDHYTLLGALVGGPSDEYDTYADNQADYNCNEVALDYNAGLVGAAAGLYLLHKGEEA
GUT_GENOME194942_02938658-768LAHDFTGERKYLQAMSDAMDYVLGRNPLDQSYVSGYGSRPLKNPHHRFWAHPVDPASPLVPPGVMSGGPNSINFSDPVASTMKGKCTGQTCWKDEIGAWTLNEVTVNWNAP
GUT_GENOME006069_00712404-522WGSARYNTAAQLCALVYDKYNNNGKPSEYSEWAKEQMQYLMGNNPMNRAYIVGYSENAAKYPHHRAASGLTRAEDTREQRHVLYGALVGGPDASDKHNDITADWIYNEVTIDYNAAFVG
GUT_GENOME157308_01162444-549ACSSAKQQKYHDFAKKQVDYALGSSGRSYVIGYGENSPKNPHHRTAHGAYSNNIGEPAQTKHILFGALVGGPDSNDNYTDDRNNYINNEVACDYNAGFTGALAKMY
GUT_GENOME009676_01208627-776NGGLVNIANSSKDKYYCYSEWGSARYNCNAQLMAMIYNKKMNGQTAYPDWAKYQMSIILGNNSLKKNLVVGYNENSPKQPHHRAASGLNGWDKFRDASVVSKYTLFGALVGGPRTSDFSTYVDLMDDDHTSEVTLDYNASFVGALAALYL
GUT_GENOME147168_02117354-474EWGSVRYACNQAFIDTIYLGLDKADKAHVDGANAYIERTINYTLGSNGQNFMVGYNASSPKNPHHRAAHGCWSNNLNAAPENSRHTLVGAIVGGPGSADDSYKDVRSDYQANEVACDYNAG
GUT_GENOME190764_00595324-459SEWGSARYNSGVQLTALVASKYPAAQVNYNNWAKGQMNYILGDNPANTCFVVGYADNSAKYPHHRGASGYSSSDEFNGKTEYSSNGHTLTGALVGGPTNSSGAYNDSVADYQANEVALDYNSVLVGAAAGLYSAFK
GUT_GENOME239411_00609311-441VNSSSNSYFFQNAWGSCRYNTALQQCALAATKNSSADYSAWCKYQMAYILGQNPANKCFVVGFTDNSAKYPHHRAASGLTSWDEFNNSGGVCPNGHVLIGALVGGPTDQGGSYVDSVKDYQANEVACDYNA
GUT_GENOME125907_00313306-471ITKSASDWAKVTTYLDSKATSESQYYCEDSWGSARHNVAVQMTALITSKYKESGKDYSSWAKAQMGMILGNNSTGKNLIVGFNENSPKYPHHRSASGHAYDPTDEGTPVWDTTNGHVLVGALVGGPTGTDFSTYNDSITDAVSNEVALDYNAGLVGAAAGLYTTYK
GUT_GENOME128142_00301649-767GEAFLAAAKLTGDAKYIEYAQKMRDYIFGVNPTGYCYVTGYGTLTPNATHHRPSQVLKETMPGMLVGGADNNLEDPYANAVLYGIAAGRAYVDNEQCYSCNEVTIYWNSPLIYLLAGLK
GUT_GENOME237059_00804367-498GYFLNKKVGGWGKLRFPANQAFVYGLYSKLKGDMSSINQYSLTTIEYIMGKNSGNLSYITGFGEKSPKYVHHRNYYREDTDKETGVTVKDKFRQFGYLVGGDLDGSYNDVPGADYTYSEGGIDYNAGLVGAL
GUT_GENOME011004_01167305-424SNYLFLDSWGSARLNCSMQMTALISQQHSGRNYSEWCKGQMNYILGDNPANTCFVVGFASNSASKPHHRACSGTSSAEDESPSKYTLVGALVGGPTDAGGTYQDSRADYKCNEVACDYNA
GUT_GENOME237440_01132417-539WGSNGAICDKAHILILANSIMPKKEYLTVAKAQADYILGCNPLNYCYVTGAGSKSPEHPHHRPSGASKKVMPGMLVGGPCASLQDEYARKHLAGEPPLKCYADAEPSYSTNEVAIYWNSALIY
GUT_GENOME211058_01615318-471FYVMNSWGSARHNTLMQTCALVATKHKEESGADFADWCQSQMNMILGDNNANVSLVVGYNSVSATSPHHRAASGLYVDDGWSQWGSWSGDYADVPTSHVLYGALCGGPTSTDFSTFRKLDAKDATSNEVALDYQIGLVGAAVGLYSAYGTGEVV
GUT_GENOME227344_00798358-491WGASRYNCSLQMMALGLANGNASSDYARGAKYQMDYILGNNSFGYSFLVGYGDKWPTHIHHRAANPGSGSAEANTSATYTNYGMLIGGPDSSGNYQDNQNSYQFTEPALDYNACFALACAGLVNLYGGDASALK
GUT_GENOME236868_00900543-647AVFMLYDMTGDEQYLNVALDNFYYNMGANPWGLSFIMGAGSRNENHPHNRASNPDGYNAGALPYEYRCPKGALMGGSAPHKTLKDDWNDYTATETCIDFSSQFVI
GUT_GENOME233268_02039471-592MAIASERAMTERAEKAMQSVRNDVHYLLGRNATGYCFVTGVGSKSPMHIHHRPSAADGIEEPVPGFLVGGPNLVQPTDCGGDGSERGIYPAKGYEDVECSYSTNEIAINWNAPMVFTLLGVK
GUT_GENOME008334_00391575-710NCSSNSPYGVAITKFNWGSNMTVANAGIILGAAYKVTGNTDYMDAANAQMNYLLGTNPVGECFFTGYGTVSPENPHHRPSMAVGKAMKGMLVGGVNQNLEDSAAKAYCQGLPAAKCYVDNSESYSTNEITIYWNSP
GUT_GENOME147168_0102287-223LKENFIWGSNMELLKYLMVLTVANELKPSTEFVNTITSGLDYLLGCNSNDVSYVTGNGEKAYKNPHLRPTAVDDIDEPWPGLVSGGPNTGLQDEQAQKLPKDTAPMKCYLDHVDCYSLNEITIYWNSPLVFVMAGLL
GUT_GENOME025619_01840626-760GYSWLCKWGSARYNCNMQMEGLVIDNHNGKNQYTDWATGQMKFLLGNSKDKRCYVVGYNENSSKYPHHRSSSGYGGFPDSGYQHTVQAHVLLGALVGGIEDASGTYHDSSADYYCNEVAIDYNAAFTGAAAALYL
GUT_GENOME236868_02367472-590AFLLTKDKNYLNAAQQTLDYLLGRNPLNITYVTGFGYRSPKNPHHRISEADFVDEPIPGMLVGGPHLGKQDIDLTGKEHWKCPNYASANLPALAYLDDDCSYATNEVAINWNAPLAYLT
GUT_GENOME165490_01494590-719LYSAYAWGSNADIANNGLILALANDIRANVNYVNSAADHLHYLLGRNSLNLSFVTNSGIHSPQHLHHRPASAANALLPGALVGGPDSALEDPVTQSLFNGDTAPAKCYVDDAESYSTNEVAIYWNSPLVA
GUT_GENOME193642_01068345-498TGSGYKDTNAWGQSRYNCGYQTIALAAANHSGSGVDTNNVKSWCKSQMNYILGDSNTNGTVLVSNFSANSTQKPHFRAGVGKRCDMTVKDDTTIIDGYDNDVFRLIGGMVGGPQYTGGSYVDRRSDYQTNEVASDYNANLVGAAAGLYHFYKTG
GUT_GENOME133685_01587419-530RLTGQKEYLDAAAQQLHYLLGRNPMGLSYLTGCGTDAIKHPHHRPSCFVGRAMPGMLSGGPCNWLADETVKEIFDKDIAPAKALADMTGSYSTNEVTIYWNSAFIQLLASVT
GUT_GENOME233278_00016182-282YKEAAAEQLNYLLGKNPLGNCFVTGYGTKYSENPHHRPSRVAGVTVPGMLVGGPDAALEDNIAKAYLKDAAPAKCYIDHADSYSTNEVTIYWNSPLTALIA
GUT_GENOME157226_00660488-619WGSNSEKCAGQGIALLYAYTLTNDHKYLDAAVGDADYLLGRNATGYCFVTGFGTKQVLHPHQRISHADGIEAPLPGFLCGGPNAGQQDRETCKTYPSTAPDESYTDDMNSYASNEIAINWNAYLVGFMSWLD
GUT_GENOME036564_00722402-547IGGWGGSRYNCAWQMYALTYAKYSGSSTYNEYAKSQMDTLLGTNSANRTYLLGCGDTWPQHIHHRAANPNKDTMTYTLYGALVGGPDANGSYDDNTDSYSCTEPALDYNGCFALAAAGLYGVYGGSADKADATIASASEIKSDYVF
GUT_GENOME236864_00294397-510AWMLTGEDKYRVGAEKQLHYLLGVNPVGYCYVSGFGSRPVAHPHHRPSVALGVCQPGMLSGGAASGLQDACARARLQHQPAGKCFIDRHESYSTNEICIYWNSPLVALMAGLTR
GUT_GENOME200541_01531320-467YNMTPTAGGLKIRNSWGSLRYAAAASAIAMQYYNISGDSKAKDLAMSQINYILGDNPSSMSYVIGFGNKYPSKPHHRAANGYQGFNDNNHMKDAKNVLVGALVGGPGEGDSYSNDPNKYTETEVGIDYNAGLVMALAGIITDGKISEI
GUT_GENOME018550_00651760-900YEWGSNSMVMNNAITMALAYDIDHEAKYINGVSTAMDYILGRNVIEQSYITGYGEHCLKYPHHRWWSGQLDAERFPYAPDGVLSGGPNSEMQDPMIQGAGYKAGSLAPMACYLDNVEAWSVNECTINWNSPLVWIASFLED
GUT_GENOME011004_01298318-449WGSARLNCSMQMSALIASKHSNADYSSWCKGQMSYILGQNQFNTCLVTGFSSNSAKNTHHRAASGYQGYEGEYGFHEGVKTYHPTNGKVLIGALAGGPDKSGNYSDVIDDYKTNEVAVDYNAGLVGAAAGLY
GUT_GENOME147257_02543636-756IYANDFTGDIKYVKAAANAMDYLLGANPMNLSYITGYGTNAAKSPHHRFWAHAADENSPAPAPGALVGGPNSTSYSDPVAATLKGLCVGQTCYRDSINAWSFNEITINWNAPLVWVASALD
GUT_GENOME034781_0041651-177WGCLRYATTAGFLASVACDTVLKGTDTTKYQQFYEDQINYCLGNNPDGQSFVVGYGEKFPQNPHHRTAHGSWKNALDTPETNRHILYGALVGGPNEDGSYEDDRQNYINNEVACDYNAGFTALLCKM
GUT_GENOME105783_00765354-483YNAIGGGWGSARYNTSYQLYTLAYAKETGDTQYVSKAQKQMDYLLGENNLGQSYLIGYGNKYPTHPHHRGSGQNLKDANDTGDQLYTLWGALVGGPGGDDSYQDITSDYNKNEVALDYNASCVGALAGLY
GUT_GENOME211350_00781343-468NWGSARYNTAQQLVGLVYDKYNNTSTYKTWANSQMEYLFGGNNSGNCYMVGYSNNSVSHPHHRASSGLNTYLKENDSREHKYILAGALVGGPDQSDNFSDYTDAYKYTEVALDYNAAFVGALAGLY
GUT_GENOME250981_00461425-555NWGSNSELANKAWILLVAARTAGEKDYINYASEIADYFLGKNPLGKSFITGMGNNRVMNPHHRISTADSIEEPVPGLLAGGPNAQMNDKNFVKYDSHLPAMAYMDVEPSYASNEVAINWNAPATVIFAILD
GUT_GENOME235783_01155324-458TSSSYYFETKWGSARYNTAQQFAAILASKYGAKDYYSWSKGQMDYILGNKSVGSASATCFVVGLTDNSAKYAHHRAASGYSSFDEMGKNSQYSSNGHVLVGALVGGPLDANGTYNDTVQDYEANEVTLDYNAALV