UHGP-MC 35678


Information


Number of sequences (UHGP-50):
50
Average sequence length:
88±8 aa
Average transmembrane regions:
0.02
Low complexity (%):
7.5
Coiled coils (%):
0
Disordered domains (%):
1.83

Pfam dominant architecture:
PF13360
Pfam % dominant architecture:
200
Pfam overlap:
0.12
Pfam overlap type:
shifted

Downloads

Seeds:
MC35678.fasta
Seeds (0.60 cdhit):
MC35678_cdhit.fasta
MSA:
MC35678_msa.fasta
HMM model:
MC35678.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME090984_006851-89MRAAGVYFGGTPLNQRASGVANGTSGIDHVVNQQRALAAYVADDVHDLGHVGLGAALVDYGKRRVQPVGKVTGTGYGAQIRGYDHQVIA
GUT_GENOME223343_022194-86AGVDLLRTLVHDGAGRVAEGSGGVDHIVDKHDLLAFYVADDVHDLAHVGLLAALVHNRERAADTGGEIARAGNGAQVGGYDHV
GUT_GENOME230054_0033211-88AAFFQGLGSVAKCSGGIDNIIGNDASAASHFTDEVHHFCHIGTRTALINNDHVRVEGLRHGTSANNATDIGGDHNRVF
GUT_GENOME101406_0030468-141FFSTKFLQGTCAFGQGACGINHVVYHDAGFAFNVANQVHNLSNAGFRTTFVDDSQRCIQHFSEIAGTAYATKVR
GUT_GENOME239436_0003434-137MRAASIYFLSTQLLQSLGTLAERTSSINHIINHDTGLVLYIANQVHNLSAARTRTTLLYNSNASTQQICHVAGTGYTAQVRRNNNHILQIQLAEIISQRWSSSQ
GUT_GENOME081757_0041012-90ALLNQCLCCMYQSTSGIDQVVDQHNLLTAYVTDDVHNLGDVCLRTALVDDCERQIQLLSKLTCTVDRAKVRGNDDEIVA
GUT_GENOME194202_008426-97KDALCTALLEGTGAFAEGACRIDNIVDDDGRLALDIADEVHDLCAARFRTALLDDGNRCLQVVSENARTRDTAEVRGDDDDIVELLCAEVVR
GUT_GENOME168603_011981-81MSTGRNNFGRAPFFKGFCAAAQGAGRIDNIINDNTGFPVYITDNIHHFRNTRSGPPFFNNRNGCVDPIGQITSARNAPQVG
GUT_GENOME071594_008901-98MRCAGVDLLRAADVLYGLCRAAEGAGGVHDVVKQDARLSLDVADDIHDLGLVGALTALVNDGHVHAQLHGEGAGARGAADVGRDDHNVLIALAEHLHK
GUT_GENOME243532_002941-95MSAADVNFFGSAHLDKRFRALSNGAGGINHVVDKQAGFVFDVADNVHDFHDVGFGTALVHNRKRAVKLVCHCARTGDRAHVGRNDDKVVALDERL
GUT_GENOME083836_0009744-116KRICGMTERSGGINDVIDDYASAAFDFSDDIHDFGNVGSGTALVDNCKIAFHALCNGASTNHTANVRRYDNQI
GUT_GENOME049034_01152122-205MAYGCIDISCTSFCKGFGTFTERACGIYDIIDDKAGSVLHITDNVHDLGYARFRTTFFNDGDRSTDTVSQLSCTGNAAEVRRND
GUT_GENOME026414_005171-94MGSAGVDILGTADLHQSLGGVAQGTGGIHQIVVQDAALTPDVADDVHDLGNVGLGTALVHDGKTHVHLGGKIPGAADGTDVGRDHHEVLVVVLA
GUT_GENOME218345_001441-85MRCASINFDCSCIHQSLSSQRDRAGRINHIINQNAALAINGTDNVHDFSNVCTRTTLIDDCNRRLQEFSQLTGARHTAVVRRYDD
GUT_GENOME072818_005561-90MGTAGVNLSCARVRKSLCPLRDCARRVYHVVKHKAHLVLYVADDVHDLHDVGFGTTLVDDCQRSVEFRRHGTAARHAAHIGRYNHQIVVF
GUT_GENOME086476_0027911-89PAALHQSSSGVADGAGSIDDIVHDDAGLILDIADDVHHFRYIGTGTALVDDGDGSAQTVGKLSGAGYRTQVRRYHNQIL
GUT_GENOME194824_00900536-628GAVNLGCAAALYQSVRCVYERACGVDQVVNQNALLALYVADDVHNLGYVGLRTALVNDGKRHIQLLGELAGTGNRAQIGRNDNGIIGADAELA
GUT_GENOME194177_0066835-120GAGRHALGSARLNGAGRIAQRARGVDHIVHDDHVPAFHLADDVHHLRNICPWTPLIHNGQRNVQPFGKGTGAGHAAQVRADDRHVV
GUT_GENOME245007_007491-98MRTASVNLACTAVDKCLCAVCDCAGGVNHIVYQNSGFAFDVADYVHNLRNVCFRTSFVDNRDGSVKTLCKVSRSCNTAEVGRYNNDIIQNVFHFLSEE
GUT_GENOME088765_0024825-127GAGIHVLGSAHFHQSGSGVAQGTGGIHHVVVDDAGLAPHVADDVHDLADIGLLTALVHDGQAHVDLPGEHTGTAHAAHVRGDDHELIVVVFLLRELLQEVVHE
GUT_GENOME272522_0124214-100FHDSLCSVAEGTCGVNHIVNEDNVLILNVADDVHNFAYVCLLTSLINDSKRSVKLSGKVSCSGYGAKVGRYDDGIVLDVLELLNEVG
GUT_GENOME120994_009151-106MGGAHVDVAGAQGLELADVGAQGACGVDHVVVDDAGLALDVTDDAHDLGLVVVRATLVGDREVAVEHIGQLLGCLGTAHVGGHDNEVLLVEVHSVVVIGEQWQGGK
GUT_GENOME017721_008241-91MRASRINFRRAEIFQSLRALADRAGGINHIVHHKAPLALHVADDVHHFHHVRFGAAFINNRNRHAEFFGNFSRALHAAHIGRNHHVDIAFF
GUT_GENOME171567_039339-91LGTALFQRFSGFTQRTAGIDHVVNQHAVAASYVTDDVHHLRNVSAWTTLVDDRHVGIVKQLSDSTGTYHAADVRRNHDWVVQV
GUT_GENOME090015_006204-83AGINLFCAADFPNCLCGVAEGSCRVDHVVHENDVFVLYVADYVHDLADIGTLTALVHNGDGSVKTAGEVSPSCNRAEIGG
GUT_GENOME105273_016321-90MRAGCVHFLGAAFLERRSCVGDGAGGVDHVVDEDDVLAVDFTDDVHNFRNVWFWTTFVDDGQWRMQEICEFTRTGNATDVRRNDNNVIEA
GUT_GENOME088344_005871-83MSCTSFSKRIRRMTDCTRSINHIINQDTCFTFNVTNNIHNLRSVHFINTTTLINDCNWCFQTLSKFSCTSHTTMVWRNNNNII
GUT_GENOME044016_00101298-379MCTAGIYLDCTSFNQRIGCICNGTRGINHIIQQDNLFTLDVTDDIHNLADICLWSAFINNCNRNIQLFRKFSDSCYAAQVGR
GUT_GENOME118203_0019916-97VRAASVNLVRAVVIKRFRALTNRAGSVYHIVYHKASLAFYVADNIHYFDLVSLGTTFIYDSHRTVEFRSDVSRSRNRTYVGR
GUT_GENOME214896_006361-89MRGAGIDLLRAADLDERFGSAAEGTGGIDHIVEQDDGLAFHVADDVHDLGFVGDLAALIDDGEVHVELHGKLAGAGHAAHVGGDDHHVG
GUT_GENOME101964_0034811-102SAKLDCSLSSVAECTCCINHIVNKNNLLALNITDNVHNFTYICSWTTLINNSNRATEACSKVSCPCNRTKVGGYNNIIIWVVANHLLEVVSK
GUT_GENOME252783_006309-96LGSSFFQCLGCLGQRTGGIDHVIHDHTVSAFHIADDVHDLGFVRPWTALVDDRQVTLETFCQCSGTDDSSDIGGDDQKILVVFLLEIL
GUT_GENOME237153_004181-86MRADGVNFTRAAFFDFLGYGGQRAGRIHDIVKNNRHFVFYITDNVINHGFVRFRAAFINNHQPRVHTGGKFAGAFHTAHIGGDNHD
GUT_GENOME244166_010061-88MRSGNVDILRSPFGQKVCRIGNRAGGINDVVNDDAVLVFHIANDIHYFRNVGFRTVFFNDGDGQTEQMRKSACACDAAQIRGNHHPVV
GUT_GENOME072069_0073316-103VRTASVDFFRAVVEKCLRTLTYSSCGIDHIVDYKASLAFHVTDYVHNFDLICFRSSFIHYRHRTIEFSGDVSCSGNGTNVRRNDHEIV
GUT_GENOME195331_015031-100MAGAGVDLVGAAHFHDGLGSVAQRACGIHHVIKEDAVLALHVANDIHNLALVGLLAALIHDGQLHVQLLREGAGAGHGANVGRNDHHILALVAELLGIII
GUT_GENOME222354_008474-82TGDNIGCTADFHQCLCCIAQSACGVDHIVHDDDLSAAYVTDDVHNFADVCFWSSFVDDSQRNVAALSKFSCSGDRTQVR
GUT_GENOME040245_0116017-89GLGRVAQRTGGVDHIVKEDALLVLDVADDVHDLTLVGLFAALVDDGEVHAHLVGKRAAAGDGADVRRDDDHLL
GUT_GENOME260207_0257820-89GGVDHIIHQQSALAFHIADDVHHLGHVRLGPALIHNGQRGVQAVGKLTRAGNATQIGRNDHQVFPVEAPL
GUT_GENOME241989_0012449-140DALGAALLQGTGGVAERAGRVNDVVGDDAGAALDFTDDVHDFSDVGARTALVDDGEVDFESLGHGAGADNAADVRGNNHQVVVLAFDEVVEK
GUT_GENOME218263_009091-100MACAGVDLLSAADLDDRLGCAAKRACGVDHVVKQDAGLALDIADDVHDLGLICSLTALVDDSHVAAELEGEVAGSCGAADIGGYDHDIVVALAVNVKIVL
GUT_GENOME229920_0124734-132MGHAGIDIFGAEFLQSLSPFAQGSCRIHDIVHHDADLAFHIPDQIHDFCHTGFRPSLVDDGQAAFQPAGNVPGTADATQVRRDHYHVFRSEALFLHIIC
GUT_GENOME164942_011641-85MRSAYVYFFSALLLQRSYSSTQGASGINHVVVNNADLAVNVTDKSGDFSLVMTRTAFVHDSQVGVKHVRKLLCSLSAANVGANHA
GUT_GENOME157142_011921-83MRAASNNFLSTVVCKCLCALRNRACGVDHVVEHKAGLALDVADDVHYFHHVCLGTAFIYDCKGRIELACHCAAAGNTAYVGAY
GUT_GENOME012563_003621-82MSAGSEDTGSTHFFQGFRATAQGTSRINDIVDEDAGLAFDVADDVHDFSNARTGTAFFDDGDRCIDAVSQIAGTRYATEVRR