UHGP-MC 85091


Information


Number of sequences (UHGP-50):
55
Average sequence length:
106±14 aa
Average transmembrane regions:
0.38
Low complexity (%):
0.36
Coiled coils (%):
0
Disordered domains (%):
37.75

Pfam dominant architecture:
PF12224
Pfam % dominant architecture:
182
Pfam overlap:
0.48
Pfam overlap type:
reduced

Downloads

Seeds:
MC85091.fasta
Seeds (0.60 cdhit):
MC85091_cdhit.fasta
MSA:
MC85091_msa.fasta
HMM model:
MC85091.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME112185_0104985-201MISHQDARRNLSPGELIYANSMVAEEIALENKGKMLEGYSKERREEKSSSVQMDGGCNESKSHTSPTHTREQVAKMSGVGTGTVARYDAIMKTDDEDLKKKVQTGEVKIGTAYREIR
GUT_GENOME230680_0113482-183MCNNQLGRRNLTPNNKKYLIGKRYEAEKMAQGGDRKSEEHKSTDKKYPLKKDSSHYSRMKIAAESGVSEGYVQKADEFAKGVDAAEEVVPGIKKEILSGQIK
GUT_GENOME232919_01697104-198NLNPYQRSELALRTEPLLREAAKKKQATSTGGHDPQLLETSRKAEKKIHTNKELASLAGVSDNTIRKARRIVRDADDAVKHELRQEKLSVNKAYT
GUT_GENOME124788_00964340-458FESREEVLAWICKNQLGRRNLTPEQKKFLVGKQYSIEHRKPGGNGNNQYTTATQETVQEELCQNDTIPPVSSETSIRRKIAEQHHVSESYVSRSEKFMKGVEIMEELVPGMQEKILSGQ
GUT_GENOME116224_00687102-222LAAMEWIIRTQKGRRNLTTYQIGQVALKLRDKIETRARAKMVAGGGDKVSAASRAGSATLPNPLAADSVSEDPIDTRKELAKTVGLGERTMGKIMKIDESAPELVKQAVAANEITVNAAYD
GUT_GENOME056437_0244089-196EWMVNTQKGRRNLTTYQLGQIALKLKDEIEVRAKEKLAEYHGNQYEDGPLTTLSKVQNDAVNTRKELAEAVGVGEVTMGKMMQIEESAPAPIKEALAKDTVSVNKAYS
GUT_GENOME096041_0199178-196WMYENQLGRRNLEPFARAEMALKVKPYIEERAKERSRQNLKQNAGNAQNFTESQKISTNTEVLNSAPREKGLKTVEILAKKAQVGKDTIQRVEKILEKAEPEIIEKARSGEISINQAYQ
GUT_GENOME000729_0195785-197QLGRRNLNSAQRIALVKKFRPIYEKQAKENQSKAVSESNKNRAKSALVNLPSMDSKLKPINTSEKLASIAGVSDKTYRMGEKVLNSDNLELKEEMLSGEKSVSAAHKELQRLE
GUT_GENOME074516_0441284-189QQLGRRNLTTLQRIAIVEKYRPIYERKAKENKSKSGGDRRSVSQNSSTPISNKDKIDVRAECAKDANTSTDTYSKGVKILNSNNTELINDINSGNKSINKAYKELN
GUT_GENOME120534_007611-78MEWMLDIQLGRRNLSPIQRIAIAEKYRNIYEKQAKENQLSGLKNQSNYVLPNLVKRDSIDTTKKLAEVADVKSSIRGF
GUT_GENOME000256_0460988-177NLSLAQRCELAMKLKPAIQKKAKENQEKAGGAVPAKSPKPVDTREELAKIAGVSSNTISKVEKIHEKGTPEQIERARKGGKGNSVNAIYQ
GUT_GENOME021842_0040895-218RYACLAWICKNQLGRRNLTPENRKYLIGKRYGAEKMSHGGARNASSVQNEHLKTGVKIAAETNTSESFVRRADQFAKGVDAGETVVNGFRDVVLAGEAKVSDYEIASIATCPEQERKQLVEELL
GUT_GENOME211803_0020595-197QKSRRNLSINELCKIALKLRPEVEARAKEKMAMVMAENRQFNPNNTNEQISTLVSKSVPDTLDTRKELAKTVGVGEVTMGRAMKIEDEAPQPVKDAVDSGDLT
GUT_GENOME103697_0078781-190EWMINTQLGRRNLPPQQRIAVVKKFEKKIQEQARENQSEAGKKFGNGKNSSSPNGEKLNDKKIRTDKELAKLAGVGTGTIARFNRVMSSDDEELKKKLLADEVKINTAYE
GUT_GENOME098713_00184110-207MLEQQLGRRNLSETERYKIVQKFKSVFEKKAKDNQSSGGKGLSNLSKVNTRKEMAKAVGVSEGTYQKIDKIMKSDNEDLKQKLEKKEISVDKAYKKIK
GUT_GENOME058701_0157883-186QKNRRNLTKYELAQIALKFKPVIEAKAKENQGKRNDLTSVRNLTKVNDENTGEIETIDTKKELAKIAGVSHDTIHKVEVIEEKAPEEIKQQVKKGELTINSAYV
GUT_GENOME224923_00782599-710LAWICKNQLGRRNLTPEQKLFLIGKQYEAEKSSHGEARKESHDENGRFHRSSQTDNSGEAMKTCERIAEENGVSKATVLRASKYMKGVEIAESLIPGMREKILNKQVKVSKA
GUT_GENOME018905_0021781-197EWMLDIQLGRRNLSPIQRIAVTEKYRPIYEKQAKENQRLAEGGDRKSVKYKENQGVVNLPQVDYTKERNPTTDKKLSDIAGVSEKTYRMGAKVLNSDNEDLKQRVLSGETSISAGYK
GUT_GENOME051843_0083984-191ICKNQLGRRNLTPEDRKYLIGKQYEAEKQSVGAPEGNQNRSFQWYQNDTIENRRTSERIAKENNVSRSSVVRAERFAQGVDAAEEAVPGARKEILSGRVKATDKAIAS
GUT_GENOME273401_019549-93FLSILYQIQRIAIAEKYRPIIEKKAKENQACGQGGVLLLTNSTKAKESVNTRDKLAKIAGVGQDTYCKGKRILDSDNEEIKDKLK
GUT_GENOME215540_0176981-192ICKNQLGRRNLTEAQKSYLRGKQYEAEKMAQGGDRKSEGFSNGQNVHLKSRREIKDGTAGRIGKEYGVNGRTIRRDAEFAKGIDMAEKTAPGIRDAILSGEVKVSKETVAQL
GUT_GENOME048656_0186319-75QWMLSHQEARRNLSPEEKASCVHVDTGSKERDFSTNTRQQVAKMSGVGAGTVARYDA
GUT_GENOME261789_0133838-133EWILQNQLGRRNLTDFQKNEIALRYEDVIAKRMRERQMEAAAKGGSSYSPVLVRKGMTKWSEPCLEPTTKRKELAKIAGTSQGSIQRSKLILEKGT
GUT_GENOME114536_0030671-202FGSNEEAVDWICANQLGRRNISEETRKYLIGKRYEAEKIIGYRKTTQRLLQPDDEPMLPEDEVFGERKAGVKYKTAQRLGKEYHLSHSTVQKYGKYSRAIDEIGRKEPALVPAILSGRYKISHENLLELASS
GUT_GENOME007813_0159076-190VIAWICRNQLGRRNLTPEQKKYLIGKQYEAEKQRQGTNNQYIKAKSESCQIDNFHSAQKTCERIAAENGISSRSVCRAEAFSKAVDIADAVEPGIRSELLAGEIKATEKDIRELV
GUT_GENOME148614_0043391-198LSPIQRIAVAEKYRPIYERQALLNKQLAMSEARKSNENNKNEQFSQNSATTVDKIDVRAKLAKTAGVSTDTYSKGKKILDSDNEKLKKEVLSGEKSINAAYKVLQDEK
GUT_GENOME143765_0090879-186WMIRNQMGRRNLTDFQRTELALRLKPLLEKKAKENQNSGLETSGKRHEGKVLQNSAKPIERINTTEEIAKSAGVSRDTVQKVQKVLSKAEPEIIEKARSGEISINAAA
GUT_GENOME059474_0352077-185EVIAWICCNQLGRRNLTPQQKKYLIGQRYEAEKQMGSFRGNQYALADKSGCSQFGNNQKSERSERTCERIARENNISKNSVIRAEHFAKGVDAADEVKPGIKQEILTGA
GUT_GENOME158628_0084482-188ALVWICNNQLGRRNLTPVQKKMLVGDRYEAEKMAHGSSERFCGNHDDSPCGQNGHMGESGLSRKKIAAETGTSESFVKRASQFSQGATAAEEVSPGFRQEVLSGKVK
GUT_GENOME180328_0226782-185QFGRRNLLPFQRSELALKMKNVIQAKAKEKQATSTGGINPQLLQISEKAEPIHTDEELSKLAGVSRDTIRKAEKIIVEGNDEQKERARTGGKGNTVNAIFNEIV
GUT_GENOME088219_0110072-162VILWIIINQMSRRNLTPFQRSELALRLKPKLTQEAKERQGERVDLKEDISVKSCESNRTGRTDHKLAKIAGVSEKTIYNTETILTKGTKED
GUT_GENOME283126_00931254-382AFNNKYEAIAWICKNQAGRRNLTSEQLSYLIGKRYQAEKQSRGSQERFQRKDSDESPKGKIYPLEDTHATAIKIADENGISEKSVRLFGEYAKGVDAAEAACPGVKQELLSGSFKPTKPEVIALSKLPA
GUT_GENOME163953_0012884-196DWMHTHQLSSKNLTAGEKLAMTMELQKEIALENEKKRKETEGRPRKENCTPIGVQNTDDNRSRSNTWTDSQTAKKAGVGVGTVARYNRVMNSDDENLKEKVKSGEVTVNKAYE
GUT_GENOME152415_0140884-198LLEAKQWALDTQKGRRNLEKWELGKIALKLKPEIEAKARANMVAGGQNYRPKEGLTTLSNLPSVETAVNTRKELADAVGIGEVTMGKVMQIDENAPEVIKEALDKKELSINKGYD
GUT_GENOME122265_0069589-184HFSRRNLTAAQKIHIVYKHKDDIAKLAKERMKNGGKQKRHLDGEAIDTRAVLAKMAGVGKETLRQYEVVIKNGTQELINRMITGEKSIFGAYKEVM
GUT_GENOME282543_0088081-194EWMINTQLGRRNLPPQQRIAVWDKFRRIVEDDNAKKKSEKISLSNKNRKSNDVQLDNNGLIKNDTSNYTRQQIAQKANVGSGTIARYDKVMKSNDEEIKKKMENGEISINAAYE
GUT_GENOME163498_0053674-196RWIILNQFGRRNLTKFQRSELALKLKPMLAAQAKERQKIYCGNQYDKKSGLRQNSVQVQKGKTSDDIAKIAGVSRDTISKVSVIQEKGSPEQIQRARTGGKGNTVNAIYHEITTKSKETKVCN
GUT_GENOME014197_0145674-187ENRYEVISWICLNQLGRRNLSGLQKQILIGRKYKAEKSAYSETHDFRGNQYTEVVSPQNEGIAKCKNPTAEKIGKELGVSHATVERAEKIVDGVDAAEEVLPGVTRDITSGKIK
GUT_GENOME281910_0106784-192QKARRNIDDGTLFKIAEKFRPYYEKKAKERQIEAGKNFGKGFQKSEKPINKEDNNTVPINTTKELAKTAGVSTDTMNKVIQVQKHAPEPIKKAVENNVISINKGYELTK
GUT_GENOME270972_0008770-205FESREEAIAWICTNQLGRRNITEATRQYLIGRRFEAEKRLGARNPVGYNQYVQRELSPQNEGKPLAIVSKYGIATNLGVEYGVAHSTIERYGRFADAIDQIKTASSDLATSILSGRVIVNQHDVLDMADLNSQQIR
GUT_GENOME018231_0085787-195ICKHQLGRRNLTPEQKKFLIGKQYHSEKSTRGGNHGNQYTQLANCQIDNLPSVENTTERIAKENNVSPSFVIRAEQFMKTVELMEKYCSGIQEEILSGKLKLSQREAAV