UHGP-MC 127045


Information


Number of sequences (UHGP-50):
87
Average sequence length:
104±12 aa
Average transmembrane regions:
0.1
Low complexity (%):
5.31
Coiled coils (%):
0
Disordered domains (%):
10.46

Pfam dominant architecture:
PF00861
Pfam % dominant architecture:
6782
Pfam overlap:
0.89
Pfam overlap type:
equivalent

Downloads

Seeds:
MC127045.fasta
Seeds (0.60 cdhit):
MC127045_cdhit.fasta
MSA:
MC127045_msa.fasta
HMM model:
MC127045.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME169939_0015752-156HIRNKVSGTAAKPRLCVCRTGAHIYAQVIDDEAGVTLASAATVEKDFRAQKLSANIKSAEVIGDAIAQRALAKDIKTVVFDRGGFPYHGCVKALAEAARKAGLEF
GUT_GENOME090001_00140168-274IRRKVRIRKKVQGTADRPRLVVYRSNMHIYAQIVNDQEGATLVAASTLSLAKTEAGMHCNLAGAEKVGMEIARLAKEKNITQVVFDRNGYLYHGRVKAVADGAREGG
GUT_GENOME187669_016036-117DANKARQKRHTRVRGKISGTAECPRLNVYRSLSNIYVQLIDDVAGVTIASASTVEKEFTQYGGNVEAAKAVGKTLAERAKAKGIEECVFDRGGYIYQGKVKALAEAAIRAGC
GUT_GENOME021980_0181323-135KRIFQKRKDRVRSSLKKAANGKMRLSVFRSGLHIYAQIIDDAKGVTVVSASTLEKAIRDNIKKSSTVEAASYIGKVVAERAVKSGVSEVVFDRGGYIYHGRVKALAEAAREAG
GUT_GENOME212980_001686-112ARRNKIKTRIRGRVQGTAERPRMSVFRSNKAIYVQLVDDLKGETLVSASSKGLEGGTKTEIAAKVGEAVAGKAKEKGIETVVFDRNGYLYHGRVKSLADAARKGGLK
GUT_GENOME143011_012129-116KLTLRIKRKKRIRAKISGCENFPRISVFKSNRTLYIQAIDDVKAVTLAAVDGRKLGVKANKEGAKKIAAEFAKTLKAKKIEQAVFDRNGYVYHGVIAALAESLRENGI
GUT_GENOME084369_002976-119SRSEIRVKKHNRMRNRFAGTAERPRLAVFRSNNHMYAQIIDDTVGNTLVAASTVEKEIKAELEKTNDKAAAAYVGTVIAKRALEKGIKEVVFDRGGFIYQGKIQALADAAREAG
GUT_GENOME027360_006719-117DRKRLHRKVHIRKTVYGTADRPRMTVTRSNKNLSVQVINDDEGTTLASISTLEKEFEALKPNIDGAKKLGEAFGARLKDKKITKVVFDRNGYLYHGVVKALADGTRSAG
GUT_GENOME207329_0059525-136KEQKLKTRKYRVRNKVSGTAQKPRLSVYKSNTNIYAQLIDDDAQVTLASANTLQKEVNEGLENCANIEAATKVGEMIAKVALDKGIEEVVFDRNGYLYHGKVKALAEAAREN
GUT_GENOME056864_003007-138KNSIRQKRHQRTRRHILGTAERPRLNVYRSLNHIYAQVIDDVNGVTLASASSLDKAIEGYGGNVTAASTLDAEVAAQLEGKTKSEAAEIVGEMVAKRAIEKGVKQVVFDRGGYLYTGRVQKLADGARKAGLE
GUT_GENOME091990_0131713-127KRAGLKRRERRVRSKICGTAERPRLSVHRTNAHIYAQVIDDVDAKTICTASTLDAEFRATGNLGSNKEAAEFVGKMIAERALKAGVETVTFDRGGRIYHGRVKALADGARSAGLK
GUT_GENOME039288_011935-89PNTNAQRQKRHKRVRSKVFGTTQRPRLNVFRSEKNIYAQIIDDVAGNTLVSASSLDKEIEGNGGNKTAARAVGKLVGKGSSMPGK
GUT_GENOME239356_007457-122KKAALRLRRQRSIRHTLRGTAERPRLCVFRSNSHMYCQLIDDNAGATLCALSTLHQSVKALVDGKKPVEQAKILGAEFAKLCAEKSIKQVAFDRNGFIYHGRIQAVADGAREGGLE
GUT_GENOME243912_001347-127KKAARIHRSHRQNGIRGTEACPRIVVFRSNRHLSAQLVIDNVPVTVEGHTNDTASKVLAYSSTEALSLQGATVENAKKVGADLAQKAKDLKVENVVFDRNGYLYHGVIKALADALREGGLK
GUT_GENOME057972_0052092-207MVSKKSRNSMREMRHARIRKQLHGTEAAPRLSVFRSNTGIYAQIIDDDNKKTLVSASSLDKDLKLKNGSNIEAASKVGESIAKKAKEAGITKVVFDRGGYLYHGRVKALADAARSN
GUT_GENOME207154_0061151-148RLRVKEYEKDHIAEFIEENVSGADRYVPMETLLILAERRKAIEGYGGNVTAASAVGKLIAERALAKGIENVVFDRGGYLYHGRVQALAEGAREGGLEF
GUT_GENOME001284_008318-105KKARAKRHKRVRSKISGTASCPRLCVFRSTSNIYAQIIDDVAGVTLAQASSLIRKSQAMAETKRQLRLLASSLQSVQKKRISLMLYSTEAVTYITAVL
GUT_GENOME223868_014167-121KQAALARRHRRVRGKIAGTAARPRVCVTRSNSNMYVQVIDDVAGKTICGVSTLGPDFKATGKSGATVEGAAALGAIVGKKAQESGVTEVVFDRGGNLYHGRIKALADAAREAGLK
GUT_GENOME174580_00642239-337KNVASSEIYPTWPVEALKANIYAQIIDDVAGNTLVAASSLDKAIEGNGSNKAAARAVGKLIAERAKAKGIDTVVFDRGGYLYHGRVAELAEGAREGGLE
GUT_GENOME095216_000815-118NRKAQRVRRHIRIRKKVSGTAERPRFCVSVTTNHIYCQFIDDVNGRTLASTSSLDAKFKAENARPNMAGAALLGKLAAEAAKAVNVTEVVFDRSGYKYHGRVKAIAEAARENGL
GUT_GENOME138014_01595210-306SGTASRPRLAVYFSNRHVYAQIIDDTVGRTICSASTADASIADPSKLANCATATKVGNLIGERAKAANVTEVVFDRGGFKFCGKVKALADAARETGL
GUT_GENOME188675_018945-125KIERRERIKMRIRKIVSGTPEQPRMTVYRSNKQIYVQFIDDLAGVTLATASSLDKEVAEAAAGKNKCDVAALVGKLAAGVAHSIRNPLTGLKLRLFSFTRGMELSPAQQEDVQAMNEAVRH
GUT_GENOME186320_0127516-105RRHARLRKRISGTPELPRLVVSRSNRHMVAQVVDDTKGVTLVSASTLQADFAGFKGTKTEAAKKVGELIADKAKAAGITDWEDPKSIFKK
GUT_GENOME105044_0048531-130DKSRLVVRLSNAHATVQVIDYAPEGDITLASAVSKQLANYGYLGGASNTSAVYLTGYLCAKRALAKGVDSAILDIGLKSAIRGSKVFAALKGAVDAGLDI
GUT_GENOME279359_002985-115IDKNIARNRRHARIREHVLGTASCPRISVFRSHKGIYVQLVDDVNHVTIASSSSLALGLTDGGNVEGAKKVGADIAKKALKLGIETAVFDRSGYLYHGRVKAVAESAREAG
GUT_GENOME110584_010486-114SKKVLRNKRHMRTKGIKGSATCPRISLFRSNRYITVQLIDDVNHKTLVHVSTEKLGLADGKNIEAATKLGALLAEKAKAANITKCVFDRSGYVYHGRIKALADAAREGG
GUT_GENOME232348_007488-114RKAKHLKITNKLSKGTALVPRVVVFKSLHAFYAQAVNDELHTTIASSSTQILENKANNIDAVKLVAKDFAKKLLAINVKSIVFDRSGYIYHGKLAAFADTLREEGIK
GUT_GENOME140356_0414522-131RRVSRLRRHARLRKKIAGTPERPRLVVNRSARHIHVQLVNDENGTTVAAASSIEDDVRSLQGDKKARSVRVGQLIAERAKAAGIDSVVFDRGGYTYGGRIAALADAAREN
GUT_GENOME237864_014598-138RRIKIKYRVRNKITGTSARPRLTVFRSNKQIYAQIVDDSWENTEVTLVKKVRKDHKIVEIPYHPYGKTLVAASSLGMEAMPKCEQAAKVGEAIAKKAIEAGISEVVFDRNGYLYHGRVKEVADGARKGGLK
GUT_GENOME037767_007337-99KNEARLRRHRRVRNKISGTAARPRLDVFRSAKHIYAQIIDDVNGVTLASASSMEKGFEGCGGNKEAARLAGIDVDKLTVMVYTFCGFCAAIAA
GUT_GENOME176473_0023510-121KVAARKRRHLRVRKRVFGTTERPRLAVFRSNRHMVAQVIDDQNGRTLAAASTLEAELRSEGDSKTEAARKVGQAVAERAKAAGIEAVVFDRGGNKYHGRIAAVAEGAREAGL
GUT_GENOME283701_0063424-131RLKLIGLDKHRLVVRITGNHTIAQIVDVHLEGDQTLVSAHSQELKNMGWLASGKNTSAAYLTGYLCGKKAVKEGITEAVLDMGLATSTKGSRVYAALKGALDAGLDLP
GUT_GENOME048252_016216-88NRKMERARRHARVRRKISGTAERPRLCVYRSNSNIYVQIIDDVAGNTLVSCSTLDKDIKTKHANKEAATEVGTMIAKKALEKN
GUT_GENOME218093_009405-120KEQRMRRARQTRIRIAQQGVARLSVNRTNLHIYASVFSEDGTKVLATASTAEAEVRAQLGAAGKGGNVAAAALIGKRIAEKAKAAGVEKVAFDRAGFAYHGRVKALAEAAREAGLQ
GUT_GENOME032633_004237-117DKKQQRDYRHKRTTNKIKAIANDKPRLVITKTNAHIIAQLIDDNQQKTLTSSSSIQLKLPNGNVKNSELVGKDIAKKILALGITEICFDRGGSKYHGKISKLADAARAEGL
GUT_GENOME109793_011228-121NKLLQRRRWRIRKAVVGSPERPRLVVKFTNLHIHAQIIDDAAGRTLIGASTTQKAMREKKLLPNVAGATEFGKLFGEAAKNAGFSAVVFDRAGRRYHGTVKAFADAAREAGLQF
GUT_GENOME206037_0000112-101QERHLRIRQNLAGTALRPRLNVFRSNKQIYVQLIDDVNGKTLCSASSLDKELALTNGANVADGNVLIPDYLETQLEALLDRSDRIQINCD
GUT_GENOME139027_015424-114RKTQRAKRHLRLRQRVIGTAEKPRMSICVTNKHMYVQFIDDSEGKTLASASTISTIKEKKANLETAKALGSAAAEAAKAKNITNVVVDRGGRKFTGRIAAIVDACVAGGVS
GUT_GENOME047527_0153217-101RVRNKISGTAARPRLDVFRSAKHIYAQIIDDEQITIEDIAVTFCCLRFSGKGFLSQQFKQVVSGKINYGVTGIQQLFCDAVGQKS
GUT_GENOME172381_0092723-117NPGKIRLSVFRSGRHIEIQAIDDVRGHTVCAASSKEKDFKGKGWNVAGAELVGKAFAARAVAAGIAGNCYFDRGGYRYHGRVKALCEAIRAGGVR
GUT_GENOME100276_0360114-128RREVRTDYHQRLRLLKSGKPRLVARVSNKHVRAQLVTPGPQGDETHAAATSADLDEYGWEAPTGNLPSAYLTGYLAGKRALAAGVEEAVLDIGLNTATPGNKVFAVQEGAIDAGL
GUT_GENOME147684_0183412-118ALRIKRKKRVRGNIFGTAEKPRVSIFKSNRYVSAQAINDVEGVTLAAVSSKTMGLNVNKENAGKVAAQLAENLKAAGIESVVYDRNGYLYHGVVAAFADGLRANGIK
GUT_GENOME120688_003135-119KKKQLAQKRRWRVRNKVSGTPERPRLSVCFSNKHIYAQCIDDTVGKTVVALCTTSKELNGEKILPNVAGAQTLGAKFGDKLKAAGVSAVVFDRGSRTYHGCVKSFADAVRAAGIN
GUT_GENOME275842_001828-121QRLEQNRRFRIRSKVKGTLERPRMALCLSNKHIYVQCIDDTKGNTLKALSSLSKDLRSQKLKPNCAGAEAFGKLFGAFLKEQGILSVVFDRAGRSYHGCVKAFAEAVRSQGINF
GUT_GENOME103572_0152122-105GATRLVVHRTPRHIYAQVIAPNGSEVLVAASTVEKAIAEQLKYTGNKDAAAAVGKAVAERALEKGIKDVSFDRSGFQYHGRVRF
GUT_GENOME046496_010731-82MINKTSRNEARKARHARIRNKVSGTSELPRLCVFRSLKNISVQIIDDEKGVTLVSASSLDKDLNIKNGGNIEASKLVGALVA
GUT_GENOME096321_003669-114KKALRDRRKLRIKSKLLGDKLRPRVSVFRSNRYFYAQAIDDVRQSTITHIDGRKMGFKNTQEDAKKLGALFAEELKKAGIERAVYDRNGYLYHGVVAAFAESLREN