UHGP-MC 49041


Information


Number of sequences (UHGP-50):
56
Average sequence length:
216±24 aa
Average transmembrane regions:
0.03
Low complexity (%):
2.99
Coiled coils (%):
0
Disordered domains (%):
2.61

Pfam dominant architecture:
PF05594
Pfam % dominant architecture:
2321
Pfam overlap:
0.26
Pfam overlap type:
extended

Downloads

Seeds:
MC49041.fasta
Seeds (0.60 cdhit):
MC49041_cdhit.fasta
MSA:
MC49041_msa.fasta
HMM model:
MC49041.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME231435_045322021-2231PGSSVLVETDPRFTNFRQWTSSDTMLSQFRNDPGATLKRIGDGFYEQQLIQQQIIRATGQRFVGDYTSNEDEYKALLANGTAAGKAFGLNVGTALTEAQMAQLTSDIVWMVKQTVTLADGSKQDVLVPQVYLRAKDTDITGGGTLMAGSNVSFNAKGDATNTGTIASRNVTVVTADNIVNTGTLAGSTLLAQATQDINNLGGHIQGDQVLL
GUT_GENOME032367_005181486-1711LFETNIEFIDQSKLYGSKYFLEQIGYDSSKTTTVIGDAYYEQKLINDMIKNAIGYSSNIGSEEIKYLLDNALILGKELGLEIGKPLTPEQLSKLDQDIIWYVEIEINGVKALTPQLYFSKESRIKIAESQGAGGTSTIKAKENIDIKSEEFNNLNGNIVSNGNINIKSKNDITNSSTGGLNGGIVSNGGDIILDTENDINNIGGQISGNNVNLSSAGNIIIESTLG
GUT_GENOME207475_009941399-1627VYETRTSMINQDDYFGSEYFFDQVGYNPTEPVTVIGDNYFISELIRRQLNQSVGTFFSVRDGVEGADLVQMLMNNASLASDNIALGLEVGKPLTEEQRANLDQDIVWFVNQNIDGTDVLVPMVYLSAQTMDEMEKGVQQGSGSAVIHANNGVYIDATEVNNGNGTISAGKDVIIRSKGDINNVSYGMNGGIKAGGTIAAISAEGSINNDGAAMRAGQDIILSAEKGSID
GUT_GENOME023903_008361224-1451SRLYTLTRDPSARYLYETDPRYADYRNFLSSDYLLERVKADPEKVSKKLGDGWIEQRLVQEQLLALTGNPYLSGTGSSMETFKKLMDNGVTQAETLHLSIGVELSKEQTAALTSDILWLVKEKVNGEDVLVPKIYLARLSEKDITPKGAVITGSDVEIYSGKTLKNLGTLHGGHTLSLAAHTIENREGTITGNRIHMNADGDILNLSGTISAKDSLSLKAGNITNETA
GUT_GENOME171513_008712514-2775NPKLNGLGQLDPALFADLNAMLGVKPSSTAPQETRLAFTDEKQFLGSSYMLGRLNLNPDYDYRFLGDAAFDTRYVSNVVLNQTGNRYLNGIGSDLDQMRYLMDNAAAAQQSLGLQFGVSLTADQIAALDHSLLWWEKATVNGEAVMVPKLYLSPKDVTVNNGSVIAGNNVTLKGGDITNSGSSLLAKNSLTLDSQNSISNLNNGLMKAGGDLNLSAIGDINNISSTISGKTVALESLDGSINNLTQVEQIDINAGGKYGKIG
GUT_GENOME019688_006541446-1677LLKPADPTADYLIEMDPRFTDYKNFLSSDYLLQRISRDPQKVMKRLGDGFTEQQLVREQILNLTGKQYLAGFTSDEEQFKELMNNAAIAADKFGLTVGIALTAEQMANLTTDIVWLVEQEINGQNVLVPVVYLASVRQGELKADGALLAGGNIHLITGDTLTNTGTIAATGNSSISGTDIGNSGTITAGKDLELTAAQDITNSGGSISGTNVSLKAGNNIVNETTTNDIQYR
GUT_GENOME003284_0076159-271VSSKIYKLNNDPSAKYLIETNKKYADYHEFLSSDYLLERVKADPEKVSKRLGDGYFEQKFVIEQITKLTGRPYLGDYGSDMEQFAALMEAGAVAAEELNLEIGVALTAEQMASLTSDIVWLVEEEVNGQKVLVPEVFLASVRAADLTNDGALIVGGDVAIYSKENIENIGTIKADGTVDLKGENINNLGGRITGGNVKLDADKNITNKGGSIR
GUT_GENOME105947_000391048-1300SSLYKLHPESTAPVLVETDPAFTNKHRFLSSDYMYRQMTWDPQRVTKRLGDGFYEQQLVRDQIVNLTGRNRLDGYDNGEEEYKALMDNGLAYAKEFHLTPGVSLTKEQMAALTSDMVWLENTAVTVGGRTYTVLYPRVYLKPSSLRLTDDGSLVSGNTLTVDTEKDMENQGTLLGNTIIMKGKDLVNLGDILGKDIRLRADNDIHENGYIQGEDRVSLTAGKNISMENTILHGKNQDILGRTAGMAVNGDHGV
GUT_GENOME147195_032451059-1323STPNGLFILSPDPKGKYLIETNPLLTNMGNFLGSDYFMSSVGFNPETDVKFLGDAFYDSRVISQAIFEQTGQQYLNASVGSQLDQMQQLMDAAASQKKSLNLTAGIALSSTQVASLTQDILWYEEIEVNGQRVLAPKLYLAQATIDNLSSGAQIAGSNVSIDAGDIENSGQLSADSSLGVNSANSITNIGGSIVGGDISLTAKKDIINLSGDIKGHDVSLEATNGSMVNETRVKTSTASLGNNHGTFTDVGKTATITSTGNLALK
GUT_GENOME098791_014362945-3166LFNLNKGTDSQYIVETDSRFTNKREWLSSDYMQNALSVDHNNTHKRIGDGFYEQRIVRDQIIQLTGNQYLDGHNDNEQQYHALMDAGIRFSEEFGIKPGVTLTSEQMAMITADMVLLTSREITLPNGSVEKVLVPQVYARVLPGDLQNSGALLSGSEVILNNIPELKNQGTILSKNGLQVSTDNLINQGLIKGKTVDLTALHDIKNEGGQIAGQSRVNLQAG
GUT_GENOME227837_00613838-1098LPSALYVVAPDVTAGYYIETDPAYTNRHQFLSSAYFLEALQADPDRTAKRLGDGYYEMQLVKDQLLNLTGQRYVGSYASEEDQYKSLLNAAVAFAKESKATIGVALTPAQQAQLKEPLVWMVETTVLLPNGQVVTALVPQVYVTKAMMDEHNPHVAVISGKNIDIEATNEILNTGTVVAGDVGVLSATNINNVKGTLRGQSLGLRATDSIHNLGGTIEAVKALSIVAGNDITVTSTTRTIKEAIHSGTMVDSLGSLRVTGE
GUT_GENOME207743_006171549-1776IFETRPSMIDQSGYTGSDYFFNQVGYDPQQPVNVIGDNYFTSELIRREISSSVGSFFAIRDGLEGDALVQYLMDNAGVAAKDSELGLVVGEPLTDEQRNNLDSDIVWYVNQNVNGVDVLVPVVYLCPETLHEMETGEVNGGTATVHAGGEMNVDADTINNANGSISSGGNMTLVSDGDINNISNGMNSGISAGGDINMSSTSGNINNNGAAIKADGDVNMSAAQGDIN
GUT_GENOME013452_006771048-1273NPFATGKETNPGSTAGKETGGTLSFIPDSSLYKLHPEEKAKYLIETDPAFTDKKKFLSSDYMYNQLLWDNDKVNKRLGDGFYEQELIRNQVTQLTGMRYLNGYTNDEEEYKALMDAGIAYAKEYNLKPGIALTKEQIAALTSDIVWLETTTVTVNGKTYTVLYPRVYLKAGTAKVLTEDGSLISANTLITDTKGTLTNQGTLKGNTIVVKSKNIVNTGTIFGNDLS
GUT_GENOME207741_001121998-2183LFRQNLSPDNPFLIVTDERFTNRNKFISSDYLLDRVGYDPAQVHKRLGDGFYEQRLVREQVLKLTGRPSLYEGDAMAQYQYLMNNGVKVASDFQLRPGVALTPEQIAALEQDIVWLVSETVETAQGPQTVLVPKVYLANKTLDLLSHGALVSGNNLHLSAESIDNAGQLLANKGLEIEANQFTHQG
GUT_GENOME090869_011211002-1242TAANLTSSLYTLHPETTAKYLVETDPAFTNKRNFLSSDYMYQQLQWDPDKAPKRLGDGFYEQYLIRSQILDQTGRRYLGDYTDDMTQYKALMDAGITYAKAMGLVPGVSLSAAQAAALTSDMVWLETKTVTVDGQKETVIYPHVYLRAGSRQQLLADGSLISANTLIVDTKDAVTNSGILIGKNVQVQAGSVDNAGHVQGQDVAIVSENNIHQTGLITAADRLRMQAKGEITLENTVDHLA
GUT_GENOME136934_005851242-1448TLPSLYFVHPETTARYVVETDPRFTNKKQFLSSDYMTEQLGWNPDRVQKKLGDGFYEQELIRNQIMALTGKRYRDGYSSDEESFKALMDQGIAFAKEHKLTMGVSLTAEQMAQLTSDIVWLESKDVTVNGQVYTVLYPHVYMQPGQGMTLQTDGSLVSGKNLVVKTTKAIENEGALLGNTIVLNGSAVRNSGLITGGSILANSTGNI
GUT_GENOME007946_001551503-1749IQPDTRLPTQSLYKINPNVDSHFIIETDPDFTNKNRWLSSDYMLKALRNDPQNMLKRLGDGYYEQRLVREQMNRLTGRHFSGNNRTFTEQYKALMDAGITFAQKFNLAVGVSLTPAQVAQLTSDIVWIEKQTVILPSGKKVEALVPRVYAVVKKGDVDGNGTLISAEKVYVKGSELSNQGTIAGKQFTQINTESLINAGKLAGGVLNATVAGNLDNIGGVLEADRAMILNIGGDFSHRSTMQTNEIQ
GUT_GENOME231670_012122017-2240SHYLVETDPRFVNQKQWLSSDYMQNALTTDSSRILKRLGDGYYEQRLVRDQIIQLTGNRYISGYGSDEDQYRGLMNNGVAFGKAYDLELGVALTPAQMALLTSDIVWLVNQTVTLPDGTQQTVLVPQVYAKVKQGDLTGDGALIGGGSVALNARNDITNSGTIKGRDVTQLTAQSLTNSGYISGNTVSLTARQDITNLGGVIGGDKQVSLLAGRDITSQSTLRG
GUT_GENOME214138_039151491-1729ASLAHVAGLPDTSARSNPQKYLIETNPVLTDLKQFMSSDYLLSNLGYDPDTSAKRLGDGLYEEKLIDQAVVARTGQRFIDGQTSDDGMFKYLMNNALESKQALNLQLGVSLTSEQVAALTHDIVWMENETVNGEQVLVPVLYLANANNRLAANGALVQGSDVNLIAGKDLSNAGTLRASSNLTAAASNDLVNRGLLEADNRLDALAGNDLTNKAGGIIAGRDVGVTAVQGDVINERTLT