UHGP-MC 100510


Information


Number of sequences (UHGP-50):
109
Average sequence length:
85±5 aa
Average transmembrane regions:
0.01
Low complexity (%):
4.22
Coiled coils (%):
0
Disordered domains (%):
0.54

Pfam dominant architecture:
PF03319
Pfam % dominant architecture:
9266
Pfam overlap:
0.95
Pfam overlap type:
equivalent

Downloads

Seeds:
MC100510.fasta
Seeds (0.60 cdhit):
MC100510_cdhit.fasta
MSA:
MC100510_msa.fasta
HMM model:
MC100510.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME022033_00223214-297MIIGRVYGSVVSTHKLEGLVGYKFMLVQCIENKNLVDKFLVAVDGVGAGIGEDVIITTGSSARVAIGDANSPVDATIVGILDEK
GUT_GENOME261579_008801-88MIVGKVTGTVVCTQKDKGLQGQKMLVVQPVNIENLKSSGGKMVALDSVGAGEGELVVVVGGSSARMAEGYSSTPVDYCIIGIVDSIEV
GUT_GENOME250828_000871-86MELAKVIGQVVSTVRCPGLPYNSLLLVDLLNEKGESIGRSQVAADPIGAGEGEWVIVSRGSSARFAIDKDAPLDLVIVGIVDHVNA
GUT_GENOME165683_005211-84MLVAELVDTIWATRKSEALNGVKFLLAEVKGGRRSGELLVVVDMIGAGIGDRVIVATGSAARRMMENDEMPVDAAIVGIIDENY
GUT_GENOME153011_00307103-185MIAARVIDNIWSTRKADCLVGIKFMIVEVIDGKDAGRRFIAADLINAGIGENVIVSQGSAARQMFEPDTMPVDAAIIGIIDEE
GUT_GENOME238250_014381-86MRIAIIKGHVTATVKHPTLEGWRMLIAQPVTPDDSPDGPPQIVIDPLGAAVGQKVVINSDGAEARRLIGAKHSPARWTVLGIVDPA
GUT_GENOME176978_019591-87MYLAKVTGALVSTTKHASLNGAKLLIVARLDEHYQPTGTAQVAVDFVGAGNGETVIVTTGSSARMTTSKEHSVIDAAVVGIVDSLDL
GUT_GENOME096098_007981-87MFLAKIVGKIVSVTKNEGLHGKKILIAVPINMNDEVIGGEIISLDNVGAGIGDKVLIANGDVARFAFDDVKDYPIDSAIISIVDSVE
GUT_GENOME261579_008731-85MYIGKVIGTVVSTCKEQNLKGLKLLIVRNLFEKDPDKSEIAVDAVGAGVGDCVLVTIDGGAARMAAGVKDAPINNAVIGIVDHPE
GUT_GENOME018630_001921-89MMLGIVVGQVVATRKDENLTGCKLLIVQPCPYADEASNKMPPIVAVDTVGAGVGETVLYVRGSVAPRAMHNLDAPVDTSIVAIVDRVDR
GUT_GENOME262233_014811-85MFFGKVVGTIVATRKDIHLEGRKLLIVQRTDNQGNPQGDMLVAVDYVQAGRGDFVYLAKSKDAGFPVPERNAPIDAGIVGIIDHT
GUT_GENOME232032_021151-89MHLAKVIGSIVATQKIGSLTGKKLMLLRMISFDETGEEMLWGAAEVAVDLVGAGEGETVLIARGSPVRHLFPEPNHGIDLVIVGIVDSV
GUT_GENOME139234_013151-87MVMGKVTGSVWATKKDEHLTGQKLLVINLIKNGKTTKDEIVATDMTGAGQGDVVLVALSASARFTLPSPDAPVDAAVVAIIDRIEIG
GUT_GENOME242963_003241-79MVLAQVIGKIWATKKIKSLNSYKMLTVQEEQSGKIMTAIDTLDAGKGDRVVITRGSSAMKNEINYNLPIDATVIAIVDD
GUT_GENOME245203_017441-85MIIAKVVGNLWATRKEESLVGRKLMMVQPASLEGDVQGECFVAVDTIDAGVGEMVLVAQGSSARKSLGQTDSPVDAAIVAIIDIH
GUT_GENOME123912_008951-86MLVGIVVGNVWATRKEDALNGLKLMVVQRLDLAHNKLAESFVAVDCVGAGTGEKVLITTGSSARKALFNEEAPVDAAIVGILDQED
GUT_GENOME092251_001951-89MTFARVVGNVVATHKKEDLRGAKLMIVQPVDNFLKDNGDEMVAIDTVGAGIGDLVLVIYEGWAARTCFPTQNPLAPIETVIAGIIDEYV
GUT_GENOME138783_012571-87MEIGTVIGSVWATKKHEDIEGQKLLVINIRKTRKEGKEALIVAADTTGAGVGDVVLVCRGQAARCAAGREKIPVDAAIVGIIDSMEL
GUT_GENOME207678_011371-85MHLAKVIAKVVATQKIDKLVGGKLLVIRAIDTDQNTVDNEPLYVAVDSVGAGVGDCVLVDWGGSVDNDCRMVGDMSIVGIVDRIE
GUT_GENOME062906_001761-90MHLAKVVGNVVSTQKDSNLVGCKLLIIKKINENGEFEKYSSQSTAIAVDSVGAGIGETVIVTTGSTARYVYGDKLAPLDMTIVGIVDEIQ
GUT_GENOME061859_008061-88MQTGRVIGSIVSTQKHESLVGLKLMIIQYVDGNQEPLPSYEVAADTVSAGIGEYVLLTRGSSARHVFGDGQDINSAVDCAIVGIIDSF
GUT_GENOME142595_025301-87MQLAKIVGNAKSITKSDELYGAELLIAVPVDMETMQASGQPFLVADKLGAREGQIVVCAAVCTFQEGDAAINMVVAIPEALTWDGEK
GUT_GENOME143731_003601-90MYMARVVGSVVATQKDPSLVGKKLMIVQQINSDQQPVRFEQVAADTVNAGIGDNVLIVRGAGARRADKERDEDQVRDVNDCTIVGIIDRF
GUT_GENOME044231_011001-86MELGRVRGSVWCTKKSGELSGCKLLLVEGWDPLDEAGGGKLRVCADVIGAGVGEGVLVATGTAARRAIGADKAPVDAAVVAIVDGW
GUT_GENOME096561_017571-89MQLARVVGSLVSTRKSDKLQGMKILVAVPVDMDTFEEKGAPFVTIDAIGAGEGEVVMCVGGSSSRQTDLTDGKPVDNTIVAIIDSVDVQ
GUT_GENOME110003_01501156-233MRAGLVLGAVWATKKSPALTGQSLLRVRCGETEYLAADLVGAGPGDRVILAFGAAARAGRPDVPVDAAIVAILDETEA
GUT_GENOME278799_0150115-99MQLAKIVGNLTMASAHESLKGNALFLCQPIDENGNDAGAVVAAISPFGGGLGSKVVIVADGSQARRYVGHEHSPLRHCIVCVLDD
GUT_GENOME026228_000201-86MYTGEVVGCVVATVKDAGLANIPLLIVQLIEKGKKSKMIVAADATRQAGRGDYVYLIGSKEAGRMFRQKLTPADAAIVGFIDRYNV
GUT_GENOME018858_009641-89MNLGRVVGTVVSTSKCPQLIGFKLLIVEPLDEHLQRCGKTQVAVDAVGAGKGEVVITCDGSAARHLFQDEKPDSTPIDAAIIGIVDTVE
GUT_GENOME128997_000661-76MKIGVVTGSVWATKKCPALTGQAFLTVQLDTETVVAVDFVGAGRGDTVLVTLGSASSREIPAPIDASIVGILDKEE
GUT_GENOME096414_01353544-630MILAKVVGRVISNQKTPDLMGAKLLLVSKIDEFQNLKEGITYVAVDKVGAGQGDIVLVGDSATNERKDSYQELYQDMSIVAIVENIQ
GUT_GENOME171359_029631-80MKMGKVVGNMVSSRKYDGLQGYKLLLIELCYTEPKAYIVAADTIGAGLDQLVLVAEGSNIQQALTKPAPIDALVVGIIDS
GUT_GENOME000607_012741-77MFLCKVICSVISQQKEPCLTGKKLLLCETAEGRGKSRLVAVDLVGAGPGSQVLVSRRYGGSKEGDYIDDIIVGIVDE
GUT_GENOME282991_000081-84MELAKIIGKVWATKKADGLDGQRFALAQFITADGALSPRTLVACDTIGAGVGDTVLVAHGHAARAVLGRDVPVDCAVIAIVDCV
GUT_GENOME072377_022161-85MVVAKIVGNVVLACAHPSALRNALFLCQPLDENGDEISDPIVAISPFGGGIGSKVLVSTDGSAAREYVGDPNSPIRNSIICVIDE
GUT_GENOME238250_005471-93MLIARVEGSVVATKKNDKMTGRRMVLVRPFVVSEPGATAFKPSSSTLVAYDALGAGAGELVLVVQGSSARLAAPDKDTPVDAVVIGIVDSVDC
GUT_GENOME063689_001601-87MLLGKVTGSLWATRKDEKLNGSKFMLVKTWNMNLEQAEGLLVAADNAGAGVGDLVLITQGMAARISAENEGIPIDAMIVGVVDSVET
GUT_GENOME021831_006211-86MRIGVVIGSVWATRKEPKLEGLKLLIVEPLDYKMAGNITREPYIAADVVDAGIGDKVLIVTGNPARYAVGTSVPIDAAIVGVIDDT
GUT_GENOME118758_002274-86LMLIGKVTGTIVSTRKCESLIGSKFLEVQLIHNGVESDSYIIAIDSVGAGIGERVLLTTGSGARLALRDTNMPTDAVIVGIVD
GUT_GENOME256537_005371-77MRIGKVCGSVWSTKKAEQLTGAKFLVVRFSDKTEAIATDTVGAGVGDTVLVIFGSTAKALCAMPTDAAVCGIVDRAE
GUT_GENOME103760_025411-91MMIARVVGNIVATQKHGDYQGQKLLLVRAANLAGELYGPETVAVDGADTDSGIGDLVLVIQEGGSARQAARCGHNGPIDASVIAVADSIET
GUT_GENOME157967_000651-85MLKGKVVGNIVSTNKFDSLRGYKFLEIRLIEQDRLTDRYIVAVDRSISAGIGEEVLVVTGSSARVAAGETDAPVDALVVGIIDKG
GUT_GENOME146009_043571-86MKLAVVIGQIVCTVRHPGLESDKLLLVEMIDREGRPNGEVAVATDSIGAGNGEWVLIVSGSSARRAQHRETSPVDLSVIGIVDEAV
GUT_GENOME173859_000241-81MWLGIVIGNVWCTKKVSALTGQTFLLVAPEGHQGPALVCADQAGAGPGDRVLVTRGSGARVAAGESIPVDAAVVGIVDRVE
GUT_GENOME097725_000911-90MRLAKIVGVAQSLTKSDELYGTDILIAIPVEVDTLKENGASFLVMDRLGAKTGQIVICSSGSASWDGMAAGAPDERVVAIPESLEFEGKN
GUT_GENOME051534_007631-84MVIGKVTGSIFSTRKTESLIGSKFMVVSLQKSGGNKPEYVVAVDNLGAGINDTVLVTMGDSARYGCPDINVPVDAVIVGIIDQS
GUT_GENOME222837_009811-87MYIGKIKGVVVATTKDKELVGKKLLIVQPLDVEYNPIGNCEIAVDFVGAGTGEIVLVATGSSARQVSGSSKAPIDRSIVAIVDNIEV
GUT_GENOME096235_037521-93MILAYVLGNVWSTRKEEALKGFKFLVVQPVTLTYEESGRPRFEKYGNTLIAADRIGAGETEIVMIASGSSARQSLEDQRAPIDAVVIGIIDKE
GUT_GENOME090356_014491-85MQLARIIGSVVSTEKLTSLEGTKLLVMQPIDSSGKDVGSPIVGVDTVGAGTGETVFYAKSKEGAMTLADPSACADAGVVGILDYY
GUT_GENOME257519_010911-89MYAARVVGTVVCTSKEEKLTGLKMLVVQPVNVLNMKNEGKCAVAIDAVGAGHNEIVLVVGGSSARQTEVTTNKPVDATIMAIVDYIEIE
GUT_GENOME264510_010331-85MKICKVVGSVWATKKDVKLEGAKLMIVMPLYGSQGDPLIAADYVGAGIGERVLVITGSTARFVSAKEGAPIDASIVGIVDSIEIA
GUT_GENOME283050_028631-85MQIAKVIGTVVAVQKDERLSGVRLMVIQPVDSKGNAEGKPIVAGDAIGSGIGETVIYAKSKEGAFCLPDPKACCDAGITAIVDSM
GUT_GENOME086587_036991-88MKTGKVIGNIWATRKEERFSGMKLLIVQPFNPMDDTEIEYPVVASDIIGAGIGERVLYVNGSSARTAAGGQDIPVDAAVVAIIDDQEI
GUT_GENOME137256_005441-90MYLGKVIGTVVSTVKNPSLTGCKFLIVEKINQDLTAKKQTEIAVDTVGAGDGETVIVVGGSSARMSGDGETKQAIPVDAAIVGIVDTVEV
GUT_GENOME096561_017561-89MLIGRVKGTVVSTNKVNKLTGSKLLIVQPVDIETFEEKGEYVICIDDVGAGDGDIVMCAYGSSARQTDTSKKYASDYSIYGIVDYITIK
GUT_GENOME096235_012281-86MRIGRVINSIWATRKADSLIGTKLMIVQLLDRPHGELGPIIVAADIIGAGIGEKVLVTEGSSARNMDNFNDSPIDSTIVGIIDEEK
GUT_GENOME155166_006511-87MFLAKVVGKVVSTTKDEGLNGKKVLVVAPIDMDGNVISDKRVVSIDSVGAGIGDQVLVTTGSVSAYPFAENGAVAIDSAIVAIIDTI
GUT_GENOME207678_032981-88MWLGKVVGTVVATPKDESLTGCKLLIVQPLRLCSNQGLSPVVAVDTIGAGTGESVLVVTGSSARHVTGNPQSAVDAAIIGIIDTIELN
GUT_GENOME005110_003631-88MLICKVVGHVWATKKEEGLEGLKLMVVQEIDGTGRKKGNTFVAADVVGAGIGEQVLTVSGSTARKAFGRDTVPVDAAIVAIIDTVEVN
GUT_GENOME207289_007851-80MQISRVVRDLVTTKRVFWLASKSLRVVEDPAGNLDVVVDPIGTKPGDYVITIGYSAARAAAGNPNITTDLTIGGIIDDWS
GUT_GENOME207472_010591-81MKICQVEGTLVATARIPGLLNRRLLVVKERGSSAKQVAVDPVGCKPGDWVIAVGSSAARDAAGSKDFPSDLTIVGIIDYWD
GUT_GENOME066321_004891-94MRIGRVIDNIWSTRKEESLRSAKLMVVQILDTPSDLENPANNGGKVQIAADIIGAGIGELVITVGGSSARRAGGYDSNVPIDLLIVGIIDENEW
GUT_GENOME096283_018161-85MFIAKVIGKVVSTQKAEKLVGSKLLVIKSLNEHTKFSEEGALVAVDRVGAGMNDIVLVDWGDSLYEEAKLAADMAIVGIIDEIQL
GUT_GENOME090555_003884-88MYICKVQGKCVSTIKDEHLKGCSLITMQRMNKSGGPSGEMLVAVDTIGCSVGETVLVTRGSGARAVLGADSPADMVVVGIVDTYD
GUT_GENOME141675_002621-82MILAKVVGHVIATQKCDALKGSNLLILNALGDDLTPLKDRTYVAVDCVGAGQDDIVLAEKYLALNKESYKAMSIVAIVEKVY
GUT_GENOME244103_004921-86MQLAQVIGTVVSTKKSDSLRGCKLMVVRIQNKDLKTFGEARVAVDTVGAGVGELVLCVSGAAARNAVAAPASPIDTAIVGIVDTID
GUT_GENOME012053_004831-94MFLAKVIGQVVSTQKDSRLVGSKLAVLRPLQIGDEKSPELVETKNTVVAVDGCSAAVGQIVLYAQGSSARQAEGMKELPIDAAVIGIVDNVEAF
GUT_GENOME000153_00017220-301AMILGIVKGTVVATRKMPELVGYKFLLVEPVFGSKKDTIVAGDNIGAGVGEMVLVTTDETTQHGLDHPSPIDAFIVGIVDNP
GUT_GENOME097732_011441-78MEIGTVTGSVWATRKARELGGHTLLVVRTDMGKLVAADFVGAGAGDRVLLVTGSTARLYCPESPVDTAIVAILDQMEV
GUT_GENOME254531_008811-76MKCYEVTGMVSAEKRLSALDELSLITLRSSEGEGLVAADLLGVRPGDRVVVSTSGAQAVLGTNLPVDALVLCVLNR
GUT_GENOME006873_02034133-209EEMKVGKVVGAIWATRKAACLQGQTFLVVESNGEKIVAADQVGAGTGDKVLLATGTVASKYCMDAPVDAAVVAIVDE
GUT_GENOME246144_012261-75MKLGVVTRQVAVKKQAACWQEEKIFLVELEGSSLAALDAAGAEPGDPVLLVMGNAAAAYCMAAPTDAVIVAVAEK
GUT_GENOME004087_0082316-101MRLAKIIGTVVATRKDNSLVGYKLMIIRRIDGHGNFIDSEEVAVDYVGAGIGETVLIGSGSSVRVDQSKREAVIDMAIIGIVDTMD
GUT_GENOME013238_003661-95MKLARVIGRITLSKKDEALAGIRLIIASPLDKSQMTGENSSQLSANKSNLIVCDCFGAAMGDIIGYVEGAEATAPFDSPTPVDAYNVGLVENLRY
GUT_GENOME098162_016231-87MYIGRVIGTVVATRKDEKLVGSKLLITQPLNIELKPIGEPLIGVDTVGAGIGELVIYVKGTASRIAARKMDSPIDISIVGIIDSMDV
GUT_GENOME105885_027661-85MDIGRIVGKTVCTVKQESMGGLRLCLVQIPPEGENSRIVVAADAVQAASEGALVYLIDGSEAADAMRRGKVPVDLSVVGLVEHYG
GUT_GENOME140601_039901-87MFLGKVIGSVWSTQKEAGMENLKLLIVQPIDWKETEGGQTVIAADRIGAGVGERVIVSRGSVARSLFQEKNVPVDAVIVGIVDSFEI
GUT_GENOME096439_037921-85MIIGDVMGSVWATRKDEKLNGLKLLIVKPIYNDDLASTFVAADLAGAGVGDTVLVTKGSSARSAFGKERLPIDSVIVGVVDSIDM
GUT_GENOME000181_015121-93MTLGKVVGHVVSTQKDAGLTGAKLLIVRCLEVNPDNSAFVESSMAMVVVDTVGAGAGETVIMTSGSNATKYVKGFEHFPTDMTIVGIVDSAEL
GUT_GENOME140566_02522166-254MHLARVTGAVVSTQKSPSLIGKKLLLVRRISADGELPLTPLTGDEVAVDSVGAGVGELVLLSSGSSARHVFSGPNEAIDLAVVGIVDTL
GUT_GENOME219686_028941-87MRIGRVTGNMTSVIHDPSHEGYKFLTVRFVNAEGQEEDAEAVFADAAQAGIGDLVLVCEDGGAAGMVFALEGSVCVLDGVIVGVIDR
GUT_GENOME093237_002801-89MRLARVVGNVVSTVKDPCYTGYKLMLVEYLDPDTRQPDGARQIVFDCVDAGVGDIVLVNIDGGAANMLLNDKVCIADQTICGIIDSYTS
GUT_GENOME096519_011431-88MVIGEVVGNLWSTKKVDSLSGARFLLVNVTDQRVDNQNWQTRQIVACDIVGAGFGEKVLVVEGSGARVVEKKSRAPIDATVIGIIDSI
GUT_GENOME238250_014391-91MRAGTVIGRVVCTVMHPAFRGDTFVLVLPWNTKTWKAGGKADFDNSLVAYDELGAADGQTVAFAESGEAAAALNPPKPVDAYCAMILDAVD
GUT_GENOME230525_004591-87MFVGKVKGSLWATRKDENLNGLKFLVVERQLNEHQSDPALLIVADCIGAGEGDQVMVTTGSSARMSLNKTNIPVDMVVVAIIDKVDY
GUT_GENOME260354_023651-90MLVGRVMGAVASSTKKTELTGMKLLMVKEIDVATLKDKSDLWIGIDTVGAGEGDIVMLVRGSSSRCLPGYKDTPADCTVVAIFDTIDIHG
GUT_GENOME122716_004531-79MRLAYVVGSIWATRKSDGLCAYKLLLVRDAHDGEFYTAADTLDAGEGEMVITAGGSSAARNEDNISLPIDASIIAIVDE
GUT_GENOME145988_012231-93MKLGKVIGQIVSTRKDERLVGHKLLLVQFLEPTKDGKLAIARSDGRVEVSVDLVGAGVGETVLLCSGSSARNATGVIDAPIDYAIVGIIDTVD
GUT_GENOME011747_001671-86MIIGKVVGHVVSTRKNENLIGQKLLIVEPHESLKGNMGSSRFIAIDNVGAGIGETVLVATGSAARVGCDLKNAPVDAAIVGIIDCP
GUT_GENOME129090_004671-88MYVCKVIGKIISTVKNEKLVGHSIVLVQAAALNDRGSLEADGPVFAAADTIGCGEGNFVLVTRGSNARYACKCAEAPVDMAVVGILDG