UHGP-MC 76549


Information


Number of sequences (UHGP-50):
180
Average sequence length:
103±17 aa
Average transmembrane regions:
0.04
Low complexity (%):
2.31
Coiled coils (%):
0
Disordered domains (%):
4.28

Pfam dominant architecture:
PF03212
Pfam % dominant architecture:
2167
Pfam overlap:
0.03
Pfam overlap type:
shifted

Downloads

Seeds:
MC76549.fasta
Seeds (0.60 cdhit):
MC76549_cdhit.fasta
MSA:
MC76549_msa.fasta
HMM model:
MC76549.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME062055_01948349-442WGTSGKNGGKLTFNANNQTLKGNVQVNKGSSLKLLLKNGSELTGNINKTKTSGTVSLTIDGSSTWNVTGTSHVTVLKSSKAHLKNIKSNGHNVY
GUT_GENOME194101_00850223-323WGQAGSNGATVKFTGIGQQLAGDVVADTISSVDLHLTEGTTWAGAASIEQNAAGSTSDAPMTVNIDGSSQWTVTADSTVSSLNVAEGAKVVDADGKTVTIV
GUT_GENOME243079_00751480-605ASGAAFWLQGSNHALTVNGSDVTAGDGSGRLLRVSDTVFTDGSSVATSRVDFSADASTLRGDVVVDSATADVHMVLGNGTTFSGALRNDSGYQVAQLALDGSSQWNVRASSSVGALEHAGTIAFDA
GUT_GENOME045257_01169374-467FGDAGTNGGDLTFMANNQGLYGNVSVDASSTILMDLQNGSHWSGAINTDNIGKKVDVNLDASSSWSLTADSHVDGFTDGTTALTNLKGNGHTLY
GUT_GENOME179101_01256337-430WGKEGQNGAVLEFIGNNQPMSGDITVDELSSVTMTLQNESHWYGSINEKKTAKSVKVGLDGKSTWSLSGDAHVNVLIDGDTTLKNITGNGYCIF
GUT_GENOME090533_00009521-616WGKEGANGGDVVLDAADQEIDGQILVDDISTLKLNLTDGSSFTGAVNPDGAEGTVDVTLSKDSTWTLTGDSYVSSFTGSTDNVITNGYHLYVDGEQ
GUT_GENOME096531_00673385-480NLLQAEGTNANNWGSAGSNGATVSLTGIGQTLAGNVQVDSISSAEVFLLQGSTWTGATQVTENASATQASGGSIAVDVDGTSTWVVTGDCAVADLN
GUT_GENOME132069_00892477-590DGVLLRVVGNSASHGWGTAGSNGAQVAFTADGQVLTGDIVVDTISTLDMTLTNGSTFTGTTSIIDNAQGGTAVSDNAVVTIDSGCTWTLTGNCTLTSLTNNGTINFNGYTITLA
GUT_GENOME143496_00158502-573SQVNLSASNSHMEGEIIVDTGSVVDVYLNSGSEWQGITASTNSVNIDDSSLWTLTGDSTISQMNSSGTTHFT
GUT_GENOME029599_00190365-450LVLKAVGENLTGHIVADRTSKVEVKLENSTLTGAINAGNEAKSVELYIGKGSTWNVTADSYVTKLANEDRTNSNIHTNGHTVVVDN
GUT_GENOME141067_00975181-259NSIALENCDVIGNMSDTKGSSGSNGAQVEFTADSQTINGDIIVDNISTLDFSLTNGIVLNGTINFNGYTITLADGTVLK
GUT_GENOME237883_00969389-456SISTIKLDFASNTDFTGSINEDQQAKSVDLTLAKDATWDMTADSYLHTISDAADNYSNIASNGHNIYY
GUT_GENOME244174_00027316-409DGGALTLATAKQSWSGNVVADKTSTVMVSLQDGAVWTGTMNGTAEAEKADVNLDRSSQWILTGPSHVTVLSDADRSLQNIRSGGDDLYYDRQAA
GUT_GENOME221133_00319421-517WGTPGANGGHVELIADEETLNGDIVVDTISDVNLTLRNNSVWTGAITIIPNAQGGEKYKTNADIFIGAGSVWNLTADSQATTVNNLGTINFNGHPIT
GUT_GENOME251527_01102543-643WGQSGSNGGTVTLSFKNQKAAGNIVVDSISSLDFTLSSGSYFEGAVNADNTAKTLSITVDASSKIKLTGNWYLTSFTDADSTYSNIDFGSYNIYVNGVAIK
GUT_GENOME015857_01298105-177NTIVNNDTTGNFLRVQKDSWGNSGSNGGDVTLNLKNQKLTGDSYITSLINENSSNSNIDFNGYKLYVNGVAIN
GUT_GENOME021467_01036520-635WGSQGSNGGNLILNATNQILNGNISVDNISTASFILKGSTLTSTINSENNAKEVNLSLDSSSKWVVTGDSYVTTLTLENNDLSLIEDHGYTIYYDASANSWLNGQTITLSNGGTLT
GUT_GENOME283623_00706395-465LTGTMESDAMHTLELHLSHDSLFTGSITAPDSTITLSRDSRWILTGDSAIGTLLNDDPTGENVILNGFALK
GUT_GENOME101275_01189468-562AEGCDATIAIKDSEVRGDILVDHETKLTLDIQSGGFYKGFIDQSMYSQDINIKIADDGRIELCSDIYVKEFITNDFNFNNVIDNGFSIYYDDTAE
GUT_GENOME098570_02177509-595TLNANEQILEGKIIVDAISTLDMQLQNNTAFTGTINADGIAGTVKVTLDETSKWVLTGDSYITEFTGSLDQVDTNGHHLYVDGQLVK
GUT_GENOME280032_01504332-426WGEDGSNGGRVTLNMTNQNVKGNVIVDDVSTLVFNLSNESVFQGAIDYADSAKKVSLKLSKDSVVSLTSDSHIDFLENEDSTNTNIYLNGHKLFV
GUT_GENOME070225_01769291-356NALRVDGATNGNSATAIRSDRGGGTVNVTLDDTSTWTLTADSYVTSFTGELANITANGYHLYVNGE
GUT_GENOME117625_00345519-617WGNAGSNGGNLEFYAENQSLAGDVEADEISTISMVLTNSELKGAVNAENTAKEVSLSLDSDSIWEVTGTSYVTVLTNEDITCGNIHSNGYDVYYDTDAA
GUT_GENOME000963_04596429-524SWGTAGKNGGDFTLTGIAQKLQGDIICDEISRVSVNLTEQSTLKGTVNGENKGQNVDISLDKDSKWELTGDSYVNVIKNDDSKCSNIQSNGHSIYY
GUT_GENOME258665_01366277-373ASGGAFTLRCSYQALAGALSFGEGGSLLLVLQNNSSFTGSLPVEARLPVTLTLDGTSRWFVTGDSRLMALTEGEDALSCIESNGFTVYYDAGRAENA
GUT_GENOME064934_01543428-511LTLRASNQELSGNIVADSISTIALDMANGSSLVGAINTDNTAKEITLKLSKDSTWTLTGDSYVKTLTNEDTTNSNIHLNGYKLV
GUT_GENOME039964_00173490-610WGKVGSNGGNVTLNAESQTLKGNVTCDNISTVSMSLASSTVFTGAVDSGNTGDVTITLDKTSKWNVTADSYVTAIVDADSTFSNIFSNGHTIYYSTSDSRNSSLNGKTVTLSDGGKLVPVK
GUT_GENOME255603_00250371-471RVLADVRADEQKNGGKLLLYGVNQKYSGDITCDKSSKVKLVLTEGSSYSGAFDAKGAAAYSKVYLSRNSAWNVTGDSHIDSLRNGDKKCRNIKSNGHTVYY
GUT_GENOME044271_00624287-374GRLSLLCDYQALEGELSVADGATLSVTLQNNSTFTGSVSADERMRLSLSLDATSRWFVTGDSYIEALSDADATLQNIESNGFTIYYNS
GUT_GENOME113718_00612368-455GKAKAVLLRQKAEGKILTDSVSSLDLTLDSGTVLKGSISDEDSPLSGSGTVLHIGKGARWELTGDSVIRRLDNQGKIETNGFKLTVRE
GUT_GENOME018428_00586406-500WGTSGSNGGTLSLLAKKEALAGDIYTDGISSVDLYLTKGSSWTGQTDGEANTSVTVTKGSKWVVTGDCTVSSLNLASGAKLVDENGKTVSVVANG
GUT_GENOME194114_01143379-462WGTPGKNGADVTFTGLDQELAGNIDVDTISSLDLYMLDGTTYIGATTISENAAASEQSDAPISVSIDGDASWIVTADSSVTNLH
GUT_GENOME014081_02446543-640SRGWGTSGCNGADMSLFATKQTLTGKIIVDESSSLTLTMSDNSSFEGSINENQEGGTVAVSLDDNSTWILTGDSYVTSFEGSMDNVNLNGYTLYVDGV
GUT_GENOME105230_00355466-578NSNKRGWGQAGGNGANCVFTAIGQQMTGDILWDSISTLELNMTENSTLMGAIVDDESNAGDGGNGYANVTVDATSKWVVTGDSIVTVLNNSGTLIDAEGRTVSVVGLDGVVYV
GUT_GENOME212955_00018381-474WGTKGANGADLTFNADNQNLEGLITVDRISTLKLNLKSSTLKSTVNSANSGKELDISLDKNSTWNVTGTSYVTVLTDEDTSLDNIKDNGNTIYY
GUT_GENOME154167_00184471-569WGNSGSNGGDLELNATNQKLSGDIILDSLSSVDINLKSSTLTSTVNNNKKAKEVNISIDKNSKWNVEGNSYINTLKLTNNDLNLIKDNGHNVYYDKEAN
GUT_GENOME261089_00027284-373HAGTLTLSGEHQAFSGDLVCEQGSALRLRLASNSTYTGRLSNEDYMDVTLALDATSRWFVTGDSYLEALEEGEGVLQNIESNGFNVYYNS
GUT_GENOME005485_05239551-645WGTPGSNGGTFQLTGIKQTFEGDIICDSISTVSVCLTDSSVFKGAVNEENTAKEINLSLDSTSTWEVTKTSYLTTLTNDDTSCSNIISNGNTIYY
GUT_GENOME046138_02263515-598TLKGTGQTLAGDVTCDSISTFSLILNEGSAYTGAVNSENTAKEVSVSLDASSTWTLTGDSYVTVFANDDTSCANIASNGFHIYY
GUT_GENOME164214_0008525-104SGNSGTNVGNLYFQKYSYCLKDNLSLTIIDNSAKEISLTLDKSSKIKLAGDTYVSSLSDEDSTYSNINFNGYKLYVNSKA
GUT_GENOME159031_01530481-576WGNDGSNGGHAVINAQNQKLSGDIVTDGISSVTLNLKAKSLYSGALNPANYTGAMNISLSADSMWELSADSYVTVFENEDAQCGNINTNGHNITYD
GUT_GENOME180136_01608562-674NANRRGWGQTGANGADCTFTAAEQTMDGDVIWDSISNLSLNLTEGSVLTGAILDDESCAGEGGDGACALTIDAASKWIVTGDSVLTTLSCNGEIVDADGSAVTLAASDGTVLS
GUT_GENOME081485_00442351-439NGGKLLMYGVGQTFNGSINCDLNSKVKLVLSEGSRLTGAVNPGDSAYYSKVYISDDSEWELTADSYTSSIVNELEDCSNIESNGHNIYY
GUT_GENOME237516_01717329-428QGAGSSGANVNFTGENQELTGNFTISSGSTMNLTLKDGSVFNGAVNPYSSYGGNVYVDLEGDASWTLTADSYISGLTCSKTAVKLNGFTLTVNGEAYTEG
GUT_GENOME019364_00554352-430AENQTLKGNIEIDNISTLKMNLTNSQYTGTINGEQTAKQIDIKIDSNTKIKLTGNSYITSLEDEDTTYSNIDFNEYTLY
GUT_GENOME256611_00136505-608TDSWGKSGSNGGNVTLNMTNQKAIGNIVIDSVSTLVMNMKSNSYYEGAINTDNSAKSITLTLDSTSTIKLTADTYVTSLENADTTNSNIDFNGYKLYVNGTAIN
GUT_GENOME098083_01966122-224GNLMQVSTWGSGTPADTDISYAVSNQTIAGDVTADGNSSLKLALRDKTYFKGALNKDKKARAVSLSLDKDAVWEVTGTSYVTAFTDEDTTLANVHSNGYTIYY
GUT_GENOME238332_00379549-637GSTLNITLKNQELTGDITPDYNTKVNLNIGSGSSYKGAIDVDSRQVVNVKLATDAKLTLTNTSYVDALELEDTAFNNLDLAGNILYYNM
GUT_GENOME252032_00404417-510RGWGTPGENGGRCTLNAIAQALSGTIRVDESSSLDMTLSDGATFEGTVNPEGQGGEVNVTLAQGCRWTLTADAYVTSFTGDLSCVETNGYTLYT
GUT_GENOME190604_00025415-552NDGKRGWGYAGSNGGEADVFFDNTTLSGNISVDQYSRMNLILQNNSTFTGTINLTDNAAVTAKASEIRVSADTSTDSDEDTPSIPDVITRNRAEVVVGAGSTWNLTADAHISSLMCMGTINYNGHQIVLADGTVMTGA
GUT_GENOME172411_0016478-155LAGDIEADDISTVSLSLTSSTFEGSINNSNSAKEINLTLDKNSTVKLTADTYLTSLNDDDSTYSNIDFNGYTLYVNSK
GUT_GENOME090341_01041467-580LLSVCDDGWNGAGNQAEVNAFSQVLEGSILVGDNASLSLSLSEGSTFTGSISGEIANSRGETVSTQVGEVSVTLDGDSTWTLTGDSCVTSFTGDAAQVISGGYTLYVNGVALEG
GUT_GENOME026646_00497392-495NSASRGWGKAGSNGGKAAFTADHQKLDGTIAVDTISRLTMTLGQGSTWKGQAEILSNEKGTSNDAGIYLTIAKGAQWDLTGDSTVTTLDNKGKINTHGYRLTVL
GUT_GENOME135373_00404397-486TNSQVRLQANQQAINGDILADRGSKLDIKLSQNSTFTGAINPNNSAKQASLSLSKDSVWNVTGDSYLTELNNDDSTNSNIHTNGHTVKIV
GUT_GENOME079997_04109337-431WGTPGSNGGNLTLTAIDQKLKGRIICDEISTVNLKLKESSSFSGSINSDNKGKYIKVSLAEDADWKLSGDAYVNVLTNADQSCSNIDSGGYSIYY
GUT_GENOME024260_00168523-617WGRQGENGADVKMSLASQSVSGDIVVDDISTLDLTLSSSDITGAINSDNSSGNIVLTLDENSSITLTGDSYVTEFNGDISQINAGNFHFYVNGEK
GUT_GENOME017984_01346348-442WGSQGSNGGKVNLRASGQVIDGDMLVDDVSALNLYLNDGSTFAGAINPTGQQGDVYVELDANSTWTLTGDSYVTSLTCVQGAINLNGHKLYVNGV
GUT_GENOME184942_01332357-461NLIEVTSDRWGTEGSNGGDFEFTAAKQSLKGDVVANNISTVSVSLTNGSNWSGAMNPKHTAKVAALSLDASSVWNVTGDSYVSALTDEDSTLGNIRSNGHTIYYD
GUT_GENOME141064_01109474-572WGEEGANGGNFTLNATKQELEGDITCDEISTVALSLADGSSYKGTIDGSHTGKEVSILLDEDSVWEVTGDSYVAAITDADESLDNLKSNGHTIYYDASN
GUT_GENOME213427_01883396-481TLKAVSQDLKGIIKYARGSSVLLKLTKGSTFKGTFRASSHSGSASITLDPSSVWTVTSDCHVHSITNRDTSCRNIKSNGHTVYYNA
GUT_GENOME191308_01415381-465WGTKGKNGGHITLTATKQKLSGNITVDSISSANVKLLKNSVYTGKTSIVANSYASSKAKTPLTMNISNSSKWVVTGNSTVTNLSV
GUT_GENOME263044_00199432-552WGRAGSNGADVTFTGLGETLTGDISVDSISSLDLYFLDGTTYTGQIQSVANAGGQNDSVSNAGDQNDSVSNATVTVNLDQDSTWIVTGDTTVFTLNAEDGAQLVDADGKTVSIVADGSTVV
GUT_GENOME264265_00026126-221SWGTGTPAQANVAYAISGQNLTGDVTADQGSTLTLSLQNGTHFKGAFNAGQKAHAAALSLDQDSVWDVTEDSYVTALTDTDTSLANIHSNGHTVYY
GUT_GENOME163677_00327624-760SDITLKDVDITYNNDNEYFLRCTGNNNERGWGESGANGADCDFTAISQDMEGSVIWDTISQLDFYMTDGSNLTGAIIDDESFAGNGGDGYCNVYVSDDSTWTVTGDSTVSKLSNAGTIVDDSGKTVTVKGTDGTVYV
GUT_GENOME001870_03982463-546LTLTANSQELKGDIICDEISGITLKLTVNSVLEGAVNGEGQSKAVEVNLDPTSSWIVTGDSHIAVLRNEQADNSNIQSNGYTIF
GUT_GENOME000603_03076462-546FTLTGISQNLEGNVKCDQISTVTLNLTEKSAFKGSINAGNSAKSATIHLDKSSTWTMTGDSYVSAITNKDTACSNIKSNGFTLYY
GUT_GENOME023425_01120123-209GDSANTIDSDGTYSNESYNSTGDDENALRVDGATVTIKNASDQTLDGDIVVDTISTLDMTLSDNSTFNGTINFNGYTITLADGTVLK
GUT_GENOME246796_01409463-560WGTAGANGGTAAMTASKQTLSGKIVVDKISALTLTLKDNSTFTGTINPDGQTGTVKLTLESGSTWNLTADAYLTEFNGSTENIAANGHHVYVNGTQLI
GUT_GENOME096505_00128416-484ASVTLNASAQPLTGDVTADSISSVAVVLSNGSSLSGAIDNANTAKSASLTLDANSTWNVTGDSYLSALI
GUT_GENOME009117_00683527-639NSNKRGWGSSGQNGADCKFTAKNQEMVGDVIWDSISKLNMSIISNSKLQGAFVNDESNAGNGGDGYANLSIDKNSTWVVTGNSTLSKLTCKGKIVDSKGNKVTVVDSSGKVLH
GUT_GENOME046057_00121290-400AEAGWGTPGQNGAQATVIAEAQQLTGDIVVDTQSQLALRLKEGSTWTGSVQILQSAEESAAEETEETAVVTIEKGSRWELTADCVISALYNEGEIDYNGYTITLADGTVLS