UHGP-MC 125840


Information


Number of sequences (UHGP-50):
112
Average sequence length:
155±18 aa
Average transmembrane regions:
0.11
Low complexity (%):
33.79
Coiled coils (%):
0
Disordered domains (%):
10.96

Pfam dominant architecture:
PF04829
Pfam % dominant architecture:
4375
Pfam overlap:
0.33
Pfam overlap type:
extended

Downloads

Seeds:
MC125840.fasta
Seeds (0.60 cdhit):
MC125840_cdhit.fasta
MSA:
MC125840_msa.fasta
HMM model:
MC125840.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME147138_011453794-3956WGVGSDFQRGMQAATAALQGLAGGDLTQAAAGAAAPYLAQMVKQQTDDGISRVMAHALVQGALAAAQNKNAMVSATGAATGELAGMMATELYHKDASQLTEGEKETVSTLATLAAGLAGGLTGDSTVSALASAQTGKTVVENNLLTGKDALNMLRELEEAEKT
GUT_GENOME143407_04436200-357YGIGGTYQKIAQAATAALQGLAGGDMTKALAGASAPYLAQMIKDVAGDDNEAARIGAHAVLGAVLSHLQGNSAAAGGAGALSGELAAIYIKNNLYPNIETKDLTEAQKQVIVNLSSLAAGLSGGIAGDSTGSAVAGAQAGKNAVENNAVSCSTLTCLN
GUT_GENOME142612_014133968-4129SDFGTGGKYNRAIQAVTAFTQGIMGGNIVTAIANGSAPYLANEVKNQIQGNSVESDIQRTLAHGLLNAGLALAKGENVAAQATGAMTGETVGILSHSLYGKTPEELTESEKQNISAWATLASGIAGGLISDNSTGVANAAQAGKVVVENNVFNLAGRKQKDI
GUT_GENOME156716_034602517-2682QSGTGSDIQRAIQAATAAAQGIAGGDISAALAGAAAPYVAEIIGHRSGLDDGMEKAAAHAVASAVLAAVQGKDALAGATGAAAGELAGTLALEMYGKDVAALSESEKQTISALATLAAGIAGGLTGDSTASAVAGAQTGKTTVENNTLAHVLAAAEANKAGTIEQW
GUT_GENOME141306_028164593-4774QWGMGGDKSRALNAVTTAITGALGGQTDLQVAANTLAPYAANMIGEKFGHGEDKNKAAQLVSHAILGATLAYLNGGNPAAGGSAAVASEAAADYFANQYNDGKTAINPETGKFDANLLPENIKSGIRDLTAAIGAVVGGTVGDSSSNAQLAGVIGQNAVENNEFSIITKGVEKKLAENKKEK
GUT_GENOME010376_00720709-884TYGIGSEKGMAIRAVTAALQAAAQNDTAGSLVALASPYLNKTIHEMTAGDTAKDKATNLMAHALLSAVEFQVTGKDPLTGAIAGVTGEATAEIIARAYGKPVSELTANEKENISTLSQLAGGLAAALTAKANGSTTEQGGNFLAATSGAETAKRAVENNYLWQEEQKEFEKKMLEC
GUT_GENOME145497_007832623-2790FGTGGKYQQAIQAATAAVQGLAGGNLSAALAGGAAPYLAEVVKTMTTDPVTGEVNKAANVAAHAVVNAALAVAQGNNALAGAAGAATGEVVGMIATQMYGKPVSELSETEKQTVSTLATVAAGLAGGLVGDSGASAVAGAQSGKTTVENNYLSVSEKTELEIAKQTLK
GUT_GENOME171513_008713690-3871TYGTGSAMQRGIQAATAALQGLAGGNIGGALAGASAPELANIIGHHAGIDDDTAAKAIAHAILGGVTAALQGNSAAAGAVGAASGELIATAIARQFYPDTDPSKLTEEQKQTVSTLASVSAGIAGGIAGGNTAGAATGASAGKNAVENNYLSVSEKTELEIAKQTLKNSKDPAEREKAQQKY
GUT_GENOME231238_008852923-3084EWGNEGKYRRALDAITSAGVAALTGQSAQGIAVTAASPYVNQAIKNATTDEQTGKVNKVTNIAAHALWGAVESNALGGSSTAGALSAGGAELVAPQIAKVLYDKAPNELTSSEKQRVIALSGVIGKAIGGITSAAKGGDTYAISKNSDISGNIAKNAVENNS
GUT_GENOME147130_0457759-223AMATYGTGSDLQRAIQAATAVTQGLTGGNLGQALAGGSAPYLAHEISKYLPADQNQTANLMAHAVLGAVVGHFNGNATVGAVSAFTAEAAAPAIINAMGWDKDHLTEQQKQTVSALGTLAAGLAGGLVGDSSNSAVAGAQAGKNAVENNTLSSKDEKLRQDAKWS
GUT_GENOME257704_036632733-2895YGTGGKYQQVTQAVTAALQGLVGGDIGSALAGASAPYLATIIKQQTGNNDTARIMAQAVLGAVVAQMQGNSAVAGAAGAAGGEAIAKVIAEQLYGVKGNDTSGLSEEQKQTISALSTLAAGLAGAAIGSDTAGALAAAQAGKTAVENNYLSRRDVDELAEKAR
GUT_GENOME143092_034742855-3024QWGTGSAIQQGIQAATAAVQGLAGGNLAQAASGAAAPYLAEVIHDMTTTKDANGKEVVNVEANLMAHAVVGAVTAYAAGNSALAGASGAAMGEYIAQQMYPGVKREDLTEEQRQTISALGTLAAGLAGGVTGDSTAGAVAGAQAGRNAVENNWLSVEEADRKAVLERKER
GUT_GENOME052644_013272890-3071MAKYGTSSEIQRGIQAATAAIQGLVGGNLAGALAGTSAPELAHLLKSTEKDLAVNAIAHAILGGAVAAMQGNNVAAGAAGAATGELAARAIAGMLYPGVKQSDLSEEQKQTISTLATVSAGLAGGLTGNSTASAAVGAQSGKNAVENNYLSADQIDNFAARAKGCEARGDCGQIVKEMEDLS
GUT_GENOME144553_029462782-2959YGTGSTPQMVVQAITGVLGGLNAGNPGQVLAGGLNPAVAQLIKQATGDNREANLMAHAVWGALAAQLGGNNAASGAAGAFSGELAARYIIDNYYGGRTDNLSEQERQQISMLATIASGIAGGLAGNSTSAAGTGAQAGRNSVENNYLSVSEKTELEIAKQKLKNSKDPAEREKAQQKY
GUT_GENOME146004_005603257-3421TGSDLQKAAQAVTGALTALAGNNLAGALASGASPYLATEIKKLTTNPLTGEVDVAANAMAHAVLGAVTAQLNNQSAAAGGLGAGGGELAARYIAGQLFPGKTKEQLSESEKQQVSALSQLAAGLAGGLATGDTAGAVTGGQAGKNAVENNYLSNQQRSDRDKEFD
GUT_GENOME007946_009452293-2552SEVAEWESGGKYHRAADALTSTIIGALSGQSATSIAATAASPYVNVGIKNATTNEEGEVNTVANIAAHALWGAVEAKALGGSGTSGALAAGIAELSAPVAAKLSQIINEVSSDESLPSTTAKMKAIVNKIKTIDKGMDPSELSNKEKEMLVGITSFIGQVVAQATSKARGADSDTASKNVKIGGIVAKNAVANNYLSRTEIEQYYKDLKDCNGKEECEKDVKQRNIALSAKHTEELELACGGDKRSSAECSEHREKARDG
GUT_GENOME231626_034203573-3745WGIGGSYSMAAAAVTGVLGGLGAGNLGSAAAGGMAPYIANKIKHATSTFVNGQEQTNVLANTMAHAVAGAVLAQLAGNNASAGAAGAAGGELMARAILRTMYPGKQASDLTQDEKQVVSALSQLAAQLSAGVASGSIEGGIQGAVAGKNAVENNFLSVKKAETFNKSIEEQKA
GUT_GENOME143518_006443705-3842ALAPYAAYFIGSKLDSNHGSDPNATLQLLSHAVLGALLAEANGGNAGTGAVSAAGGELAAKVLTNTLTGGDPSRLSPEQKEMVLALSQAVGALASGLSGQDLAGIALDAGIAKNSVENNFLGNDDHARMVHLREKAKR
GUT_GENOME095843_00426109-229PYVNQVIKDVTKDIPSLNLPAHVIWGAIEAELTGGSATTGAISTAAGELGAAYLAEHIFGKKAAELSPEERSKVRDAAKAIAGIAGGLSSAMQGQDLVSSLNDTSVGLTVANNAVENNYLT
GUT_GENOME095841_011032475-2643KEWETGGSQRLVIDSALNVISTALAGRPAAEVVASGLSPTVNNQIKKATTDAKGNVNTALNLTAHALWGAVEAYAGNRNVAAGAAGAAGGEAAAHFLASTLYDKSPEKLSEEEKRTVSSLSQVAAGIAGGSLSDSSDGAIIAAKTAKDSVENNSMADDVHPSDERKQNI
GUT_GENOME096384_022171848-1993QSVSGILAGAAGGDLKKALAGGLNPLMAQTIKGATTEDGKVNESANLMAHAVWGALAAQLSGGNAAAGAAGAFSGELAARHIAAEMFPGKDPGDLSQDQKQVVSLLGTMAAGLAGGVVGNSTASATTGAQAGKNAVENNYLSDKDI
GUT_GENOME102159_000672438-2626EADKWQTGGEYKRKIDGAMNAISAALGGLPAAGIATSALSPEINHQIKLATEGSPMANKVAHAVWGAVEAYSANQNAAAGAGGALAGEVMADVIAKELYGKKPNQLNREEKEVVSSVSQAAGALVGGAAANSSQGIGVGLTTAKNAVENNFLSDASRARLNALKTKYHRGEKLTNKEKLEFRDLIESDQ
GUT_GENOME121634_001772303-2463VGEWGYGGGNRRAIDTVTALFTSVLSGQGASATTVATLSPTVNKLIADNTHDKATNALAHAVWGAIEAQVNGGSAAHGAISAAGAELLAPQIASILYGKSEADLTPDEKAGVISMASLAGGIAGAIMNGKSEGVEIIGNTAINAQIAENTVTNNYLSGWQA
GUT_GENOME231557_010472949-3104RAMQAATAAVQGVMGGDLKAALADGAAPFIANEIKKQIPDEEADANLKRTIAHGIANAALALAKGENVAAQATGAMTGEAIGILAEYIYNKQPGELTEREKENVSAWATLASGLAGGLAGGDTQSVANAARAGKTTVENNLLSPEKDDRRFKALEA
GUT_GENOME032708_001732427-2562GLLGGETLTQAGLNASAPYLASTIGQTFGEKGTHPNREAQLMSHALVGAIMAYVNGQNPGVGAGAAAGSEFVAKYVAKALYGVEDPDQLSYDEKRSVASAVSALVGLGTSLTTSDMISAQTAGALGKTVVENNYFE
GUT_GENOME143489_009752542-2720AQESFGIGSSFWTAGMAVSAALTGLAGNADIGSISSAAVAPYLAGQIKKYTTDKDDKVNKTINILAHAILGGVVAQMQGNSATAGALGGGGGELAARIYMDQVHPGKKVSDLSEADKRIVSAIGTLTAGILGGLSTDSSTGLITGAQAGKNAVENNALSVAQTQSLIKEMSQCQGGKVC
GUT_GENOME171447_0164350-207EYGVGSDFWRNGTALTGLLAGALGGNVTGGMATGAAPYVAGQIKSVADGHESARIALHTLASAVLVQLQGGNAAAGAAGGFIASSGSEALSLAFYNKEPDKLSPDEKTVIVNLVAALGAAGGSVAAGNSSGTGSGANAARVEVENNYLSSTEKSRQTY
GUT_GENOME231670_012123259-3409QKVAQTVGSILTGLVTGNAGQAVAGGLNPWAAQLIKKETTDASGNVDVATNAMAHAVWGAVSSQMSGGSAAAGAAGAFSGELATRYIVEKYWGADTPEKIAALGQEDREQLSLLGTLAAGLAGGMAGNSSAAATSGAIAGKNAVNNNLFGG
GUT_GENOME096136_031553407-3552DNRRMVEAGTALVQGLASGDVNKAIANASAPYIANEIAKNIGEDNKAGRLAAHAIANVALALAKGENAGAQSLGAFTGEAVGMLSEKLYGKEPSQLSESEKATVSAFASLAAGIAGGLVGGDTSTAANAAQAGKTTVENNLLSNKF