UHGP-MC 5799


Information


Number of sequences (UHGP-50):
70
Average sequence length:
144±11 aa
Average transmembrane regions:
0.03
Low complexity (%):
0.75
Coiled coils (%):
0
Disordered domains (%):
2.78

Pfam dominant architecture:
PF12831
Pfam % dominant architecture:
286
Pfam overlap:
0.29
Pfam overlap type:
reduced

Downloads

Seeds:
MC5799.fasta
Seeds (0.60 cdhit):
MC5799_cdhit.fasta
MSA:
MC5799_msa.fasta
HMM model:
MC5799.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME224607_0194012-149FQDHGQWKVDTQFTHLMGSSYLLACHTPGKPVQDATTRFAVPKNGRYRIWARTKNWYYPYAPGRFELLVDGQNSGVELGTLASTEWAWQIAGDYALAEGQHNLALHDLTGYFGRCSSLIVTDDMDFVPARPIEEFERQ
GUT_GENOME116174_00105372-496ADSFGELGTWKRDGVYLQGTSVADAETRPAVLEVLVKASGTYRLWVNARDFSTNQPGTRYFNVAVDGIESQTTFGQHGGDGFKWTDGGTFELEKGIHKIALLDTSRFYARCSGILLSQELDFQPY
GUT_GENOME212984_0111710-161LTAVLLCAAGANARNASCLIEGESFQFKGKWVVEKSSECLGSAMLRVYQDNRTEASADALTVIDIPEAGLYRVWTRSQDYAHSPRPRTFTLSVDGKAMNPSGAHGVAGFSWECVGEVELQSKPTLLRLTDSGLYFGRCDAILLTSDPSVDPN
GUT_GENOME000496_0280927-144MGQGYLLALDRPGIPVAPAHTNILIDTPAVYRIWVRTKNWLRGHAPGRLKVHVDGNPLPGELGTQPNGQWYWDIAGDADLGCGQHTISVEDTTGYFGRFSAVILTSDMDYTPEHQVGR
GUT_GENOME256537_01670153-286FTDLGDWTRDGAAMKGRMTRDTFLEEGKEIPAGKDAVATINVTKPGKYKVWILARDFAKNQQGTRYFTVSVGGVRSAKTFGKHGQEGFVWEDAGEFDLQKGANEVRLIDTSAFHARCGGVFITNNLKFTPGSDT
GUT_GENOME256537_00811654-793YLIFPDDFSSLGSWAKEPYSGSLSGLALAGRSSTQAEAKDATVSIYAETGKTCRLWVRSRNFADSAGKRTFHVAVNNVQSERQFGTTAKDGFFWEDGGLISLKKGENTLSLIDTSGYYARLDAVLITDDLSLNEPKNNFN
GUT_GENOME099628_0144518-149KQEMLLVEAESFDHYGGWVLDAQFMDQMGSPFLMAHGLGEPVKDAYTEVEFPDLGEYKVFVRTRDWAGPNGPGHFSLLIDNVPLEKDFGNGGLGKWFWQEGGVIRVDTPKVKLALRDETGFNGRCDAVLFVP
GUT_GENOME256537_0151326-167AADSVYIYLRPENFSAGSWTPMENEAGAIDGLNLKGKIDQLPKTAVPASAQVEIPKDGEYYVWARGMDHETVPGTRNYQVALGEKVLPKKMATHGINGYKWELAGSVTLKAGKSFLKLLDTSCYYARCDCVVITDDKNYIPP
GUT_GENOME256537_00817163-304TLILLKAEDFAKDIGTWTIANGIQGATMNVLFGKDTQKPEECKPANATVKIEKDGTYYIWVRALDYSENQPATRYFSVAVNGEQLSENGGTHKTQGWKWQLLGSKELKAGDNTVSVIDSSAFYPRCEAVAITSDKDFSGPGE
GUT_GENOME025525_010947-141LVEAEQFDDYGGWVLDAQFVDEMGSPYLLAHGIGVPVRDAQTTITIPEEGDYRIWVRTKDWVPEDHPDTFSVLIDGVPLAGDFGASGKGWSWELRDRVPLKKGAVRLALHDLTGFDGRCDALYITNTTSTPPEKA
GUT_GENOME171650_029662-154EKESKRSMVLVSMGDFEDYGGWSLDTQFVTTMGVPYLLAHGLGKPVKDAKTTLYFEEPGLYHIKVYTYNWVAPWKPDLTAGIFQIGIDGKMLGEKFGTTGCCWGWQEGGSIEIAKQQVVLNLHDLTGFEGRCGLILFSRDKEFHPPVDLDKLN
GUT_GENOME256537_00700353-507SLGSWKIVNQDTKNSVLPNIMMGNNDNKDSTSNPATAKFGIPKDGFYKVMVHTRDFSSSPNTRLFKVKAGENPEIVFGALGKDAWGWQESAVLPFTSGEKELKVIDYKGYNARLDMIIITDDLYFEVQDSAENFKMLSEKRYVEGSVTKQKQDNA
GUT_GENOME118058_0132914-166LGTLVAPTSAQDLWLEAESFADHGGWSVDQQFMDEMGSPYLLAHGWGVPVKDAATKIVIPQNGTYHVFVRTSNWTSAWSDKRGPGRFSILINGQTVGGVLGDRGGLWHWQKAGKIQLKARTNTLSLHDLTGFDGRCDAVLITRHLDPRLIPSD
GUT_GENOME154048_00225200-346WFGPNSFAELGTWTRQNDNLIGISTKATLEEAQTALANGELNPEPAILKFTVPEDGTYRLWVSARDFTTNQPGARSFNAKIDQKMSGVLFGQHAKNGSGKDGQFVWEDGGLFTLTAGEHTVYLLDTSCFFARCQGVIISEDPFNPNS
GUT_GENOME011131_017489-163VLAALFCVSAFARGNDRTGADETEFVLVEAESFVQKGGWCVDQQFMDQMGSPYLIAHGMGVPVEDATTSVQLKKGLWNVWVRTYNWTSPWTDKEGPGAFSLSVGGKALKTVLGTTGSEWGWQHAGAVRVRGGKTEIRLHDLTGFDGRCDAVLLVS
GUT_GENOME214044_01512355-495SAKGDSILLAGSAFDNKGGWISDPQFMEVMGSSCMLAHGLGRPVEDASTSFYVDKEGDYYVNVRTRNWTAYWSDRPTPGIFRISVDGRMLEKVFGEGSPEWHWQEEVKPVHLEHGNHRICVHDLTGFDGRFDSILLSLSPG
GUT_GENOME258018_021019-151IDAIEFEERGGWKLDSQFTHLMGNPYLLACETPGQPVSDATTNVVLDKPGKFRIWVRTKNWYATHSPGQFQLKINGETSGVILGNLPTNDWYWHCAGDFDLKEGKNEICANDLTGYFARFAAILLTDDMDYVPPRPVNEFIAE
GUT_GENOME111204_008855-153LIEAESLKNKGGWLVDSASMETLRSAYLMAHGMGVPVADAYDTVEISRDGSYYIWALTRDWTATWDIADPAGKFEILIDGEPLSQTLGTNGKEWSWQLAGQRFLSKGSHTLALHDLTGFNGRCDAIYITAEAVCPDSSRDGMNLLRQRL
GUT_GENOME255980_007036-174RKAVSLFTTLALLLSVFGGYSAVYAADAFSVDTLIEAEDTSLGAGALVTEGETASGGKYITSEGERVDDPATVKNPDVVFTVDIPADGTYAVYAKVIIADGGRDSYHFKWDGDAWETVHPGEKGTDYVWLKLSEKQLTAGEHMFYWTHRETLAIYDAFFVTADSSKLPA
GUT_GENOME061575_01844362-496DSVLLPGTYFRHKGGWTADPQFMEQMGSSYLLAHGLGTPVEDAVTKIEIPQSGQYRIFVRTKNWTAHWADKEKHAPGAFRLRIDGRDCDTLFGTGDPEWHWQAGGTTYLTEGVHQVALHDLAGFDARCDAILFTL
GUT_GENOME256537_005881010-1143YWFRPDSFSELGTWRYDSNAEGAFDSLTLIGRNDRKPANTKPAVAYFNADSSDSVWMWVRTRDYTTNYTGERVFKTRLNGELIDYTFGNHGTDGYQWVKLPYRVKLKKGENKLELVDTSAYWARCDSVLLTSDD
GUT_GENOME255303_018196-166KFFCFFCLIIVSMHIKCYADTLAIEAEDGTIDGGFAVQGSQLASGGSYIFAQNTTGDVRSEAELGDVEAEYKFTVQNEKNYSIYLRVMATNDGDSNWVTIDGTLFALHYGAIEPGTFEWRKVGMYKLSPGEHVLRLYCREAGACIDKIIVSSNMLFTPRGM
GUT_GENOME217674_0561634-179LWVEAEQFEHKGGWVADAQFMDQMGSPYLMAHGNGHPVEDATTSLQIDKADDYHVWVRTYNWNAPWDDKQFPGIFKLFIDNRQVGDILGTTLQWGWQYAGKSRLQKGKVQIKLHDLTGFAGRCDAIVLSTDAALKLPASGEKLSVL
GUT_GENOME277990_0217130-165ESFATLGGWTVETQSIRQEGSSYLMAHGYGVPVADAETTVQIPSDGTYAVWARTRNWNAPWTGGAAGRFQVVLRGAGTPAWTSAELGTKGADWHWQRAGEATLKAGACRVALHDRTGFNGRCDALFFASAGERPPA
GUT_GENOME256537_0005839-200ADKVYYCITPESLDDIGGWSFSSDNTADSYRSKIMYDLGNASTKKEAKTKLVLPKDGAYTIWLHTRDFASNKPGTRTFTFSVDGKEIGTGGNHGNEGWAWQSYANKLFTAGDHELVFNNKGLYSRFDLILITDDANFVPGNTEAELKALETDHLYDPSKVEI
GUT_GENOME122366_00453846-986AEYEHFLLRPESFIKRGTWGLTSDYVGAYDGTVLNGLTNKDKTAEDATTKITVKKAGEYRVWVRSRNFEQSPQARHFALSINGTRLSKTFGQVGTEGFVWEDGGTVTLSEGENTVNLIDSSQYYAKVDSVMITNHTKMTPP
GUT_GENOME140284_01722278-417SSSGTDLWIEAEDFTTQGLWTKKQDLNNVVLCGVTDKDKPCNKDGLEATTHIKVKEPGTYYLWVRSRDFSASPAKRRFKVTVNGQITETEFGKHSKEGYEWEFGGTFSLPKGKISLGLLDTSQYYVRTDKLLLTTDSEFK
GUT_GENOME285291_000437-156ILMLLSAALTSRADEVLIEAEQMTNKGGWVTDQQFMDLMGSPYLMAHGLGVPVADASTFFQTRTSGTYRLFVRTYNWTAPWTSKPGPGKFRVAVDKTYTSQPVGDRGEGWQWVDLGTLRLKSGRHRLVLHDLTGFNGRCDALYLTTGSTH
GUT_GENOME140017_0016512-148AESFEDYGGWSLDTQFETEMGSPYLLAHGLGRPVADATTHLSCPTAGRYYVWARVRDWVPKYHPGRFDLLVNGQKLMTLGESGENDWIWEKAGTVELREGENEICLHDLTGFEGRCDAIYFSTDPEDAPPEEGKQVD
GUT_GENOME256537_01627389-549IPPQMIMLTPESFYDTGTWQLEGVVSGSFDCGVLRGAAPSGGNAGPEDGDPSKTTPAKARFTVAKEGTFRVIAHCRDYTTSYPSARKFGISVDEKRVGGDLGTHGKNGFYWQEAGTISLSAGEHTLELLDTSGFYARCDMIVLTDNKAYEPSLMYNEMLSV
GUT_GENOME256537_01555182-354SEKHMKNTGITPEIPLVYKPNQVFLPGKFYTAFGVSDFHEKGTWKVEKTGTYNGETAGKDMNFIGLPNGKPEESKDATVTFAVPSPGKYYIWIHSRDFVERQGLRTFSAGIDDTVLPGVYGDHGIDGWGWERKEIENLTRGYHTFHLIDSKAVYARCDYFLITNDEAFVPQST
GUT_GENOME058155_000686-149IEAESFSFLGGWVVDSGSIHEIGSPYIMAHGMGIPVEDAKTIVFLPEGDYTVFVRTRDWSAPWKHIKKGKPAGRFQLLINGIPLQETLGTNGAEWDWQKAGNVHLNSGRTEISLHDLTGFNARCDAICFTQRDITPPKESDELK
GUT_GENOME036137_0141523-172LLVETESFQQKGGWVVDQQFMDQMGSPYLLAHGMGVPVEDAKTEIIFPETGTYYIFVRTFNWTSPWCPEKEGPGKFKLKIGENQIDQVLGTLGTSWTWQKAGQIAVTETNIKTTVSLQDMTGFEGRCDAIYFTTEKDFVPPSDIAELEKF
GUT_GENOME117076_015534-145IFIDARDFEQLGGWKREYQFITTMGMPYLLAAGIGTPVEAAEVKFQVPKAGKYRAWVRTRDWLPEYSPGGFHLEIGAEKSAVLGQNKCEEWDWQAAGDWELPAGENTVRLIDDTGYYGRVAALFFTTDLDFVPPADVEQIRL
GUT_GENOME256537_00132541-671IVTESFSDLGSWTKSANVNAFENLQLHGVTSGPNTSIPAKLSFSAQKGTYRVWVHSLNFTDRPAARYFNIGINGSTLAKTFGQVGTDGFSWEDGGTVELNGETKLELQDTSGFFAKLDAIVLTKDLSFTPP
GUT_GENOME069659_0242125-167QKKDFVLVEAESFNEKGGWVLDQQFMDQMGSPFLLAHGIGKPVQDASTEVMIPKKGTWHVYARTWNWCSPWKMKEAPGRFKIAVNGAVLDNELGMGTQWDWEYAGSVEIKNKSNSVTLKDLTGFEGRCDAILFTKDKNVVIPN
GUT_GENOME256537_01621448-591APIIYYVDGTGITELGSWTLQSDGYIQGCTDTANEGKGYTGGTPAKVSFNVTKPGKYKLWIRARDFATNKPGSRYFNVAFDGKEYPEKMGTHGKDGFIWTEAGVYDIQAGKHTVEILDTSGFYARCSGVLLTNDIDWKVPDNDA
GUT_GENOME010808_0011514-161LLEAESFDTLGGWVTETQSMQTIGSAYIMAHGMGVPVADAETTIELAEAGVYAFWARTRDWTAVWGRGKPAGRFTLGVDGQTLDTVLGTNGSEWKWQKAGSVALSPGKHRVSLHDLTGFNGRCDAVYITNDDDEPEQDNGKMTEFRRR
GUT_GENOME163893_0188140-200ARQACHLFVEAESFAEKGGWKTDQQFIDQMGSPYLLAHGMGVPVPDATTHVHFPAPGVYYAYVRTYNWTSPWSKAKGPGQFRLSVNGKRLEAVLGDTGDSWMWQEAGKVNVKNVDATVSLHDLTGFDGRCDALYFTTRCGALPPSDPVGLAEFRRQTLRLP
GUT_GENOME015077_003096-146IDAADFKSRGGWQLDTQFVRETALPYLIACNRPGEPAENAVAKFTLTKPGTYRFFVRTKNWKPAFSPGKFNLLINGSPLPAVCGARPQYSWYWDIAGDLYLPAGTHTLALKDLTGWLSRTAAVIVTNNMDFFPAGEPERFL
GUT_GENOME063070_0070426-165AQKDKYLIEAEAFQFKGKWSTDRSADCMGSAMLRLNGGGSLDEQFDALTVVNIMEEGDYNVWVRSADYDKLPGTRLFRLSVDEKPMKESGKHGKVGFYWENAGCVQLAKKQVLLRLHDTKRNFGRCDAILLVKDHSINPN
GUT_GENOME114269_0022335-207HLLLADDFTTLGSWTVENQQTAYNQKNLKGRTTYDPTVPGTDAEAKFTVTRAGTYQIWVHAKHGNTSPYGENPWEFKVGVDSDNLNTVFGGAANHANFAWTKGNSMKLTTGEHSLRLIDFSANYARCDAVLVTSDANLTPEEDYTKLQKQLTAQTVTEPKVDHILIRPDNFAD
GUT_GENOME139680_010279-148AEFKNKGGWRLETQFVRAVGQSYLIACDIPGEPVRDAVTEFEVNKDGRYRIFVRTKNWKYPEAPGQFNVFVDGKALPSICGKMPTNNWYWEIAGDTELKAGSHTVTLHDLTGWLTRCAAVIITDDMDFTPSPENERLQKQ
GUT_GENOME013513_00552543-676DFAEKGAWKVDTQFTHLMGSAYLLAPGVLKPIGSAKTRMHVEKAARFAVWARVKDWVPAFHPGRFALALDGQRLPHVLGASGRRGWTWEKAGEVTLAAGVHELALEDLSGAFARCDAVFLTDDLKETPSEVLTE
GUT_GENOME256537_01457198-342VKGAVEFKDDKTYIVLNPESFSGNLGCWTYITPNDAGGYATSYLKGTHENAENANYASRKIVIPHDGYYYIHSFAREYDTYVGQRFFDIQIGDSVRRRLGTHGQNGWEWESAEALPLYAGEYDLKVIDSSSNYARLAMIIVTDDP
GUT_GENOME238256_0069040-175DELLIEAEQFDSWGGWVNDSQFMDQMGSPYLLAHGLGKKVENAKTTFAVDKPGRYCIFARTRNWTREWNAERTPGAGQFAILLDGKKLDYVFGNGSKEWELEGGVEVELLSGEHTLELEDMTGFDGRCDCVVIAKT
GUT_GENOME243602_0119636-177ESKAQTCLIEAEAFQYHGGWRVERDEQASGKSLLMVTGGGNSAVDGITVIALKQAGDYTIWSRTKDFKQQAPRTRISRLVIDNTELEKQGMHGQDGLYWEKVGKLFLAKGNHVLAARDVNANYARLDAILLTADATLDPNKL
GUT_GENOME127907_021768-149ADFQNAGQWKLDTQFTHLCGSPYLLACHTPGVPVRDAVCRFAAPFEGMARVWVRTKNWFLPDSPGRFTVSLDGHESAELGDMATHDWYWQIACDVHIEAGEHTLVLKDKTGYFERCAAVLITDDMDFVPARPMHEYVKLRAA
GUT_GENOME256537_01839325-464ILVEAESFSSLGGWVLDQQSMDLMGSPYIMAHGLGTPVKNAATAFNAPKAGQYRMWVRTKNWTELFGTNGAPGQFKVLLNGSETKETFGTHGADWFWQNGGVVTLKSGSNSLELKDLTGFNGRCDAILFTTDTSFTPIDS
GUT_GENOME025984_019494-151YFLEAEAFENLGGWVIDQQSVEQMGSPYLMAHGLGVPVADACTTFEVAEEGLYRVFARTRDWTAVWHAPSSAGRFMALVDGYPLETVLGTNGPQWAWQAAGKIHLAAGRHTLALHDLTGFNGRCDAIVLSDDPALCPPDGGDALAEFR
GUT_GENOME256537_00090161-292FDVLGCWTMESTTVLKGKTNPEGQKTDVTNDDAIASFDILKDGKYKVWVNSKDYAKNQPGSRYFNVAVDGKRADLRLGAHGSEGFKWQEIGTYDFTEGQHTVALQDTSGFYARCQGIIISGDMNYVPSDNYD
GUT_GENOME238207_0069249-188VVEAESFADRGGWAVDQQFMGQMGSAFLLAHGIGRPVKDAVTRIALPPGRMRLWVRTRDWAPPQGPGKFEVFANGRSLGAFGVGGSGRWEWWQGGEFASPGGAVELRLHDLAGFEGRVDALAFLPPKARPPAAFSREWRA
GUT_GENOME195036_0055622-172AQSLLVETELFEQKGGWVVDTQFFDQMGSSYLLAHGMGKNVDDAYTTVTFPETGEYYLWVRTKDWAPFPKGPGKFQLSIGQILLDSVFGASGQPGWKWYSGGKINIVNKETEIRLKDLTGFEGRCDALFFSKKNVKLPDNLIAEKAFRKLF
GUT_GENOME256537_00815303-445FSENETNLVLLPDDFATDYGTWKNTKDGSTQILMGATSSSADVSDATAKIEIPADGTYYVWGYAKDYTSNSQGSRAAKIGVDRTALSGLIGIFGANTQTGASGAYGWDLAGKATLKAGTATISLKDVKKNYSRVAAIVLTSDK
GUT_GENOME264037_002545-177MKTKWIAICCALLSLQGAVTAQVLVETESFAQRGGWVLDHQAFDKIESAYLMAHGMGRAVEDASTTVKFAEGGAYHVYVSTYNWTAPWYSGEGPGAFQLKVDNRVLTDKLGVTGSCWGWQYAGQVTLSAGEHEVRLHDLTGFNGRADAIYFTKTKEAPVGDYHVFAAERRRLS
GUT_GENOME175996_0056640-168FEAENGKLSGAMKKTKDTGASGGYFISVTEGARVDEPAKMQPEASYVFTVPVDGEYQFWTKSYMKDEGADSFWARFDENAYIYCGGAVTDGYTWKNHSGVTAKLTAGTHTLDIVHRENNTKFDMICITT
GUT_GENOME256537_01285176-304VTAYSFTDLGEWTLSDGYMLGTTAQTHPSTSINVKMPGTYKVWVYSKNYSDYNRTFSVSMDGTQLPNLVGNHKYNAETSVGSPSTWSAWQEAGTVELTAGAHRIDLKRMTGWVRCSNVFVTSDLNMVPD
GUT_GENOME256537_00610206-340NGEAVYTVLDYTSFNPLGTWEVQQDSGTQLDSFLFSSTKSSGDADNAKAVFGVTQDSVYDIYVHAKDSTVNPGYRSFYLSVDGGEEVLAGGHGKEGWAWEKVNSVPLFAGEHTLSLRDYRGNFSRVDMILITNDR
GUT_GENOME049363_0132631-185ESQINYLLRPDDFEKQLGTWTIQSDQLAFESSNLIGLHENQVHDPTKSKDAETEISVQKEGTYNLWVHAKKNAAGETVSSNAWTFQVGVDDKILEPVFGGKASATGYVWQNGGTIEMEEGTHTIKLIDSSANWARCDAVLITSDANLFPSNDYTQ