UHGP-MC 63849


Information


Number of sequences (UHGP-50):
94
Average sequence length:
118±8 aa
Average transmembrane regions:
0.01
Low complexity (%):
4.22
Coiled coils (%):
2.15
Disordered domains (%):
1.34

Pfam dominant architecture:
PF08264 - PF06827 (architecture)
Pfam % dominant architecture:
2447
Pfam overlap:
0.46
Pfam overlap type:
reduced

Downloads

Seeds:
MC63849.fasta
Seeds (0.60 cdhit):
MC63849_cdhit.fasta
MSA:
MC63849_msa.fasta
HMM model:
MC63849.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME081295_02250805-922KWDKLLAIRSEVTKKLEEKRASKEIGKALDAQVILYAEGETLELLQQMQSELATIFIVSQVTVKAMSEAPADAYVSEDVKGLDVVIAPAAGVACERCWIHYEELDENGLCPRCAAVMA
GUT_GENOME286888_00628859-976ESAIRKWNNIFFIVNKVNIKIKKEILDKKIKNSLQAKVILGINQKTKEFIDLNHEDFLRSLNVSVLETNISDKSYIKIETADGLECKRCRNYSLDIGKDIKYRYLCPVCAKIMNEKVD
GUT_GENOME147751_000531132-1263EEFNDAFWDDVRYIKDQVNKELENQKANGIKSNLEAKVTLKYADDANGTIKKLKLLGEEVRFIFITSQFVISEQAGGIDDENIQYNAGNTTVQAVVTRAEGDKCPRCWHYTTDVGKVAEHADICGRCVSNIA
GUT_GENOME188886_01583872-990RWAKLVSLRDAVNKALENARNAGVFKKAQDTDVTLSVSESDAAFLAGVDLASLCIVSKVTVTTGAVEGEKSEDCLIPCTIAVALDESPKCPRCWNHSEHIGADGHHNQLCDRCAAVVGE
GUT_GENOME133031_00014849-953FTLSAKKKIRDAISKKIIKNSLEAALILNVLEEKREFLEENKENIKDALNISEVKINVVDDKSKRKVGVVKKDGEMCARCRHYTLDVGRDVRYVYLCENCVKILE
GUT_GENOME018404_00835818-929KWNKIRDVRDLVQKELETARQKGIIGASLEAKVIFKTSSPEMKQFLQETKALWPEIAIVSAVDVQDGADELTIEAVHAPGTKCPRCWQWKEDIGADPKHTDLCARCAAVLER
GUT_GENOME073912_00558815-943GEEELLSKWDDFMEVRSHVLKSLEEARNAKLIGKSLEAQVDLYLTDDQKQLLDSLNENIQLLLGVSALHVHPADEAPADADQYNDGVAVKVTTANGETCARCRMVKEDVGSDPAYPELCARCAAIVREN
GUT_GENOME049328_00988528-642DEANEFEGVLEASLELRSAVTKALEDARSAGTFTKSQQVRVKAVVPAEMYALLTGDKAVDLAEFYIVSDVELTQGEELSVAIEAAEGECCDRCWNYRTTVGEYNGHAHICKRCAD
GUT_GENOME141741_01062802-919DFRSEVQKALEVARDNKVIGKSMEAAVTVYPTEPVRDMLDDVEANVMQLLITSHFEVAPLEAKAPENAEQFEDMAVVVEHAKGTVCPRCRMVRTDIGTDPKLPELCSRCAAIVEANYP
GUT_GENOME257279_00203942-1051FAPALEAREVVTKALEDARAAKLINKSQEAAVVLAAPADQLAALEAFGTDALAELFIVSGVGFAEGEGLSATIRSADGEKCPRCWNFRALGGNPNHPDVCERCGDALDAI
GUT_GENOME117662_01377795-914DALKEQFDLFMNLRTDVMKSLENLRAEKVINSNMEGKLTITLKDEYKSLAALEDSMKQLFIVAKVTLDDNAEGKDEYETCFIKAEKFHGVQCPRCWNWFEEGELVDGLCPRCHEVVETLP
GUT_GENOME095591_006808-126ERIAARFERVFEVRDVVLKELEEKRATGELAKAQEAAVHIYANSAHLDALRALDGAGVEELLVVSEVHLHESADDAITVEIQHARGEKCPRCWNYRSLGTQSDYPDVCERCAQVLREIN
GUT_GENOME103664_00709810-917AKWDRIHRIREDVQKALELARKDKTIGKSLEAKVTLCAAGELHDFLQSVESQLPEIFITSAVALSDGEGDFTGEVEGLSVSVSKADGEKCERCWKYDETVGQNAEHPT
GUT_GENOME276219_01056850-969KELTEKWNKILELKELVSKNLELARADKIIGHSLNAKITLYANDKDYEFIKNNLEILKTVFIVSDLQVEENLRKEKEKIGIKVEMAEGEKCERCWMYSTTVGEDKENPTICHRCSQVIKE
GUT_GENOME204436_008931098-1222ADDAFKAKWAQLIAVRDEVKKVLEQARAEKLIGASLEASVTLYCSDAVYDLLNSIPMDELADLMIVSHVELVKGEGGAASAVEGLGVAAAHATGDKCERCWKYSASIGSHPAHPTLCARCASVVE
GUT_GENOME237239_01581803-923LESEWTSILKVRDDVLLSLERARDNSTIGKSLEAYITICTKESSTKDLLAKYEKYLNEIFIVSKVTLSDSKDDTFIEGGVSFVKTEKASHDKCVRCWGHYDSVGTDSEHKELCTRCAEAVR
GUT_GENOME171643_00619800-922WSAFMDFRGQAQKALEEARNEKVIGKSLEAHLTVYPNEVVKTLLGAVDSNVAQLLIVSELTIAEGPAPEGPAPEGAVAFEDVAFTVDRAAGEVCDRCRRIDPTTAERHYHATICDHCASIVEE
GUT_GENOME127045_00105805-909IALRSIVLKALEEKRASGVIGSSTEAKAEVFTNNETLFNVLKKLDDNEIARLFGVSEANITEGETKATIEKDDDPVCERCRNHKKDAVMRENGSVLCSRCLAALG
GUT_GENOME018491_00175802-908HVLRDAVLKALEEARQNKTIGKSLEAHVTLNVSDEDKALLEKNFKDKVNQWLIVSKVDYTSEDLPKYDDFGIKVEKVAGVVCPRCWNITDSTDLEGLCDRCKKVLAK
GUT_GENOME178939_01386193-300AGVEKILALREKANEELEKLRQAKEIGKSLEAEIEIEAGAGTPEAKALREFSANLPEIFIVSSVKVSEGDFETPRIRAAKAEGRRCSRCWRVLPDLPEDGICPRCKKS
GUT_GENOME239277_01022831-956KWKKRLAVRSVAMKALEEARQAKVIGHPLDAEVTVYADGEAYDIVKAMEKELADFLLVSQTHIVSGTAAAPENAASNEEGTVKASVAVCTLAKCERCWKRSADVDADPKHPGVCARCAHVLTEMGE
GUT_GENOME206930_01455698-816ADDAFTEKWNLIYQTRLDVNKMLEEKRNEKVIGKSLEAAVEIAVADDAAYAILSENRELLEKVLIVSSVTVTKTDAAENQYIITTAAGEKCERCWMYSTTVGEDKTHPTLCARCAHVIS
GUT_GENOME259479_00886791-906DYDVSILEEYDNFKTYRDLVLRKLEEARSNQIIGSSLEAEVNLTLDEKENALLSHFNEDELKDIFIVSRVHIKTGESDISISHHQGNKCVRCWSYHDKLIDVEGNLVCPRCKKVLT
GUT_GENOME027890_00246805-938PRYVDEALEERWNKRLALRSEVMKVLEESRQAKEIGHSLDADITIYANDDAYGTLKDMGAELADFYIVSTVTLVHGLDQAPEQAKLTEEEDMKIVASASTREKCERCWKHDPSVGSMAEEPHLCARCASVLKDR
GUT_GENOME039172_00639813-929EGVYENIERLLQIRDMVNEELEKLRKDKVIGKSLEAEIEIMVSSDSADADILEAYKDKLPELFIVSEVKIVKGSFSTIAIQAGKAEGERCVRCWRVLRDLDEDGICPRCRAALDELA
GUT_GENOME202059_001031085-1203AFWDELLKVRGEVNKVIEQARADKKVGGSLEAAVTLYAEPELAAKLTALGDELRFVLLTSGATVADYNDAPADAQQSEVLKGLKVALSKAEGEKCPRCWHYTQDVGKVAEHAEICGRCV
GUT_GENOME056587_00662729-851DESLLKEYALISKLRSEVLKVLETKRQEGVIGSAQEASVMLNIKDEATKEAFNKFSKVEQERLFIVSKVELVDEKLPNDMELSSIDVKKNNGHKCERCWNYVDHVTEVDGVHLCDRCLDAIKE
GUT_GENOME089795_01791807-919AKWSRIAALRSAVNGALEQARADKVIGKSLEAAVALTVPQEDAFLQEMDEDALADLFIVSQVQLTVGDEVKVAVKSAEGTKCGRCWKVLHSVKAVGEHEALCPRCAAVMAKLP
GUT_GENOME231436_0251781-179AVESDWNAIRALRQQVTEAIEPYRREKQVRSSLEAEVTVPDLPASAEDLAELFIVSAVTKGETVTVMPTQRHKCGRCWRHLPEVAQDGALCDRCAGVVE
GUT_GENOME059944_00843794-917ANDKLEAEFTKILKLREIVTKAIEPLRADKQVGSSLEVAVYVQGGDPELLKKYESELANIFITSQAELVDKAPEHVLNEYKEDDYTIYVTEAHGEKCERCWKYRKLIQVGNHGEICQDCINAIN
GUT_GENOME233272_00281124-233YASFVHLRNIGLKALEEGRAKGLFGSSTDAALSLTIADDNLLEILRSNSDEMLSKFFMVSKVHLATGESDSASATVGEGELCPRCRQKVSELVEHEGEHLCCRCAKALEE
GUT_GENOME006921_01717814-934DNEEVIKEYYDRFLLLRKDVLKSLEELRNAKEIKSNMEAKVTICLKDSYQDMDKLADVLKQLFIVAKVQLVNDSTNLEEYDTAYIHSEKFKGVQCPRCWNYFEESEMEGELCPRCHDVIHG
GUT_GENOME223796_00645792-921EELLGNWQEFLDFRDKILKALESAREAKLIGKSLEATVTIYPNEVVRTLLTAIDENVAQLLIVSNFVVANEPVNNAPESAMKFDDLAVLVEHAAGEVCDRCRRTDETVGHNANENLKMLCEHCAHIVETE
GUT_GENOME274349_00490856-966KWDRIFRIINVARKQINKAKNDKIIKNSLEAMLTINVKKENKEFIEENFDDIYLSLNISEMKVVESEEESVEVTKHEGVQCAMCHQYSIYIGRDLKYRYICPRCAEVMEGH
GUT_GENOME122131_01016473-581RFDSLLAARDAVTKALEDARTAKTVNKSQEAALTIAAPEAAFEVLSSADAADLEELFIVASVAVERAEGEDIVVTVEKTSADKCPRCWNYRELGGNAAHPDVCARSSRE
GUT_GENOME284170_00158878-987KWDRIFKFKTKMKRVLGSAQKKKIIGNTLQAKVIISTNKEAKKFIDDNYDDILRSINVSQMIVQESDKERVAIEKAEGVECARCKNYAIDIGKNLKYRYLCPKCAEIMEE
GUT_GENOME096462_01575827-952DALMKKWTAFMKLRDDVLKALEEARNEKVIGKSLTAKVTLYVNEESRALLNSIDEDMKQLFIVSGFEVAGTLEQAPENALQLETAAIVVTKAEGETCERCWVVSKEVGSNPEHPTLCPRCASVVKE
GUT_GENOME205883_01388858-969KTDMEELFKVRKDVFKALENARAQGLIGKSLEAHVVLHVSDAQKEIMDRVLSNAAQWFIVSKVTFTTDELEQFEVCQVQVEKATGHVCPRCWNYTESDNEDGLCDRCNHVLH
GUT_GENOME157109_00026810-936VEAKWADIIKVRKEANKSLEKARQGENRIIGNSLDAKVMLFSKNADMQNFLAENRDRLELALIVSNVEIVDSCDETFVEGEELKDLYIKVVHAEGEKCERCWKYSTEVGVDAEHPTLCPRCTSVLKN
GUT_GENOME056438_01034485-617LPQEAMEKWDVVIRLRQDVNAVLEAARAAKRIGKALEAHVSLEAQDDAAGAALEAVRELDLAEICIVSSCALEPVPEEATQGQGANFPGLRIGVTEAKGAKCPRCWMHSLQANEEGLCPRCAGVVSRLPQDAL
GUT_GENOME014048_00684816-931FAKLLAIRSHFSEVLDSLKKQKSLKSGLELCVKGDFMGAQKDSMDKALQDKKFSEMVAEWLIVSDYECGEKITDFMLGDQTFGVYKSSGHKCPRCWRYMAESENELCARCKGVIEK
GUT_GENOME172392_01565168-273MEYNVEIKGIIEKLIGKSLEAKLTVACRFDEDYNMLSGMPEALRDICIVSDLHVIKDESLPTDTPFAVTVEKAVGDKCERCWAYSETVGTVPEHPTLCARCAGVVK
GUT_GENOME031212_01192544-663AIDVDDKFMAFWDRIHELRDNVKKSLESLIKDKTIKGSLEAKVTLCAGGETLEFLKKAEPELCAAFIVSEVEIVDNGGELEIKPEKAEGEKCERCWAISKTVGQCAEHPTICFRCVQNLK
GUT_GENOME081289_00528817-925SVRGDVNGALEPARNEKLIGKSLDAEVVVYAKGELFDLLKAEENNLAEIFITSKAAVESAENAPENAFAGEAVKVKVKASKHEKCGRCWIHSETVGTIPGYDGICADCV
GUT_GENOME111752_00154811-925RYEKFFLLRDPVMKALELARADKKIGKSLDATLTLYVPGEEDYQLLSSFGEELPALFIVSDVKLVKGGIPAELKTTDEAPLGALVKNAEGEKCDRCWNYTRTPFHDADVCVRVAA
GUT_GENOME254219_00812803-926LSLQYEHFMNLRNAVLKALEEKRAEKIIGKSIDASLLLHVKDEVIKEIIEKMDEDTLQHIFIVSKVTLKDCDCDLKDYGVASLKVYENNGIICDRCWNRIDADKICEGNLCSRCYKVIKDMHHE
GUT_GENOME140365_05393850-976DEILAARWAKVRRVRRVVTGALEIERANKRIGASLEAAPRVFINDDELFAAVEGLDLAEICITSALKLERGEGPAGAFRLDDVRGVAVVYEPAEGRKCARSWKITPEVGSDPEYPDVTPRDAEALRE
GUT_GENOME018916_01306820-940WSAEEASALLSAYNQLATARDVFTKAFEEAKEAGVVTEGTSQAAFATLTLPADAATALADVDLAEVFVCAAVEVVSGKGFSCTVAPAKGEKCPRCWNVRELGGNANHPHVCERCGDVLDSI
GUT_GENOME047079_01360711-837RWEAVLAARTEVTRAIEPLRKAGTVGHALDTAVTLYASPELLEVLGGIGTDLRAVCIVSQLHLAPLADAPADLAQADIAECGKLAVSVAKAVGEKCERCWIYSDELGSDPEHPTLCPRCAAVMKELA
GUT_GENOME023640_00955801-929WNRFMNLRSGVFKALEEARNEKLIGKSFEAHVDLYVSNGVQADLDALNANVRQALIVSALDVHPLSEAHLIKRRSEAPENALKFNDEYAVVVEHAEGEVCPRCRMIKTDIGSDADLSTLCASCAEIVRE
GUT_GENOME228819_01117811-929KWNKLIAFRDDVNKALEGARNAKTIGKPLEAHVTVYTDADTAAFMNGCGQDLADLCIVSEMDVVAGEGEGLASEELPGLTISVVRAAGEKCLRCWKQAKSVGSDAAHPALCARCAKVVG
GUT_GENOME198215_00706893-1018IDPALEKKWEDFIGIRSEITRVLEVARKAKTIGHSLDAKVELYAEGEALAVLKSVEKDLPALLIVSQAELHEGVGEAGEATTREDLKVAVKAAEGHKCERCWIYSDTVGQDAEHPTVCARCAEAVK
GUT_GENOME029805_01649807-935DAELAAKWDKLLDLRSDIMKVLEGARQEKTIGHSLDAAVTVYADEDTYRFLAPMQDSLAAFLIVSEAHLVEGTDAAPEQAAAGENHPAMKVSVSASSYEKCERCWIHRESVGQDAAHPTLCSRCASVVE
GUT_GENOME215361_00560842-945AVRADVQKAIEDERTSGTIGSSLQTTGEICAASPLYEVLASLGDELRFVMIMSEVKLTKAADGAETTVSVKPSTEKKCERCWHYVPGVGSNAEHPTLCPRCVSH
GUT_GENOME186793_00727373-498ARWDTLIALRSDVNGVLEQARADKRIGKALEAHVALCARDEAAAKALETVKDMDLTALLLVSDVILTDDDPDAANVTGSGTAFPGLRIEVRNAEGVKCPRCWMHSTKANADGLCPRCAAVMAELDL
GUT_GENOME036200_02825133-249AIRSEASKALEEARRNKTIGHSLDAAVAIYAEGENKALLNEKAADLANILIVSQAYVADFADAPSDCYKNDELKLAVKVGAAKGVKCERCWIYSEKIGQDSHYPTLCPRCAKVLEEE
GUT_GENOME208127_00249839-975PEAAAEHRDAELGQRWEKRLTLRTDILKALEAARQDKVIGHPLDAKVVLHAAGETYDELMRIVGDLAALAIVSEVEVVEGTAGAAGMQAETTPDLVCEIQKSTAPKCERCWIHSETVGDSAEHPTVCARCARVLAEE
GUT_GENOME254774_00291828-948FEAKWNKIHAVRDDVLKALEEKRSAGVIGKPIDADVTLYADEANYKELSAYPELAQAFSVSNVTVKNGADGEYKGSADGVSVTVEKAASEMCERCWSHDKSVGSDEKFPHLCARCAAVMKL
GUT_GENOME147679_00041798-923DEVLANWTEFMVVRSDVLKALEVARNEKVIGKSFEAHLTLHPNKETKELLDKLDSNIRQILIVSELTLTDDELTGDNVQDLKSGQILVEHAEGEVCPRCRRITNDVGSDARFPDLCARCANIVAES
GUT_GENOME178666_00867840-947LNKWSIIRDLRSNVQMEIERQREKGLIGSSLQAEVSLKLPQEEYDLIKGLGDEAAFVMITSKVTLDGISPERVITVKPSEAKKCERCWQYKESVGEDKNYPTLCCRCV
GUT_GENOME054076_00218792-907DNALNNKYLVLNKIRDQANKALEAARNQGLVKGSNEAELKLDIKSGNYKELLDDLDKVELARLLSFSKVSYVEGTGTEVVPVKGDKCLRCWNFFNKLNDFMGQPVCPRCEAALKSF
GUT_GENOME109793_00179817-930LLALLDDVNQKLDELRSAKVIGQSLEAVVSISADPASPTFALLKKYAAQLAETFIVSEVVLKEAPAGTPVSVDAVRAESLGYVRCPRSWRWVPELADAGAFGKVSPRCLAALKE
GUT_GENOME038476_010371333-1456EEFIARWDRIHAIRDVVKKALENARAEKTIGASLDAKVTLFCTGELYAFLKSVEDELATVFIVSQVEIVNGEGGTLSDDALGLSVQVDKAEGCKCMRCWTYSSTVGKNPAYVDLCERCAKALSE