UHGP-MC 17571


Information


Number of sequences (UHGP-50):
73
Average sequence length:
128±12 aa
Average transmembrane regions:
0.01
Low complexity (%):
3.7
Coiled coils (%):
0
Disordered domains (%):
0.5

Pfam dominant architecture:
PF13549
Pfam % dominant architecture:
8493
Pfam overlap:
0.58
Pfam overlap type:
reduced

Downloads

Seeds:
MC17571.fasta
Seeds (0.60 cdhit):
MC17571_cdhit.fasta
MSA:
MC17571_msa.fasta
HMM model:
MC17571.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME146002_0031991-210DGFIITQMVFGEELYIGSVEDSTFGNVILFGKGGIYLELYKDVCYIESNAREDEIKRALATTKIAKLFDGFRGFDYKIEWVINLVKSVQKMLQENEIKELDINPLKLTKDGLVAVDARIL
GUT_GENOME186073_0037593-226VAERAPGADIHGVLVAAQAESGLECIVGMTQDASFGPAFMFGLGGIFVELLKDVAFRVLPLDKAEVLRMLRETKGYALLSGARGRAPMDVDALAQLILDVARMVEENPEIKELDINPLFVYSKGVLTVDARVLL
GUT_GENOME140366_0789475-207IMAACTASSPDAAIDGVLVEEMVSAGLEVFIGARVDPDYGPIVLFGPGGSGVEQGVKPTAALAPIGEAEAFGLVDAAFPGRFANDRAASRAELARCLLAVAGRDGLIMQEDVSELDVNPIIVTESQAVAVDAV
GUT_GENOME037862_00149571-710IMTSCKSCVPDARLEGVLIQEMAPKGVEMIVGVTKDSQFGPMIMVGMGGIFVEVFKDVASALCPVSQQEAMEMIDSLKAVKLLNGYRGAPACDKSALADTIVKLSEFAWEQRDTLCETDINPIIVYQDGKGVLAVDALVV
GUT_GENOME143282_01806595-739IVKNARSYNPEARVHGVSVQEMLGEGVEIIIGAHNDSTFGPVIMVGLGGIFVEVFQDIAFKVAPINRQDALNLLDELKGKSILHGARGKTPIDTEAIVDVLLKVSALMTDHGEQIQELDINPLVVYEKGIRAADAMLVVKEQQVI
GUT_GENOME160450_01184575-713IIANAKKAHPDIVPDGVEVQKMMETGQEVIVGMIKDKQFGPMIAFGMGGIYVNLIEDVSFKLANGISSQEIDEQINDTKVSELLKGYRGEAPCDIDAVKEAIKRVARLTLDFPEISELDINPIFVYENGSSALDIKIKL
GUT_GENOME274086_01747561-701LLERLEAKGLLEGLEGVIIQEMVKGNREMVCGIATDPQYGPMMMFGLGGVFVEVMKDVTFRIAPLTDVDAEEMIKSVKAYKLLEGARGTKPAQLDQIKETLLRLSQLVNDFKFIDELDINPLLISEATGEGIAVDGRIKVR
GUT_GENOME210257_01663560-684ILKNAAAFAPQAVIHGVLVQKMLDPGMEVIVGVTCDPQLGPMVLTGLGGIFTEVFQDAALYPAPLNLWEAREMILSLKGSRLFLGYRGRPQLDIDGLAQVLVNVSSLACAYKDTLVEMDINPVFV
GUT_GENOME094792_00301585-711VNGVLVEKMEKAETELLIGMQTDPLFGPTIAFGVGGIFVEVLKEISLRVAPLSEDDAYDMLSELKGAKLLDGARGKPPCDKKSVCRAILCLSRMATELSGIVREVDINPLFAFTDGVRAGDALVVLE
GUT_GENOME096381_03432777-907LRPVVQAMAPRGVGTVIRGTLDPAIGAVLSFGLAGTPSDLLGDVAHRLVPVTDKEAAGLVRAIRSAPLLFGWRGAEPSDTAALEEVLLRVSLLMDDHPQVVAVELDPVVVAPRGVTVLGAAVRVAAPAHRP
GUT_GENOME147550_02265761-893NVTDFEVQSMAPTGQTVVLRAAEDPLIGPVLSFGMTGDAVNLLDDWAHRVPPLTDRDITRMVRAPKAARKLFGYQGVPPVDTTGLEQLVNRAAFLKDRFPQIAFLELNPVVLSGPVLTVLSAVVKIGDPGQRT
GUT_GENOME186073_0126196-233QRVALGAPRAEIAGLLVVKQALPGLECVAGMTRDPQFGTALMFGLGGVFVEALDDVALQLLPVDRDEALEMTRGIRGARLLDGYRGAGPLDRPAAADLICRLGALAEAEPELAEIDVNPFFLYPQGLLPVDVRMLLKK
GUT_GENOME000530_00579779-907AGQACMLTALEDPLLGPVVSFGIAGDATDLLGDWVHRVPPLTDRDVARMVRAPRAAVKLRGTGGVPPVDLAALEDLVGRVSVLKDDLPQVARLRFAPVLAAPEGVTVLQAEVHVANAARRTDSARRALR
GUT_GENOME245344_0051785-227IMANAAKNAPGAWVQGVLAEQMLPKGFEVLLGVSTDPQFGHVIMAGRPWQALGGVLVELLHAVSLRVLPITRQDAEDMIDETPLAKACQGLRGQQYRREEIVEALMCLSRLVEEHPEITEVDINPLMLYGDGRPATAVDAMVV
GUT_GENOME095736_03303581-716NVARAQPGLVLDGALVEKMGERGLELVVGASRDAQWGPILMVGLGGIWVEALGDVQLLAPDLPRAAIIERLRRLKAAKLLDGFRGAPPIDLDAVADVVALLGQLMLQYPEVTEVDINPLVAYGRGQGVVALDALIV
GUT_GENOME096290_02195745-871RDTQFFVQKQAPDGVAVSFSATEDPLFGPLVSFGLAGAPSELLGDRAYGIPPLTDVDAEEMISGLRSAPLLYGYRGAEPVDVNALEDIILRLAALQDDLPEVSELDVEPVHVSADGSWVLSARAKVT
GUT_GENOME258991_00877585-685SGVEMLIGAKNDPSFGPSVMVGLGGIFVEVFKDFSLSLAPVNRAEALEMINSLKGSKLLYGARGAKEADTEALADFIVKFSQMAAANRDTFAEIDINPVIV
GUT_GENOME000452_04797578-704AWLVARMQRGLHECALGARVDPVLGPVVMIGSGGKYVEALRDVAVLMPPFDAAQVVEKLRGLRIGALLQGVRGDPPADVAALAQQAVALGNYALAAGPRLASVDINPVMVGAVGQGAVAVDALVELH
GUT_GENOME143497_02672570-700PDARLDGILIQEMAPSGLEMIIGVTNDPQFGPVLLAGLGGIFVEIFKDVALCPCPVNTYEALGMLRQLKAYKMLEGYRGSRPCDLKALTDIMVKVSQYALEHKNTIAEMDLNPVFVYGEGKGAIAIDALIV
GUT_GENOME285944_01413824-948IQVQQQISGNLEMIIGASVDPALGHSILTGLGGTLVEILKDVSFGHVPLSSQDPGRMLRSLRSYRLLEGYRGSAKANIEQFRQILMQVNQLLLDFPAICEMDMNPLIFDETRGAFYAVDARIKIS
GUT_GENOME096500_03372568-704IMENAKKYSEDARINGVLLTPMLLDGMEMIIGALRDQQFGPCVMVGFGGIFVEVLKDVSFKIAPVSYSEAKDMVEKLQLYPMLKGIRGQAKLDVDAVVDTIVKVSRLITENEDIKEIDINPIRVYEKGIAALDGRVI
GUT_GENOME031618_03397647-744ETIVGVNRDGGFGSAVMVGLGGVFVELLRDVSLELAPLSPEGARAMIGRLKAAKLLTGFRGRAASDVEALSDVLVRVAAMCCALGDRLVSLDLNPVMV
GUT_GENOME029331_00322590-718DANIFGVMLTPMLKGGVECIIGSSWDSTFGPTVMFGLGGIFVEILKDVSFRVAPVNMPSCRRMVREINGLAMLQGARGSKPCDLEALAEAACLISHMVDELRDIAEVDLNPVFAWEKGLAVADARIVLQ
GUT_GENOME243081_01116566-686PTARIDGVIVQRMAAQGVELIVGAVRDPVFGPALTVGLGGTLTELYRDVTHRLLPVDATGAREMLQELKAFPLLDGYRNTPLRDLDGACEAIAAFGHAFVALGETAGEAEINPLTVRERGQ
GUT_GENOME096069_00348557-685ILEKARKFNAEANIEGVLVQEMVGSGLECMIGVKRDPIFGPFVAVGLGGIYVEVLRDVSLRRAPIDENTAMEMIEELKGYPLLAGARGQKRKDINALAKAVSLISRMACIESELNELDANPVFVLDEGK
GUT_GENOME000079_01706578-708PDARIEGVHLESMAKTQFELSAGMVRDPQFGPIVMFGLGGILIEALRDVTMRTAPLSKGEALEMIGSIRASSLLEGVRGMAPVDKEQLAQIVLGLAAMALDHREISEVDINPLAVTEQGVWAVDARIKIGE
GUT_GENOME143124_01068591-705GIETLVGVHRDAVFGPMLSFGAGGVAVEIVADVARCRLPADRPAIEAMVDQTRVARLLPAHRGRPAYDRNALVDAIERTAALFLSLGDGVEALEINPLLVLEQGQGAVALDGVIH
GUT_GENOME243584_01588565-703FERVRRAAGTARFDGALVARMVRGWGEIMVGVRRDPVFGLVALAGIGGTAVEIFRQMSFGLAPVSRERARAMLTQSRAAALCAGHRGHPPLDLDAVADVIVNVSRAADAIGERLDTLEINPFIVSADGLVAADAVITLR
GUT_GENOME285774_02509547-676IAQGKDFRGVIIYPMLKIGKEVILGLTNDPTFGPLVAFGMGGVYTEVLKDITFRVAPIGKDEALKMIKEIKMYPLLRGVRGEAPSDTEALAEMISQFSLLPFFYPDIHEADLNPVFLYEKGACAADVRIL
GUT_GENOME232847_00853103-216EMAKPGLELIVGGKIDPAFGRVLTFGLGGTMVEFHKDVGIRLLPASDDELRSLIREIKGYTLIKGYRGDAPKDEEFLFQVLKNACRFFEDNENVVEFDINPLRLYEKGGCAVDA
GUT_GENOME256102_01171568-703ILASCREKAPGAEVHGVLVTRMVSGQEIIAGLVHDRQFGPTLMVGLGGVFVEILKDVNFCILPGTRKELRGKIESLKGYPLLNGARGREKTDVDALVELLYNLGRLAVENPEIQEMDINPIFVSAEGAVAADARII
GUT_GENOME212052_02970566-709LLRRAAEKAPQARLDGVLVARQLQGGVECFMGIQRDPLFGPVALFGLGGIFVEVLQDVVFRRCPFEVDEAEAMIRSIRGAPLLLGARGRPRADVAALARLLSNLSRFAWEAGERLRSVDLNPVIALPEGQGAWAVDAVLEVEEV
GUT_GENOME084598_01435569-711LMDNAKAFAPNAQIDGMLIQEMAGKPVAEVILGITRDPQFGPAVVYGTGGILVEILKDSTIGLPELTKEEALKMIQGTKGYRLLTGFRGAPKADVDALANTLVKVSKLAAEGADKVAGLDINPLLVYPEGQGVLAVDALIELQ
GUT_GENOME000060_00843573-681ILMAPMLKTGLEMIIGVNNDPQFGPVVMVGMGGIFVEVFKDVSVYPAPLNKTEAMNMLQSLKAYKLLNGYRGGDKYDIDALCDTIVSVGNFAAANKDSLKELDINPLFV
GUT_GENOME138909_00496585-716MMQIKDANSVMIQKMISGMELFIGAKYEDRFGHIVLCGLGGIFVEALKDVSSGLAPLTMPEAESMIRSLRGYKILQGTRGQKGINQQRYAEIIVRLSTLLRHATEIKEMDLNPLLAEGDNVTAVDARILIEK
GUT_GENOME232061_06248554-696MRESLARHAPQLVFERVLLERMAGAPLAELIVGVKREEGFGLALVIGAGGILVELLRDSRSLLLPTTDAAIRDALLSLRSAPLLSGFRGRPAVCMEALVAAIRAVAEFACEHAERLLELDVNPLLADAEGALAVDALIRLANG
GUT_GENOME143497_01088818-942ILVQEQISGKTELILGGVKDPATGHGVLVGMGGTNVEVTKDIAFGHVPAERCYIKQMIQSLREYPVLKGYRGAEGVDLKQLEDAAARVNQLLLDFPEIEELDINPLIFDEAEKRFIGADARIKIS
GUT_GENOME096544_03583756-895VLELAAGHAPEPGRAPLEVQAMVPRGASCVVRTVEDPLFGPVISFGLSGDASDLLGDVSYGVPPLTDVDVAELVRSARSAPRLFGYRGLPALDVAALEDVIARVSVLADDLPELRSLELNPIVVSERGAVVLGVRAQLAA
GUT_GENOME243136_02881607-732ILIAQQVKADLELVVGASLDAEMGPVVLFGTGGVDIELMKDVALAGAPLDAAEAKQLIGKTKAGVKMKGYRGKPALHEPSAVKALVGLSNLMADANGRIASIDVNPFLINSKLGVAVDGLIVLNNA
GUT_GENOME061911_01884571-710TFDLMVQRQAPGGQTAVLRALEDPLLGPVVSFGMAGDATVLLDDWAHGIPPMTSNDIDTMMRTPKAAIKLWGSSGVPAVRVDKLQDLIAKVASLKDNHPEIAHLELNPIMVAAEELTVVACHMKLGNPQQRTDSARRTMS
GUT_GENOME140601_00672429-543AGTELLVGVVKDPLWGLVLAVGLGGVFVEVLKDTSLRVLPVDRAEIVAMLEELRGAPLLKGVRGGVKADLEQVVDAIYRLTLAALSVSDQLQELEINPLWVHGAEVEALDALIKW
GUT_GENOME237422_01012569-691GIVIQPMLQGKEIFIGAKREGDFGTVVMCGLGGIFVEVLGDVSSALAPVSPYEAEKMIKKLKGYKIIEGVRGEEGVNEVIFAEMVSRVSSLCAAAPEIAEMDINPLLGNSTSVTAVDARIRIE
GUT_GENOME262274_01059585-718LMQIKGAEGVQVSQMESGVEVFIGVKKEGEFGHLITCGLGGIFVEVMKDIRCALAPLGKNEALEMIRSLKSYPIIRGIRGKQGINDEVIADTLCKISRLLAAVPEIEEMDINPLMGRGERLSAVDVVVRLSDAK
GUT_GENOME096459_01897574-697IEGVNVQHMVEKNGQELIMGANQDATFGAVIMLAAGGVTAEITKDKALELAPVDKDLARTMLESLRIWPLLEGYRGMSGADLDSLVTMVQRFSRLVVEHPEITEIEINPVLANPDGAVALDARA
GUT_GENOME191958_01032556-693ILANVKKHVPDAKVDGVLVAQMLRKGTEMILGIHRDSQFGPMVLIGFGGIFVELFKDSALYPAPFGKEEALEMIRSLKTYPLLSGYRGSAPLAIDQLADMLVKMGNLAVEKKEEISELDMNPVFVYEDSVCAVDALVV
GUT_GENOME117958_01682574-693LIQEMIVKASETIIGVNTDKNFGRVMIFGTGGIYTEVMRDTTMRILPADDFDAMIKETKIGTILNGVRGEEPKAVGPLAETLKKVQELVLEIPEIKSIDGNPVLVTKDRAVVVDFKMLLK
GUT_GENOME057011_00849558-693MMERIKTAVPDARIDGVLIEEMFDGRELIVGMVRDKQFGPVITFGLGGIFVEIMKDVSRGVVPITVEMADRMIKSIKSYPILAGARGKKSADIAALKSLILKISQLSVDFPEISELEMNPVMAGEVGCCAVDALVV
GUT_GENOME096565_01163765-879AGASLTLRAIEDPVLGPMVSAGIAGLPTDLLGDVSWRVPPLRRTDALTMLSELRAAPLLAGYQGAKAPQLHGVEDVLMRLTALKDELVQVVDCELTPVIAGLDSTAVVGARLRVA
GUT_GENOME096291_01262793-907TGVAVVVRSQEDPLYGPVISFGLSGDAVDLLGDISYRVPPVTFNDVSEMVRSIKAAPKLFGYKGLPPVDTEALENLIARVSLFSDDHPQIQSLELYPVIVGPEGCAAISAKIVVA
GUT_GENOME143124_01031566-694GLSFDGMIVAEYCKGMRELMIGARRDPVFGAVVVIGDGGKYVEVLADNAILLAPFSADEVRERLGRLRIAPILRGVRGEPPLDTQAFSEAVLAVGNLMLGDERIESIDINPVIVSAEGLGCRAVDAVVY
GUT_GENOME243196_02972576-708APAARIDGILVQRMEDRLLELMLGFRDDPLVGPMVMLSAGGITAELHRDVSLRPAPVTPAEAMQMIEEVRTTRLIRGFRGLPRGDVDALADAIVRFSRLAAVQGVTVAEAEINPLFVRANGVVGVDCVLRLAP
GUT_GENOME124925_00897554-661VVAQKMITDGIECMIGIKRDPLFGPVVAVALGGAYYGLMKDISLRVAPVSLETAMDMIASLKGYPLISGEWFGTPMDVEALAQQIVTLSQMGCAEPDIELLDINPIFI
GUT_GENOME186073_00654575-686TELIFGMKIDPNFGPLVIVGAGGIYTEMLRDRIILLPCATEEEIAEKLKTLKTYKLLTGFRGSKPADLDKLVGQIKQFCSLAAYLSQWAKEIDVNPVAVSGDKIVALDALIV
GUT_GENOME000612_00879565-701LIAHVLSLKPDAEMKGVILAPMVNGGVEVIAGMVKDPQFGPSIMFGSGGILVELLKDVSFRIAPLREQDAEDMIRETKAYKMLCGMRGDSPKDIGAMVNLLVSLGRLAAENPMIQEIDLNPVKAQEQGINILDARII
GUT_GENOME096283_00736113-248VLVQELAPKGTELLLGIKKDPTFGHQLIVGLGGTLVEIMKDFSMRMMPVTVQDIKEMLKELKGYPILDGYRGKRGINKQLLYSICLSLNDLVESTPEIEELDLNPVIFNGDQATICDVRILMGTPSMGEAKERDLS
GUT_GENOME020734_00683568-698PNARLDGVSIQQMVSGREVIVSIMRDVQFGPVLSFGLGGIFVEILREISQAHVPLSEEQLDEMITSTKAFKLLNGARGTAKSDIAAMKDAMRKLVLIAEENPEIREIEINPIIVGEEGKGCWAVDALVTLE
GUT_GENOME096381_05680597-754QVRDTYRELTDIARAEEIELDGVLVCQMVDPGVEMIVGLSQDPLFGPVVTVGLGGVLVEALRDTAVRVPPFGEEQARAMLGELRGAGLLEGVRGRPPADTDALVEVVLRVQRMALELGGGPAGPDGGGPREGSGLAELEINPLVVLPRGQGAVALDAL
GUT_GENOME243549_01946126-243IQVQQMLLGGTEVIIGSITDGSFGKLVAFGLGGVLVEVLKDVTFRLAPATNADALSMLDGIQAHEMLEGVRGGDPVNREALATIIVNVSQLVSDFPEIVELDLNPVFATKKDAIAADV
GUT_GENOME194757_03077599-715AGCEMIVGARHDPQWGPVVLVGFGGVTAEVLRDVRLLPADLTPAAVRRELDKLKGAALLHGHRGKPACDVAALAELVAALGALMQAEPRLVEIDLNPVMVYPEGQGVLALDALMLIA