UHGP-MC 4663


Information


Number of sequences (UHGP-50):
163
Average sequence length:
135±15 aa
Average transmembrane regions:
0.01
Low complexity (%):
10.43
Coiled coils (%):
0
Disordered domains (%):
0.73

Pfam dominant architecture:
PF00731
Pfam % dominant architecture:
7485
Pfam overlap:
0.83
Pfam overlap type:
equivalent

Downloads

Seeds:
MC4663.fasta
Seeds (0.60 cdhit):
MC4663_cdhit.fasta
MSA:
MC4663_msa.fasta
HMM model:
MC4663.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME030478_014914-150VVIMMGSKSDEEKVSPCVDVLRSLGISYLFTVSSAHRTPERTEKILREQEAAGAKVFICAAGMAAHLAGAVAARTTRPVIGIPVSGGLMGGMDAMLATVQMPPGFPVATVAMDKAGARNAAWLAAQILAVADPALDEKIRAARAKMQ
GUT_GENOME238265_000534-126VAVIAGSASDQAIIDKAVSALESYKISYEVKILSAHRDAEALDEYVKASDAEIFICIAGMSAALPGVVAARTKKPVIGVPVSGKIAGGLDALLSIAQMPKGVPVACMAVDGGENAGHFAARIL
GUT_GENOME213102_016822-149KVGLVMGSDSDIPVMKKALEVFHDFDVECEVLLASAHRTPAAVQEFASKARDRGLSCIVAGAGMAAHLPGVIASFTTLPVIGVPLESGSLKGMDALLAIVQMPSGMPVATVAINGAKNAALLAVQIGATSDPELAAKYKAYREKMSRD
GUT_GENOME044882_00538102-247IFLEGQDIYINYSNDQTHVFMKDLNLPLPPVQDPRRALDGELYQAITFLAKENEHLLLDRAPHLKTTRWHPTFLDVIPATGGKDKGMDAMLATVQMPPGFPVATVAMDKAGARNAAWLAAQILAVADPALDEKIRAARAKMQAGVE
GUT_GENOME212832_011104-145IRVIMGSASDVEIAKKVTKILKKFDIDYEVSVISAHRALNVLEDTMAKDDAKVYIGIAGMAAHLAGVMAGMTIKPVIGLPVGGTNTAGLDALLSMVQMPKGVPVATVAINGGDNAALLAIQILALADENLAQKLKDYKKEML
GUT_GENOME259564_0211647-195AKVGIVMGSDSDMPVMAKAADILEKLGIDYEMTIISAHREPDVFFEYAKSAEAKGFKVIIAGAGMAAHLPGMCAAIFPMPVIGIPMHTTSLGGRDSLYSIVQMPSGIPVATVAINGGANAGLLAAKILATSDDAILQKLKEYSKELKEQ
GUT_GENOME204943_007112-145GSASDWETMKHACEMLDQFEVPYMKQVISAHRTPELMGEFAHNARANGLKIIIAGAGGAAHLPGMVAAQTTLPVIGVPVRSHALSGWDSLLSIVQMPGGIPVATTAVGNSGATNAGLLAVSILSTTDERLAKALQDYRDSLKEK
GUT_GENOME129035_0004146-192PIVGIVMGSNSDWEVMKNAAEMLKAFDIPFEARVVSAHRMPDEMFDYAETARDRGIRVIIAGAGGAAHLPGMIAAKCTIPVCGVPVPSKYLRGQDSLYSIVQMPKGVPVATFAIGEAGAANAALYAAQILSVTDAGLAQKLADFRAA
GUT_GENOME103718_01819153-258REIGATVDRVDDVGVANLDRILDQRDRIREADAVVVAAGREGALPTVVAGLVAAPVIALPVSTGYGVGGEGVAALEGALQSCSVLTTVNVDAGFVAGAQAGLIARA
GUT_GENOME147550_02075443-586VGLVMGSDSDWATMSAAAQALEELGIPYEADVVSAHRMPTEMLEYGRTAHERGLRVVIAGAGGAAHLPGMLASVTPLPVIGVPVSLKNLDGMDSLLSIVQMPAGVPVATVSVDGARNAGLLAARVLASSPDLSGTELRGRLQEF
GUT_GENOME105688_0055651-184VASAHRTHNRIKDIMTNYVDGIEVFIGIAGLSAHLPGVIASYTTKPVIAVPVNGKIEGLDALLSCTEMQLGTPVATMGIDRGENAAWLACQIIACNDEKMREALVDKRNSYNHKMETSEKELIEKIAGKYYTRT
GUT_GENOME236868_022909-148VGIITGSASDKPIVDKVTGILDEFGVAWEYNVLSAHRTPNKTAKYAQEAESRGVKVLIGIAGLAAALPGTLAAHTTLPVIGVPGDGGPLSGVDALHSIVQMPSGIPVATVGIGNGKNAAYLAVSILALLNSEIRQKLADY
GUT_GENOME030718_00009118-244RVAVISAGTADGFVAWEAARTLAYLGIGHKLFEDCGVAGLWRLAERLEEINTFDAVIVVAGLDAALVSVMGGLTPKPIFGVPTSVGYGAAQGGRAALASMLSSCAPGVGIMNIDNGYGAACAAARVK
GUT_GENOME195308_0137843-192KVAVIMGSDSDWPVVKGACQQLETFGIPYEAHILSAHRTPAAAADFARSARANGFGVLICAAGMAAHLAGAFAGNSTLPVIGIPMKGGAADGLDALLATVQMPSGIPVATVAINGAKNAAVLAAQILAVSDDALAAKLDAQRSEMAEQIA
GUT_GENOME096459_01005385-506LPVAREALLTARYLGRRAELVADVGIAGVHRTLDRLDTFRAAGVVVVVAGMDGALPGLIAGLVSAPVVAVPTSVGYGAAFGGLAPLLTMLNACAPGIGVVNIDNGYGGGHLAAQITAPREGD
GUT_GENOME242874_0006152-197MQKVAVVMGSTSDWPTMQQVTAQLTSLAIPYEKRVISAHRMPDELADFGKQAVALGFGAIIAGAGGAAHLPGMLAANTLLPVIGVPIKTRTLNGVDSLLSIVQMPGGVPVGTMAIGDAGAVNAALFAAAILALNDSELADRLQAFR
GUT_GENOME060185_00787137-266KVGIITAGTSDINIAEEARVIVEEGGCEAITSYDIGVAGIHRLFPQIAHMIKEGVEVLIVCAGMEGALPSVVAGLVDIPVIGVPTSVGYGVGEGGVVALNAMLQSCAPGIAVVNIDNGFGAGVFALTIIK
GUT_GENOME096388_0148343-191LVVSIVMGSSSDWPTMKKAKDMLDYFGIETDVEVVSAHRTPKNMYEFATNAKEKGTKVIIAGAGGAAHLPGMIASMTNVPVIGVPIQTKALSGIDSLYSIVQMPGGVPVATMAIGEAGATNAGIYAAKMIAIEDESVYDKLEAYTKSLE
GUT_GENOME176881_003313-145VSIIMGSLSDSPIADKVVSKLMEFGIDYEVKVISAHRALKSLEKYVKESEDESEVYIGIAGKAAHLSGVIAALTTRPVIGIPAKSSHLGGLEALLSTVEMPSGVPVATVAIGGGENAAILATEILAIKYDSLREKLKDMRIKM
GUT_GENOME007461_01345121-247VAIVTAGSSDLGVALEALYTLEACGVAGELVTDVGVAGLERLLARIGRLRSADLCIVVAGMEGALPSVVGGLLPTPVIAVPTSVGYGAAKEGFTALCGMLSSCASGVTVVNIDNGFGAACAAARILY
GUT_GENOME233284_008106-156VAILMGSDSDLPLVENTIKVLKDFNIKVEVKVTSAHRTPEVTAQFVKDADKRGCKVFICAAGLAAHLAGAVAANTIKPVIGIPVDCGPLKGVDALYSTVMMPGGIPVATVAVGSAGAKNAGFLAAQIMALSDEELAKAVYENRQKAAEGVI
GUT_GENOME177425_01010121-248VAVVTAGAADESVAREAVATLEFLGIPVKAYYDRGVAGIHRLLAVCKELKDAQVCIAIAGMEGALPTVISGLISAPVIGVPTSVGYGVAQNGMTALFGMLTSCADKVAVVNIDNGYGAARVAAAICRE
GUT_GENOME282471_0123645-194KKIGIVMGSDSDLAIAQKAADTLTELGVPFEVHVYSAHRTPEAAAAFAKTAREKEFGAIIAFAGKAAHLAGALAANTTLPVIGVPVKSTALDGIDALLSTVQMPSGIPVATVAIDGAVNAALLSVQMLAIEDKEAAAKLDEKRAKDAAKV
GUT_GENOME215114_0007519-172MPEIAIIMGSKSDLISLKGAFDTLNSFNVGFTARVMSAHRTPDAAADFAKNAEAEGYKVIICAAGMAAHLAGVIAAHTTLPVIGIPMACEPFNGFDALLAMVQMPPGIPVATVTAGKAGAKNAALLAISMLALSDSGLAEKLKAFREEQSRKVE
GUT_GENOME283003_013054-143VFIIMGSKSDFPIAQKAISVLNKYGVAYDIAVASAHRTPKRVEDLVSGSDADVFISIAGLSAALPGVIASFTTKPVIGVPASGSLNYDALLSIVQMPPGIPVAAVGMDRGDNAATLAVEMLALSDKGLAEKLCEDRKAMA
GUT_GENOME213442_0105758-200EEKALVGIIMGSESDMPNMEPCMKQLEEFGIPYEVKVASAHRKPAEVHEWASGAEERGMRVIVAAAGKAAHLGGVVAAYTPLPVIAVPMKTSDLGGLDSLLSMVQMPSGVPVACVAINGAKNAAILAVQMLGTGCDECRKKIS
GUT_GENOME232032_0357386-213AVIGVISGGASDRRVCEEIQLTLNFHGIPCQLHQDLGVAGLWRLQQHLAQLQRYKIIIAVAGMEAALPTVVAGLVRAPVIAVPTSVGYGISAGGQVALNACLASCAGGVMTMNIDNGYGAATAAIRIY
GUT_GENOME132203_01788120-249RIGILAAGTSDARVADEARVVAESMGVEVMAAYDVGIAAFHRFLDPLVGMLDSGVDALVVAAGMEGALASVVSSLSDVPVIGVPTSVGYGAGAEGQAALLSMLQSCSPGLAVVNIDNGVGAGATAALISI
GUT_GENOME133804_002632-126KAYILMGSESDMPHAEKIARGLEAEGVAFEYFVRSAHKDPVGVFNFVKARDGESDVLFITIAGRSNALSGVVAANTNKPVLACPPFKDKLDFLVNINSTLQMPSKTPVLTVLDPGNCALAAARIL
GUT_GENOME051415_010201-157MKKIGIVMGSDSDLPVVEKAMATLEKFGVPFEAHVYSAHRTPVEASEFAKGARANGFGAIIAAAGKAAHPGSFVALFGENPDQKDLDYMSLEKHVTADTPPCFLWQTATDESVPVENSYLFAKACKEAGVPYAHVFSDGVHGMSVATEDWLEERGRE
GUT_GENOME246899_007165-154VGIIIGSDSDLGVMSKTAEILDKLEIPYEFSIISAHRTPDRMYEYARQARKRGLKAIIAGAGGAAHVPGMVAAMTTVPVIGVPIQAKALDGMDALFSICQMPPGIPVATVAINGALNAGILAAKIIATNDEALADRLEAYATNMKETVEK
GUT_GENOME176226_012801-122MKVIVIYGSPNDKPFMEPAREYFAQENVPYEETVLSAHRDLPELIRFLGDLEASGEKAVILAVAGLSAALPGVVVMSCSLPVIGVPDPGGPLSGLAALLAIAQCPGGVPCTTVGLLLHWNSY
GUT_GENOME030135_0179770-200MKDQVDALKTMGVKARLINSSLSNSEYSEVLEEIENDTDVIIGAAGKAAHLPGVIASHTLIPVVGLPIKSSTMDGLDSLLSIVQMPPGIPVATVTIDSGINAALMALQIISIKYPKVKEDLVLYRKEMEEK
GUT_GENOME096430_01661812-909GTHVVRHDDVAGAGFRALLAEPSQLAEADCLVVVAGLDAGVAATLAERTDVPVVAVPTSTGQPQALGGLGALMTMLHSTVPGVAVSTIDSGYSAGVLA
GUT_GENOME046150_007425-149KIGIILGSASDAPHAQKIGATLRELGIPFEVTVASAHRTPEDAAHYAQNARPRGLEAIVAVAGLSAALPGAVAAQTTLPVIGVPVESGTLLGMDALLSTAMMPPGVPVASVGINGAANAALLAARIVSTHDSAVAEKLQSYADEK
GUT_GENOME201212_0001097-245RSQAGDLNGNENSWLYTPEQIGVHSQVYRFAFAKQYGIYEKNKIYFHEGLSLQENIVPILTVQLTEERKKDAFRLELTYKGKNSGIVRVNRPLIELNFSGVDSLYSIVQMPGGVPVATMAIGEAGATNAALFALRLLSVEDKAIAEALA
GUT_GENOME118381_00385116-256ESVGKIVVLAAGTSDLPVAEEAAVTAELYGNCVERVYDVGVAGLHRLLGRLPVLEDANVVVAVAGMEGAIVSVVGGLVACPVIGVPTSVGYGANFGGAAALLSMLNSCASGVSVVNIDNGFGAGVLASKINRQSKDRRERM
GUT_GENOME039190_0147547-162AMPWDDMTVAGERLQELRRPFAAAGMAAHLCGVIASMTTVPVIGVPINSTLDGLDALLAIVQMPPGIPVATVGINGALNAGILAVQMLAVGDEQLQEKLSAYKEDLKKKIVKANEE
GUT_GENOME239432_0133685-235PLVSIIMGSTSDLPVMEKACKWLEEQEIPFEVNALSAHRTPEAVEKFAREAKGRGLRVIIAGAGMAAALPGVIAASTTLPVIGVPIKGMLDGLDAMLSIIQMPPGIPVATVGVNGAQNAAVLAAEMMALADEEIARKVENWKAGLGKKIEK
GUT_GENOME206139_01165131-242VAEEAALTAEFLGSRVVRYRDCGVAGLHRLVSHLDSIRQATALVAVAGMEGALPSVLAGLVKAPVIAVPTSVGYGANFRGVTTPSLSRNKALPAIYLPFNQFANQMPHCNVH
GUT_GENOME283701_012819-192LGSASDFKIAEKATTILEKLEIYYDLRVASAHRTHEKVKQIVTSAAENGVEVFIGIAGLSAHLPGIIAGITHKPVIGVPVDVKVAGLDALFASVQMPLGAPVATVGVDRGENAAILAAQIIGIHDAGVRAKLASFRQEFYSKIADDEEKLFLQMKGKYYSKIETDLVEAPNEDSPVENGLNIHP
GUT_GENOME178185_010741-190MKKVGVLMGSDSDLPIVRKALDTLRAFGVPYEVHVYSAHRTPAQAQEFCVSARDQGFGVLICAAGMAAHLAGAAAANTILCGLNLLCLTRGQKVLGHAPQFTAYIDHVNCSGSVYECYYLNRERGYRFEVEGYLERMGPQYDLFIVENPNNPTGQILALEEPGLADKLAARRTADREKVLEKNQAVEQEF
GUT_GENOME127584_00424305-409IPVGAEAYGALRFWGHDCGFLTDVGVAGLHRLAPHIRDLRAARLIIAVAGMEGALPGVLAGLVPCPVLAVPTSVGYGVGAGGMAALHSMLCSCVPGLAVLNIDNG
GUT_GENOME239364_0028230-179KVSIIMGSDSDWPILKPAYELLKKFDIEAEVIVASAHRTPARVKEIVTTAPERGVCAFISAAGAAAHLGGVIASYTTLPVIGVPINATALNGMDALLSTVQMPSGIPVATMAINGAKNAAIFVVEMFAISNPELAQKLKDYRQNMVEEVE
GUT_GENOME103518_011741-167MGSDSDWPTVEPAVEALDAFGIPWEVGVYSAHRTPQRMLDYANNTSSQQASILRAINRHSCYRHSRRHLHDRQQRIHSLQVLQSTRDTNNRQRRGCRHHARQMCSTTGTSNNALQPPPRGLLAVIQHALRSTVASIIGAADAEVRAKMEAFMAGMEEEVVGKHAALQ
GUT_GENOME198457_00202109-244GLITIAVSPVAAMQPAQEAAETCQMFGAVTECIYNLDASCIGRIYADEKIMQSSAIIAVIGTGDALASILANITQKPVIAVPTDTTSDSCVNGMAALLASAQAQTPGIVTVGINYGFAAGYFASLINTAIEDGRKT
GUT_GENOME157468_0180921-170KVAIFFGSKSDTEKMKKAAEVLKEFGVEYKAFVISAHRAGELLHKTVKQCEDDGFELIIAGAGLAAALPGVIAGITTLPVIGVPLECISAGSNGLGGMDALLSIVQMPPQIPVATVGIGNAKNAAYLAVQILSIKYPELKEKLLAFRKKL
GUT_GENOME206907_006034-153KVMIILGSGSDIAIAEKSMKILEKLEIPYSLKIASAHRTPDLVRELVVQGTNAGIKVFIGIAGLAAHLPGAIAAYTHKPVIGVPVNVNVDGLDALYSSVQMPYPSPVATVGIDRGDNGAILAAQILGLYDEEIRKKVLELKEEYKQKVIK
GUT_GENOME082152_0038055-207KKPVVSILMGSRSDLPTMEACFAQLKEFDIPFEAHALSAHRTPNEVISLAESAKDRGIKVIIAAAGGAAHLGGVIASSTTLPVIGVPIQTSALGGMDSLLSTVQMPGGIPVATVAIGKAGAKNAAILAAQMIALQDEKLAEKLVEFKKAMAEK
GUT_GENOME103718_0107826-181DVGIIMGSDSDLPVMEGAYDALDELGFAEQTDFDEAPDARFTYESYVVSAHRTPELMYAYGETAAARGLDVVIAGAGGKSADLPNMTASLAYPIPVIGVPVQEKSVNSVIGMPTGAPIVAVDAGKSFNAALSAAQVLAREHDAIEERLVEYHEGLK
GUT_GENOME106866_0081517-162KVAIIMGSTSDLSKVEPAITILKDYGVEVNVRCLSAHRAHLGLSSFIKETETDGTEVIITAAGMAAALPGVVASQTVLPVIGVPISGATLDGMDALLSIVQMPSGIPVATVAINGSKNAAYLALQIMAIKHDEIKEKLIAYRKDME