UHGP-MC 107471


Information


Number of sequences (UHGP-50):
192
Average sequence length:
94±8 aa
Average transmembrane regions:
0
Low complexity (%):
2.95
Coiled coils (%):
0
Disordered domains (%):
0.16

Pfam dominant architecture:
PF02803
Pfam % dominant architecture:
8490
Pfam overlap:
0.75
Pfam overlap type:
reduced

Downloads

Seeds:
MC107471.fasta
Seeds (0.60 cdhit):
MC107471_cdhit.fasta
MSA:
MC107471_msa.fasta
HMM model:
MC107471.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME002254_03146303-400LNKCDMKLDDIDCLEINEAFAAQALGCYKLLANKYNTTIEKIVNKTNLKGSGIGLGHPLGSTGARIVGTLSHTMKELGSEYGVASLCIGGGMGAAVLV
GUT_GENOME174497_01943295-393AVVDAYKRSGLTVEDIDVFETHDCFTSSEYAAISAFGITEPGKEYEAVEEGRIAFDGDKPINPSGGLIGCGHPVGASGVRMFLDIYKQVSGTAGTYQIK
GUT_GENOME080434_02236294-374LKKQALTIEDIGIFEINEAFAASSIVVERELGLDPKKVNRYGGGISLGHAIGATGARIATTVAYQLKDTQERYGIASLCVG
GUT_GENOME253011_02207281-376LDKLFEKTGRKPEEIAAYECNEAFAVIDELFAKKYPACVPAYNACGGALAYGHPYGATGGILTLHLAKRLQQCAEDNVYGVSAIAAAGGQGTAILW
GUT_GENOME000530_00809301-386RALDQAGLDITDIDLVEVNEAFAPVPLIFMDEFGYPEDQLNVNGGSIAIGHPLGSTGARLLTQLTDELHRRQARYGLLTVCEGGGM
GUT_GENOME096560_02390290-376LKQQKLTVEDIDLFEINEAFASVVIASANQLSISEDKLNVNGGAIAIGHPLGGTGFRLVLTLAYELRRRGGGKGIAALCGGGGQGIA
GUT_GENOME077876_00638318-412VKKLLTQQKMDITDIDIFELNEAFAAQSIACIRQLGINDMDKVNPKGGAIALGHPVGASGSRILTTLIYELIENKGNYGIASLCIGGGMGAATLI
GUT_GENOME133148_00173278-389MGMGAYLAIKKLIDKNNISFSDIDLFEINEAFAVQTIAVINELSQNYKYKREELISKVNISGGAIALGHPLGASGARVITTLITNLKRRNMKYGVASLCIGGGMGIAILIKN
GUT_GENOME215875_02654291-379KALQRAGLTVKDLDVVELNEAFAPQVVPCVRDLGLDPERTNPNGGAIAFGHPLGGTGVILTVKLLYEMLRRDYELGLVTMCIGGGQGLA
GUT_GENOME207996_03262314-403LKKAGLTLADIDVIELNEAFAAQALACLKGMGLSDCYDDKVNLHGGAIALGHPLGCSGTRIINSLLTVMEQRDAQFGLATMCIGFGQGIA
GUT_GENOME145053_02257370-479MLLGPAWSTPLALERAGLTMSDLTLIDMHEAFAAQTLANIQLLGSERFAREVLGRAHATGEVDDSKFNVLGGSIAYGHPFAATGARMITQTLHELHRRGGGFGLVTACAA
GUT_GENOME021289_04129298-395MGIGPVYAIPKLLKRFGLTVADIDLWEINEAFACQVVHCRDFLQIPNDRLNVNGGAIAIGHPFGMSGARMVGHSLLEGRRRGARFVVVAMCIGGGMGA
GUT_GENOME096513_03440286-368IGPVPAVNRLLARSRLTAEDIDIVEFNEAFASQVLASLQQLSIPEEIVNIGGGAIALGHPYGASGAILVTRLFSEMTRMHGGR
GUT_GENOME087805_02249285-379ALDDAGLGLDQIDALEFNEAFAAQVLACADALDLDEHRLCPQGGAIALGHPWGASGAILLVRLFSRLVREGAGRYGLAAISIGGGQGSAMVVEAV
GUT_GENOME243327_02685311-423YAIPRLLARHSLTYADIQLWEIHEAFAAQVLAHIKALESPEFIRDKAGVKAEFGKFPRERMNPNGGSTALGHPFGATGARILSQAVKELAAMPKGSRAIVSICADGGQGTVAL
GUT_GENOME096469_01624306-393EACRKEGIEPGDLDLVEINEAFAAVGVASARALGLSDDKVNVNGGAIAVGHPIGMSGARITLHVAKELARRGGGVGAASLCGGGGQGD
GUT_GENOME212019_02316291-382KVLQRAGLGIADIDLVEINEAFASQSIACIRELGLDMDRINLDGGALAIGHPLGATGARITGKAAALLRRTGGRYAIATQCIAGGQGVATLL
GUT_GENOME176457_03203289-395MGLGPLPAMTCALRRGRFQLSDMARLEINEAFAAQVLGVVKGLAREHQMTAEEIAARLNVNGGAIALGHPLGSSGTRIVVSLLHALRRENKPTGLASLCIGGGMGIA
GUT_GENOME224777_00781290-394MGYAPTYAIAKVLERANLTLADIGWVELNEAFAAQAVAVIRDAGLDPERTNPLGGAIALGHPVGASGAIISLRALKNQQEKGIEHSLVTLCIGGGQAIAAIFRAI
GUT_GENOME096270_01775357-455MGIGPVAAIPKALKMAGLELSDIGLFELNEAFASQSLQVIRELGLDEEKVNVNGGAIALGHPLGCTGAKLTLTLMHEMKRRNVQFGVVTMCIGGGMGAA
GUT_GENOME098572_00438288-392MGIGPVGAVAKALGKAGLTLQDMERVEINEAFAAQYLACEKELKLNRNITNVNGSGISLGHPVGATGTRIITSLVYELLREKKRYGLASACAGGGMGTAVVLEAL
GUT_GENOME145858_01719407-497KVLERAGLNINDMDLIELNEAFAAQALGVLRQLGVPDDAAHVNPNGGAIALGHPLGMSGARLALSASLELQRRGGRYALCTMCIGVGQGIA
GUT_GENOME171681_00093300-398MERNGLTLDDIDLMEINEAFAAVPLVSCLGILGMDEAAMDAKVNVNGGAVAVGHPIGATGGRILMTLLYELRRSGKRSGVAAICSGMAQGDAVWIELEG
GUT_GENOME192650_03190291-389MGYSPKFSSEKLAKKLNLNLSDIDMFEINEAFASQAYAVARDLGLNQEKVNIYGGGISIGHPIGATGCILTVKVLYELMRSDKKDAMISMCIGGGQGIS
GUT_GENOME169434_00916290-393MGYAPVYAVQKLLKKVNMTVDDIDLFEINEAFAAQAFACVRDLGLSMDKVNVNGGAIALGHPVGATGARLTINMMNELRRRNCRYGVVTLCIGGGQAIAALVEN
GUT_GENOME151430_00551295-388KLMAQTGLSLKDFDLIEINEAFAAQSLACLKVLGLDNEADMARINVNGGAISIGHPNAASGGILVARILHEMKRRNVKRGLVTFCIGGGQGMSL
GUT_GENOME181068_01107297-400MGIGPAYAIPKCLKGADMDFNDVDYWEINEAFAAQFLGVGVKLKEEHGWTVDMDKTNANGSGISLGHPIGCTGLRIIVSMLYEMERRGATIGGASLCVGGGQGI
GUT_GENOME140152_00825269-367ATEKLLTETHTKISDYDAIEWNEPFAAIDALFNHYYPEEREKFNIFGGALAYGHPYACSGIINILHLMQALKYKNKPMGLTAIAGAGGVGMAISIEYLG
GUT_GENOME096406_01425308-411MGIGPAPAIRLLLERSGLTLAEIARFEINEAQGAQVLAVQRELDIDPERLNTCGGAIALGHPLAATGLRLTLTLARQLQANHERYGIAAACVGGGQGMAVLIEN
GUT_GENOME232298_01465276-378LGPIAATQKLLDKYALTMSDIAVVELNEAFAVQSILCQAALKITDEQLNPLGGALAYGHPYGATGGIMIARLLNSLNRITQPALGIATLCVAGGMGMSMLIGN
GUT_GENOME045582_01286306-404AMERAGLTLQDLDVIELNEAFAAQSIAVIKEWEKLGISEEELLPKINPNGGAIAHGHPLGNTGAALTVKCMYEMQRRERARYGMITMCCAGGVGVAAII
GUT_GENOME004249_00698313-429MFAGATIASSKALTGAGLTLKDMDLIDIHETSASQMLMNIRLFEDNNFAKKHLNCTACLGEIDQTKLNHLGGSIAFGNPRAVTSLRTVIQSLHALKRQGGGISLVASSGLGGQGAAM
GUT_GENOME278758_01107335-414GLSPADIDYWEINEAFSVVGIVNTRLLDIDKDRLNIFGGGVSLGHPLGASGARILVTLLTVLQRKGARTGCAAVCNGGGG
GUT_GENOME098255_00079282-380MGYTPYYAVKKLLAKTGRKIDDYDIIELNEAFAAQGVAVARDLHIPMDKLNIMGGAIALGHPLGATGTRLVTSAISGLKNRNGKRALVTLCIGGGQAVA
GUT_GENOME096547_00443310-399VTSALTRAGWTVDDLDFLEINEAFAAVVVQSLRDLDYPLDSTNIHGGGLSLGHPIGASGARLVVTAAHELLRRGTGRAAVSLCGGGGQGD
GUT_GENOME178919_02235281-376LSPVAAIEKLLRRAALPLTAIDRMEINEAFAVKILACCRQLGYSLEQTNSLGGALAYGHPYGASGAIILLHLIKALQHGGGRFGIAAIGAVGGMGT
GUT_GENOME096371_00270317-408EALAKANLIVDDIDVWEINEAFAAVVKHAQQNLGIDYDKLNINGGAMAMGHPLGATGAMITGAAIDELHRTGGRYALISLCVAGGMGVATII
GUT_GENOME096480_01297309-400KLLKRFGLSIGEMKRVELNEAFAAQALACAKELGITEEQLNPNGGAIALGHPLGASGARILTTLIHQLWRDGGGWGLATMCVGVGQGISLLV
GUT_GENOME228853_02082273-372AAENALDKAGKSIEDMDFIEVNEVTASQVLAAAKNLGISETDVKTKLNVHGGALATGNPWGAAGVILLERLICILKKADKEWGLVLCGAIGGQAIAAVVK
GUT_GENOME147550_02062313-400KVLERAGWSVGDLGSVEINEAFATQSLGSIRRLGLDPEIVNRDGGAIALGHPLGSSGSRIVITLMGRMERERTAKGLATMCIGVGQGA
GUT_GENOME186788_01820418-520MGIGPIPASRKALSKAGLEVKDLDLIEANEAFAAQFVEVGRELNFDPDKVNVNGGAIALGHPIGASGARILVTLLYALKNRDKKLGLATLCIGGGMGTSAVVE
GUT_GENOME141362_02384408-519YAVPRMLKRANLKLQDFDFYEIHEAFASQVLSTLKAWEDEKFCKERLGLDAPLGSIDRSKLNVNGSSLGAGHPFAATGGRILATAAKLINEKGSGRALISICTAGGEGVVAI
GUT_GENOME141728_05738335-420LEKTGKKIEDIDLFEINEAFAAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHALKQRGGGIGIASICSGGGQGD
GUT_GENOME096467_02572297-400GLGPVSAVTRVLERAGRPLTSVDVLEVTEAFAAQVLGCTDRLGVAVEPSGAGPVVCPDGGALALGHPWGASAAALLVRLFDRLVRRGLGTVGVATCGIGGGQGL
GUT_GENOME255009_01648261-348KLLGQASLSMADVDLYEINESFAAQGLATIAALGIPPERVNVNGGNLALGYPVGATGLRMDVTLLWEMSRRNARYGISVICAGGHMAQ
GUT_GENOME096430_03686296-399MGIGPVPATAKALALAGVELADVDLIELNEAFASQVLAVAREWKFTDADWERTNVNGSGISLGHPVGATGGRILATLLREMDRREARYGLETMCIGGGQGLAAL
GUT_GENOME257704_02092288-381LALERADLRLDEIDRIEFMEAFAVTIAKFVRDYRPDLERLNVGGGHLAKGHPLGASGAILLSALLDALDACRGRFGLVVATAASGIGCAMVVER
GUT_GENOME141726_04645297-385KGLEKVDWSLEDADLLEINEAFAAQYLAVEKELGLDREKVNVNGSGVGLGHPIGCTGARITVSLIHELKRRGLEKGIASLCVGGGIGVA
GUT_GENOME143421_00138283-380MGVSPVNAVRQLLEDNQMQLNDIDLFEINEAFAATSLAVANELALPEEKVNIKGGGIALGHPIGASGARIITTLAHSLKAEGKQYGIASLCVGGGLGV
GUT_GENOME046757_00741282-376GAMRTADALLLRQGLSYENLAAIEFNEAFAVIDVLFERAHPGCLDRYNRLGGALAYGHPYGASGAILALHLLQSLKLAGGGLGLLSIAGAGGMGE
GUT_GENOME096544_03548291-391MGIGPVPATHKALERAGLSIDDIGLFEVNEAFAVQVLAFLDAFDIKDDDPRVNPYGGAIAVGHPLASSGVRLMTQLARQFAERPDVRYGLTTMCVGLGQGG
GUT_GENOME095952_01683276-374IGVAPVQAIEKLLKEVSLSYQEIDRFEINEAFSAQILACIQLGSLPLEKINVSGGALAFGHPFGATVAILVRRLMTELTECGGTYGIVSLCVGGGQGTA
GUT_GENOME157750_01519308-407MGLGPVYAVPKALDNAGLTMDDIQLIELNEAFAAQSLGCIKLLGWEDKMDIINVNGGAIALGHPVGSSGCRIIVSLVHEMKRRGLKYGLATLCIAGGMGQ
GUT_GENOME000530_01814329-421ALSRAGIGWKDVAAVELNEAFAAQSLACTDAWGVDPEIVNAWGGAIAIGHPLGASGTRVLGTLARRLEASGERWGVAALCIGVGQGIAVVLEN
GUT_GENOME216940_00945297-394IANALKNAGIKLSDIDVIELNEAFAAQSLGVIHELIAQHGVTREWINERTNINGGAIALGHPIGASGNRIVVSLVHEMINNEAKLGLASLCIGGGMGT
GUT_GENOME243472_01032293-383VQQVLAQAHLTMDHIDVVELNEAFAAQALIVQQTLQIPEEKLNPFGGATALGHPLAATGTRMITTLLNIMQQQQLNNGIATLCIGGGQGIA
GUT_GENOME103718_03323289-374LDRAGRTIDDYDLVELNEAFASQCEYSRRELGVDEESYNVNGGAIAIGHPLGASGARLPVTLIHEMQKRNADRGLATLCVGFGQGA
GUT_GENOME186229_00712299-389MQKALDKAGMGIRGMDLIELNEAFAAQVIACHRKMPFEMERLNIHGGAISLGHPIGASGAKILTTLIYSLIHQNKEVGMASACIGGGQGVA
GUT_GENOME285947_01217306-392LKDAGIGAGELKQVKTHNPFIVNDIYLGKEMGIDQKHINNYGSPMIFGHPQAPTAGRAIIELIEALAVQGGGYGAFTGCAAGDLGAS
GUT_GENOME103703_00998276-368KALEFAGKAACDVDIIQMNELTAAQSIALIKGLGITAEEAAKKVNPYGGALATGNAWGASGCVEFHRLLCALRMQHREWGLALCGAEGGQALA
GUT_GENOME236429_01722313-386AIRKLLERNALTAERITAWEYNEAFAVIDEIMVRSFGEHSDRYNIYGGALAYGHPYGASGAIITLHLLQALRQL
GUT_GENOME010345_01275286-381LTAITAIERLLAKADICPEQIAAWECNEAFAVIDVLFERHFPKQVKDYNIFGGALAYGHPYGASGGMILLHLLKALQNRNGKYGICSIAAAGGVGT
GUT_GENOME096469_02951289-389MGYTPTFALKKLFEQTGLTPATVDCIELNEAFAAQACAVVRDTELDMDKVNPYGGAIALGHPVGATGAILTVRLAKHLVRNDLETGIVTMCIGGGQALAAL
GUT_GENOME096544_02767317-405QALARARLTIDDMDLVEINEAFAAQVLPSARQLGVDPERLNAHGGAIAVGHPFGMTGARITGTLLNGLAAVDGRYGLETMCVGGGQGMA
GUT_GENOME250014_01895301-388IPKALEKAGLTADAIDYYEINEAFAAQFLACNRELKLSMDKVNRNGSGIGLGHPVGMTGARIITACISEMIASGEQYGVASLCVGGGP
GUT_GENOME147382_02515299-399IEQLLSQLDWSIDEIDLWEINEAFAVVTQIAVAELGLDSSKVNIKGGACALGHPIGASGARILVTLIHSLRQLQALGIDGDASNKKVMRGVASLCIGGGEA
GUT_GENOME096509_00393289-387MGLGPVSAIQQALEKANMRISDLDLIEINEAFAAQYLGCQKLLDFDPKLGNVNGGAVALGHPLGASGTRISLSLLYELRRRGKKYGASSLCIGGGQGIA
GUT_GENOME041001_00375300-388KLLQQAGLHFSDMDSIEYNEAFAVISALFARQHPEALDAYLPLGGALSYGHPYGASGAILTLHALAHLTRKKGRFGLIGAGVMMNYMDL
GUT_GENOME038215_00960298-396MGLGPAYAIEKLFNETGISREDVDLYEINEAFASQSVACLRRLKLDERRVNPRGGALALGHPVGCSGARILVTLIHEMQDLDAQTGIASLCIGGGMGVA
GUT_GENOME000598_03067254-352LGAIAASQKLLQSRNLNINQADLVEINEAFSVKVLAFLKYFNCQKEKVNIYGGALAYGHPYGASGAIIMLHLMEALKDQNKKIGLTTLGVAGGLGEAAI
GUT_GENOME222909_00382297-394KAMKLAGLQLSDLEVIECNEAFAAQNLSVIKEMESRTGEKIDQSKWNPNGGAVAIGHPNGASGARVAMFAMRQLEQTGGRYGLFSSCCGGGHGTTTII
GUT_GENOME262669_01048302-403ALKRAGLRIEDIGLIEINEAFAAQSLAVIQELEIVASHEFEVLADSGEADIVYLNVNGGAIALGHPIGCSGARILVTLLHEMQKRQVRYGLATLCIAGGLGS
GUT_GENOME286415_00376301-396KATKRAGIGLADLKVIEINEAFAAMPLVSTKILADGDEEKMEKLREITNVNGGAIALGHPVGASGARVIMTAMYELIRRGGGYGVAAICGGLAQGN
GUT_GENOME141762_00035297-381KVLEKAGMKIGDIDIVEINEAFASVVLSWARVHEPDMDRVNVNGGAIALGHPVGCTGSRLITTALHELERTDQSLALITMCAGGA
GUT_GENOME000445_04389318-419LQRHGLGLNDLDLWEINEAFAAQVLGCLAAWQDEAYCQEHFGTPAWGALDTARLNVDGGAIAIGHPVGASGARIVLHLLEALKRRGARRGMAAICIGGGLGG
GUT_GENOME186905_01575285-382MGIGPAKATKKLLERNKMTLNDIDLIELNEAFASQSLACLKELDLNMDKVNVNGGAIALGHPFGATGALLLTKLVYEMQRTDAENGIVTFCIGGGQGY
GUT_GENOME096235_00271309-428IKKCLDEAKLTMDDLKVIEINEAFACVPLVSLKLLANERFLTGNYKAMVREASAQPILDHDPDRYLKLKEKLNPNGSAIAVGHPNTASGARLMMTAAYQLREAGGGYAACAICGGLTQGA
GUT_GENOME000931_04824312-409EVLKNHRMTLEDIDILEINEAFAGQTLGCLRELGNEPGTELYNRLNPNGGAVALGHPLGMSGARIITTICYEFKNHPEKRYAIASACIGGGQGIAMLL
GUT_GENOME182582_02343291-375INKLLKKANRTIDEIDTFQINEAFAAASVAVQKELKLSDQKINSYGGAIALGHPIGASGTRIVTTLISELIQEEKQRGVASLCIG
GUT_GENOME091039_01424317-417MGLGPIYAVPKALKRAGKTIDDMDVIELNEAFAAQAIPCIKVLKMNPEKVNPYGGAMALGHPMGATGAFLTCKALDYLQDNKKTTALVTMCIGGGMGAAAV
GUT_GENOME034618_00530289-386MGTGPIPAIRKLLQKTGLSAADVGLYELNEAFAAQALVCIRELGLDMDTVNVNGSGISIGHPIGCTGAMITIKLMNEMMRRHVRYGIASLCIGGGQGL
GUT_GENOME000159_00650296-387KLLDKTGVSLDQVELIELNEAFASQSIACIRALGLDPARVNPNGGAIALGHPLGATGAVLTTKAAYAMQNAEREYCIVSMCVGGGEGAAALF
GUT_GENOME193126_01206273-373FGLGPVYATEKLLAKQGLAISEIDRIELNEAFASQALACIQTAGWQEARVNRSGGALAFGHPFGATGTILVRRLMTELEKRPTLKTGLVTMCVGGGQGTSL
GUT_GENOME141959_00240553-644RLLKRAGMQIGDIDLFEVNEAFASVVMSYMRHFDLPHDKVNVAGGAIALGHPLGATGAMLLGTVLDEMERRDLNTAVITLCAAAGMATATLI
GUT_GENOME243389_00670293-384IPATQRVLAKTDMTLEQIAHYEINEAFASVPLAWQKTMKADGARLNPRGGAIALGHPLGASGVRLMTTMLHALEDSGERFGLQSMCEAGGMA
GUT_GENOME029599_00550349-448AWQSTEAILKRNHLTMDDMDAVEWNEAFAVIDVLFERAYKEHVHKYNQLGGALAYGHAYGASGAVNLLHLMAALKHCDGHYGVTAIAGAGGTGVAMLIER
GUT_GENOME243573_00712287-387MGIGPVPAIRKLLFDNKLSINDVDYFEINEAFSTQALYCIKELGIDNKIVNINGSGVSLGHPVSMTGVRIIMEVLYELERQNKEIGIASLCAGGGPAIAAL
GUT_GENOME096430_02769286-377IPATAKVLERSGLKIDDIGVYEVNEAFAPVPMAWAVESGADTGRLNPLGGAIALGHPLGGSGARLMTTLIHRMRNTGSRYGLQTMCEGGGMA
GUT_GENOME095247_02004323-408KVLAKAGLTKDDIDLWEINEAFAVVAEKFIRDLDLDRSKVNVNGGSIALGHPIGATGAILIGTVVDELERQGKRYGLVTMCAAGGM
GUT_GENOME095456_00563288-386MGLGPIEAVPIALQRASLTLDQIDVIESNEAFAAQACAVSQALGFDPAKVNPNGSGISLGHPVGATGTILTVKALYELTRIDGGYALITMCIGGGQGIA
GUT_GENOME116649_01057289-384VGAVQAAEAVLTRCGLHREDIDIFEVNEAFALIGELFARQFPRCVAAYNPFGGALAYGHPYGATGAVLLLHAIEGLRARDGALGCCSIAGAGGLGK
GUT_GENOME140359_00968289-380GLGAVKVIGKVLQRAGLSPGDVALWEINEAFASVPIAACREYGLDEEKVNFSGSGCSLGHPIAASGARMVTTLTYELARRGGGIGVAAMCAG
GUT_GENOME000469_03244290-375RALERAGLRLRDIDWIEINEAFASVVLAWLKTLDADPAKVNPWGGAIAHGHPLGATGAALMAKMLAGLRASGGQFGLQVMCIGHGM
GUT_GENOME171681_00089292-394IGPAWAIRKALDRAGMTLEQMQLVEINEAFAAQVIACERELGLSHDIVNVNGGAVALGHALGNSGLRISITLLAEMKRRGLRYGVSSLCIGAGMGIATVFEAL
GUT_GENOME243887_00358310-408MGYAPKHVMERMLNATGSNVDNIDLFEINEAFAAQSIAVSEQLHIDESKLNVRGGAIALGHPLGASGARILTTLIYALRDSNKLNGVAALCVGGGIGVS