UHGP-MC 116300


Information


Number of sequences (UHGP-50):
149
Average sequence length:
80±7 aa
Average transmembrane regions:
0.01
Low complexity (%):
1.19
Coiled coils (%):
0
Disordered domains (%):
0.53

Pfam dominant architecture:
PF00884
Pfam % dominant architecture:
5906
Pfam overlap:
0.1
Pfam overlap type:
shifted

Downloads

Seeds:
MC116300.fasta
Seeds (0.60 cdhit):
MC116300_cdhit.fasta
MSA:
MC116300_msa.fasta
HMM model:
MC116300.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME010533_01394386-456IINPAITKKTNNQAWTTVDLPATILEALGIQFGGQFGLGYSLFADKPTLYAKYGNNFDTELKKMAAEYNEF
GUT_GENOME285906_00048408-492RTPYNCIINSPISSEYCKNRKFSVIDLYPTMLAALGAKIDGDQLGLGVNLFSGKKTLIERFGFKKINQEVKKRSELYRCYLAGEQ
GUT_GENOME014276_00728448-527ERHIRNIFIGNVPKIPEEKRNQYISAVDMAPTILQAAGAYWGSSKFGLGTSIFSKDKSLIQRLGQKKYNRYMSAPSKMYQ
GUT_GENOME111204_01183424-498AFFNTAVAPVEGTTAGRVWTSYDMFPTTLAAMGFEIEGDRLGFGVNMFSGKQTLAEQLGYDYLNAEVNKFSDYYI
GUT_GENOME072096_00128432-515NRKCYYAIINSAATPTTQNSRTISTYDLFPTTLASLGVTFNSNRLGLGTNLYTETPTLVEKLGFKKFDKELTRHSNYYDMYILY
GUT_GENOME114931_00004927-991NRSFAAMDFYPTVLAACGFTFGGDRLALGTNLFSETETLYEQYGIEEVKAELRKRSDFVIKRIFR
GUT_GENOME113153_01284396-495TDFYADITYEKHHGETTRKVYNAIINPAAEPVKEKNRKFTTLDMFPTTLAGLGAQIEGDRLGLGTNLFSERETLAEEYGYETMFAELDKKSVFYNNEILY
GUT_GENOME234170_01226404-506FDNDFTQFTRKDPERGIYSVFLNPVRTDLKTRPCGFSAMDVAPTILNAMGVTFSSDDHGKVSHARLGLGRSLFDRGENLVCRFGAEGLTRRLGQYSAFYNTLQ
GUT_GENOME064784_00121451-541MDKILVKDVDKVNREAIDVFINSASPYQKINQKRTAATMDLFPTILSSMGYSIKGNRLGLGTNLFSSKKTLSEEMGYDEFNLELGKNSRFY
GUT_GENOME234465_00808435-519RTVFSMFVNAQVEKKDYVKTFTTLDMLPSTLAAMGVKIEGDRLGLGTNIFSDRQTLSEELGLGELDNMISQNSDYYTNNILQGSD
GUT_GENOME257394_0138914-93ERRVYTAYINAGKAKSDKERKYTTFDDFPTTLAALGVQIEGNRLGLGTNLFSTEKTLTEIFGIEKENSELKHRSKLLDQL
GUT_GENOME193400_00248438-518YNTFINPTISTSHSKNRQFTTFDMYPSTLVALGVKIDGDRLGLGTNLFSGKKTLVEQYGGIENLNSELSKRSAYYENKIFT
GUT_GENOME175214_01587526-599IINSPIQPQQEKNRSFTTMDMFPTTIASLGATIEGDRLGLGTNLFSGEQTLAEKLTFDQLNDDLSQKSKFFEKM
GUT_GENOME130633_01256396-473IINSSATTKYTTNRKFSTLDMFPTTLAALGVNIENDRLALGTNLYSKEKTLIEQKGYKYVLNELDKRSTFYNNNVLAY
GUT_GENOME200682_01506356-417IITHFDLFPSLLAALGFTVEGDRLGFGYNVFSQEVQPEENYRDRLRKRVLSHSKFYESLWLP
GUT_GENOME051112_00907156-238INMDSTFIEDMDNRKIYNVIINGKTPIYKQNRLASSFDMFPTTLYSLGVEIEGNRLGLGTNLYSSEETILEKYGISKVNEEIK
GUT_GENOME019239_00233402-488NRKPINIFINTPVAAVHTQNRTFTPFDIYPTLIESLGAKIDGHRLGLGTSLFSDMPTLTESKMSVKDMDINVRKKSKLYDWMLYGKE
GUT_GENOME051351_01107357-426INLPVSTERSFSMLDLGATLLEMLGFVLPQKGLGLGRSLLSSQEPTLIEKYGIEHIHGEILKNSAFYKKL
GUT_GENOME032719_02438696-777RTTYNCFINSKVTTDQIKNRQFTHMDMYPTTLAAMGFNIEGNKLALGTNLFSELPTIIEKYGQDYINEEVQKSSEYLDKNIY
GUT_GENOME234653_00615397-482RRVFNLFLNAAKQPVRRARRKFATMDIYPTIMDALGVDVEGDRLGLGTSLFSDKPTVLEQIGDTERFNEQMMKRSAVYDWLLYGRE
GUT_GENOME063399_01134518-605RNQYNLILNPVQGADSVSESVCRNRQYANFDMFPTILSSMGVQIEGNRLGVGTDLFSGDKTIIEEYGLDTVNLELQKKSDYYNAKILN
GUT_GENOME013445_00691450-535DRTIFNVIINGVIEAKNNKNRIFTSMDMYPTILASIGIKIEGERLGIGTNLYSGEPTLAELKGLEAFNDEISRKSDWYNKHILGDD
GUT_GENOME080627_00449624-706RVYVTIINAADGCKEKERERGYSTFDLYPTTLAALGAQIEGDQLGLGVNLFSEKETLYEKYGKEYLDGELLKSSQYYEKHFMR
GUT_GENOME102730_01753385-456INSNYTENINTNRHFSGVDIFPTVLEGIGADIPNRRLGLGVSLYDNSQQTLIEVLGKKRLDEELYKKSDRYF
GUT_GENOME247483_00481413-489CIINAPIQTDHDKKREFNTFDFYPTVLSAMGVYIEGDRLGLGTNLFSGKKTLAEQYGYEKIDTEFSKTSKFYNNYIW
GUT_GENOME247103_0076668-151RLVYNCFINPAENVSGLDFSDRRFSQVDMFPTILAAMGFDIRGDRLGLGANLLSDQPTLIEQFGFDWVNTELQKTSVFYERNFY
GUT_GENOME180358_01740423-498ANRRVYNCIINSPVSAANTGGREITPFDMFPTTLAAMGCQIQGDRLGLGVNLFSDQPTLAEKMGVEALNREIDRSM
GUT_GENOME194201_01102393-468FINSPIKPLKEKNRQFSSFDIFPTLLASMGVKIKGERLGFGTNLFSNKKTILEKMDQKTFDLELRKQSQYYKEHIL
GUT_GENOME074589_00459423-505ERGVYYCFLNPAAVPADRTHSACTMDLFPTTLAAMGVEIEGDRLGLGTNLFSDRDTILDEFGKEEVISQLNMKSDFYDNELLY
GUT_GENOME237527_00010478-567PAAAGKPPQRTVFSCLVNDAPPSAPPSRERLFASFDWAPTLLEAVGVRWPSRRFGVGVSLYSGEPTLLERVGGATYERESRRVSAVYRRL
GUT_GENOME112022_00092423-503RHPYNCFINTVYTEKESLNLKNRVFTSLDMFPTTLSALGYEISGDQLGLGVDLFSSRSTLSEALSFDTLNEQLAKKSDFYI
GUT_GENOME237517_00426207-291IKSERTIFNAIYNSQRKVPKNKLKETVSALDIAPTILDLCGASWNREQFGLGISLFSDKKSLMEQYGEEEFNNKIKQNSKFYDKL
GUT_GENOME104352_00296447-531RSLYNVFLNSSVDTDCNKNRTFSTFDFFPTTLASLGAEIDGERLGLGTNLFSCKKTLGEIYGNQYINDNLNKTSSFYDDYLLNQK
GUT_GENOME222503_00558402-478VYTTFVNAAEQPEIQTTRIYTTLDNFPTTLAAMGVKIEGNRLGLGTNLFSDTPTLAEQYGVKELGQEIGRKSEFMDT
GUT_GENOME007325_00984518-600VVNLIVNPAPGVTAEGPVTNRSFTSVDIFPTTLAAMGVGIKGDRLGLGTNMFSLSQTLAEELGVDTLSERLNDSSEFYNAHIK
GUT_GENOME284015_00753529-619VYTCYLNSAVKPARDDYREFATIDNFPTTLAALGVKIDGDRLGMGTNLFSDKDTLIEKDGLDYVNTELMKSSDWMDEQSNLKKVTADIHYG
GUT_GENOME271430_00476395-478ERNIYNVFINAKTSPSKQYNRLFTNIDMFPTILGSLGVKIECDTLGLGTNLFSDKETLVERYGYLDYYNKISYYSEFYDKEILK
GUT_GENOME118227_00878415-500NRTIYNTFINSKVKTKNNKNRLFSSFDMYPTTLASLGVKIDGDKLGLGVNLFSDQKTILEEYGLTYVNNEIKKKSFFYDNVLLGDS
GUT_GENOME022251_00701514-587RVFTTIINSPVHPSRNVERRYSTLDLFPTILAALGASIDGNRLGLGVNLYSEQETLVEQLGEPALNHELSQGSI
GUT_GENOME113564_00537410-492DKKVVNVIINPAIEAENTKRVYSTMDLYPTTLGALGAKIEGNRLGLGTNLFSNEKTLIEKYGVEYVNNELKKVSRFYDNSILV
GUT_GENOME046337_00117440-521GIYTCIINPRDGLAPASGETRQASSFDIFPTTLAALGVQWDRDQLAFGVNLFSGAPTLVEQLGREAFDLQLQMPSGTYLKKF
GUT_GENOME158781_02102424-511GEYNRTIFNGVIHAAVEPVQAKNRQFSVMDMFPTTLASLGVTIEGDLLGLGTNLFSDRETLMESMGEEAFSDELERVSNYYNNQFLYP
GUT_GENOME005382_01846344-423YNTFINPRVQAEKMTNREFSVLDMFPSTLAALGVEIEGDKLGLGVNLFSNQLTLVEFLGADQLNLEIGMKSNFYEKKFLQ
GUT_GENOME245233_00041418-493INAPIVEKNTKERQFTAFDMFPTTLAALGVEIQGDRLGLGVNLYSKNKTLCEKYGLEKFNNELDSRSLFYYRRFIG
GUT_GENOME237957_00596447-525RKTYVNFINSAKKRSSETERTFSHFDIFPTILSSLNVSLSSPYLALGVDLYSDEATYLETYGKDYINRQLQGKSQVING
GUT_GENOME232485_00271428-516GISMEDRKVYNCFLNAAKAPSISTEGRIFTQLDMFPTILSAMGFEIDGNRLGLGTDLFSETPTLSEELGFDYLSSQLQMQSKFYKENFY
GUT_GENOME203676_007651259-1338YNCFINSAVEPIQPKNRKFAIFDFYPTILASLGVQIKGEHLGLGTNLFSNEKTLLEKYGVKKINSEVTKYSNFYRKVLIN
GUT_GENOME258075_00899368-445QKRRIFNMIINPQPGLTAQKHRWTTLDLAPTILEAAGFDCGKLALGRSLWQKEPTLEEKYGLSLDLEFNKSSAFYRQL
GUT_GENOME110428_00501423-502ERTIFNTFINSTISPVETQNRVFTQLDLFPTILASMGAEIDGERLGLGTNLFSDTPTLAEELGFQYFYDQLSMNSRYYNN
GUT_GENOME230853_00769435-512VVVINSGETYTLDYDRQFTVFDMYPTTLAAIGAEIEGDRLGLGVNLFSDTPTLLERYGYDKLNSELEKKSVYYNTKLK
GUT_GENOME237118_01491419-495RKTYNIVIGKNIKEQIIYKPFNQFDWAPTILELAGVKWQNHSFGLGTSLLSSEPTLVEKYGAKILEQELAKNSKVYE
GUT_GENOME244214_01641518-599RHTFNLILNAPTAVTANVKTKNRQFGTTDMFPTTLAAMGVQIKGNRLALGTNLLSSEKTLVERDGLKKVNDELSATSKFYNN
GUT_GENOME124925_00957398-476AVLNLFINAIPRPEREGERLFSAPDFAPTILEAIGANLPGRRFGIGTSLFSACPTMIEELGEKAYVAELGKTSARYNRF
GUT_GENOME204174_02049672-757SNLNDNYKRTVYNTIINADCTYKENVTGNRDFSTMDMFPTTLAALGVQIDGNRLGLGTNLFSGQKTLPEKLGRGYINQELKKNPKK
GUT_GENOME008321_00420401-478AFINSKAHTDGNTYRTFNNFDMYPTILASIGASVEGDKLGLGTNLFGTHKTLSEEIGKDKLEKELLKSSKYYEKYILK
GUT_GENOME115568_00740395-466NVFIVVNGSQSGSIHKPHSALDIAPTVLELSGAKIDKHGFGLGTSLLSTTPTLTERFGAEKLEIELNKRFYE
GUT_GENOME233291_00662434-512RYIYNAFINPRFSRKPTRNYVKNRELTHYDFTALMLDSLGFKVEAFGLGRNPLYGQTLIEKYGLDELNRQLKQKTKFWE
GUT_GENOME063798_01124422-500RTYTAYINSAVNPVNNKKRTYTTMDNFPTTLAAMGVKIEGNRLGLGTNLFSEELTLMEDIGESTLKAELKKKSEFLEKI
GUT_GENOME277693_01788597-683NSKERRVYNCFINSAVDTSNNKNRKFTSMDLFPTTLAAIGTKIDGDRLGLGTNLYSDKETLAEQYGYEHIYKELSKNSKFYNKDILN
GUT_GENOME066225_02065434-513RVYTTYINSAVQADDAAMTRNYTTFDNFPTTLASLGVEIAGDRLGLGTNLFSDTLTLTETYGIQRMSDELQKESKLMQEL
GUT_GENOME265610_01018419-496IINPKNKIDELNVNRVCSVVDMFPTTLASLGAEIEGNRLGLGVNLFSNEKTILEKYDYKYVDEELRKTSIFYNNRFIL
GUT_GENOME122055_01622424-506RRNFNVLINPARPLPDAERRMSHFDWAPTLLESMGAALPEGKMGFGVSLYAGLPTLVEEMGKDELEAKLTPFSPFYLSELMGA
GUT_GENOME237875_00744761-844RKTYVCIINSQKEETGKRRTFATIDLFPTTLAALGAEIEGDRLGLGTDLYSDTPTLVELRGEDWMNSQFAMNSDFYKQKILYGN
GUT_GENOME266286_01774444-527DRSIYNCIINPACEPSNDHERIFTTMDMFPTTLAAMGVTIDGDRLGLGTNLFGDQQTLTEEFGLSFEDDELMKNSDFYNKVLLY
GUT_GENOME000001_00443529-607ILNAPITTDNVKNREFAPFDMFPTILASMDVQFEGDKLGLGTNLFSDKKTLIEEEGLESLQDGLERNSNYFNDKFISEK
GUT_GENOME256138_01189409-488CIINPAVENPTAYTKNRLFTTLDMFPTTLAALGVTFAGDRLALGTNLFSGVPTLAEEMGLDELSEQLSRNSNYYNYHFLY
GUT_GENOME178967_01169530-590NREFSAFDMFPTILTAIGAEVKDDRLGLGVNMFSGKQTLVERDGAELMNEEFAKRSPFYDS
GUT_GENOME051351_01105371-454LNKSNDRMIFSALVNAAPPPVEKRPFLAWDMGASILSLLGFGDNAKIGLGVSLYSPEQNLLEKSGAEKLNKELMKNSEKYNELL
GUT_GENOME236134_00600444-527RYNVILNSAVTAENTKKRQFTAFDFYPTVLAATGFRLATDYLALGVNLFSGLPTYAEKYGMEKLNAELEKYSDFYIDKIMGRGD
GUT_GENOME250710_01686421-503KNREIYNVFINTVFPPDHVNTRRKFATFDLAPTILESMGATLEGHRFGLGTSLFSEEKTLCETYGTAFVNEEFQKHHQFYQKL
GUT_GENOME065718_00823371-445DRRIFNLFLNSGFKSVNTNRKVNHFDMLPAILESINISVEAFGLGRNPTKNTPLLLEKYDFDDLNKEISKRSKMY
GUT_GENOME087909_02009754-824KKYTGDARTYTTMDMFPTTLSALGCGIEGDRLGLGTDLFSNTKTLAEEFGCSALDYQLKLNSKFYNKNILM
GUT_GENOME244289_02430466-551DRTVFNLFMNTGLIAETNKNRQFSNMDMFPTTLVALGAQIENEKQQLGLGVNLFSGEQTMIEKYGYQKTKDQIEQRSKFYIKNLIE
GUT_GENOME157254_00493506-609MDQTFFKDFDPDYRRTCFNLVLNPSPNCSSAKADRFNNREWATFDMFPTMLASIGVDIPGDKLGIGTNLFSETPTLFERDGTDFVNTELEKRSNFYNNHVLVDW
GUT_GENOME024450_00175406-480RVFTAYINSARTYNGEKRQYSSFDTFPTLLASLGANIEGDKLGLGVNLYSSSSTLTEEMGVEGMNDKLIAKSEFM
GUT_GENOME134013_00359422-515MDPNFFTNISGYERTVFDLILNSDRESENTKDRLFSTMDLFPTTLYALDVSISSNRIGLGTNLYSGDKTIFEEYGVDYVNNELNIRSKFYDDVF
GUT_GENOME235146_01856418-494RKFYNVFINSAKQIATDKTKNRVFSSFDMYPTILEALGFEINAEGLALGRSLFNDSKTLLEKYGIETVTSETMKRTI
GUT_GENOME188428_01123522-598FLNSGMSNYDNKNRLFSAVDMYPTTLAALGVEIPNNRLGLGTNLFSERRTLIEEIGYEDFYNELSKRSKYYEDNLMQ
GUT_GENOME139070_00354532-613IINPAVTPVAGSTNNRQWAPLDYFPTILSAMGYDYGGNRAGLGTNLFSGEPTLMEQMGFQRFDDELRAYSKFYENVFLQNDP
GUT_GENOME202325_01214390-472KKYRRGIYNVFLNLPDNMSYNPRKEFTALDLAPTYLELLGIKLPEHAFGLGRSLFSDVPSLISLPEAKLETAIRQKSKVYDKF
GUT_GENOME130348_01003578-648INPARPLPDAERRVSHLDMLPTMLEALGGTVPGGRMGLGVSLYSGEPTLLELMDAETLDGKLRGHSEIYNA
GUT_GENOME017809_00505449-514NRHFTAMDIFPTLVESLGAQIEGRRLGLGTSLYSDEKTLLEQGYSVEELSNELALPSRLYNYFLLG
GUT_GENOME024029_00277416-503GVAQSDRRVYNCFINPAMSVDNDAKYNRIFTSLDLFPTILASMGYKIEGNRLGLGVNLFSGEQTLAEKMTFDALNKETEKRSEYFIKT
GUT_GENOME238386_00112523-595LNPSVEPNTGSTLNRSYTELDYFPTILASIGFGVEGNKLGLGTNLFSGQPTLAEEMGITKLSQALGRFSEFYL
GUT_GENOME259545_00992395-471TIINSKRERNNISDREYANIDMFPTILYALGADIKEDKLGLGVNLYSDNKTLIEEYGYEKVNNKLSSYSKFYNENIA
GUT_GENOME140104_02456433-516RTVYTTYINSATEVQDDVYREYSTFDNFPTTLASLGVEIPDNRLGLGVNLFSDEKTIIEEYGLEAVNQGLSQKSALMEGLIAGL
GUT_GENOME065695_00586405-492SDLGKKEISESRRFLDIIINPVPELNGVNQIRKFSSFDVMPTVLEALGNKIEGKGIYLGRSLFSEEPTLVEKYDALFVENETMTKNTV
GUT_GENOME268448_00634382-458REVFNLFINSSLSKPSDRPFTSFDLAPTILEAVGFPLASNQFGLGISLFSDKKTLLQKFSYQEIDAELKKPSIFYTS
GUT_GENOME153817_00401411-511KLAKSGSTREIYNVFLNSPYLPAGKVLQPAAGYTPMDVAPTILASLGIAFTSTMPDRSVSHSRLGLGTNLLSGEPTLFSTVGMKPYEEELLNKRSKLYEKL
GUT_GENOME279959_00642419-502RRRYNVIINPVVTTTHTKNRTFTAVDMYPTILASIGAEIDEDQLALGVNLFSDKKTLSEKYGLDFMNSELTKKSLFYDDKILGS
GUT_GENOME259441_00247434-500NRYFTTMDYFPTTLASLGVQIDGNKLALGTNLFSNEETLVEKYGIDYVRKELAKRSNFYNKKFIYND
GUT_GENOME028063_00225387-468RVWNCFIHSQKNPVHSKKREFTTMDLFPTILSGMGYALSKNGLALGRDLFSDEKTVLEKKGEKVLNLKIQKQSSFYKKKILG
GUT_GENOME135756_00207432-517DRNVYNVYINSAIESKNTNNRKFTTFDYYPTTLAALGFEIDGNRLGLGTNLFSERQTLAEEIGIDKLDDELSKNSKYYNNRLLGDS
GUT_GENOME272660_00026380-458ENRTIFNLFIDGGRKADMKRDFSGLDILPTLLKSLGADWQGNRLGLGTVLFSEQKTLLEIYGHEGLNRQLSAPSERYNA
GUT_GENOME237226_00643384-458RTIYNNFVSGADALSNLDRTFTALDIGPTVLELLGFELQDGALGLGRSLLRPEPTVIEKYGKRVLEQELLKKSKK
GUT_GENOME105228_01407424-506IINPHFEAGSNILENLNTERVFNTMDMFPTTLAALNVKIAGDRLGLGTNLFSGRQTLSEMLGTEYLNNEILLRSPYYEKYFLN
GUT_GENOME147168_02677420-496RKTYTSIINSVVTPQLDTCREFATIDLFPTILAALGAEMSSDRLGCGTNLYGTQQTILEEYGTDYCKSEFSLGSPFV
GUT_GENOME027250_00515415-496ERKVFTTYINAPVQPTDTTKYREYSTFDQFPTTLAALGVSIEGNHLGLGTNLFSSEITLIEKYDKNVVDDELEKQSDFMDEM
GUT_GENOME192597_01609382-467PDFMKTVQQRYIYNMFYGPVPILPEQKKNAVISALDIAPTLLQCAGARWENGRFGLGVSFFSEEPSLVEQYGLEKLNDLLSNTSKK