UHGP-MC 78051


Information


Number of sequences (UHGP-50):
52
Average sequence length:
87±7 aa
Average transmembrane regions:
0.03
Low complexity (%):
0.3
Coiled coils (%):
0
Disordered domains (%):
3.33

Pfam dominant architecture:
PF07940
Pfam % dominant architecture:
8269
Pfam overlap:
0.43
Pfam overlap type:
reduced

Downloads

Seeds:
MC78051.fasta
Seeds (0.60 cdhit):
MC78051_cdhit.fasta
MSA:
MC78051_msa.fasta
HMM model:
MC78051.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME115981_01508375-468PARLTFPAGGYSFLRCGRMYAGIDHAPLGFGSIAAHGHNDILSFQLFLDGRPLLVDSGTYLYHADRKRRDQLRSTRMHNTVTVNGREQSQMLGA
GUT_GENOME185158_04468407-495SLEAADAGHIFFRSSWEEDSHFTYLKCGPLGSGHGHADLTHISLYYRGRPVLADSGRYSYVEEEPLRPFLKSAQAHNVCVIDGESHGRP
GUT_GENOME096499_01367318-399DSGYRKLASENIELFFDIGEVGPTYQPGHAHADTLSFELYIKGRPVIVDTGTSTYEMNETRFYERSTAAHNTVTINNENSSQ
GUT_GENOME142392_01074350-438RIPTEINKSFVDSGNIYIRSDFKEDASFTYIQNGTLGSGHGHADLGHISLYYKGEPFLVDSGRYTYVEEDVMREYLKSAKAHNVSVIDK
GUT_GENOME037671_00575397-489SRYLPWGGFAVMRSDWGPEAAYLCFDVGPLGRAHEHQDKLNINLYKGSEELIFDDGGGQYDRSPHYFHARSGAGHNTLLVDGLAQFRREPKTA
GUT_GENOME200288_00632348-430SSRAFPQTGYYVMRDKYQYLFFDAAAMGGAHGHADALNVEWMWKNQLFFTDTGRYTYEEGKWRRYFKSTRAHNTVTVDGLDQT
GUT_GENOME237527_00642537-623TSTFLPWAGYAALRSDWSPNATYIGFDAGPLGYGHYHQDKLNLVLYAGKDELIFDDGGGCYENSPFRRYATNAFGHNTILVDGLPQQ
GUT_GENOME147440_00596312-394FSESGYSIVKSDWQGDVDTSHMLFLTGMYHTKSHKHRDCLSFELFNNGTRLLCDSGKYGYRSDKYRNYFLSHRAHNTVEIEGF
GUT_GENOME260001_01981651-749VTFGEKGKKPPFTSCLFPDSTVSQMRNDWSKTSPYLFTNVRGLGGHGHADDNAIIFYAYDRILLNDSGTLTYTNNKYRQYAISTEAHNTVMVNDKSQNM
GUT_GENOME126947_00835374-450GNSILRGKDERYLLFKHDCYGGEHDHYDRLDISYEAFGKRITPDLGTTGYGALMHYDYYKNTGSHNTVNIEGNNQAP
GUT_GENOME234270_01337368-447GMAVFRTDWTENAMWAFFDGGPFGKAHQHEDKLSFDLYAYGKNLLSDVGCYDYDTSLMRAYVLSTRAHNTGMVDREGQNR
GUT_GENOME142405_00110590-672DAGVIIYRNDIEKDRKKLYWIFTSAFHSLIHKHSDDLSITLLYDGVDYLVDGGKYNYQEKNNYRKYFRSTRAHNTITVDGKSY
GUT_GENOME171377_01798335-428SNVGIFKRSGYYIYREHWCNERMEESTWLLFKSGYSSRSHKHSDDLSFMLYTKGYDVFIDPGWFNYMWGNKYREYLTSSRAHNTIVVDGQSYST
GUT_GENOME224423_03081238-330SVALKDTGFYVMKNGWDEHSTYLLFNCSDISPRHCPGHGHADALSFELYAKGQTIMMDPGVYSYHDKKYRYYFKSTKNHNTVIVDGKDQSEIL
GUT_GENOME095216_00006383-482EGRKPQGLPSRFFDYAGVAVMQSDFSPGRQWAAFDVGPFGIGHYHPDKLSLVLFDGRELLCDPGRFSYNWNGGWSPLYFNSTAGHNTIRIDGCDQYMPRW
GUT_GENOME026458_003831892-1969YALMRSDWTDDAMFTLMDSSGFGGHGHADTNSMTVNAFRRSFLIDPGYYNYDNASPIRRYMISTRAHNTVEVDNTSQN
GUT_GENOME033824_00814355-431FEDSGFYIFKQGAWKLIVDAGQPGPSYIPGHAHCDAGSFELFKDGKPVIVNCGTYAYQCKERRFFRSTAAHNTVMIN
GUT_GENOME096533_005551028-1128NPVIPGDASLPSSSMNFTGIGHSVLRAGEGDDQLYALMDYGLHGGYHGHPDKLHLEIFGKGERLAPDLGIPPYSNSMYESYYKKSFAHNTVLIDGDTQQIP
GUT_GENOME037220_02092422-498HSGNWCLRSGWRRDSDYLHFKCGSLGGGHGHFDKLHIDLSIHGEDVLIDSGRYTYVDGSLRRHFKSACAHNVPVVDW
GUT_GENOME036619_02591283-378KAPQLPEIKKSQLFADFGWATMRTSWEKDATMLAVKSGHTWNHSHADANSFIIFHKGVDIIKDAGNCWYPNPSYRNYFFQSEAHNVVLFNGKGQSR
GUT_GENOME096573_00155692-785FPDGGYYVMRTGWTTADMMMVLQNTPDNPSGGHNGAQWHRQYDNNTFELWIKGRNFFPDSGCFSYGGTSESNASRRKYAASTAHNTVTLDGKNV
GUT_GENOME007321_00295377-463SIALPYSGFFVMRNGWESGSVWGLLDAAPFGRAHQHEDKLNLLIYAQGKYLLTEGGNYAYDDSEMRRYVLSSRAHNTVLVDQKGQNR
GUT_GENOME018315_02819327-415RPVSVNRVYPKSGYAIFRNKWPEHSQYDKAFHAIVKIGCSSRYHHQQDEGHISVFAGGEDWLIDSGLYNYINADPVRKYMRSRIAHNVP
GUT_GENOME000271_01147394-480EEHRIYPQSGYYFYRSNRGDAPQQDTWKLLKAGYVQTTHKHADDCSFLLYSKGHEIFVDCGMYGYANDAFRAYFLSAKAHNTVVVDD
GUT_GENOME247217_00833369-458EGTPPKETSLMTPYSGYAIMRSDWSENAFWCLFEDAPLGFGHQHEDKLEVLLYAYGKYLMPEAGRNAYDASERHWYSLSSQGHNTVVVDE
GUT_GENOME191956_03233355-461RPRSPQELGLPAASYYPDTGFVYMRDSWERCGELWGDTVLTLESGGGGRCLSHQHYDKNSITLFAKGEYFLVDPGHSCYRGESYQRYDCKTAAHNTLTIDGQDQCLG
GUT_GENOME081289_00233390-481SVDFPYGGYSVLRDGWTKDSMYAFMKYSRYAPGHKHEDSNAIVITAFGKRLLIDSGNYNYSDDTLSTEINNYMQSSAAHNTMDIDGLSQARL
GUT_GENOME158306_00738611-690MRSGWTQDDLVMVINAASADGRSHTHPDNLSLVAYAYGQRLLVDPGRNNYNDTEVSNWLRLSTESHNTITIDGKSQAIAS
GUT_GENOME036207_00979844-936EGTPPENFRFFPDAGFYSVRSGWEADAVHLLAKCGPDGGWHNHFDNGTFELFAYGRNLTPDTGNFTYNHLENRAWYTATARHQTLTLNGENSV
GUT_GENOME142591_02962409-493ETAACFPVGGYVVSRQSWERGAMYMAMRAGVGINGHAHSDALGLVLYAGGKELVADSGMGLYEWNKERKYVVSTRAHNTVVVDGQ
GUT_GENOME147550_00454258-347RPPGSTRAVFPDAGYAIARSSWDENAEWVLFKAGYRSNYHHHCDDLSLLFYAHGRLVLAEAGPYGYDYKDALTKYAFSQKAHNVVLVDGT
GUT_GENOME255709_010521347-1443SKLYPTSSYAFMRSDWSNKAQYLFTNVRGGGTHGHYDDNSIIMFAYGRNLLCDAGYVTYSAGEDRALAMSTLNHNTIEINEENQQSPRAYTLKFDNI
GUT_GENOME158229_00697501-588PLSHLFKDIGIAVMTDSLTNPDRVQLTFRSSEYGSYNHAHPDQNAFYIQAFGEKLAVQGGHYDYFGSTHHKNYARKTYAHNTITVDGG
GUT_GENOME158306_00375491-575QQPEEKSIFLPYAGYAIMRTGYSDADYYTAFDVGPFGFSHQHEDKLNFVISFGKTVLLSEAGVYNYDYSPRHLYALSSRAHNVIL
GUT_GENOME000203_01740346-430SYGFEDSGNYYMRSGWSENDNYMYFHCGTLGSGHGHADLLHISLFANGEDYLIDPGRYTYIEGNEEREYLKSCKAHNTTIVDNDE
GUT_GENOME063190_01170696-772AVMRTGWDDKAWYLFTDADGGYGNHAHPDDNSITVMADGQYLLVDPLYGSYSGSAAKTWLTSTIAHNTVTMNGKSQY
GUT_GENOME041467_01156379-476QTPVRCSWNCPQSGTVTLRSDWTARANFTALKNSPLGSSHGHADQTHLTLYCQGKPFLVDSGRYTYREDDPLRTQLKAPAAHNVCVIDGQSGGTPDGS
GUT_GENOME208951_0071099-182VVFPSGCAVLRDSWEPDAVMWAVRCGYTWNHAHEDAGSFLLYDRGVPLLIDSSNCTYDNPIYSSYYRRAAAHSLVLMDGAGPAG
GUT_GENOME104878_01702550-633SSYYANSGEAFMRNSWDPQNAVYVNYQNNKNNGHAHPDLNQVLLYAYGSPLLVDSGRYTYSTENDIYTRLRTPEFHNTISVDGL
GUT_GENOME231492_00778341-421SAYLFPDSGHVCLRDDRRYIFFKNGPFGSAHTHSDNNSVCLYDKKKPIFIDAGRYTYKEEQLRYDFKRSTSHSTCTLDGQP
GUT_GENOME221909_01231245-322FAMSSRGKKDESTELFVQSCHYGTVSHSHGDKLSIILRIRGRQALADWGTYGYRTPARRQYSKFTAAHNTVLVDRRSQ
GUT_GENOME226761_00295331-417RSLTLDAYDSGLFVTRSSWNSDANVTVFLDGPLGSGHGHCDELHLSVWSNGRPILVDSGRYSYREDSSDRPRLKGVSAHNVPTIDGH