UHGP-MC 15858


Information


Number of sequences (UHGP-50):
76
Average sequence length:
77±7 aa
Average transmembrane regions:
0.03
Low complexity (%):
1.15
Coiled coils (%):
0
Disordered domains (%):
2.57

Pfam dominant architecture:
PF07228
Pfam % dominant architecture:
8158
Pfam overlap:
0.27
Pfam overlap type:
shifted

Downloads

Seeds:
MC15858.fasta
Seeds (0.60 cdhit):
MC15858_cdhit.fasta
MSA:
MC15858_msa.fasta
HMM model:
MC15858.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME142591_00027603-687GSAKAFEVTTGVAAAAKDGDLLSGDSFSMTELGSGKFAVSISDGMGNGARARLESSAALSMLEQLLQSGINERVAIKSVNSILLL
GUT_GENOME186679_01054414-482AASGDINGDCCGWQDIGDGKIAMIVSDGMGKGKKAAAESLMVTKTIISLLKSGVTTDLALKMINAVMLM
GUT_GENOME180136_01916562-645LRASAGSACRALEADMPSGDSHIVRTLDGDRLMLMLSDGMGSGAAAARESAQTLRLLSRFLAADVARPLALETVNELMLARSET
GUT_GENOME011004_00962552-626DNSKENGDTSMIFSDGTGVSYVILSDGMGCGKNAAVESRMVVRMFRRLISSGVNYSSAIKLINSIMLTKSREESF
GUT_GENOME064849_00684563-635RDFSEVSGDSIDTFVTEDYKQYVILSDGMGSGKRAMFESHITLKLLREFLQSGFGVKTSIDMINSALCLKLDY
GUT_GENOME188395_00272565-645LTGVAKVPRVGESVSGDNFSIMNLGTGDMIMTLSDGMGSGKLACEESESVVELLEQLIDAGFKEESAIKLINSILVSRSES
GUT_GENOME260354_01324163-237VVALPKSGAKVSGDAHSVAELSDGRILMLLSDGMGSGERARKESNSAVELIEDLYRAGFEEKDILPAINKLMLLC
GUT_GENOME033595_00419421-504EEGKYRMVYGMACSPREGEPISGDTCTVQTGLPGQVIMSLSDGMGSGRSASEDSEKVIELTQRLLETGFSARSALKLVNTVLLL
GUT_GENOME125542_00859258-338LYGIKKVTKPGEAVSGDNFSVFWLPEGRFVAGLSDGMGSGLQACGQSETVLDLMEQFLEAGFSRETAVRMINSSIVLQPEP
GUT_GENOME091122_01777561-638ATKSGETVSGDNFSFMELRTGELIMMLSDGMGSGSRACRDSENLVDVLESLVEAGFRKESAIRLINTLFVMSYEGKTF
GUT_GENOME225912_03614589-667VARACKDGQSIYGDNFDFGECIDGSYMMVLSDGMGSGPQAGRESKAVVDLIGNFTSAGFSRTTAINTVNSIMSLKFSEE
GUT_GENOME090342_01315295-379EDKYTITVGKARIAKERLNINGDSMLEQKLEDGKIALAISDGMGTGPKAQKYSGLVIHMLQKLLGAGFQKKESLELVNSTVLTLA
GUT_GENOME081956_00652588-663ASQETISGDTALLFTDNAGNPYLVLSDGMGTGKNAAIESRMTTELFRKFISGGICGTAAIRMMNGLLLTKSPQETF
GUT_GENOME015268_01710590-656EDISGDSSCRFYDGFGNVYYIISDGMGSGKRAALESSMTVSMLIRLIKAGVGLESAVKKVNLMLLAK
GUT_GENOME219477_00992424-494GYARLGDVSGDSYICKTLSDGRYIIILSDGMGKGEAAAMESSLAVKTLANLIEAGFDVEIALRTLNSILLL
GUT_GENOME181208_01687560-643EAEPLAVSVGVAAVKKEGEPVSGDRGTYFKTEQGVLCVLLSDGMGSGEQAAKESISAVRILERFLRSGVEPATAMKILNSVMLL
GUT_GENOME282114_02188583-645VSGDSHAVFHIHPGLVAVMLSDGMGQNREAQRESRRLVRLMRECLSYNMNPETAMHTLHYVMS
GUT_GENOME199712_01418613-691AGEDVSGDTVNLFANRKDYFYALINDGMGAGKEAALTSGLCSVFLEKMLRAGNRAATALRMLNNLILSRCPDSERECSS
GUT_GENOME257768_00364475-560QAPAMTMEIGAATRSRSGEEVAGDSYLSRALAGGRHVLALSDGMGSGVSARQESHAALSLLVESLRAGYTRAQALDVVNALMLMCT
GUT_GENOME236238_01612148-226EDKFAVQVGLAKITKEESNFSGDSNLELKLDDGKYLLAISDGMGSGINARETSSTVLKFLKNFLVVGFEKEKTLELINS
GUT_GENOME256138_01317150-230DSADTHRERLLVGHASSKKEQVSGDSWGVQDLQDGRMVQILCDGMGSGQLAMEQSKLAVRLLQTLLFGGLSIRLSLSMINT
GUT_GENOME211787_00666556-637ISVGYACLGKNPENVSGDTFSLTKISETKTLFALTDGMGHGKRAREIASVALALVENFYKSGFKSETIISSVNNILLSANEE
GUT_GENOME097047_00307574-650STAAAMAIKDDEHISGDSYSFMELPKGGYMIALSDGMGSGRAAGEESRTVIELLEQFAEAGFKRELALKMINSVLVM
GUT_GENOME257235_00046550-636AVKSSIVGAAAEGEDVSGDSAVSCDMPDGNRLILLCDGKGVGKNANMESSEAVRLISKLIFAGFTPDSAVRLANSAMAECCEESYTT
GUT_GENOME158306_02717188-267AGIAKIARERETQSGDNHTFQPLADGKYILALSDGMGSGSRANVQSGITIELIKRLMNAGFDKETAIRLINSVLMVNTDR
GUT_GENOME096493_00058590-655GEQCNGDSYTFGKLKDGNYITIISDGMGSGPQAGQESKVVVELIERFVQSGFSKITAINTINSIMN
GUT_GENOME176153_01057596-679EDKYKLQVGIATSTKEGSKVSGDSNIATKLEDGKYLLAISDGMGSGEEANQSSKIAINMLAKLLKAGFDKETSIKLINSSLIAN
GUT_GENOME194352_02373575-659VVGIGQRQKPGEKVSGDTARSFVTESGHACLLLADGMGSGPAAAQDSRSILSLMERFLRAGIPPADALRTVAPAFRLRVDGTRGV
GUT_GENOME105241_01086325-404GIARAVKNEENLSGDTFAFTNLPMGRVLLCLSDGMGSGQAAFAESETVIDLAEQLLEAGFSAEAAVKLINSVLLLRGEEQ
GUT_GENOME025296_01703568-654QFSISAAVASAIKTNRRVSGDYALYALVDRTTYAIILCDGMGSGESAREESRTCASMLMRMLETGMEPQDAMNMVNSMLLCASTGTL
GUT_GENOME143400_02873550-633LTGVARATKEGEEISGDNFSILSLDNGEEVLLLSDGMGSGEQANRESCLVVELLEHFLEAGFDKEAAIRMINSTYVLQSDSQSF
GUT_GENOME222837_01661587-667GVVRLAKDSEEVSGDSYSSISLGDGKYMVALSDGMGSGKRAAKESRTTIDLLENFFEAGFNREIALKAINSILMLRSNDEM
GUT_GENOME026458_01504567-644MDTGFAQTAGESGGVCGDHYICAPSSDGKYILALSDGMGQGAEAEAQSRMTVQLLRRLLAAGFDKETALRLMNSILMV
GUT_GENOME157115_01556576-642EKYSGDTYSVFRDGDGNFYAVICDGMGTGMRAAVNSNLAVSLFEKLIKAGFGVQAAISTVNTCLISK
GUT_GENOME239069_00156548-613CGDYAAEFSLPGGKSCLLLCDGMGSGADAYGESRMAAELLTEFMSAGFSAQTAVSLVNSTLILHSG
GUT_GENOME115105_01664471-543KKQGSVISGDSFSVRENDDGKLVLMLADGMGSGSIAASESEILINTMEEMLDAGFDPLYAIAFANKCLSEKNR
GUT_GENOME009869_00519563-643LTGVARVPKEGEELSGDNYSVLSLPTGKVVMTITDGMGTGSAAYDESETVIEMIEQLAGAGFSESMALRLVNSSLVFSREG
GUT_GENOME173872_00863475-558EAEAFEVHAGYASLAPDGGQDCGDTLTTVQLPDGQYMAALADGMGHGLRAQAESRAAIDLMEDFLLARFEPEAALSGVNDLLLH
GUT_GENOME037638_0076930-109GAARAPKTGEMVSGDNYTFCQCGPGQVVMSLSDGMGTGMQAERESRQVVELAEQLIGAGYSPRSALKLINTVLLLAGEEQ
GUT_GENOME279992_01224323-403GIARVTKFGEEVSGDSFSIKTLTNGRVLFCLSDGMGSGRKAYLESQMTLDLLEQLLETGFELEAATDFINHVLLFSRKDQH
GUT_GENOME273725_00307415-505EETKYKVLTGVARLTKDGQSVSGDTFSMINLPSGELLMALSDGMGSGTSAFDDSKNVIELLEQMSEAGFSQTSAIRLINSLYVSDQEFENY
GUT_GENOME122716_0055976-153IRVRIGYAAAAAQGVSGDQAVSCELPDGRFALILSDGMGKGAQAAAESRRLVRRLRGNLKKGMSPARAIKEVNRFMIR
GUT_GENOME255980_00487573-648FKIVSGIATVKKDGEEQNGDTCCALSLSDNRYALAISDGMGSGAAAANESKTTIQLLKKLLLAGFDRAAAIQLINS
GUT_GENOME071708_00072482-572EKLTQNMLGAGYKLDIGIAKVKKQGSSVSGDTAEVFKLKNGQLLLGVSDGMGSGEKALKSSKKTFELLEKYINADLDKKMAIELINSYMML
GUT_GENOME222296_00800563-643AFSVQIGVASATKEGSDISGDAYSHEAFADGRYMLLLCDGMGSGRRAARESRLAVSLMEDFYKAGFAEADIIEMLNKLLIL
GUT_GENOME183568_02098600-668VSGDNYALFEAGPGKYALAISDGMGNGERADEESRATLQLLMNVLKSGIHEKLAIKSINSIMALRSTEE
GUT_GENOME000079_02979444-514EGISGDSTLCNHIRKGEYLLALADGMGKGEKAAQESNLTLNTLYNLMRAGFEPELALRMINSLLLMKSTEE
GUT_GENOME000259_04800402-489LYGVARSTREDEKISGDNFAFTQTENGQLVVSLSDGMGSGIPACQESEQVIELLEQFLEAGFCKETAIKLINSALVIRTDAQTFSTVD
GUT_GENOME039215_00304149-225IVGKCFVTSDMDGVSGDNYSVEYLSGGKLAVTIADGMGSGVRARSESGVVIELLEQSLDAGFDARAAIELINAAISA
GUT_GENOME210309_01589637-706TAADERLCGDSYESFYDGRGNYIVILSDGMGQGARAALDSTMAVTMTAKLLKAGIGYHSALKMVNTALML
GUT_GENOME044174_00846575-656QVEGFDTVVGIASRSRSAESGDKHYTSYLNDGKFVVTISDGMGTGHKAAVESDAIVNLLGNFLEAGFDKTIAVKLVNSVMVM
GUT_GENOME071234_00957182-263MTGAARAVKEMEPKSGDNYTVLESEKGKITLLLSDGMGSGEKACEDSEGVLDLMEKLIEAGYRPTAAANLVNTALLARGEEQ
GUT_GENOME110965_00947516-598EKPNYRLSIGFAQHSAEGTLCGDTVKIINDNKGHSILIISDGMGKGSRAALDGAMGAGLISKLLNVGFGFDSALKVVNSALLV
GUT_GENOME246434_01403558-641MEGTACVPKAQENVSGDAYTVIHLDSGQVVLSLADGMGSGAAAGEESEYIIHLMEQLIETGFGRASAVRLINSLMFLKSENQAF