UHGP-MC 4501


Information


Number of sequences (UHGP-50):
75
Average sequence length:
150±7 aa
Average transmembrane regions:
0.06
Low complexity (%):
3.58
Coiled coils (%):
56.75
Disordered domains (%):
6.46

Pfam dominant architecture:
PF01895
Pfam % dominant architecture:
133
Pfam overlap:
0.34
Pfam overlap type:
shifted

Downloads

Seeds:
MC4501.fasta
Seeds (0.60 cdhit):
MC4501_cdhit.fasta
MSA:
MC4501_msa.fasta
HMM model:
MC4501.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME229354_0105253-167LFDPAALLEELESSFTRLEWLIARINLTNCAVKVEGRSLTELIARRDVLSLRAEAYRRLVEEASQNTHRATRTEIKILSAVDVPALQRQADDASRELRLLDNTLQATNWTADLME
GUT_GENOME252005_0149061-209MKLAEALQERADINSRLAELYPRLSNNAIVQEGEKPAEDPQSLIDEIESCTARLAELIARINLTNCSIRVDGKTLTEMIAEKDALNAKINLYRNLIGAASNVAHRATRTEIKIKSTVDVGKLQKQADAYAKELRLLDNRLQEANWLNEL
GUT_GENOME027336_01174158-308DMKLAEALSIRKDLTKKIEIIKSRLISNVRLQEGDEPSEQPAELFKELDSCLVQLESLIARINKTNMHTKADGRTLTEMMAEKEVLTKRIAIIADVADKANETQDRYSRSEIKMVTTIDVKALNKQMDKLSERLRKLDISIQGLNFLTDLE
GUT_GENOME046155_015181-150MKLAEALRERKQLQIKMGTLRQKLIDNAIYQEGSKPVEDPAELIKALEATRGELAALIKRINATNSSVKVGDFLLGDLVVDRDTRIMEVTALNNLIDQASQVSNRYSRSEILVLPSINVKETQKKVDQLAKDIRELDNLIQATNWNTDLI
GUT_GENOME179017_004551-151MKLAEALITRKEMYRKLSQLAERIQKNIIIQEGDEPVEDPNALMPELETLSREITELVQRINATNASTRLENGMTIAQALAKRDELIRLAGFFRSFADMGREGQVERYSKSEIKRVCTIDVAATEHKADALAKEARELDMAIQSLNWQVEL
GUT_GENOME200288_053951-151MRLAEALVLRADEQKKIAQLKQRLERVVKVQEGEQPAEDPQLLMIDLENTIRSLTVLVKKINKTNAQTDFTAGVTLADALAERDGIMQERASFNEVLQHASIRQDRYSRSEVKYERTVNIADIQHKVDSLSKSYRELDFKIQEKNWTIDLI
GUT_GENOME194674_001471-150MKIAEALILRADIQKRIAQLRTRLNNNAKVQENEEPAENPELLLTELENLILQLNDLIVKINRTNTLSKIDGISLVELIAKKDTLSQKAGILREFIEIASQKVDLYSTTEIKVFSTVNVSEQQKKLDKLSKEIRETDTKLQQANWTIDLV
GUT_GENOME147629_016751-151MKLAQALIERADLQRKLAQLGARLQQNAQYQEGEAPAEDPQDLLTDYRRSAAALTRLIVAINRANHNVTLADGTTMLAALAERDRLKAEHAMLGKVADAAMTDQSRYSRSEIRTLAAVDIRALRVEADNVAKRCRELDIQIQQANWENDIA
GUT_GENOME167257_018651-149MKLAEALSIRADLQRRINQLRTRLKDSSKVQEGDLPAEQLTDLFQELDACLVQWEDMVFRINQTNIKTLYEGKSITRMIARKDRLAQRVAINQELLKHVMETERYGRNEIRYVRQVDVTALRKETDCFAKELRELDLKLQELNWAVDLL
GUT_GENOME115357_008401-150MKLAEALRERKELTSTIEVLRTRLLQNATIQEGTKPTEDPIEIMQALNNAVSRSIELICRINRTNSESVLDGRPLGDWVVERDQLMKKASVYREFANKAGSVVDRYSRSEILIIPTVNVKEYQKKADELAKEVRELDNKIQQANWNIDLL
GUT_GENOME237868_026021-151MKLAQALQTRADLNRHLSQLKDRLIDSSLVQDGEKPAEDPIKLIHEMEEDIKTLESLVTKINLTNSRTMVEGTSLTALLSKRDALRQKIQITRAFVEAAREKVDRYTAKEIIVHSAVDVASLQDGLDYLEKKLRLTTDLIEETNWTTELLE
GUT_GENOME142592_029111-163MKITLAQAINLLSSLHKKVNELQGEFFTVHVIEVPKGESYTPYERTVEAVLQELSEIQHDILELKEIIQQGNLSNQVEWDGHSISMIRAIETAKQLRDRLNLLKTLATTKKREYNVHHHSGAIMEQIALFDPAEFKKQVDKLTRQAELLSSRIDKVNYTVEIE
GUT_GENOME027343_010866-157IMKLAEALSIRADLQKKVAQLKERIKESAKLQEGDEPCDNVEELYKELDEVLVQLEDLIYRINITNVQIVQDGDSLTRLIAKRDVLSMRVKALKEVVNYVAANDTRFGRNELKYVRTIDIKALRKETDTYAKQYRELDLKIQSLNWTVDLMD
GUT_GENOME080367_006111-151MKLAEALLLRSDLQKKLLSLQQRIHKNVLVQDGDTPSEDPEQLIDEAVLVNKQLFQLIQKIHQTNAQAQTNNGKALLDILNQRDQLTAEHRIIQQAIDNTQKDTDRYSVREIKWIKAVSVSKLQKQADEISQSLRLINLEIQASNWQIDLR
GUT_GENOME025365_014571-151MKLANALAQRADLQRRMAQLASRLMNNAKVQEGERPAEEPAELLAQLEEISRQLEELIRRINLTNTAVRSETGESLTALLARRDCLKMKLGIYREFLQNASDVVPRGLRTEIRIVSTVKVSQMQKQVDDMSRDLRLLEETIQSLNWTTELQ
GUT_GENOME047838_0018117-166MKLAEALQERADLNRRIEQLNSRLYRNSKVQQGRNPEEDPAELISELDGCIARLEYLIAKINICNCNTKTPDGKSITELIAKRDCLKIKIDAYRDLAGNASSLCDRVTRSEIVILSTVDVRALQKQIDAMSRELRCTDNMIQQLNWLTEI
GUT_GENOME098425_01633121-280ARKHEADALAARIRGIDARKELEARLTRLEERLHASARVQEGLEPDESPDALYAMLEKAAAELVEIVSRISRTNVETVVEGRILADWIAERDVSWRRLRVLQSALRAAAPSYDRHGGPGSVRSVTTFSTARRHAELDALALRINAIDARIQQANWETELL
GUT_GENOME260923_002661-150MKLSEALILRKDIQSRLDEMENRLCACALTQEGESPAEDPAALLAEAESLTSQLSELCTRINLTNSRTMTEGQSLTELLARRDALGRRQSLLSSLLSSASQAPRRASGREIRILPTVDIPALRKRLDASAKELRELDMRIQQINWTVDLQ
GUT_GENOME036189_015451-138MKLAEALQERADLNRQIEHLRSRLSHNAIVQEGEAPAEDPGDLLAQLDRATARLEELMASINLANSRTVVNDKTLTQIIARKDCLRLRLEAYRELAETASQTARRATRSEIRILSTVDVKTIQAQVDTMAKELRMDTV
GUT_GENOME095002_009301-151MKLAEALSRRAALMDKVRQLKVRLDDCIKIQEGDTPIETPEEVIAELDKTLDSLRRLIYCINITNTRTEVDGRNITLLLAERDTLKLRVTTLADSLKHLTVREDRYNRSEIRYVRTVDAGEFRKLYDRCASQLRQLDLKIQSIGWMTDLIE
GUT_GENOME000287_020681-150MKLAEALLLKSDYQRDIYELKTKIIHCSKIQEGEESLVNPDDLLSKLDEVFNKLELITKRINYTNSQIIINGQTLVDLILTRDAIKMKRKILTTLLDEATIKQDRYSQSEIKFVTIIDVLNIQRQIDDLSKRFRELDTQIQQLNWTHDLM
GUT_GENOME273588_0095626-178NMKLAVALQERADLNKKIEQLNYRLGNNAVVQDGEKTQETPEELLNELNACLDNLEKLICRINLTNCQTKTNYKDMTLTELIARKDILTIKINSYRNLVNTASNLIPRVSRSEIKIKSNIDIKATQKEIDKLSKELRLVDNSIQEINWTVDLL
GUT_GENOME237714_013971-150MKLAEALQERADLRRVIGQLEARIVNNARIAEGTTPTEDPAELIRTLDGAMDRLAALITAINGTNSVTVVDGKSLTEWIAQRDTLKEKIEIYWRLVNEASEVTRRMTHSEIRIVPAVDVPTLNRQTDKMAKELRLMDNRIQAANWTTELI
GUT_GENOME091147_0183716-166SMKLAEALQERADLNKKIEQLRSRITSNALMQEGVLPVEDPEQLLKELDECLNQLEKLIIVINKTNMAVVSDGELLSDLLAKRDVLKLRIASFQNTISIASNLCFRSRGDEIRQLSAVDVKALQKKVDALSRDYRILDNRIQAANWTADLI
GUT_GENOME219931_017321-150MKLAEALQERADLQKRIDQLEARIQSSALVQEGETPPEDPLALMEETESCLLHLEALAERINATNCAVTTPEGTLTALLSKRDRLTRQVRLYRNVLETARSTAHRATRTEIKILSAIPVQAYQKKCDQICAELRRVDTLIQSANWTEELL
GUT_GENOME137589_015241-150MKLAEALILRKDLQTRLVRIQERLNANVLVQEGDRPSEDPAELIKRLNETCAELNTLICRINKTNSSTLMDGRPLADAITERDLNLRKISILRSALKKAADRPSRYSQKEIRLLTPLDVQKEEKIVDRLCYDTRALDAKIQGKNWEVELL
GUT_GENOME028746_011341-149MKLAEALCVRADLQKRIMQIEDRLKSVVKIQEGDEPDEDAENLFSELQQATQQLEEFVYRINRTNLHAERDDVPITRMIARKDALTLEVGTLRNVLKVAAEKESRYSRNEIKYVRTVDTVKLRKKIDSLSAELRKVDLKIQESNWMFDL
GUT_GENOME050176_018501-150MKLAEALTERKALNTKIGELRTRLEQNALVQEGDKPSEDPQELMTELNQNVEAFVSIVSRINRTNERTMVDGRPLGELVTERDARMRQAGILRSFLDQVSGRVDRYSRKEIKVLPTVDVITQQKILDAMQKRVRELDTLVQSANWTTDLL
GUT_GENOME242999_013511-151MKLAEALQMRSDMQKKIEDLRDRLFLSAKYQEGEKPLEDPSKLVKELDKVTKDLEDIIYKINYTNSKTETDQGSLTKILAKRDALKLKRSVYSSFADEASNMGPRYTKMEIKLYPSMDVNVLRKIVDDLSKELRQIELLVQQTNWTTDLIE
GUT_GENOME250230_0148115-165PMKLAEALNNRADLKKRLFQLKERLLRNSKVQEGEEPSENPEDLLIEVNSCLVELELLIKKINKTNSCTFYGDKTITDLIAERDVLSLNLSIKREFLKDASEKIDRYSNTEVKILSTVNIKEKQKEIDKLSKILRETDMKIQELNWTTELQ
GUT_GENOME003350_006461-150MKLAEALNERKELQTRLDRLHDRLTANVRVQEGDTPSEDPVKLFALLDHTAAALQALIYRIGRTNIETIVEGRSLADWVAEREVAQRKFNIMRDVLDTAARRTERHSVAEIRILATIDVAAYQKSLDQLAVRIREIDATVQAANWTTELL
GUT_GENOME271834_013351-150MKLAEALSIRADLQRKLSQLDNRLLNNSKVQEGETPSENPIELLAELDDCTSKLEFYIQSINYTNSVTIIDGLSIADLIAKKDTLNKKATIIRNFLNSASEVINRYSNKEIKIHSTVDVTELQKSLDTLSKDLRELNIKIQSANWTTDLI
GUT_GENOME026773_009491-149MKLAEALIERANLKNDIESLRQRMIANAKAQEGVEPSEKPSDLLKELETKLERYEYLVVHINKTNSETVFEEGSIADMIAKRDCLKMKVSVIGNLSESGRALVDRYSRTEIAIKPTFDTNAIQKKCDTLAKECRELDAKIQGYNWITEL
GUT_GENOME242757_006891-150MKLAEALLLRRDLNNRLFQLRNEISSSVLVQEGDTLDRSITELFKEYDEINQQYSELVVAINRKNATASLADSALLLMEALEQREQLRRKHALLTQALDETKAAPRIGRNEIRLVRTIDTKTLTEQLNATAKQLRELDGKIQQTNWLVDL
GUT_GENOME035963_017891-155MKLAEALIKRKALEELLVILKRRLLANALVQEGHYPAEDPIKLTEELRRAVDDLMVLISRIETTNLSVSVDGISLSEMITKRDLILREVFILSEFLDKTKYRYCSSELLHGRTVRVRRIFSVAVTPCQKKLDELSHQARLLDAKIRGANWNTELL
GUT_GENOME029667_00863145-291KTAEAIILADDKLDNLHQRILANLKVQEGDAPHEDPKALLEESMEFHRTMCALVQRIDRANGTICLPGGETLSQALARRDMLRKQRGLLADIAEAASQRDYRLTHSELRMQTTLDLGALQKEMDRLSKAYRELDVAIQGVNWTTELP
GUT_GENOME267195_008571-151MKLSQALIIRSDYQNKIYELKKRIINNSKVQEGENVSEDPMKLLKELNRVIDELDIITKRINKTNNESLFEDNITIADAICTRDTIKKKRNAIVAIIEEATIKVDRYSQSEVKFISTISIEQLQKQSDLLAKEFREIDMKIQEKNWTTELL
GUT_GENOME271787_018641-151MTLAEALMERAELKAKISAVSDRIEDNILVQDNEAPGEDPKELLPELSASIQRLRVLISQINKTNCSTLIDGESLTDLIARRDCLLLEISSYRDFTETSRRSTDRARGSEIKIRPCIEVKELQKRLDDLSKELRELEVKLQKANWNTELIE
GUT_GENOME159267_024521-151MKLAEALQERADLNRNIEQLKNRLSSNALVQEGEQPVECPENLKQELDASISRLSYLIARINRTNCQTVVEGQTLTELIAQKDALSLKLHVYRDVVYAGSQISYRARNTEIKIRSAFSVADWQKEIDRMSKELRRLDNRLQESNWKTDLVE
GUT_GENOME188062_010611-150MKLAEALILKEDYKEKIEELKYRLNDQVLVREGDNTELGTIELLDRIEEVMSKLHHLERRIHEACVNVAIDDEHTLNDMVQERNYKRKQYRFLYQLLMNISIRNPKFTRPELRFVPTISVKELEEKVDAAKKELNELEGFLQRINWEVEV
GUT_GENOME244378_0081937-189NAMLLSEALASRAETADRLAEVRRRLAIVSLVQEGDTPDEDPKSLIAEAERLMTRFEWLVRSINATNSATPFDDGTLSDALAKRDHQLTLREFYTSAADKASGRRDRYSASEIRYVATVDIQSLRKKADRAAKEYRALDTRIQQVNWATELIE
GUT_GENOME212246_015531-150MKLAEALNLRADLQKRIANLKERLIKNAKVQEGDIPSENPHTLINELNDNIIELENIIKLINKTNSSTYVDNESISDIIAKRDALGLKISILRSFVSESADRIDRYSNKEIKILSTVNVAEMQKQVDNLSKEYRLIDTKLQGLNWTTDII
GUT_GENOME200705_01488791-941MKLANALSLRSELQTRIRQLESRLDNNALVQEGESPAEDPMELLRELEEDYAQLEELIARINRTNSTTEAGEGRTLSDLLARRECMTGRLRILRDFLNSASAVVNRRTVGEIRIRSAVNVRELQKQVDRDAKALRELDETIQEKNWTTELV
GUT_GENOME026455_001353-152MKLAEALMERSDLKTRIEQLAARLNENAKVQEGDEPAENPAELLDELNRLYARLERLMTLVNLTNARTLSDGEPLTALLARRDCLSGRIRQLRDFLACASANVSRGTRAEIRVVSTVNVKEYQKRADDLARDLRELDVKIQSLNWTTDLM
GUT_GENOME191723_0155217-168EEMSLAEALQMRADIKTRISQLHSRLNDNAKVQEGETPAEDPLRLIELLGAECEAYETLIRRINLTNAATVWEGKTLTALLARRDTLSLELSIFRDFLQQASQRIDRYSKTEIKILPTVDVTAMQKTVDAKSKELRQLDAAIQKLNWNTMLI
GUT_GENOME158225_035761-151MKLAEVLSLRADLQKRLAQLKVRIKNAAKIQEGDTPAEDPKEMMQEYLDGLSQLEGLIVRINKTNQETTMADGHTLMEKIVHRDILTKQVSMQTETCNYLTEMTNRYSRQEIKYVNVLDAAAIRKQADECSKRLCQVDVEIQAANWTIDLC
GUT_GENOME207613_0219837-189KMKLAEALILRADLQKELMSLKNRINQNLLVQEGEKPSENPYTLYDEILLKVSELEGLILKINKTNSNYITSNNESISDLIVKRDMLGKKISILKEIAEEATVKVNRYSQTEIKIFSAMKVDQLQREINKYSKEYRELDTKLQGLNWTIDLME
GUT_GENOME031057_01958343-493IMKLAEALSLRADLQKRISQLEVRLKNNARIQEGEEPAEDPRGLMDELNNNLNDLETLIFRINRTNMATLAEGTSLTEMIAKKDILALRISVLRSVTQSATGSLERYSANEIRYVRTINVADLQKEIDSYSKQLRELDVKIQQLNWLTELI
GUT_GENOME282473_0397040-190NRMKLAEALSIRADLQNRIDQLKSRLKYSAKVQEGDQPSENVNELYKELNECLSQFEVIVYRINRTNMQTVHEGETLTQMIARKDTLKLRLTVMRDLLKHVVENDRYGRHEIKYVCTVDVGELRKVTDNYSKQLRNIDMKLQRLNWSIDLI
GUT_GENOME083979_000961-151MKLAEALSMRSDLNTRLYQLQVRLNNNSRIQEGEEPSEDPQALLVELDEVASRLEDLIARINLTNSRTVSSDGSTLTQLIAKRDVLTRKSEILRSFLDEASCKVGRGSASEIRVISTVDVRALRKKADSLSEEVRITDTRIQELNWTTELI
GUT_GENOME095452_0081615-165MKLAEALSERAQIQARLTHLSQRAQNVIRVQEGEDPAEDPAQLIEQAEALHQKLQNLIMRINRTNSATTFSGEQTISDAIAARDNAQRRHIFFSNLADKAAARQDRFTRMEVKFMPAYPVSQLRDRADDAAKEYRHLDLQLQELNWTTELV
GUT_GENOME141247_032411-151MKLAEALLLRSDQQKKIISLKQRINANVLVQDGDQPSEDPNELLKQVFSLIQEFQKLSYAIHETNALTKLNDGRSLLALLTLRDEFVEQHKTLTAAISNTSRESDRYSTREIKWHKVIPVSSLQKQADDISLKLRDLNVLIQSTNWKIDLI
GUT_GENOME000812_009351-152MKLAEALIERRDLQNKLLELKNLMLANTLVEEGSEPDIEVAVLLRQYDQASKQLEELIVAITKRNQEVVVTVNGTTCSLQSLLAKRARLRQKYDLYTSVLEATRENRRFGRNEIRMVKTVSVKDFTEKLDELAKALRQTDGVIQQTNWLEEL
GUT_GENOME212323_01079264-415IMLLAEALAERAQAQERLNSLHERLLAVVRVQEGDTPEEDPQELLRELDGVTGRIDELVQAINATNIATAFDEKMNLMEALALRDGLLRKRRIYHDLAQRAGARSDRYSRNEIKFVSTIPVADMRKRVDDLSKQYRELDTRIQQLNWNTELK
GUT_GENOME041198_009251-151MKLAEALAQRADLQKRIAQMDGRLKAACKVQEGDEPAERPEALFEEFNRCIDELEQLIRKINRTNQSVVLADGATIADKIALRDVLKLKVQELQNLLSFLSERPDRYSRQEIRYVNTVDVKEVRHRADECSRRLRETDCEIQSANWTYDLM
GUT_GENOME046050_009001-152MKLAEAVLRRNQHRENVRELHMKLESQLLIPEEEVLEEDSKVLLDRLEEEMNRYHEMNQCIERAYNNYYVEEGLTLKDLILERNYWKMRWQKLDSIVAQLTGRTCAIRDLRYRKEGQSLKKTLQITMVENQREACKEMYNALDVRIQEKSWE
GUT_GENOME233268_002331-151MKLAEALQERADLNSRIYQLRQRIEENLKVQEGDEPDMDVNELMKEHDDVVNQLTTLVTKINLTNCSVKDNEGRTITELLSKRDSLKLMLSTYQGIIESSGSRNHRISRSEIKLLSTIKVADVKTKLDAISKQFRLIDSSIQQLNWQVDLI
GUT_GENOME029599_013441-156MKLAEAIMERSTLLGIVNNLHDRIERNLQIQEGEVPQEDPQILIKVKHQKSEELLRLITNIQRINASTYVLDEKDRATTETLQAALVRRDYLFRLSSAYSEYARYSVPNVRGRQSEIKVLPVLDASSLQKEADKYAKEARELDRLIQKTNWYVEYS
GUT_GENOME096370_0210315-150DRLNKLQDRLVQGAMIQEGDTPIEDPAAMLEEAAGLLQEIETLVRRINHTNAQTAFEGATLTDAIARRDALLRSRRLYAAIADAGTMTGSRYSRSEVRFVPAVDVKALREMADSAAKAYRELDTKIQQLNWTTQLQ
GUT_GENOME259304_006451-113MKLAEALILRADIQKNIAQLRERLMLNAKVQEGDSPAENPEELMTALAARTTAFIDLVRRINRTNAMPLADGVRIADLLAERDALKNSARIPQGGVRQDRSLFRQGNSNHEYS
GUT_GENOME047708_004541-150MKLAEALIERKELQSRLSRLNQRLQANALVQEGDVPSEDPKVLIKDVETTIDELTTLVDRINKTNGVVIVDGKTLSEMITKRDFMVKRISIMREFLQKASNKIERYSRNEIMITATVDVKPYQKIVDKLSYEARMLDAKIQYTNWNTELI
GUT_GENOME096502_013451-155MKIAEGLITKADLEEKIYNFYNRARNNLLVQEGEVAQEDPEKLIKSLEEANKSLVELTAKIHKANSQSQLVDEDGNPLDITIQEALAKKEGLVSLASKLRDLAESATPQNRYSKSEIKFIATVDSKVLQTKADKLSKEARNLDIAIQRTNWLVDL
GUT_GENOME243849_009274-154MKLAAALVMRKEWRNKMNELATRIENNVVSQEGDEPLEDPNELLKQLQATHQDTVALVQAINRTNATVRMADGRTIADALAERDGLFRMVRHLRDIADKAAVEQVRYSLTEIRQVSHIDVTRVQAEIDRMAEKARQLDLEIQELNWTTDLM
GUT_GENOME217902_011421-151MKLAEALLLRSEYQQRIESLKQRISDNVRIQENDTPHEKPDLLLSEIVNVSGQLCEIIKRINRSNSKIILPDGRTMAEALVDRDMLLKERSIFLGIVSKANERDYRLTHSEVRTYLTVDVAKLQKQIDELSRRYRELDTMIQSTNWTTELA
GUT_GENOME041325_001561-151MKLAEALQIRADLQRRLAQMPQRLENNATVQEGTEPAENPESLLRELDTLTGELERLMTRINLTNARAAGEDGVTITALMARRDILRSKADTLRRFLNAASSIAQRRQSTDIRILPTVDVSRLRKRCDGLSKELRETDLAIQKLNWTTDLI