UHGP-MC 29569


Information


Number of sequences (UHGP-50):
118
Average sequence length:
127±15 aa
Average transmembrane regions:
0.14
Low complexity (%):
26.98
Coiled coils (%):
0
Disordered domains (%):
14.44

Pfam dominant architecture:
PF18676
Pfam % dominant architecture:
508
Pfam overlap:
0.37
Pfam overlap type:
shifted

Downloads

Seeds:
MC29569.fasta
Seeds (0.60 cdhit):
MC29569_cdhit.fasta
MSA:
MC29569_msa.fasta
HMM model:
MC29569.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME001544_02421420-534AGEATTINDNETPLANAEQTNAGSFALINILLAVLSIISAMVMGLKGMRREGKTSMTAAGIVLAIVSAAVFFLTQDLSGNMVIADMFTVVMAVIAGVQIAVGCMINREEAREENR
GUT_GENOME142595_020321581-1711EVNAPTPEGQAQEEAFIVSEDSKIPMAGAEASGQREMENWSLADLLLMLAGAALAVLNIVRLCVKKKRRQPQKHKALLITGILAGAVAPVLFLVTQDMTSQMTIFDRWTWLFGLLAILPGITLWLRRKKEA
GUT_GENOME152603_003181589-1737TQPTESFPDDGGEEDIADEPTPLDMGGAWALVNLILTILTVLGSILLLIGYIGKKQKEREDENGNVILNAEGEAETDDIKKKGGWRLASIIPAVAAVIAFILTENMRLPMVLVDKWTLLMVVIALVQLLVAYFSKKKTQEPEQPEQMAA
GUT_GENOME018931_014923084-3234PATTTPAPEATVEPAKVEPTPAPTAKPEKIKEEATPQASPKGHWALINLIAAMLSVVLAVAALLAKHAKEDDEEEKDDQVVESENQDDETSASKRHRTWKIVAVIDAIAAVVVFILTENLAHDMVLVDKWTVLMVLFGLISIVSTYFARKW
GUT_GENOME164721_01054383-487VNDNKTPLAGIENSWALINLIAAMTTVILGIVIIFLKKNMESNQEEGPTYQKRYTWMKLLGIAIAIISVVIFILTEDIALPMVMLDKYTVVMVVIAIIQFVVFFT
GUT_GENOME220946_01629826-941ESEPEVIEDEETPLAPMANGKWALVNLVLMILTVLASLLMLLGVIGKKRGRNNKSFWRIASLIPAIGALAAFVLTENMKLPMAMVDRWTLLMVIIAVLQLIVAAMSKKQNESEDDG
GUT_GENOME282983_00058802-918DADKEEDKTENIEDEETAQAGAPGEGNWALLNLILMALTVVAGLAMLALRFARKTGMAHLLGIVPAIVALVAFFVTEDMSLPMAMTDKWTLIMALIAVVQVVLLVLGRNGAQEDDQP
GUT_GENOME093619_00145358-516EIIPDEEVPTAPGVDEPAQPEELEDIADDATPLAGGRGGHWALINLILAILTVLASLLLLIGYIGKKKKALEDEDGNEVKDEDGNTVWEYEKKKKGFWRVFSLIPAIVSVIAFILTENMRLPMRLVDRWTLLMVIIAIVQIVVAVLCKKRKNKDDEDRD
GUT_GENOME173048_011552759-2850KDSASALSLLDLFATSGVIVLTLASFLFKKQVRGYGVIASLTAVILFFATQPLILKFTFFDKWSPLFLILLLMEALMVMMLGEKKEKEDKTQ
GUT_GENOME260074_008502306-2461PAPTPQEEVKKEKTPKAEPKEEKVEKEKTPKARPEKFWALINLICAIVTVLFGLLLLISKRHKNKDDEEEDETKDQTTTNEDEEKEQEKKRGAFTRVLAVLIAILSVVFFLVTEDLSLPWTWTDQWTIWMVVIGLVQIVVFFVGRKWKNVDDDEDE
GUT_GENOME252297_01627564-679PVQEDARALTLADLACTLLSILFAALVWLRGKKDGGEDADNENEQAERMDKEAEEDEPRGHAVQKTVNAVLAVLSVVLFFLTQPLVWRFRLVDWWTVLFVLLCGTALAMLIWKRKE
GUT_GENOME257282_006011674-1801IKENVTPKAKGTEANWAIMNFICMIITVLLAILLLLSKSKNEEKLENDKTYSYDEEEDYQEEKRSLFIRSLALLLAIISIIVFFLTEDMSLPMVIIDNWTIYMLCFAIVQLIVFILGRRWKEVEQEEK
GUT_GENOME173036_003412161-2253PLRSDRMALADLILTALSSAMAAEVLVFRRKRKAGMVAVIAAVASLLLFLLTQPLILRFVIFDGWSILFLALIVCQLAARRHSKEKENDPEEE
GUT_GENOME097725_009282368-2519TPLSDGGNASSPTVNIGDNKVPLMSNGTTPSWALLNLILAFLTGIIMFILLFTYFFSKDKEKEDDDEQKATRLQRTEDESSDRRIKRRGLVRILSIVVTLAAVLVFIFTEDITLPMVITDQWTWVMAAISVAQVVLAVFARKGHKENDGEDK
GUT_GENOME110766_00676211-328DLEQLGDEAVPRGLRNTDAWAFANLILAVVTFLGTIILWIAYYIKKKKIEKSHEQEEIGTTEESNFYKSFRILSIVVGAASVVFFFLTEDVTLPIRMTDQYTWIMTVAFFVQLIDMLY
GUT_GENOME257519_008491003-1123APTVNIDDDPVPLADGGAIWSLFNLIVMILGILGAIVGLVCMLRRHSEDDDEEEDPEAELSAAADDEETEENEKRTKTRKILRISGMAVSVVSLILFFILENMKLPMAFFDKWSWLFAVLG
GUT_GENOME089754_01038394-516GHKGSWALINLIASLLGLLAALFLIFAKHSKEDDDEEDQNQTVAMNEEDPEQLKRKRIYKWISGLTAVISVIVFVLTENMRLPMVLVDKWTLLMVVFFLINAVTLYLGRKWHEDEDDEEQTQA
GUT_GENOME001046_007771865-1992ENEKPDQLATAVPEPSPETRPDISTPEETPKSEAAEKTTGALAIVNLVMMILTVLAAVLVYLKRKPKTAAGYRWAGLAIAGLSIAIFFITERFGSWTFVDSWTWLMLLPAVLSAGLLWSVRSNGRDRE
GUT_GENOME064925_000092889-3002TPLASGAAWALLNLILMLCTALVSILLLIGFLGKKKKEDENENVEYTVKRKGVTRVLGLIPAIGSIIAFLLTENMRNPMVFVDRWTWLMVLIALVQVIVCIFARKEKEEPDDNA
GUT_GENOME122905_00799392-518LEDIEDPGTPTTGYTRAWALVNLICAILTVIFSVVLLIGLVGKKTKTEEAENGEEVEKEIKKKKFWRFFSIVPALAAVIVFLLTEDMRLPMVFVDRWTLLMVIIGLVQVLVMVFAKKTKKDPDEEEE
GUT_GENOME092621_004091381-1488TDETQNIEDNQTPLAGKTDDSSWSIVDLILTLLSIVFMILTFIKKRETIIKAISGAAAVASTIVFILTQDLTAKMIMFDTWTILFVIAIIVQAASYILSRKEKEKDEE
GUT_GENOME147108_01616965-1137PAAEDTVTVPDNQTPLAGSEDGNNGDGGSAEQEPVTIDDNQTPLAGGIGQASWALLNLLLAIATGIIMIVLLAGYFIGRKKNEAEEGEGAQLHTARGEEGEEQGKLKRKGIVRLLSIIPAVAAIIAFILTENIWNPMVWTDKWTLLMAVIAVVQVVVAVFTKKSRKDKEDEEP
GUT_GENOME011000_02941622-775PEITVPAPTLPTPPDRMIVTGDTNQQTSLIGGEVSNNQNDTEQEEIAQITDQQTPKSHGAVEKQWAVLNLIAVMLSLILAIALLLSKRNKEDEDMRTRRRWTKVVAIIIAVVSIVIFVVTENIFLPMKLTDQYTIIMLGLSLIQIVMTIIGRKY
GUT_GENOME096399_00279579-745APVTPIAPIAPETPEVEIPDPETPEAQTPAVDVEIPDPETPQSVFPSWALLNLILAISTAVASLLLLVFYFIGKKKREEEREEGAPAREDEAEKEQLKRKGFWRVFSLLPAIGSIVAFLLTENIHNPMVFTDRWTLLMALIAAIQLVVTVLSIKRRKDSGEREDPDA
GUT_GENOME000603_00556411-521DSQDKTAPAAVSIQDSQVPLAAGISKESGWAFLNLIFAVLAALMSVYVLAGKKQSKGMFKAGSAFAAICAIMIFILTAEMNLPMILADPWTAPIGLLTLLEAVITRLAIKD
GUT_GENOME134587_00098815-948GDVEDVEDNATPKGNNGIWALINLIAAIVTVILGLILLLSKRHRNDEEEDEEERQARIERGEEKEQEQKRGWICKVLGVIVAIVSVVFFILTEDMSLPMALTDKWTIWMVVIAIVELVLVLVGRHWKDVDDEDQ
GUT_GENOME245840_00322901-1033TPEAEPEPSAAPAEEEIADDEVPLAAGAGGAWALLNLLLAVFTVLGSALLLITWATGKKRRELESETEQQRRRHGLMRLVSLVPAVASVAAFLLTENMRLPMVFADRWTILMAILAAAQLAVMFLARKNWEDQ
GUT_GENOME033115_002671087-1226DNGGTVNIPDDKIPLAEGNSSWALINLLCTIFSVLSGLMLLIFARKKKEEYYAEEYYEAEYADADEYLQENKKRRFFTKILAAIDGVIALIVFILTEDMSLPMALIDKWTIFMVVFALISSVTLLMGYKNANDEAEEETV
GUT_GENOME000603_02322748-861VTIKDQPTPLAAGAGAHWALLNLILTIATALLSILLLILYVTRRKQEADEENDQPETEIKRRGILRLFSAVVAIGSIIVFVLTEDMSLPMQWTDKWTVVMAVLAVIQVLTVIFA
GUT_GENOME250159_00162319-481APVTPTAPDSTPESPATPVSPADTIQPATEEIIDDETPLAGPAGGAWALLNLILTAVTVALSAALLVGYLGKSKQAREDENGNTMYDENGNEVLEYIRKKRGFWRVVSLIPAIGAVIAFILTENMLLPMVIVDRWTLLMVAIALVQVVITALSKKRDEEQEEN
GUT_GENOME147776_016712178-2326PVVEEPATPEVDTPEVEIQDPETPLAQSQSSWALLNLLLAIATALVSVLLIVGYILGRNHKKDEDENSQYARQDDEDADILKRKGIWRLLSLVPGIGSIIVFFLTEDMRNPMIFTDKWTLLMLIIAVVQLIISILCIKRRKDNDQKDDP
GUT_GENOME099523_002774963-5064QEDSQQQAAQKTQENTIAIFNIVATILTIAMAIIAFMKKHQYAFVAIIDALVSTLLFVLIEGLQWTSFTFLNMWSILFAILLIVSFGTLFIVKKQKEDKTDE
GUT_GENOME058464_032771724-1858GGESTEVQENPTPKGGGSQNEWALINLICAGGTVLLGLFLLLSKLKKEEEEEDEKTSAMRNDEEQSKGYKRRKWLRVASTITAVVSVIAFFLTEDITLPMVLVDKWTLLMAAFLIVQVIFVLFGRKWKELDDNKN
GUT_GENOME096561_00643735-866GQEGQDDVQIKDPQVPLASGNGSLGDAHWALLNLILMLATAVIMIVLLVGYFTGKKHKDKQTQQKGELKRRGVARLLSILPGVGAVIAFLLTENMRLSMQFTDEWTLLMAVIAIVQAVIIFFTFKKHKDGET