UHGP-MC 8803


Information


Number of sequences (UHGP-50):
54
Average sequence length:
228±19 aa
Average transmembrane regions:
0.1
Low complexity (%):
1.31
Coiled coils (%):
0
Disordered domains (%):
9.5

Pfam dominant architecture:
PF18814
Pfam % dominant architecture:
370
Pfam overlap:
0.64
Pfam overlap type:
reduced

Downloads

Seeds:
MC8803.fasta
Seeds (0.60 cdhit):
MC8803_cdhit.fasta
MSA:
MC8803_msa.fasta
HMM model:
MC8803.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME013306_001111784-2018YSDRDAGSAGPYAYDSLVSKPDMAVTRLSGDVPGNRADVIYQAKQNAAKIGRFNPKDGSVSVYVKDMDSDVIIGRDGLKHSIDRRLDVNAPVILKAGEILSNSIRINEMTPKLDNASNSYVLIGAAENGDGEIYVVRSVVNRFKNELVSMDVLYAINAKKEPAALLPLSTEKSALGTGSTISISELLDYVNDYFPDILPEEVLKHYGHESRPEGKLGESVLYSDRDPEAVKRNQI
GUT_GENOME198844_00889529-768EVKFSREKAKAEPKRYSYEWFIQKPDMVVTTIGDPVTMSRADIINEAKKNAVTVGKGDPNGTVTVHVRDTDMDVIIGRDGLKHSIDRRLKELGPIILKAGEILQNSIKINELLPKKADAVGSYVLIGAARNPNGDFYVVRSVVNRFQSKLVSMDVLYAINAKKEAAAQNVQTASEDAAALNAPIFTGNPLYVTAPTISISELLENVNAHFPDVLPEDVLKHFGHDRRPDGVLGESALYSR
GUT_GENOME263969_00026705-927RDYSYDNLVSKPDMKVTQIGTEAPSNRADIVFEAKRNASKIGKFNVKDGSVSVYVDDIGTDVILGTDGLKHGLRRSKNISTDINSLAILNAGEIIKHSVKINEIIPKDKNATSSYVLIGCAENSSSRFIVRSIINTFDNSLSKMDVLYAVKVKESAAQPAPGSAAKAVSSVTDSKISITDLLEIVNRHYPDILPESVLKHFGYSERPSGDLAVSALYSFRDTK
GUT_GENOME245503_01306131-322KNAAGRHYVYVNDIGEYASVNRKAFEHGLSRQANATAYITSNLGSVLENSIKINELEPRNETVLSSYILLGAARDENGTLYPVRFVVNEYPDGLSEITGMDVIHALYAAKAKKIEPAANAGATTSQKAGVPSGSVISIRDLLENIKGDELAASVLSNDVLAHLGIDRPESPFTGSLRYSYGEGSFTDTYREY
GUT_GENOME103072_008612008-2228YQDRTYSYDELVKKPDIKVPVLSAIKTSQYNNRADILNAAMNNLKNDTEVFINGEIAVIYNSDDDKYIEVSKRRLRHGIARRANEASIFVTLNIGDAIKYGIRVNEATGERNNADGAFVLLGKLTNTQGEDYYYRLIINTTNEGEYEVNRLYAAKAKKKVLGGNAPTPAGIKPAKLNTFFKLKVSDFLNEVKDYYVDSLSEDVNNHFGRVRGKSDIEGLLY
GUT_GENOME239142_018141547-1772KYSSRTGKKFSYQYFSQKQDVTITDIDADVGIDRKTIRNAGITSALNVGSRNKYGAVDVYVNDIGSNVVVGKDGISHGLRRENKGTPSENYIVAANAGPILKNSILINELTPKNENADGSYVLVGIAKDQYNTGYVVESIVNKFTNKLESMDVLYSMNAKKELAALNAPRATPKALPVTSSDYTVPSVLNLVKEHFPDILPEDVLKHYGYESRPDGKLGESVLYSA
GUT_GENOME104366_001631632-1875PQRRKTGEEQVSFETLRELPDMEITEVEENDLADKPVKEILDAGLASTQRIPGKNSGSVVNRYTGDTIVVTRAGLRHGLLNAAQIQKNAPYVGQIGPILENAVKVNELEARGKEVRSDLYLGAARMSDGSLMGIRFVVNMYEDGQKVLDADSMEPLRGSLYAHTGRQIKNGTESLKGREVSTNAVALYGSNISIEDFLGSVKEELGENLSANVKYAFGMDTETTAGFGEGLRYSAPTEADYDDL
GUT_GENOME252152_008002230-2469MQEPEKDSQSQQGKSYTYKALTQKSDMKVTLLDSHNIYDETGKMDRHLLLDKAIENVKSKNNPKNTAEKNFVYVPDIDRDVLVGRKGLSHGLSRNANITALVTTKIGDIVENSIKVNELKPRNNTAGGYVLLGIAKDNKNNYYPVRIIVNNYAVDNVEILDVLYAVNAKKKGQFSNETELPANAVPPIKDPSTISVADLIEAVKDNFSDVLTDDVLNNVGAKRRKSLLSESMVYKDVAND
GUT_GENOME117305_0039189-340TKASIDQYSYQGKSMTENSEIYSYDFLTHQKDMEITELPPLSEVKSDGKIDRDLVISLGLENAKKIGRKLDNTVVAVKNKYSHREIIISKPAIQHSMDGGNPARLRTNARIGAICGEVVANAIPVNALNNKNPEAIGTYAMVALLKSGDGNIAAIITVEQHTNKVERIDTIDLAHAFNGRLAKKEGSESSTWEPGYANSVLPSNTTFTLSIADVIQIVNTTHQSILSDDVLKALGEERNPEGYYAQRAKFSL
GUT_GENOME000870_02424225-421ITSVNKSKFRDINTVVQKAVDILTKYPGVEYRGKNPVITNLDTGDKVQVTKKSLKHGMRMGHSEATIFVSLNIGDDIKYGIKVNEAIGNRNNADSSYVLMGCMMDEKNNNYYYRIVVNRYDSNRIGDYYIDDMYAVKAKKEETFTAVMPTRVTANADASNISSNIKVSDFLNEVKEYYGDSLSEDVNNHLGRLREKS
GUT_GENOME012071_004352851-3098SYNRLTALDDMSVTRIVSSIPKKEDGKIDREAIISSGLNNAFKVGRRVNSDTAVVKNKYSGEEIAISKRGLKHSLDRRSAVALATLNIGEIVQNALRVNEMSPRNEHVAVSYPMVGYASDANGNKYAVLLVVNRNNDSYPTIDGVSVYETLYSHNAKKTESGVHSTRSYDDNSPLFPNSTISIAELMNVVKDQFPNVLSDDVLSHYGTSKTESNYDGILYSFNGRNSGETEQAGQSEQTDKGKYIPKG
GUT_GENOME125440_01228104-316KSGNKFALSENETKYDYKSLIAKDVIVIASDVNAAVPYADGRINRKEIVKQALNNVSTNNENRFIYNKDLNKYVKITRKGIEHGLKRKGENNALVAILIKNYLPNAICVNELRAQENLTNSYVLLGAFQNNNKFYCVRIVVNENKNAYAVKDIDVLYAINAKEIEPVATKRRGLGVETYSTSTGSKISIADFLNNVKNYYSDVLSDNVISHLQ
GUT_GENOME245166_01603779-1010SNIDNIKYSQNKDSGITYTKLISKPDMPISMVTTTLKDITDKSRKEIVDAARKKLPKQNNNGQVTIHNKDTGMDIVIGKPSLEHGLGRNYEYTAMVSMHLEDYLRNAIKINEAVADGNRKHDSDILLGYGQTESGEKIPAYFVVSKLTTGQKELVEFGSLYSIKAKKIAEDSAQSSQAFQGPTSAKISISALLDIVNETYSDILPKTVAEHYGNERRKTKLGESVRFSLSAP
GUT_GENOME011747_019741139-1364FSYEALTAKLDMPLTDINTSIPRDANGKVIRKDVADRALKAVRDSGNKKNTDVDSYVHVDDTNSDLKVSIESIRHGLTRKPDLNAIAALNIVGLAKNAVLINELSPREGAVGGNVYLSAGKDSDNNLYFARLVTDKKNLAISEIETVYAINAKKESVAHYRQGYENKIPFAFTDSTISIADLLDFVKNHFSDILPESVLQHYGISRKESAVSDSVMYSINAEKDSV
GUT_GENOME112664_014221588-1856DYSYESLIAKPDMKVTEVDDTIRYTPNKESRKQIVDQAIKNAASVGHINENGNAVVHVDDIDTDVVVSKNSLIHGLDRRLNNFAPVTNKVGEILQRSICINELIPKKISAAKSYTLIGTAKNKKGDISVVLFIVNRFTNEILSIDVLYAINTKKEPASLKDTWLTDQTAAHTGSNTSITNPPDSVNTKKEPAALNAPRFTAKPLSVTGSTISITDLQDFVNTDSTIRIADLLDYVNRYFPDILPESVLRHYGYEARPAGKLGESALYAL
GUT_GENOME237494_001352129-2344KKTGYSYEELISKPDMKVTVLGDTAPANRADVVALARKNAAKIGKFNAKDGSVSVYVKDAGADVILSKAGLIHSLDRRFSQNAPVTVNAGEILANSVVINELVPKNEKASACYVLIGVAGREGGSVQIVESVVNRFSNELLSMDVLYSINAKQKGTAALDAPPVFHPDYRSTISIAELLEFVNRYFPDILPEDALRHYGYTERPGGVLGESALFSD
GUT_GENOME105411_00191798-1027SKQARDYSYAALAAKPDMQVTTVDDVVRYDADSKMRKDIIARAIAAAKKVGSTNENGNAVIHVDDTDTDVILSAKGLRHGLDRRFSTNAPVTLKAGEILKNAVRINELTPKKDTVDASYVLIGAAKNAKNEPYIVQFVVNRASNEVTSVDVLYSIGTKTEPAGSLSPEVTGVPATLTGSIISISSLLDYVNRYFPDILPESVLKHYGYTERPSGELGESALFQQRGRSDR
GUT_GENOME202265_01034433-670RTFTYEEFMERYRDGVEVVRVRELNTFPSPADRRSLPSSALREARRNGNPKNTAGRAYIYFPDTGTNAYVDENSFRHGNATYDTQYALACMNATKIAQSALAVNTLEQDNNHRTDATVYLAIAEDKTSRFIVRFIVAAPSKKASMDVLYAVKKEGTFSPSADSYFTAPTSQGDAAKDSLRKANVPRRASSYKINIADFLQAVNGTAAGNAVLPPDVLRRLRTGRGNEPRVSPNLRFSE
GUT_GENOME123477_012561441-1613YIYCKDIEKDIQVGKKGIAHALQRKWVESGIAMTKLGDLLENAILVNELEPREGVKRAYVLLGYGKDENGADYPTYFVVNEYPTGNVTVEDLAVLYSSKTQKMEPYANNASMIAGKNPLLTPGSTISIEELLDKVNAIYSDILSQDVADHFSNTRRQTSLSGNVRFSIREEDI
GUT_GENOME271236_007001679-1909RGIINRYTYNKLAQKADMPVTKLAEDLPINSDGKINRADILAKAMNNVRDKNNLHNTENTSFIYVKDIDKNILVGKDALRHGLMRNSKDTAKVTTKIGDVLENAIKVNELKPRNNTLGGYVLMGIGIDDKGNYYPTRIIVNNYEVQKIEPLDVVYAVRTKKKNQSPNGAGFASKETPLSKGSSKISVSDFLDIVKDNFADTLPQDVLEKYNTKRPKSALSESVKYSIKESA
GUT_GENOME273851_01592324-558AKNGTIYTYDFLVKQKPIQIVTVADTAQTQGENNHVDVKAVVETGMENALENGRKYKDVCLVRNNYTGREISVSANTLRHGLGGNRNRVITNAKATEKIGEILKNSIPVNGLKNKADVDSTYAMAGILMDTKGNQFIAISTIEQKSDKVSDIAVYDIAHAMNARMKKGSTAGMFHPQGVTPSTASYAYKIQDFLRIVNSTHKSILSESVLAALGETRPENGTYSDQVLYSLDTEQ
GUT_GENOME189583_010642052-2282STVLFQNRNYSYDELVKKPPMNIPVVKAKAISEINRKSIVEMAMNNIKSHKGIEFRGKNPVVTNIDTGDKIQVTTGGIRHGLAGRTNEAGIFVAMNLTEAIENGIKVNESSIGRKNADASYILMGAMDNEAKERYYYRLVVNRYESNNMGTYYVDDLYAVRAKKEETFTAVMPTRVTANADASNISSKLNVSDFLKAVKDYYGFELSKDVNEKLGLDRGKSDIEGLMYQKR
GUT_GENOME009864_000451895-2131VRFSIKDFDNPEYYTYDNLISLPDMQIENSKMDIVSKDTSRADIRERAFENIQNSKYGRRDGNKCFIKNKYLGDVMITKDSIRHIIVKINENRMIAALNIAKYLPESIVINQTDGARKGANKSYVLLGMMEDGTNQYILRTVLNSYENEYEVSDANVVYSVASKKNQPAYFSQGLGNKSMPSTGSHALSIKDFLKIVKNYFPNELSKDIHEHFETNRKKSEIEGLRYSLKDEEGTSL
GUT_GENOME046771_011031140-1396SEVYDYDFLTSLPDMRITRLPEVPMIRDAGGKVNSAVVITEGMKNARSVGGERDGKIYVQNAYTGRQLRIDTSSIRHGLNGGVNRLITNARLGAVIGDVVQNAVPVNALYNKAKDVMGTYAMAAYANDSRNREFVAIVTVEQRSGNISGLESYDVTHAVSGRQKNSSQADTKSQGVYPIKAAEISIADFLSIVNSTHQSILSDDVLQHFGETRNLNGDYSGQVKFSLREYSEDEKKQHIADAKKYFGKTYKWAETGY
GUT_GENOME190474_00769673-934YSYEALVAKPDMKVTVLDKDIPDSRAAVVAEAKKNAASVGRQNTDGSVSVFVEDIRRTVVVSTYSIRHSLDRRMSVNAPVMMKIGDILNKSIKINELNPREYTVNNTYVLIGMAKNRNNEPYVVLFLVNSYKNEINSVDVLYSANAKREPAALLPRLTDASAAFTDSVISIAELLEYVNRYFPDILPEDVLRHFGHSERPAGKLGGSALYSLDYGRAPVQSRPADGADFAKSAFSLPESGNEYAYYDAAPNYKGESGGLSGA
GUT_GENOME259522_01687451-674NEYQQFKGSSREYNYDYLASKDDMQLTEVVPKERYSRSDIVAEGMKNAAMFGYTNNSGNAVVYIKDTDTDVIVTEKSIRHGLDRRINNQAPVIEKLGEVLSNSVKVNELIPRTENTEKTFVYVGAAKDTDNRLYIATFIVNRNTNELESYDVLYAANAKKEPAALLPTSTDKSAMFTGSVISISDLLDKVNKLFPDILSEDVLRHFGRTERPQGKIGENVLYSS
GUT_GENOME023233_000682092-2335YEGKSMTEDGEIYSYDFLTSLPDMNVLSMPTLDSVKTGNRIDREKAVNAGLINAKKAGRAISDTVASVKNRYTGREIRISKSGLEHSLDGGNIGRLRTNARLSSIGGDIVKNAVPINALHSTNRQATGTYAMAAVLNSGNMKVVAIVTVEQHKNNVVDIDYVDITHAINGRLGAKKEADGLPQGQQDTALRRLLTTSTSTIKVSDVLEIVNGTYQSILSENVLERLAEAQKSTAREDGVQYSLK
GUT_GENOME034912_005811517-1744DIRFSRDVDYSYDELTKKPDMKITRIDDSVDYKANSISRKNIIERAISNAKKIGRVNENGNAVIYVNDIDTDIIVSKSAIRHSLDRRLGVNAPVVVNIGDILGNSIRINELIPRSEYIENSYALIGIAKNNNNEPYIVSFVVNKHTNEVQSIDVLYAVNAKKEVAALDEPEFRPINGTALTTSTISISNLLDYVNNYFPDILPESVLKHYGYNSRPEGNIGESALFSR
GUT_GENOME261105_02096118-352NTYAQANNGGAEAVSYEALTALPDMQLTQISTALPEGMSIGDVARLGVENSRLPGMKNGRVTNRYTGQEIQITADGLRHGMTRKTRLQKNGPYAIQAGEILKNAVKVNELSSRGNEVRTDIYLGGAIDESGIVHGVRFLVKLYENGNQAADLGDMEIYNGSLYAHTGTKIGTAALRAPEVSNQVAAPTVPTLTIADLLGTVKGKMDTFLSENVKENIGTAQGKNGEFGDSQLYSV