UHGP-MC 25843


Information


Number of sequences (UHGP-50):
150
Average sequence length:
147±17 aa
Average transmembrane regions:
0.03
Low complexity (%):
5.29
Coiled coils (%):
0
Disordered domains (%):
2.93

Pfam dominant architecture:
PF13884
Pfam % dominant architecture:
467
Pfam overlap:
0.17
Pfam overlap type:
extended

Downloads

Seeds:
MC25843.fasta
Seeds (0.60 cdhit):
MC25843_cdhit.fasta
MSA:
MC25843_msa.fasta
HMM model:
MC25843.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME144615_028331-138MWYREGTITFTQGSNTLVGAGTAWNVTANGVLPGMIVIGPDNKLYEIKRVISDTNIVLSEPYTGETQSEVPCRIITTYEGDLTQFSARFTALMSRMSADSKSMRSWLTALDEVTIEREDGTEVTVKPLMQIVNEHNEN
GUT_GENOME216800_031751-154MSAGTITVTNGSTTVTGSGTSFTTELKVGDFICVIAGGTPYTLIVASIASNTQLTIGGAFAGPTASGLAWNAVPASLLYAVTQQIMNDMGTALRGMNSQLVNWQKIYSDASSVTVERSDRTTFTGPSWGYMAQQYASKANTADVLAKADNLNGL
GUT_GENOME143367_037941-142MAKGTISIANGATAITGTGTPFTTELAAGDYIVFTAGQVVYTLAIKSVDSDTALTLTKPYTGPDADGLAWSAVPRSTMSQITMEVVNQVTEALRGLNHDKANWQQVFSERNNVTVTLPDGSEFKGPSWPFIVNLLEDLDPGR
GUT_GENOME147131_046044-144YEAGTVTSVAGTNVITGTGTLWNNPIFGIAAGQMIFVPGAGQVVIYEILSVDSDTKIRITRNIASPIDNSEYAIVTTVSNSMSDLARRTAVQLALYQKLLEDWQDITTGTGNVTIIAPDGSTVVIPSLSDLTAWVNDSKTW
GUT_GENOME213236_020191-172MIYSTGTIAISGNTLTGTGTNFTAAGSLIRNGCTVIAMTSPVQVFQITTIGSATSLTVTPAANPTVPAGTRFAILLSDSLSVDGLAQDIAETFTMYQRYMSGFADVMNGTTDVTITINGVAVTVPGQKSLAKKGANSDITSLSGLTTALSISQGGTGATNAADARTNLGLGS
GUT_GENOME141675_024721-151MYTAGTISGTKGSTKITGKSTQWLSNLYNILEGQVIIIYTSTSSNAYSIKNIKSNTEIELSRPLYDAVTNATYEIFTTIDNTTSDAAVKIATSIARNNEFMGLMNQWTTGTGTVTVNWGGEQVKLTTVDQMNKNIDGKFDKSGGVIEGDVS
GUT_GENOME096468_041911-113MAWYRAGTVSCTQNSTTVTGTGTAFAANARVGDAFMGPDGRWYEVVNVASDTVLSIQPAYLGATVAVGTYALAPMQGYVKDSADALRTLVNKFGALAANLADYIPDATAISGL
GUT_GENOME143526_028721-155MSAGNLTLTNKSAVVTGTGTTFTTELKVGDFIVFNVGGAPYTIPVKSITSNTQLILISNYTGPTQSGVAWFSVPQEAQSLISSALATQSAEALRGLNYDKMNWQQVFSSNDNITVTLPDGSTFTGPAWLKLVSMLNNGLAKDKNLSDLPDKSAAL
GUT_GENOME143385_038201-153MAWYSTGTVAVTENSPTVTGTGTQFSSNVRVGDAFIAPDGRLYEVSNVASSTVMSIKPNYRGSTASGQPYAVAPILGYDKELSDRFNLIANQWGGTLAGIQPWATAPTPAQARSSLELRSAAQADIGTTLGNAMPVGAFGIGSERPDRAPSIH
GUT_GENOME147137_021699-146GTVAVTSGSKKVTGTGTTFADAKNGVAKGHLFCMTTGATVDLYEVDYVVSNTELFLVQAFRGVTGTGKAYEVITTFSDSIPEFARKLNASLSYYQGQSDMVQQLFTSDAAEITVTAPDGTTHKLVPWKRVTSDGEGQA
GUT_GENOME147146_032316-137YRAGTVSVTNGSKKITGFGTLWKTTALKPDKGHPVHGPDGRIYELDYVESDTVMYIVTAYAGVTAAGQAYAIDIPRTSGIPAFSRDLAAFMGYHQTQMDGWQQLLTGTGDATLTAPDGTKLTLPTWDKVMNA
GUT_GENOME264766_030572-142IYEIGTISTKANDPKITGKNTLWKNSVTKVSVGQIILIQSGNTIYQNSIRSVESDTQMTLSFPVPVALTDVKYVISITMIGSVSDGVNKATAMVTESYQLMMILNRLMTESGVIKVVLPDGTEMQLRTEKERDRLLDGKFD
GUT_GENOME147143_005505-139WQRTGNVTVTNGSKTLTGFGTKWKTGTLPIQKGHTFYGPDNAAYEVDTVVSDENILLVDAYRGGTMANQPYRIDITRTSTISQFAADLASLVAKYRSWFDGMMTWLTGSGDVAILNPDTGANVTIPSWKKVASEG
GUT_GENOME224857_036901-169MSAGTLTLTNNSAAVSGSGTAFTTELAAGDFIVVTVGGIPYTLPVKTVNSNTSLTLVSNFTGPTQSGAAWSAVPRVVLNMVTAAMIVQNTEALRGLNYDKQNWQQVFSGTGNITVKLPDGSSYTGPAWGGITSALNDKANKTDLGSYAKKGANSDITSLSGLTTPLSLE
GUT_GENOME143489_028683-143WYRSGTVTAAAGQNVITGTGTQWANNVMGVAPGQALVVQRSDGNTLIYEILAVDSNTKIRINGNVVDALSASNYGIQTSVANSYSALARETSAQLALYQQLLQDWQQITTGTGNITIIAPDGTEVVIPSLAQLSQDISNKI
GUT_GENOME231920_023991-173MPAGTIALTNNSTAVTGSGTNFSSELKANDFLVAIVGGVTYTLGVQSVNSATSVTLTTAYNGPTTSGLAWTAIPNAALVGITAQVAADVARAIRGLNLDKANWQQIFNGTGNVTVTLPDGTSWTGPAWNGIATTVSGKLDKSQNLNDLTDKAAARTNLGFVDGKLPISLGGTG
GUT_GENOME111027_016321-95MAWYKSGTCSVTSGSPTVTGAGTAWVDNVRIGSDGFVGPDGLLYEISKVVSATEITLAAAYKGTTASSGGYAIAPIQGYTKELADKAAALIDQFS
GUT_GENOME231559_005098-149MSAGTIRLTNGSTAVVGTGTTFTSDLKSGDVITTTIGGVFYTLFVDTVTSNTAATLTDPFTGPTTTGAAWVAVPQLSLNRITAALATQTAEAVRRILQDNANWQAFYSGTGDITVTLPDGTPTGRQVSGPSWAKLKTDAGSA
GUT_GENOME019210_007351-82MAWYRVGTVNVTNNSNTVVGIGTAWVQAAAIGETFLGPDGGLYEITGINSNTSLSINPAYKGPTSTAQNYALMPTQGYLRDL
GUT_GENOME145986_022511-140MYSTGTVTTTANSTKLVGTGTKWLNNINRVSAEQAIQIQINSTVYNNSIQSIQSDTELTLNFPLPAAATGAKYVILTTMVHSVSDAMNKIVSMNGANVQFSDILTRWMTEQGIITVTLPDQTTQQLRTTKEMDKQLDGKF
GUT_GENOME211879_005413-148MYEVGTVTGAASQARVTGATTKWSQEALGIQPGSILVVYRSGSADLYAIKSVDSDTQLTLTRNITTAFSGASYGIITAETASTSSFANQLASAFAFWRSVVEGWSMALTGSGDITLTDPITGKQVTVPAIAGMAKASDLNALAKLT
GUT_GENOME146005_044841-173MPAGILTLTNNSAVVKGTGTAFNTELKSGDFIVSVVGGVTYTLPVKTVDSATQATLIKAYDGPTQAGAAWYAVPRDAMNTITAQLAAETAKALRGLNLDKANWQQVFSGTGNITVTLPDGSTYTGPAWNSFTAALENKAAKGVNNDITQLKALSTAITIAQGGTGAKDAATAR
GUT_GENOME124925_0128411-139YQTGTVSVTKNSAVVTGTGTVWNGTQTGATVVNAGDIFTVDDSRLYFIKSVDSATQITLDKPYAGTTGSGKSYRIIYLAAAHFPTDTATKVSRTVERYERIASTAESALQPENLIAGDDIAIVDSSGNK
GUT_GENOME096244_039823-173LYETGTITGALNATTISGTGTKWSDAKIGITNGSVLFVSSNAGIDGVYQVKRVISDTSIELAQPIFKAFNASKYSIMVAESASTAAWSNQLAATLGYYQAQMDGWQQIMTGTGDIALTAPDGTKVTIKSFTKLSNDLDKKANAGANSDITSISGLKTALSIEQGGTGASNA
GUT_GENOME244019_025513-134WYSTGTVSVVLNSDTVTGSGTAFSANARAGDAFRGPDGRWYEIGNVTSATVLTIKPAYQGATANGQAYSITPVQGYSKTLADQFRDLSNQWGSLLAAVKPWAIASTGSQAQADMGITEVGRAINGASTVGNA
GUT_GENOME207082_005201-170MLYNTGTIAINGNTATGTGTNWTAPASQIRVSQTIIVLSNPVQMFQITAINSGTSLTVTPAASPALSGQKYGILVTDSLSVDGLAQSMSQLINEYDENIGAWETFASTSANQLVTVTINGTSLSIPAIGGLARKGANSDITELKGLTTALSIAQGGTGAKTAAEARKNLE
GUT_GENOME232340_017771-140MWYKAGKINVVANSASVSATGTQWGDAKYGVMPGMMLLAPDNKLYEIKSVNSNASLTLNSSYSGETANGQPYAIITTYEGDISQFSARFSALLTYFQGSRSELIDVLTGTGDVKLTKEDGSLITVPSYSKLMADMQDKAL
GUT_GENOME147488_006474-153SWYRRGTVTVTDGSKEVVGVGTLWVDAGNKPLAGDIMFIAGSIYEVESITDNEHLSLFRPFTGVPPANGEYAIIRNTSVTIATRVAAMVAAAINSKQVLLDDLRGYYTSTADKVLIHTDDGSTIEVVPLQTLVSDITNLIAQSESIKNDL
GUT_GENOME282465_001923-115WYKTGTVTVTAGSTEVVGSGTTWGDGTITPGDSFSLVDSSGAAVAPFYEVASVTDDIHLTLTVPYGGASASGVNYALWNLAAEHTTPYLSSLVAALVQKFKSVSTDILNYVTR
GUT_GENOME096425_026742-147IYQAGTIATTAGQTKIKGTGTRWKDNLAGISEGCPISYLINNVVYMNTVLSVNSDTEINLTYPVPVAASAAKYQIATFVLDSMSDGVRKMLANQQYIQYFLRNMDAWMTQDGIVEVKTPTGETVRLESLVELKKLIDGKLSLTGGV
GUT_GENOME177579_032441-151MIYTTGTVSTVSGSTIVSGTGTKWTVNNPAIRSGTIILIKNGNANFIYMVDRVNSDTELVISQPATFTVKNTSYSINLTEPNSYSDANNRMTAIASDTTYFLRAMDQWMMNNGVVTVELSNGQKVTLDSIKKMQGDIANKFDKSGGEVTGA
GUT_GENOME171569_000843-147YTDGTIAIKAGSPIVTGTGTQWKKNIHGVAPGQLISIENGTAPVSMMIRAVNSDTELVLSFNAPVTLSGAKYSIATTVPDTISDAARTMSANQGYIVYFLQAMQQWMTDTGQVEIELPNGQKVTLDSIKALNDAISKIPKVVQEP