UHGP-MC 19206
Information
- Number of sequences (UHGP-50):
- 80
- Average sequence length:
- 72±8 aa
- Average transmembrane regions:
- 0
- Low complexity (%):
- 0.69
- Coiled coils (%):
- 0
- Disordered domains (%):
- 0.09
- Pfam dominant architecture:
- PF08241
- Pfam % dominant architecture:
- 125
- Pfam overlap:
- 0.14
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC19206.fasta
- Seeds (0.60 cdhit):
- MC19206_cdhit.fasta
- MSA:
- MC19206_msa.fasta
- HMM model:
- MC19206.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME243831_00157 | 182-257 | QSFFREGRYEEKIFANDLHLNLEEFLGRYLSASYAPSKEDSRYFEFTKALTGLFGSYQKGGIITLPNKTRCYIGEV |
GUT_GENOME114591_00795 | 175-250 | VSNFFETKYEKKVFQNDLKFTKEKFIQRLKSASYMIPEDSPNYEAYISALENLFDKYQKNGFLTMPNETKSYAGEI |
GUT_GENOME168217_00650 | 188-250 | FRNDVFYDRDGFIGRNLSASYAPREGEKSYGALTDALSEIFDAFSSDGRLLLPHITAVYWGRL |
GUT_GENOME000279_01334 | 191-249 | DQIYTAEQFIARGLSSSYAPAPGEDGYEGYRKGLQRIFDRFEEGGRVTMRNRTAIWCGT |
GUT_GENOME268448_00122 | 178-257 | PIKEKIHLLLPVCEERSYPNEYSLTKEQYLANCLSKSWTLTPQDNEFQNYMEALEDVFEKFASKNKIPRPMQTILYIGKL |
GUT_GENOME096273_01313 | 198-274 | ELAQFYGGSHYQRLEYDNDELITWDTFRERHLSLSFAPKEGDPAREPLVEDLQRIFDRYAVDGLYRFPNSTHVLVGD |
GUT_GENOME009818_01280 | 4-79 | IQSVFRDDCEVVRFPNDAVYDRERFFERMDSSSYASKASDSSRDDFRKAVDELFDRFGIDGKIVFPSYNVAFIGKC |
GUT_GENOME268089_00437 | 177-254 | KFFKEVYLGDCKVFTYDNPFLMTKEQFIGDTLSRSHAPKSSDGNYREFLSELEKAFDEFVTDGHVTVLYQTVCYLGTL |
GUT_GENOME179657_00042 | 172-241 | DFFKGEYTHLSFENPLSYTEEMFLRRCLSSSYAPKKGKVGYASYASALKELFKRHEKDGTADYSYTTNVY |
GUT_GENOME235215_01421 | 185-263 | ERIGHFFNGRFEKAVCPNDLCYTRDLFIARQCSSSYAPKPQDPAYEPFTRALGDCFDRFAADGFLTVPNVTECCWGRPK |
GUT_GENOME010010_00280 | 173-247 | VTGFFQAAPQIHRLQNPQTRSREQFFHWCLSSSYSPRPGEPVYNDFLRCVGDLFDQYQCAGQIILPSQTLLYIGD |
GUT_GENOME127143_01840 | 175-249 | RSFFCGDYQLISLDYPQLYNREAFVLRSLSSSFAPAPQEDGYAAYRAALEKLFDAYCQNGTVAYPYITRCYAGRL |
GUT_GENOME112069_01150 | 183-254 | FLDTFERAEFPNDLALDRDTFVFKSLSSSAAPAAGDERYAAFTDELEQVFDRHQKNGVLTYPNKTVLYFGEI |
GUT_GENOME018196_01355 | 169-256 | NEAELPAFFDGTCDVRRWDNPQLLTREAFVARALSSSRAPREGDASYPRYLEELDALFDRFAEERETEDGGTRKLLSFSLRSTLHVGR |
GUT_GENOME239386_01595 | 189-248 | NDLIYDEKAFVSRCLSSSYAPKPGEEKYDGYVAELQELFKKFSKNDTVPYPYITRCYIGR |
GUT_GENOME131740_00692 | 6-82 | QVASFFCGACPSCRTWANDLHYDWDAFLGRMLSSSYAPQKQSPAYAPLVAALRQIFQEESENGILRFPMQTISFLGM |
GUT_GENOME139120_00534 | 169-249 | NDVEQIKRFFNGKLEYLEFYNSLDYTKEKYVERCLSSSYSLNENDENYSLYIKEIEKIFDEFSKDGIYKMPNISALYIGKI |
GUT_GENOME127691_00948 | 190-247 | VMMTREEFIGNSLSKSYSLVETDPNYKAFVTELRELFDKYHRYGKVAASYTTNCYAGK |
GUT_GENOME083723_00803 | 193-273 | LKNYDGFGKFFEGEYKLLRFENNLQFDREKFVGRSLSASYSLTANDEKFDEYVKELNELFDKYEKNGIATVPNYTEVYIGE |
GUT_GENOME096505_02163 | 168-250 | KNISQTKLLSFFKEGTMQEVRFRMSQEFDFEGLKGRLLSSSYSPVPGHPNYDPMMTELGKLFDRNNQDGVVPFDYETEIYWGE |
GUT_GENOME121695_00683 | 172-247 | LSSFFGGEYQTLSFPNPSFYSLEQFIARNLSSSFAPFPEDTGFSAYVAALTALFEQHCQDGILRYPYTTHCYFGKL |
GUT_GENOME096445_00570 | 192-253 | PHDLVYTNEQFLSRCISSSFSPNPGDAGYPAYCDELKELISRFSADGKILVHNQTAVWFGRI |
GUT_GENOME207750_03182 | 190-251 | SNPLKMSKEYFIGRNLSASYAPKKDDFLYDEFVEKLGYLFEKYSKDGTILVPNDLHVYVGVI |
GUT_GENOME219382_01310 | 184-249 | EIYKFDNPIARNKEMFLQMALSSSYALRPEDAEYDNFVRELSEYFDRRNKNGYIISKNVSVLYVGQ |
GUT_GENOME028044_00984 | 131-196 | EKKTYDNDLMFTKEKFIQRCLSSSYALCQGDADYNEYLSAPTRIFDDFSVDEHLCMKNKTITYIGR |
GUT_GENOME018922_01083 | 33-108 | RIGAFFTDGYTRKEYPNDITYTKESFIDRSLSASYSLRPQDELFQEYKAELEKLFDSYADDGVAVMSNKTVVYSGK |
GUT_GENOME259921_01092 | 194-252 | NNLAYDLDTFIMRNFSSSYAPTGGAEKAGYEKALKELFESHAESGMVQYPYITECFYGK |
GUT_GENOME058782_01456 | 189-252 | FDHPLSYTEETFIRRSLSGSYSLKEGDVGFEEYLEALKNLYRKYNRDGILTMGNHTAVYYGRLQ |
GUT_GENOME245503_02283 | 187-251 | QVFENPVIYDCQGLIGRYLSSSFAPKLGDVHYLDYIAAFGALFDRYSVQGRLCFPYKTRCYLGSV |
GUT_GENOME235447_01282 | 166-251 | IHSFSFEKDSFNNFFKEQPRFYKFIDDGLNYLTEEEYIGRALSASYSLTNKDSKYNLYISDLRKVFHKYAQGNLVYFPLSTIVYIG |
GUT_GENOME096494_03359 | 74-147 | AFFKHGHYSVRSYDSPLVYDRNHFIGRQLSASYAPKKGDSSYEPFIQAFSALFDQYENEGTITVPNVTSVYLGS |
GUT_GENOME073766_00186 | 200-282 | DMGECEADFRNVCEVMSFDHPLYFTKEKFIKRSRSGSYSLKETDKGYKEYILELEKAFDRYAVSDILKMPNHTMVYAGPLQKH |
GUT_GENOME057822_01064 | 188-261 | NFFRGGGYEEKQFRHDLAMDETGFVGRNLSASYAPRPGDPSYGEFVAAVRRLFADHARNGRIVMPNLTRSYLGY |
GUT_GENOME095952_00509 | 173-249 | KIHTFFGHENYKVLQFDNPLFYSLETFIGRNLSASYAPTENDSSYPAFVDALIKLFRRHETSGKLCLANQTVVYLGT |
GUT_GENOME165449_00976 | 463-540 | EETAAFFPGGAEEISVPNDMTYDRAGFVRRCLSASYAPAEGEPGYEALSGALGTLFDHFAEGGRLTLAARATLWLGRP |
GUT_GENOME256340_01424 | 174-249 | KEVLSFFHSHEHTLTFSNDQFVSMEKFLHWCLSSSYSPQKDDPCYLYFLDECQKLFVRYCVSDKILLPSNTIVHIG |
GUT_GENOME104878_00370 | 177-242 | RTFFRDGKYEMKKFANDLYFDKKGFIGRNLSASYALKEGDENYEAFIIALKALFNKYQEDNKILLE |
GUT_GENOME096523_01369 | 172-247 | DCRQFFTGNCDTFQTENDQEYTRQGFVNRTLSSSYSLKAGDKNYEQFVAAVGKLFDQLAQDGKLLVPMKNKLVDGL |
GUT_GENOME074527_00344 | 174-248 | IRHFFHDHYTRITYVHPIIFTQETFIQRCLSASYTLKPQDPHYDEFLSALKQVWDDHAQDGKLIQGNQSVLYAGT |
GUT_GENOME260281_00811 | 186-247 | FDNRLFYDKQTFLQRILSSSFALTGKDENFAPYIAEFEELFDTKAQAGRLIMPYQTICFIGN |
GUT_GENOME110634_00101 | 254-318 | YPNDLIFTKEQFIGNRISRSYAPKSGDPEFEFDNYCTALEEFFEQHQQNGVILIPNNTVCFIGMV |
GUT_GENOME236157_01159 | 173-249 | SVSRFFDGNFDLLTFDNGFLYDENIFLSRSLSSSYAPRPEDELFEPYVAALREVFRRNSIDGKVLYRYITRCYIGNL |
GUT_GENOME126104_00732 | 351-415 | LRLENERFVTKEQFIGDNLSRSYAPRRDDSGYDGFVCELENAFSKHEKSGKVSQNYITECYLGSF |
GUT_GENOME096270_00079 | 183-244 | FRNDLQHNLDEFLGRYLSASYSPKVTDMEYTPFITALTKLFEKYSNKDTIIIPNRTRSYLGK |
GUT_GENOME284947_01757 | 175-246 | YLPNCTYYEFDNSEVRDKESFIGEVLSRSYAPCEGTELYDDFKAALSELFDRYSVNGVVTIPRIACCYLYKN |
GUT_GENOME109331_03421 | 173-263 | FSGSSRGMRGAISADDYKNFFTGNYITKSFSNPLIFNLENFLGLHQSASYCPTRSEENYSKYMESLTNFFESHCRNGLLTLENNTHCYIGY |
GUT_GENOME216297_01262 | 195-258 | RFPHDLSYTLDGFVGRNLSSSYAPRPDDAGYQPFVEALQALFHRYEQDGRLRQPNVTCCYIGRV |
GUT_GENOME096273_01312 | 196-259 | FAHDQMLTVDEFLGRNLSATYAPREGDPAYAPLVADLRRLFDEHREVDEVRFPLTTRLFLGGVG |
GUT_GENOME224740_01934 | 184-250 | HSKTFENPLIYDLDAFIGRHLSSSFALKPDDAEYKEFIQTLEEVFTRYETDKTVEYPYLTRCYWGKV |
GUT_GENOME276546_01562 | 187-259 | FFDNFETKIFRNDLHFDKQAFIGNRLSRSYSLKEGDEHFDEYKAELAEMFERYAENGVITIPNVTECYLGEVI |
GUT_GENOME095251_00023 | 190-256 | MARLPNVQRMDFDGLRGRLLSSSYAPQAGHPRHAPMLDALQQLFDAHAVDGQIAFEYQTRAFVGTLD |
GUT_GENOME185607_03152 | 173-249 | EQFSNFFANGCEERRYQNPVKFDEQRFIGYELSTSYAPRFNDKNYDDFIHSLKDIFAKYSEHDILEIPFITKSYIGT |
GUT_GENOME142025_00582 | 168-244 | NLDDRVKAFFQERYDYHKAANIIPINENEMIGNLLSLSYAPPSATTEQDLFIQQARNIFARHQSDGKVLFDLTTHLY |
GUT_GENOME104808_00149 | 178-249 | FFKGACEVKSWNNDQLMTKKSFIERVLSSSYSPREGDANYERFLEGIEQLFLQYAQDDRVRYPHNTIVYAGK |
GUT_GENOME007685_01196 | 186-255 | QYHYFQNDNTEYLDSETFINRTLSASYAIQKDNPCFSDFTDELYSVFEEFSVNHKVKMELSTVIYSGHLR |
GUT_GENOME006582_00102 | 171-247 | NNVESFFGEKCEVFTFKNNLCYNRDTFVKRMLSSSYAPREGDKNYQSFINALYALFDVLADHDTLLMPNHTALYIGN |
GUT_GENOME007925_00203 | 183-250 | CRIREFENPQYFDRESFIGRNLSSSYAPVAGSEGRDAFTADLAELFDRMQDGNGKILIRHVTRCYFGM |
GUT_GENOME111239_01171 | 178-251 | AFFRGGRYEELEFPHDLPLDLDAFIGRNLSASYAPLRGSEQYPLFIEALVELFRTYEREGRVLMPNVTRCYAGQ |
GUT_GENOME229516_01910 | 177-252 | EAVGNFFKTYEEFCFENVLKRTKEQYIAACLSVFCALRKGEEGFDAFVHELERFFDRRSDGGILVSQARSVLYAGY |
GUT_GENOME015871_02091 | 187-266 | HLDKIYYCIPNAQTSIYPYNLTYNRKRFCQRWLSSSFSPNKDMDYYKAFCCDINSIFNKYADNGLVTVENQTLVYVGSPF |
GUT_GENOME249702_01317 | 185-259 | SEERRVGNEKRRFPNPLTLDRDQFLRRCFSSSYALREGDADYEAFRAALEALFDTFASGGQLIQPNETVAYVGIP |
GUT_GENOME057820_00725 | 180-252 | FRGRCDTVSFPNDAVFSSAGEFVGRSMSSSYAPQPGEGAYEAFRDALTDFFVRRSSGGKLTIHTNATAFYGEV |
GUT_GENOME044179_00107 | 184-245 | FSGERVFDRDSFIGETLSRSYAPHAGEPNYEPLIDELNVLFSEFESENKAAVKTNTVCYLGS |
GUT_GENOME210309_01476 | 194-252 | LYFDMQGFVGRNLSASYAPKPTDAAYNDYVAALQAVFGKYAENNRLCMPNFTQCYIGQV |
GUT_GENOME207955_01117 | 176-248 | AFFEGEYRLDRFENRQLFDLGGVLGRLNSSSYAPAVGTPAHEQMTALIRREFGRNAVDGRVSFNYCTLVYSGR |
GUT_GENOME222313_00101 | 1-57 | MGRDGFTGRALSSSFAPRQGDGTYEEYIAAFASLFDKYAENGRLLYPYITRLFYGRV |
GUT_GENOME062907_01086 | 188-249 | FPNPLFFDQKSFIKRCISGSYSLKKTDQNYFEYIAALENVFDKYSNNGQLVMENKTVVYIGR |
GUT_GENOME108999_02205 | 185-254 | NLHTKSVENDLFFTWEGFLQRALSSSYAPDQSDGNYLPLVEDLRSLFDRYAEDGQLRMPNYTAFYWGVLE |
GUT_GENOME123310_02006 | 170-249 | DEKCIAFFAGKCVVFRTDNTQIYDRQGYINRVLSSSYSLKAEDDRYAAYLKEINGIFDRFSADGRIAVPTETVAYIARFD |
GUT_GENOME238933_02178 | 169-249 | MHDDSRIQTFFGGSYDYVSFANPLYYTVDTFLQRHLFNSYSLRPGDAHYDEFIKALHHLFTKYAIDDTVTMPCQTVAYIGS |
GUT_GENOME111115_01711 | 175-253 | ESLPRLSRFFASQEARAFDADAVYDRATFVRRSLSSSYALRPEDARYADFVAALGALFDAWAEDGRVCVRTRAVCYLGA |
GUT_GENOME222296_00767 | 183-251 | HEERFPFPLTYTREAFVARSLSSSYAPPKDDPRAKAYSRALLDLLDTYFPDRESFTLPNDTVLFWGRLT |