UHGP-MC 38459
Information
- Number of sequences (UHGP-50):
- 68
- Average sequence length:
- 72±5 aa
- Average transmembrane regions:
- 0.03
- Low complexity (%):
- 3.89
- Coiled coils (%):
- 0
- Disordered domains (%):
- 2.91
- Pfam dominant architecture:
- PF00232
- Pfam % dominant architecture:
- 147
- Pfam overlap:
- 0.19
- Pfam overlap type:
- reduced
Downloads
- Seeds:
- MC38459.fasta
- Seeds (0.60 cdhit):
- MC38459_cdhit.fasta
- MSA:
- MC38459_msa.fasta
- HMM model:
- MC38459.hmm
Sequences list (filtered 60 P.I.)
| Protein | Range | AA |
|---|---|---|
| GUT_GENOME035939_02350 | 86-160 | MAQSYQWFNPVFLTFFKNIFIEDKPLFVRLFFHSCWINPRPVDRCSENSESHLRKKGNILFVMMIKINCLMTWIK |
| GUT_GENOME079504_00422 | 1-77 | MRDRDKWLNVVALEFSNQRRVKGKPFFAWAVACRTGWVNPRPSDGEAVDFKAHFRKQRNVFLITMKMINGFMTGVKR |
| GUT_GENOME068385_00684 | 83-143 | IFQKLVYKVIVEVDAPLIYLTRSVRKYPAPCNGKSICFQSKLRHKADIFFISVIMVTGNIT |
| GUT_GENOME167827_01969 | 78-152 | MNDRDQRFYMMTKTFVDHRVVIGERLGIGLRIVQMRKNPGQCNGEAENLKSHFRKQSDILFICMIKINSPAFRVV |
| GUT_GENOME217024_01907 | 246-312 | MKERDEWADAVFEQSIDELIVEGDAFFIDPACAVGEKARPREGKTVILDAELRHQLNIVAEAVVMVA |
| GUT_GENOME176629_04133 | 276-356 | DNRLDAIFKQFVEQVIVKLQARLVRRLLVAIGEDARPGDRGTKTLKAHLGKQRNILFKARVKLHALMVGIVVARLHAVGDN |
| GUT_GENOME223640_01162 | 186-259 | DGGKNLHAMLFALAEQIFIVADAFGVRLSVIAVGEQPGPADRGAKDFHAHFCEQRKVLLVGVIEVDAAAIGIIR |
| GUT_GENOME002834_01812 | 152-222 | MEHCYKRLDSVLEALIYHIVVEVNSLLVDSAGSLWEDSAPRDGESECLEAHLSHEGYILCISVVKISAHVQ |
| GUT_GENOME185019_01142 | 208-283 | VRERNKRLDAVSAQFAEHVFVIRDAFLIGLGIARVGVDAAPANRNAQDLVAHLGKRSDVLFVMMVEIDSAALRIIC |
| GUT_GENOME043642_02185 | 135-213 | MTQRDHRLHAAPAHLREQLVIIPQTGFVGLDFVAVGKNPRPGNTRAECLEAHPFHQGQILVEAMIKIDRLMTEIYLVGF |
| GUT_GENOME193330_01362 | 276-349 | MGQGDQRLNAVLVQLVEHCIVELQTLFVGDGIITVGEDAAPADAHAEHLEAHLGEQLDIILIGMIEIDTAALGQ |
| GUT_GENOME031701_00053 | 164-236 | MAKRHHWLHAIFMALVKDAIVKCKAFLVRILIITVRENAAPANGHTENFEAHFRYLGKLLLVSMIEINAATLG |
| GUT_GENOME158817_02004 | 36-105 | MRHRDERFHIPFAAFTDHAVVISQRFRVRFTVADMREDAGNRDREAEYLKSHLRHQCDILGIGVIEIDAP |
| GUT_GENOME243286_00486 | 145-219 | VRERDDRLHAVGMHRVEEIVVELQALFIRLRLVAVREDARPGNARAEALEAHLSEQGNVLFVVMVEINGIVVRID |
| GUT_GENOME242744_01230 | 1-70 | MRQRYDRFDPQALHFLKDIAVKSDPFLIGCRFFPGWEQTGPRNRQSQHFKAHFRKQSQIFWIAVIEIDSS |
| GUT_GENOME218411_00237 | 90-163 | KRHERFHAVLMAKIKNLVVKCKPFLIRFRIITVRENARPIERKSEALEAHLGHQFDVFFIVMIEIGCRMTRIIE |
| GUT_GENOME002833_01865 | 78-146 | VQHGNKYFYMMLPALCENIFVIFDSGQIRLFIVAIGENPGPADRKPVNLHSHFCKEGDIFFISVVFVTP |
| GUT_GENOME207755_00619 | 1-68 | MSYSYKWLNIIFMTFIYQIFIKLKSSFIRLTLISIRKNPCPSYRKSINFKTHFCKKSYIFLISVIHIY |
| GUT_GENOME231501_00268 | 78-146 | MGNRHQWLNTILFALIQYIIIEVKFFFIWLFVISIWENSGPCNRKSKHLKSHLCKHGNIFFVTMVKIDC |
| GUT_GENOME090892_00311 | 88-161 | DRDQRLHSMLFQFGKNAIVEAESCFIGFQFIASREDPAPADGEPINIHAHACQQCNIFFVTVVMVNRFVAGIVT |
| GUT_GENOME167048_00747 | 36-103 | MRKRYQRLNSIFMQFIKNILIKSQSCFIGHSVISIREDTCPTDAHPKAFKAHFSHQCNIFLIVMIEIR |
| GUT_GENOME039845_02054 | 266-333 | DHRLNAVLQQLIKQVVVKLQPRLVGRSLIPAREDARPGNRGAKAFEAHLGEERHILFVVMVKVDRLVA |
| GUT_GENOME015840_03376 | 108-183 | EGNHRLNIMRQQFIDQIAIKLHACFINLAAPGGQQARPGDGKAIALQAHLGHQRHVFAKTVIVIDGDIAGGPFKGG |
| GUT_GENOME023347_00237 | 164-237 | VAQRHHWLHTELVAFIKHAVIKSQTDLVWLSIVTVGENAAPADRHAENLKSHTRNLGQLLLIAMIKIDSPTLGK |
| GUT_GENOME236035_00740 | 265-336 | VRDCDHRFNAPAVQFPEHILVVGQTLLVGLGIVAVRENATPCNGGAQALHAHLAEHGNVFFVVMVEVDGFMG |
| GUT_GENOME027520_00422 | 36-113 | MRNGHDRLNPVFQHFIKECIVIFQSFFIRQILIPVWENAGPVEGCTKAAESHFLHQCKIFLIGMIEINAVPSWIIAIL |
| GUT_GENOME064333_00650 | 208-276 | VRHRNQRFNVVFLEFVEHLVVELQAGLVGLFVIAVRIDAAPGDRHAVDLKAHLGHQRDVFFIMMVEIAA |
| GUT_GENOME223346_01546 | 170-244 | VTDGHQRLNAVFVTRPEHLLIEPQSRLVWLLFLPSGKYPRPCNAHAVDFKAHLAEEAYVLPEMVKHINGLVRGIP |
| GUT_GENOME231694_03165 | 202-270 | VRQRDQRLNAVLRHLVEQPIVERQPGFVRRLFVALRENARPGDRGAQALEAHFGEQRDILRVTMIEIDR |
| GUT_GENOME036666_01227 | 97-164 | VAQGDDRFNAVLVHFVKKVIVELQACFIRFSVVTIREDSGPGNGGSENLEAHPGQEFDVFFIVVVEID |
| GUT_GENOME070071_01775 | 74-140 | MVQRRVGRYPVFKKFVDHPIVKIEPFSVYSARTVRKYARPRKRKTERVDSKTVRQRNIAFIPMIKIA |
| GUT_GENOME081757_00918 | 278-360 | MGQGDDRLDALGQHVVEQIVVECQTLFVGLGFIALRENARPGNRGAQAAETHLGKQIDVFLVVVIEVDGVMVGVVFFRVDAGG |
| GUT_GENOME158329_01682 | 257-320 | VADGHQRFDAVLVALVEEGIVEGEALLVGFGVVAVRQDAGPRDRQPVAPEAHLGEESDVLLEMV |
| GUT_GENOME136783_01767 | 92-164 | MRNRHQRLNLIFPAFLEDLFVELKSLSVRLLLVALWKNPGPCDTQPIDFKSHLSKQFYIFPKMVVHIDGLRGR |
| GUT_GENOME177916_01143 | 267-334 | DQRLNAVFFQLCKYLIVKLQTRFVGLRLGSFWEDARPVDGHPIHLEAHFCKQRNVLFIMVIEIGAAVA |
| GUT_GENOME092680_01443 | 258-325 | HRDQRLDAIFVQLTENIVVELYTLSQGLFLSSSGEKAGPADTHAEGLEAHLAKERDVFFVVVIEIDTS |
| GUT_GENOME061636_02322 | 321-394 | DHRFDVVFQQFIDHIIVELQPFFVWLSFIAFWKNARPGDGSTKTFEAHFGKKLDVLLVVTIKINGFMVWIVFAR |
| GUT_GENOME087009_01578 | 1-72 | MRHRDHRLHPIRQNFVNQAVVKLQPFFIRLGVIALRKNPRPVDRRPKRRQPQSVLSQKFQVFFVSMIKVDSA |
| GUT_GENOME119122_00406 | 191-261 | DQRLNPVFQQLVKQIVIKLQPFLVGRLVIAVGKYSGPVDGGAEHLKSHLRKHGHVFLISMVEINGFFLGVI |
| GUT_GENOME160349_01071 | 72-160 | VADGNERLDVMGQQLVDHVLIELEAGLVGLGFVSCGEDAAPGDGKTEHAKAHLGHERDVLAPVMVKIDGLMAGIVLVVVEAGHSSLGEG |
| GUT_GENOME169846_00779 | 36-111 | MRQRDQRFDAIFQTFIDHPVIERQSYFVGFGLIPARTNPAPRDGETKYLEAHFRHQRDIFPIAVIKIRADQFQIIR |
| GUT_GENOME123768_02323 | 1-75 | MRQRNDWLNAVFQAFVEQIIIKLQASLVGLQLIALRENTRPGNRGAKTFEAHLGKQRQVLWITMIKIDRLMVRVV |
| GUT_GENOME157767_01204 | 1-67 | MKHRDDRLHAGVNHLADHPVVKLQALFIRHPLIPAGKNPRPCDRKPEAVHAHFLHQRNVFLIAVIEA |
| GUT_GENOME237516_01018 | 92-162 | MIQRHQRFNAIGNQFIDDILIELQTFLIDFSIPIRNDSRPGKREAVSLHAKLLHQSNIFPPVIIEIACHLG |
| GUT_GENOME098675_04277 | 272-337 | DHHGHPVLFCLAEQVLIVLQPLLVGNVLFPCGENAGPVDRGPENVKTGFAKQRQILFVAVVKVNSP |
| GUT_GENOME221775_00846 | 192-261 | DSDHWLNTVLVQLVKDPVIECKAGFIGSLFGTIGKNACPRNGHPVSLQPHLRKERNIFFVVVVEICCFVG |
| GUT_GENOME149360_00625 | 321-392 | DHRLDVVFQQLVEHVVVELQSLFIRLGFVTLREDAGPGDRGAEAFEAHLGEQFDVFFVATVEVDGFVVRVVF |
| GUT_GENOME171404_00876 | 263-341 | MRQRHQRGDVVLRHLVEQLVVERETGFVGLGVIAVGIDAGPGDGHPQAVKAHLGEQSDIFRVAMVKIDRHILNTAVAGY |
| GUT_GENOME247430_02617 | 159-218 | LNSIGNQLVYQVIVELKSFWIHLSRSIREDSGPGERKAICCKSTFLHQSNIFFIAMIMIA |
| GUT_GENOME171711_01385 | 275-345 | DERLHAVLFELVEHAVVEGEARLVRGFFVAAGEDARPGDGEAEDLKTHLREQGDVFTVAVVEVDAFQLKVV |
| GUT_GENOME099864_01606 | 316-385 | QRDQWLHSTAKHLVKQIIVILEPFLIGHRIVTVREDARPVNGGAEHFEAHLLHQRKIFLVAVVEVDCFMS |
| GUT_GENOME249851_01372 | 237-303 | KRHKWLNARILHGAEQVAVVVDAGLERLFLCTRGEQASPLDGYAQRLEAHLLEKRNVLLVAMKEVDA |
| GUT_GENOME027878_01847 | 80-146 | DRHHWFDPVFQTFINDILVKGNAFLINFPFSFRVDSCPGDGKAVGLQSHFCHQCNILFVSVIMINRH |
| GUT_GENOME162746_00535 | 341-405 | GDEGFNAVLFALTQHASIPLDAFAVRLKIISVRIDSAPANRGAEQLEAHLRKKRDVLTVVMIEVV |
| GUT_GENOME232292_00263 | 97-177 | MRQSNNWLNPVFETFIKKVIIKLQTSFIGLLFVSLRKNPRPSNRSSEALETHFRKQSNVPLIKMIKINCFMIWVVFAWQHS |
| GUT_GENOME164252_00526 | 95-162 | RNKRLNSVFVAFFKYIFIKLKSCFVWFCVITVWKDSRPSNRQTEGLESHFTKKCDIFFIMMIEVDSFH |
| GUT_GENOME180111_00453 | 198-262 | LGKDGVVEVQALLVRLGLHARGEDAAPGDGHAEQLDAHLAQKRDVLGVAVVEVDGIVIGVVLALD |
| GUT_GENOME124730_02032 | 268-343 | MAQRHHRLYAIAAAFLKQPVVKPKPALVGLCFIPLRKDPGPADGGPESPDSHIRQQANVLLVPAVKIDGLMTRIIR |
| GUT_GENOME125980_01965 | 242-308 | MENRHQRLDTVCQQLVHQIGIELQSFRIYLTLLGNHPGPTDGKAVGFESHFFHQRHVFFPAMIVVAG |
| GUT_GENOME106428_00166 | 1-69 | MEHRNEWFNVVFQTFVDDIVVEIDTFLIDRTGSFRKNTGPCNRKTECLESHFCHQSNIFFVVMIEISTY |
| GUT_GENOME167775_01737 | 92-166 | MRNRHKWLNSVLLAFFKGILIKFQASFIRLFLIPHWKNTTPGDGHSISLEPHFCKQSNIFFITMIHIDCNFCWIK |
| GUT_GENOME009799_01850 | 92-162 | MCNRNQRLNSIFMTFFKKIFIKLQSFFIGFFLITIWKNPGPGNRKPVCLKPHFTKHGNIFSEMMIHIYGFH |
| GUT_GENOME231594_03003 | 92-167 | VTERDQRFDIIFAALLKDRPVKGDALRIGRQFIAIGIETAPGNRRAEHAKPHLRHQRNVLRIAVIEIDGFMAGIVF |
| GUT_GENOME169846_01486 | 80-155 | MGNRDHDLHALGDQLVKHRVIKLQAGFIRHFFLAGWINSRPGNRSAVDVPAHFRKQRDIFFIMMVEINSLMRGVEW |
| GUT_GENOME153029_03308 | 79-147 | MCQCDDWLDALSVQFIKHLIIKPKSLFIGIFIIRIRKNPCPADTHAKCLKSTLLHPCDVFFISVIEIHR |
| GUT_GENOME103593_00480 | 313-387 | DRDERLDAVGEQLVHDVDVELDTLLVGGLLIAGGEDAAPGDGEAVHAKAHLCHEGDVFFPVVVEVDALVARIVLV |