UHGP-MC 102799
Information
- Number of sequences (UHGP-50):
- 158
- Average sequence length:
- 79±6 aa
- Average transmembrane regions:
- 0.15
- Low complexity (%):
- 6.96
- Coiled coils (%):
- 0
- Disordered domains (%):
- 14.94
- Pfam dominant architecture:
- PF18668
- Pfam % dominant architecture:
- 8165
- Pfam overlap:
- 0.76
- Pfam overlap type:
- extended
Downloads
- Seeds:
- MC102799.fasta
- Seeds (0.60 cdhit):
- MC102799_cdhit.fasta
- MSA:
- MC102799_msa.fasta
- HMM model:
- MC102799.hmm
Sequences list (filtered 60 P.I.)
Protein | Range | AA |
---|---|---|
GUT_GENOME145671_02182 | 36-144 | DRFGVLRKTWHGMEMIFSRFIDYITGRGEQAVAAIGWQELGNWAVGLAVDNRQQIVYYNGSWYKYLGELEHVIAGDSPENDGGVWSAENPTGKWSNIGDAALRSNLGSG |
GUT_GENOME224857_00722 | 212-297 | WGYVLIDSFQLGANITTRFQALHWSLPDGNGEYYRWDGALPKIVDAGSIPTTTGGVGIGAWVSVGDASLRVELSSSADGKGSSLIA |
GUT_GENOME227607_01065 | 74-145 | GYTRMDSFQAGATLTLPNQILRETTTGLYYRWDGAFPKSVPAGSTPENSGGIGIGAWQLISTLGFEVPVDTD |
GUT_GENOME272778_02755 | 58-144 | ISQYGWIPVGTFQAGATLTLPNQILKDTTDGEYYRWDGSFQPSGKVVPNGSTPGATGGVGVGAWISVGDSALRSLLSSTSGAANVGT |
GUT_GENOME232182_02273 | 62-138 | FGYINLNGQNFTTGATVNLNELLLNPADNNYYHWTGTFPAGGKIVPPNSTPQSTGGTGPGKWLNVGDTALRGDLARP |
GUT_GENOME096419_02899 | 72-146 | AAGYVPAGAFQEGAGVVSRNGTVLWKLPDGDGDYYRWDGDLPKQVPAGSTPQSTGGIGKGAWVSVGDASLRSDII |
GUT_GENOME171625_03265 | 204-281 | ATGLWGYTLLDSFELGATLNYRYQALKKTSTGEYFRWDGALPKTVPAGSTPESTGGVGLGAWVSIGDAGLRSQLGTIF |
GUT_GENOME131092_01123 | 73-156 | GYITVDSFQQGAQLPDNEITQRNQILRDETTGEYYRWDGDLPKVVPAGSTPESTGGIGKGAWIDIGGANLRTDLKADNGFTFVG |
GUT_GENOME127966_01349 | 63-131 | KDSFQKGATLNSVRDEITYEGYRLVWTGDFPKTVAPFSTPQNTGGVGPGAWAYTSDAMLRGNLKSSAGY |
GUT_GENOME095444_04134 | 145-215 | TFQDGGELHSVLDRVSDGTYLYYWTGTYPVTVSPGSNINTLGGLGVGFWSVDGDALLRSALAQYFGFGNVG |
GUT_GENOME144766_01729 | 135-199 | SFESGATITTRNQALKFNTEDSYYMWNGALPKVIPTASTPSGTGGVGISAWVKLDDATVRFSTGL |
GUT_GENOME143501_04622 | 62-146 | YGFTILSGQTFATGATVNVNDILLDESTGEYFQWTGSFPEGGLIVPPGSTPVTSGGEGPGAWLSVGNAALRSMLSGADGRKYIGK |
GUT_GENOME231635_03086 | 44-131 | WLKQRYEEAFSGLGWAELGEWAVGLEVTTPSQIVHYQGYWYRYGGSLPHTITGASPSLDDNDNWFNLGNDVSLRANLGSGDGLKWVGK |
GUT_GENOME232032_01682 | 62-135 | GFKPVSGSFEKGGLIEEIWQTLLYEANGMFYQWTKELPKRVNQGSTPFKTDGTLVDGWVDRTDLTTRAELVKGQ |
GUT_GENOME183862_03985 | 73-148 | VGYITMDSFQDGKTLTLTNQALRWKLPDGDGDYYRWDGALPKVVPPGSTPTTSGGVDKGKWVNVSDGTARSDLAKP |
GUT_GENOME147143_04398 | 317-397 | GYSLRDRPESFGKGGTLTFASDVLLDEGSGKAYSSAGPYPRTVEKGTNPSSGGFTDRSKERLREMLASTGGAAMIGSQSGP |
GUT_GENOME001883_00528 | 61-148 | YGYVILTGKTFTTGATINNPNEVLLNTTDGEYYKWTGSFASGGKVVPANSTPAGTGGIGPGAWIGVGDASLRAALSSTEDGNGDELIG |
GUT_GENOME231992_00443 | 50-138 | DAANKSIAEKVSAVKTFAEGATLESPREEILFDGYRLVWTGEFPKTVLAGSTPQGAGGIGAGRWAYTSDAVIREDIASNEETLGAGMMG |
GUT_GENOME231465_02133 | 58-145 | ISSLGYITLDSFEDGNNLTLPNQVLRYEATGEYYRWDGELPKSVAPGSTPETSGGIGPGAWLSVGDASLRTDLASGEKASLVGYGSST |
GUT_GENOME096244_03219 | 78-152 | KSTFLNGATLESERDFIWDDNSKSWYYWTGAFSKEVPAASTPESTGGIGVGKWLSVGDASLRSELAQPDGSSMIG |
GUT_GENOME064775_01816 | 75-152 | IGDYTSGPLKIEEYNQLIRYNNELYKLTAATDIPFTTAGNTDETWTGTDAAHFVSVGDAALRQNLGSSEEGLGLSLVR |
GUT_GENOME231951_02494 | 63-145 | GLNPVGTFQGGAVINSAGDIIQDETTGAWYRWDDLTTLPKTVPPGSTPDSSGGTGVGKWLAVDVSDVLRRELALPTGADQIGY |
GUT_GENOME002976_01052 | 203-276 | SFAAGATLTSTKDFIYDQASNAYYYWTGTYGKVVPANSSPEDTGGIGSGAWSVVGDVVLRNSLASTSGASMIGV |
GUT_GENOME211898_03243 | 61-144 | FGWILKESFEDGATLTLPNDALLWRSNGEYYRWSGALPKTVPAGSTPESTGGIGQGAWVGISDAALKTMLASSAGASMIGMGAG |
GUT_GENOME103766_03612 | 134-208 | TFDTGATITERKQVLLNSDDGYWYRYNGTLPVSVPPNSSPDSSWSAVGYLNGYDIGDVRNWSNNLSDIALNDVNL |
GUT_GENOME145119_02233 | 183-265 | YGYNTKRSFEAGNTINYPNDVLLLESEGEYYRWDGPLPKEVPPGSTPDSTGGVAPGAWRGVGDATLRSELASDSGAEIVGTNS |
GUT_GENOME231519_02285 | 178-261 | WGYVPAIGSFEKGSLLTQRFEVLLWESTDEYWRWDGAMPKIVLPGSTPDTAGGRGKGKWLDVTDATLRSNLGSSEPGMGASLIA |
GUT_GENOME145815_04842 | 50-126 | LKSGYTFRDGAILETENDLIKYGDVLYAWTGAFNKEVPAGSSPESTGGIGEGAWKQVIDSNLRRDIASHDGFGNIGK |