UHGP-MC 5954
Information
- Number of sequences (UHGP-50):
- 53
- Average sequence length:
- 65±4 aa
- Average transmembrane regions:
- 0.02
- Low complexity (%):
- 3.83
- Coiled coils (%):
- 0
- Disordered domains (%):
- 1.85
- Pfam dominant architecture:
- PF02894
- Pfam % dominant architecture:
- 189
- Pfam overlap:
- 0.38
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC5954.fasta
- Seeds (0.60 cdhit):
- MC5954_cdhit.fasta
- MSA:
- MC5954_msa.fasta
- HMM model:
- MC5954.hmm
Sequences list (filtered 60 P.I.)
| Protein | Range | AA |
|---|---|---|
| GUT_GENOME219870_01556 | 298-363 | GAQHAGVMQNFADALTGRDMLRFDVSEAMGSLGLANGMLMSAWCGCEVDFPLDGAKYKRLLDEKIA |
| GUT_GENOME143137_01542 | 302-370 | MDEEKPYILMLRNFTQAVLGKESLIAQGCEGERAMELANAAYLSAWKGQKVTLPLDAEEYERYLDQAKR |
| GUT_GENOME116069_00927 | 300-365 | GAGHTGILENFARGVLYGEPLLAPGADGLNELALSNAAYLSSWLDREIKLPPDSELFDAELAARIA |
| GUT_GENOME193968_00428 | 583-651 | ETDGRNKQHPEVTSKFAAAILRGEPLVARGEEGIRGLSISNAAHLSSWLGKPVELPVDEDVYYAELQKK |
| GUT_GENOME237527_01938 | 298-368 | NGEQHIGIKKNFVNAILRGEKLIAPAEEGIHSVELANAMIYSFMTGKRVELPLDAAVYEEHLKRLIADSRF |
| GUT_GENOME136650_01557 | 303-372 | VEGYQKIFQNFTDHLLKGEPLLATGEDGLRVLMIANGAYLSSWQGQRVEFPIDDEKYAVMLEEKVRDEQK |
| GUT_GENOME222576_01100 | 12-73 | QQTAIVNHFIQAVQGREKILCPVEEAVQSLNIINGAYLSSWNNKVVSFPLDMALYRKEWEKA |
| GUT_GENOME017739_01840 | 304-366 | HKDIVENVANAILHGTPLLAPMEEGINGLELGNAMLLSGWKEETVNLPIDSAEYAARLERLIA |
| GUT_GENOME078784_00011 | 309-373 | HKGIIQNVINAILNGEKLIAPGPEGINALSLINAVNLSSWTDSTIEFPIDADLYWKLLQEKIEAS |
| GUT_GENOME051831_01714 | 380-445 | YPEMLANFADAILHGTPLTAPGVEGVRALELTDAAYLSAWLGEKLTLPLDADRFEQELQKHIQEEQ |
| GUT_GENOME219682_00879 | 302-373 | GFSHPDVTQNFTDAILYGAENKWNGLDGINSLELVNAMMLSGWQNGAMIHLPLDADLYHQTLLEKIEQNKKN |
| GUT_GENOME283047_00951 | 315-376 | AALRNFAEAVRRRDPDLPLAPWAEGRKSLLLTNAAYLSSWRGETVAIPAPGAADEAAFEQAF |
| GUT_GENOME253443_00880 | 319-382 | ILNNFANAVLGLEPLYAPASDGIKGVTLANAMLLSAWTNSTVELPIDGKKFYELLQEHIRDSKV |
| GUT_GENOME001177_01971 | 304-365 | EYAAILQNFADHLLKGTALTAPASAGLAALEMANGAYLAAWLGRRVEYPVDRELYERLLKQK |
| GUT_GENOME251942_01223 | 307-369 | HEAIIDNMVQTILGKSELIAPLEEGIRGLELGNAMLYSGLNETTVDLPLNSQAYSSMLDRLIG |
| GUT_GENOME006638_00568 | 296-362 | DGENPQHAGVLNAFAAHILHGTPLVADGREGIRGLMLSNAMHLSSWTNQTVSLPIDEELFLKLLNEK |
| GUT_GENOME050070_01400 | 616-693 | QHINVMINFTEAILEGKDLLAPGTDGIRGVTLANAMHLSSWLGKDIELPFDEDLFLSELEKRKAEELESKKEVELETV |
| GUT_GENOME026861_00291 | 310-379 | YVMMFRNFSDHIRKGTPLIAPGEEGVGGLLLANAAYLSAWTGERVSFPLDEKRYGELLEEHRRAEREKNS |
| GUT_GENOME023314_01545 | 310-379 | VLANVCDAILHGTALRAPVEEGIKGLELANAMMLSGFLKKPVTLPLDSSIYAEKLQELIATSRYPKKVVA |
| GUT_GENOME127900_00207 | 307-373 | EAHVGIFKNFTRAALTGSALLAPGEEGILGLTISNAIHYSAWTGKTVDVNNFPHDEFYDLLQDKIKH |
| GUT_GENOME095922_00159 | 315-385 | QHINVIRNFSLVVLGKEDKLIAPYADGLNGLTFSNAIHLSGWTGKQVVYPIDEEAYIAELNKRVDEEKNGG |
| GUT_GENOME052378_01646 | 321-388 | DGWGVQHTTVMEDFAKHIIEGTPLLAPGADGINGVNLANATLLSSWLGKEVELPVDEDVYLAELNKKI |
| GUT_GENOME257502_01461 | 314-371 | EHLQILQNFTNHILYGETLIAPGYDGIFSLSLSNAAYLSAWQSRTVSLPLSKEDLAAF |
| GUT_GENOME133037_00942 | 593-656 | ICRNFTNAILGIEPLFVDGKEGLKSVELMDAMLMSTWLNKMIELPIDDEKYYELLLKQIKNSKI |
| GUT_GENOME263628_02013 | 260-331 | YSEFTAEKEEAHAGILKNFAGAILYGEELLSPGYDGINELTISNAAYLSEWTGNKKVTLPFDTAEFDRLLKE |
| GUT_GENOME251867_00587 | 301-364 | VLKNFVGAIEGREKLDYAAEEGKKSLEIANLMLLSAWKNSEVSSPIDSAEYKKILEEKSAASKL |
| GUT_GENOME141041_03564 | 2-62 | ITQNFIDAIRLNTPLLAPGEEGMKGLTISNAIQLSTWLNDAVEFPLDENLYYEQLQKRIQQ |
| GUT_GENOME000496_00486 | 317-385 | YRLMLQNFADHIRYGNVLAVPGEEGLNALELANAAYLSSWFMEEVHLPISDKVYQNALAAQMRLEKPLP |
| GUT_GENOME002048_00068 | 315-385 | QHCEMLADFVNAAQTGTPCTAPGAEGLNEVQLANAMYVSAWQGRAVTLPVDEAAFSDMLAERARSEAKKNN |
| GUT_GENOME040401_00677 | 307-370 | QQKNILVNFAAAIAGRQKLISPIEDGLASVEIINAVYAGGWTGKKAKIPFDAEEYKALLNEKRE |
| GUT_GENOME075922_02244 | 291-350 | LMLSEFIKAISEGKSKCVMGGEAINSLMISNAMYLSSWRECVIQLPIGGQTFRTELQRRK |
| GUT_GENOME250540_00354 | 582-655 | GQGDQHVGILNNYAEALLNGEALLAPGEEGIFGVTVADAMYLSDYKKAFVDTKNFPHDEYVAFLKEKISESKKE |