UHGP-MC 2614
Information
- Number of sequences (UHGP-50):
- 61
- Average sequence length:
- 78±9 aa
- Average transmembrane regions:
- 0.05
- Low complexity (%):
- 4.35
- Coiled coils (%):
- 0
- Disordered domains (%):
- 5.32
- Pfam dominant architecture:
- PF12571
- Pfam % dominant architecture:
- 820
- Pfam overlap:
- 0.36
- Pfam overlap type:
- shifted
Downloads
- Seeds:
- MC2614.fasta
- Seeds (0.60 cdhit):
- MC2614_cdhit.fasta
- MSA:
- MC2614_msa.fasta
- HMM model:
- MC2614.hmm
Sequences list (filtered 60 P.I.)
| Protein | Range | AA |
|---|---|---|
| GUT_GENOME147133_02283 | 294-412 | LAGKMDKAKNGSDIANVAAFLANLGLGDAAKLGVATNSQMAAGTSTSLLPTVAAVMSLFAKRSFAAADYIRIPDVPGGLIIQWVRGATAGFNEGAFPAITFPIPFPTACVFVSAATQGN |
| GUT_GENOME250613_02959 | 88-157 | NLGLGTAATKNVGTGANQIPDMSSFSYSNGVYRFPNGMIIQLFAVTSLTTTAVTPFTFPVVFPSSCQGIF |
| GUT_GENOME030447_01473 | 134-216 | AFIQAAITALSLGTASQKNVGTGTGQIPDMSSFVFNLGGSGAAYNWLPGGNLIQMGRATTSTTGQQFVNIPITYPNGNFNVVA |
| GUT_GENOME144123_03747 | 341-425 | KDVAGLLAYLGLGEAAKRNVGTGENQIPDMASFASGDGWMKLPNGKILQYGRGAVTPTLSTQTMRITFSIPFPKKVDCAMLTHSG |
| GUT_GENOME141376_02157 | 90-163 | VKTALENLGLGEAAKRDVGTGENQIPDISNFPFTKNNPLNFELPGGLILKAGYGKLTTQSSIWINFPAAFPNDC |
| GUT_GENOME232319_00848 | 173-249 | ILSYLGLGEAAKRAVGTGENQIPDMSSFPLTVTGDLSSRKLTTKLPNGLILKAVNGNADSNGWFSLNFDTPFPSEIL |
| GUT_GENOME199027_02703 | 327-413 | GKQPLDSTLTTLSGKTADGIIEYLGLGNAASRNGITGLLSGSGYLKIPFLNGATERTFILQWGSVSNSGSGTTGSATFPIVFPSAII |
| GUT_GENOME096246_00895 | 282-357 | LQNLGLKEAAKRDVGTGVNQIPDMSAFDAQLALSGYAKEPSGLIRQWVGGNTGGMVAGEIKVIELPFPFPSGILAV |
| GUT_GENOME171640_00054 | 371-443 | LGLKSAASREVGTGANQIPDMNSFTFSNATQGFMKYPGGFIEQWFVATIPQAPSSTVNGTIDIIFPTPFPTKS |
| GUT_GENOME251068_02097 | 133-210 | GKTSVADVLTYLGLKEAAKRDVGTGANQIPDMSSFAMSMGQTGRQLLPSGMLIQWGVIALGNDGNAFGTLPVAFPNSI |
| GUT_GENOME145558_03885 | 231-298 | NLGLGEAAKRDVGTGENQIPDMVSFSGVRDFYGKQLLPGGLILQWLTIPSSAAVKAVTLDNGNYQLSG |
| GUT_GENOME094301_01946 | 199-277 | LKSAAKKDVGNSAGQIPDMSYFELSGIGTDNMIAKLPNGLIIQVFRRNISNSASVGVATINPVTFPTPFPGAVWGVFCT |
| GUT_GENOME147130_00040 | 105-176 | LGLGDAAKRSVGTGENQLPDMNFFTSGSNWQKLPSGKIIQYGIVSIPQAQSAAVFNLTFPIAFPNAASIVVG |
| GUT_GENOME278390_00034 | 197-290 | SAANPHPGKFAAADHTHDLSGKANITLNNLNLSTALANLGFSGSLVENGYQKLPGGLIIQWGNLTGIATETLWEQRYLFPIAFPHQCLTVLAIP |
| GUT_GENOME144670_03937 | 91-167 | AVSEALENLGLGEAAKRDVGTGANQLPDMSSFTFTGDANSFVARLPGGIVIQGGKASTDVNGFATVTLGLAMSTYAV |
| GUT_GENOME144998_00936 | 196-282 | AGDVLKNLGLGEAAKRDVGTGENQIPDINSFLSVTGAAGYQKLPSGLIIQWGSATAGVGATGNTGNAINFSVSFPNICHQVVVSYDN |
| GUT_GENOME141887_01105 | 343-421 | VAGLLTYLGLGEAAKRNVGTGANQIPDMGSFTLSVSGTGYQKLPSGFILQWGSIGAPGIAQDVVTHFPIAFPNRCLRVL |
| GUT_GENOME141871_00226 | 406-491 | FRNALQLGTAALASMGVGKNQVLSASDFSFFAGGNGYLYIPCLATTDNPVKIMLQWGTIRTRGGENNIYNLPYAFPTAGLWAMGCR |
| GUT_GENOME249421_01236 | 91-183 | KTALENLGLGEAAKRDVGNGQNQIPIVTVLLFHRPNLYLRSTMSAWECGGDTTTGWRRSPDGYIEQWGLTGATINEVLINFPIPFPTGVISIN |
| GUT_GENOME145578_01089 | 278-340 | ISGLGTAATKNVGTGSGQIPDMSSFTSGTGWASLPGGTIIQWGIAASGQIGDPQTINFPVPFT |
| GUT_GENOME145020_01420 | 419-490 | KSDARANLGLGTVATKNVGNSAGQIPDMSYWSSPAGGINFPNGFQMRFGTIAGTGGKLFSTPFTNQCYGIVF |
| GUT_GENOME212260_01587 | 77-134 | RTNIGVGAGQVPDMSAFASSLAVNGYQKLPGGLIIQQGMATPTVSGVFVSFPIPFPQG |
| GUT_GENOME171578_04623 | 227-316 | FGTAAMRNIGTATSRDVPDMSNFPRDDNPQTGYFYLPNGHLSQYGTVNLPAVGTFNPASFGGVTYYTRYYIVDFPVAYPNAQISTVVSLA |
| GUT_GENOME144704_02367 | 158-237 | ENFREALNLGNAALKNTGTGQGQLPDMSAFLSNQEVNGYQYLPGGLILQWGNTPCGDRRTTVSKFPIPFPNECLMVIVCG |
| GUT_GENOME144715_02894 | 12-78 | NVGLKEAAKRAVGTGAGQIPDMSAFEYVGNASAGYLKLPNGFKLQWLETPAQVPASSTGVGYWTYPF |
| GUT_GENOME231253_03904 | 73-141 | MITNLALGTASKRNVGTGTNQLPDMSSFTSTLNTPGVMRFPGGFTIMWALGGTDSAGVANTAMPASFPN |
| GUT_GENOME147525_02376 | 93-165 | SNLGLGEAAKRDVGTGANQIPDMSKWKSLKAPMGWRETPDGFIEQWGKSTYGNGDADFVIPFPHECFYVVISS |
| GUT_GENOME079047_00674 | 224-303 | KTVSGILAYFGLKTAAQKDVGTGPNQIPDMGGFTALLGDNGWAKLPSGLIFQWGIAGPLTPQNPDAVANFNITFPNKCLF |
| GUT_GENOME031700_04399 | 179-247 | DSRLVNLKSAANRSVGNSANQIPDMSYFTTGETSSGRWAKLPSGLIIQTGFGLTASNGTATVSLPIPFT |
| GUT_GENOME047373_00246 | 13-93 | ILPKVGLKEAAKRAVGTAASQIPDMSFFIQGFAMASGNQFAGYFGFPGGMLVQYGRVALYSTANVATINLPLTYADANFAV |
| GUT_GENOME140833_00754 | 122-199 | EGVLSDLGLGEAAKRNVGTGENQVPDMNSFGNSLTANGYQKLPGGMIIQWGSFYVSPTGGSVGTVDITLPVAFPAACR |
| GUT_GENOME227607_04632 | 424-514 | GKADAATAKGHLGVGSAGGRNVGTGFSAGGSEIPDMTFFAGLKGSAGYQYFPTGMLLQWGTVGLKSAPSGSSIGTFPVAFPAAGQQIVVTH |
| GUT_GENOME146007_04527 | 339-425 | RAAITALSLGTASTRNVGNGGFGTPAPQIPDMSFFPNSYGSGTGWFKLPGGDILQFGSTNLSGGVSGTRINLPIPFPSSGYSISLTY |
| GUT_GENOME144139_00363 | 86-157 | ITTNLGLGEAAKRNVGNGENQIPDMSFWTVTGGNGNFVIRQPDGLITQMVTVSISGPVAMNGMTDNAYAITG |
| GUT_GENOME148074_02928 | 351-433 | TVADMRTYLQLFSAAQRDVGTGANQIPDMNAFSGSIVSKGYQKFPGGLIIQWGINNASVGGTSGNGEDVSYAIPFPNGCLSLT |
| GUT_GENOME143924_04871 | 117-193 | DIENVGLGEAATRNVGTGTGQVPDMSSFTTGHSGAADWPIPKSGWSKGPDGVITQWGIFGFPVGQTGTNVVFPLPFP |
| GUT_GENOME232008_02693 | 236-317 | GATTAADARNNLGLGNAATFTVGSGANQIPDMNSFTKGSGWQKLPGGKILQWFQVTTSTSAAVQVSFPIPFMSGANMIVASP |
| GUT_GENOME143950_03971 | 343-426 | VAGLLTYLGLGTAAKKDVGTGTGQIPDMSNFTYAGASNVFTSIVIAGNRRIMEGSGSGNFVNGVLSFTLPLAYPSTGFTFIPTD |
| GUT_GENOME284002_02233 | 224-284 | EFGPAAYRNVGNGNNQIPDMSFFESGPGWIKFPDGTIIQTGISISGNIGFPATVSLHIPFR |
| GUT_GENOME141377_02881 | 198-271 | TNLGLKSAALRDVGVGANQIPDMNSFSAVLSIIGSEKAPGLAIKQWVGGGLGLLPAGGSRTITLPFPFPSGILI |
| GUT_GENOME143746_02521 | 181-259 | RAALGLGTAATANIGTAANQIPDMNSFALAPGGGGASFRRLPGGDIIQMGRLQTSASNTVTGTLNVPFPNANYNIIAVA |
| GUT_GENOME143408_03425 | 99-172 | LELGNAALKTVGNAAGNIPDMSFFTASNSTNGWQKLPSGLIIQWATSTGTNASGVANTTLPMQFPTAGLCALVT |
| GUT_GENOME144315_00269 | 247-336 | INNALAGKQPLDNTLTNLSGKDVAGLLTYLGLGDISATGVATGSMGETGYAIFPMMIGGARKTFIMQWGLLTPLKSGGYVNINLPVAFPT |
| GUT_GENOME143743_01903 | 84-165 | GSAAVAEALTNLGLGTAATKNVGTGSNQLPDMSSFAASLSVSGWQKLPGGLIIQWSQVLSNSGGFAPWTYPIAFSTACFHVY |
| GUT_GENOME231626_02436 | 101-182 | IEFLNAGTQAQSDARFNIGCGSAATRAVGTASGNIPDMSSFASQQASSGYQQMPGGLVLQWINTTAPDGVTSGSVALPIAFN |
| GUT_GENOME143488_00771 | 226-309 | AEALTNLGLGDAATKDVGTTPGTVAAGDDPRFLTGWTYGGNKARGWRKDPQGNIEQWGTDPATNGIVAVVTMPIPFPSASFIVN |
| GUT_GENOME096099_02421 | 161-235 | SRLNLGLGSASTRDIGTLGDNIPDMYSFNSDLRPVGIQYLPGGYVRQWGVGVGNSSGDATITFPVAFSETPFSIA |
| GUT_GENOME096156_00512 | 230-310 | AGLRSVLGLGTTATRNVGTGASQIPDMSSFLSGSTASAFWYKLPGGMVVMGGNASGIASGTAGNEVFFPIPFPNACSSVVV |
| GUT_GENOME096384_03241 | 91-167 | LSRQNPFADIKADGAAAVSSALANLGLGATSASLAQNGWQRFPSGLILQWGYNVVLNGVVSVTFPIPFPTALFCVSG |
| GUT_GENOME145300_02971 | 260-329 | TAADARANLGLGSAATANLGNGANQVPTMANFTSGPGWVKFPDGTIMQFGTNISGSAGYPTAVNFPIPFT |
| GUT_GENOME145318_03176 | 117-204 | ASDMRTTLGLGSASTRNVGMGSASNLIDMDSVRSMMSGNGYIQLPSVATTGSNLKLYLQWGNTSTPPNSNNTYNVNIAFPNTILFAMG |
| GUT_GENOME096246_00855 | 413-485 | AQLRDYLQLGSAATKAVGNGPGQVPDMSFFPLSGGVSGYVKLPNGFIFQWGQGNAGAGSANIPFPLMFPGGCV |