UHGP-MC 101831


Information


Number of sequences (UHGP-50):
173
Average sequence length:
51±5 aa
Average transmembrane regions:
0.43
Low complexity (%):
12.08
Coiled coils (%):
0
Disordered domains (%):
43.4

Pfam dominant architecture:
PF04892
Pfam % dominant architecture:
116
Pfam overlap:
0.31
Pfam overlap type:
reduced

Downloads

Seeds:
MC101831.fasta
Seeds (0.60 cdhit):
MC101831_cdhit.fasta
MSA:
MC101831_msa.fasta
HMM model:
MC101831.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME220305_00184226-281SAYDVTMPKNAATSIQNTAPGPPAATAVATPTIFPVPIIAASEVESAPNALTAPFA
GUT_GENOME152327_023671-47MPAKADIHIQKIAPGPPIVTAPVTPTILPAPTLPAILIRKAASWVVP
GUT_GENOME284689_02008126-193VAMPNRPVTHIHKTAPGPPARMAVATPTMEPVPMVAASAVASAPKWLMSPLPSGSLESDSLMAFGSLR
GUT_GENOME080637_007996-61SANFVLQPMTPATHIQNTAPGPAMKMIAGATTIFAAPIDEARAVEAAWNGETVPSP
GUT_GENOME157418_00252103-159SKNFVVIPTRALTHIQNTTPGPPIATAIATPAMLPKPTVAAMALINAWNELICPSCL
GUT_GENOME244846_0076395-149LSENFKASPTTAASIIHPSAPGPPSAIAEATPTILPVPISAASSVISDANGETPA
GUT_GENOME109256_008991-58MAPKAETHIQNKTPAPPFCRASATPVILPSPVGRGFFVGSRSLSETVQKLPKASEGFS
GUT_GENOME239341_001065-54KNPATTIQNNAPGPPSEMAVVTPIILPVPTVAPIITAREPKLLTMPSEST
GUT_GENOME239440_0084213-65VAAPNNAKIHIQKMAPAPPMMMADVTPVMFPTPTRAPIPTQNAWNEEIMLFLS
GUT_GENOME284296_0101211-66SENLVAMPNKAASHIQNRAPGPPATTAVAIPTMLPVPTVADIAVHRAPKLVTSPLP
GUT_GENOME212162_035276-54LLDMPTSAVTHIQNTAPGPPRVTAMPTPAMLPAPTRPARLSIRAWKELS
GUT_GENOME229057_011439-63LMTQQKNPDSHSQNTPPGPPKATAMAVPMMLPAPKVEARAPVTACQPENLPSSPF
GUT_GENOME169843_001494-54VAPPMMAMAHIQNTEPKPPRHRAVEIPMMFPVPTREAVETIRAWNEETEPS
GUT_GENOME280995_023903-48PMRATAHIQNTAPGPPVAMATDTPAMFPTPSVEASAVQQAWNGEIS
GUT_GENOME266092_018951-46MPSTADIHIQNIDPAPPEYIAVATPTMLPVPRQAASAEKSAPLWLT
GUT_GENOME111868_0018898-155SAYLVDRPKSPVSQHHNTAPGPPRAIAVATPMILPVPIVAAKAVANAPNCDTSPLAFL
GUT_GENOME226275_0113219-70KAEAEPKSAISHIQKTAPGPPAATARATPAKFPVPTLEAVLIQNAWNEDIPC
GUT_GENOME252160_0144214-70ISAYFTSIAAIALHHIQNIAPHPPVIIAAATPVMFPTPSVPASASMSALKGDIFLSA
GUT_GENOME079221_010311-53MPRIPVIQHQNNAPGPPMAIAVETPTILPVPSVAASVVASAAKPDRLAPFPSF
GUT_GENOME159640_0145414-72ARATSAYLVIIPNRALTHIQKIAPGPPAVMAVATPAMLPVPIVPAKAVDTAWKGLIPPL
GUT_GENOME112720_0110948-103TTSAYFMPPQSRPQSQNQSILPAPPTESAIPTPVMLPMPTVPPSAVASDENGDIPS
GUT_GENOME183038_00848103-157SEYRVAMPRTPVSQHQKMAPGPPAAIATPTPMMLPAPMLAARATIRELNGEMSPL
GUT_GENOME068385_001026-58SVKAKAITTNADKIIQKTAPHPPTEIAVATPIIFPVPIEAARAVDNACILEIP
GUT_GENOME223785_0014114-70VDSTKLVDIPIRATSHIQNRAPGPPATTASATPATLPMPTRAASPTAKAWKEEMPPS
GUT_GENOME239143_002876-60STNLSAIPKSAPIKIQNTAPGPPIEIDNATPAMLPSPTVPDKAVINDCNGETSPP
GUT_GENOME097205_025481-49MPKKAATHIQNSAPGPPVEMAPVTPTMLPAPTRMAVLSMKEAMGLIPVC
GUT_GENOME243890_0015325-84FVAIPTRAVTHIQNTAPGPPKAMATGTPTIFAPPIQEARAVAAAAKADTEPSPSDFLNIP
GUT_GENOME255948_0088282-134ISAYFVIMPINANRSIQKTAPGPPTHIADATPTMLPIPIVLPKAMDIALKPSG
GUT_GENOME272443_003057-57FVAMPRKAAAHIQNNAPGPPYKMAVETPTMLPLPTQLAREAHSDWTDVQPF
GUT_GENOME068379_004751-42MIHIQNTAPGPPVTIAEATPAILPVPTVEASAAHTLWNWDIC
GUT_GENOME256420_0001512-62STNFKDIPKIAPTKIQKAAPGPPNEIAIAAPAIEPIPIVPDKAVARAWKDE
GUT_GENOME204230_0167916-70SANFRAIEKNPAKIIQKITPGPPIDTAAAEPATFPIPADAEIVAVSAWKAETSPF
GUT_GENOME069123_0115427-75METSALIHIQKTEPIPPSDMATATPTIFPTPSVPASETLSTRSGEFSPA
GUT_GENOME077934_0024611-60KEVEAPSKAVTHIQNTAPAPPTDSAATTPTRLPMPTRVAVDTTSVWKDER
GUT_GENOME245172_00945105-158SAYLTAMPNKAHTHIQNKDPGPPSLSAYATPIMLPMPTVPPSAVDIARSCESIP
GUT_GENOME158406_01835103-159SAALVDMPTSPATHIQTMAPGPPIVTAVATPAMFPTPTVLASAVHPAAKLEMVPSPF
GUT_GENOME090141_0018422-77SEYLVAMPMRAVTHSQNSVPGPPKVTAVVMPAMFPVPTVPARAVETAWKGVISPSP
GUT_GENOME079120_0145489-157PASLVVRYLMASIHSENLLVRPKIAEISIQTSAAGPPLTNAVATPTMLPVPMVAASAVISAANGEISPV
GUT_GENOME138352_015401-44MAVIHIQKIAPGPPRTIAEATPMMLPVPTRDAVETMSAPNEDTL
GUT_GENOME270652_006444-57SRKLVAMPIPAKTHIQRMAPGPPTRRAMIGPVMLPTPMREPMLMQNTWKDESLF
GUT_GENOME104702_005513-51AHPAKASIHIHTRAPGPPKAMATGTPVILDAPTHAAREAVHAENDDIPP
GUT_GENOME225761_0089824-77ISAYFKPIPSSALTQSHSRAPGPPSSRAPATPAMFPVPIVAAREVEAASKGETP
GUT_GENOME125003_013322-47VTIPKKAEIHIHSTAPGPPVKIAEVTPIMLPVPTVPEMAAENASNC
GUT_GENOME280790_0077010-59MISTEKKAAIHIQKMAPGPPMAMAVATPARLPVPTWAEMAVASDWKEDMP
GUT_GENOME273104_008201-45MPINAVTHIQNTAPSPPSAIAVATPLILPTPTRLANPVANAWNGV
GUT_GENOME157499_0081969-122FVEMPNAAAIHIQTSAPGPPMIMAVATPTIFPVPIVPANAVVSAEKGEISPSPL
GUT_GENOME231549_0060013-67STNAVAPPTMATNHIQNTAPGPPMVMATATPAILPVPTRDAAEMVNALNAEIPFL
GUT_GENOME170658_019719-63RYSSDKRISATFTIIPAKALTHIQKTAPAPPIATAPATPAIFPVPTVAESAVHAA
GUT_GENOME238385_009191-47MPRTATIHIQKIAPGPPARIAPEAPTIFPVPTCAATAAQRAWKELMP
GUT_GENOME011841_0124015-69SANMVAIPKMLEIHIQNIAPAPPSEMALATPTILPVPIAPARAVAIARNGVILPF