UHGP-MC 20006


Information


Number of sequences (UHGP-50):
221
Average sequence length:
81±7 aa
Average transmembrane regions:
0.02
Low complexity (%):
1.08
Coiled coils (%):
0
Disordered domains (%):
2.15

Pfam dominant architecture:
PF12224
Pfam % dominant architecture:
1222
Pfam overlap:
0.08
Pfam overlap type:
shifted

Downloads

Seeds:
MC20006.fasta
Seeds (0.60 cdhit):
MC20006_cdhit.fasta
MSA:
MC20006_msa.fasta
HMM model:
MC20006.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME238261_01472207-291VEFRLFNGTTHAGQLKANIQLALCLVTFAKEARAASAKHQRTLNTNTSKYDVRVFLLRLGMIGEAFKTARHLLTKHLGGSTAWRG
GUT_GENOME193244_0161777-168EDSILIVEENSFISSEKEAAYRQFLKKLLQTAHARKWVVPNRKNTSSGASEKYRFRIWLNQLGLKGAEHASTRKLLTGNLSGSSAYSSEEKM
GUT_GENOME056514_01277163-251FPDTGSTDKNLALTSLFAMMVRAAGEAKRVNPKNLIQENEKYYLRIWLLRLGFTGKGGQGIRRALLTGLKGHTAFRTPEEAERFSARQK
GUT_GENOME033880_00266215-297KGTVEFRLFNSTLHAGKIKAYIQFCLAVSAWAITSNDKLVFRSMDNYTAEQKVTIMRNILTHRLGLYGDEFKTCRLHMMNPLK
GUT_GENOME253607_02578196-272RWTAYADLLKGILKTAETASRISIKRVDDSENEKYHANSWLMRMGFGGADYKETRRILMGHLTGFAAFKSAADMEAH
GUT_GENOME008388_01737212-291NGSIEYRMFNGTCEPERISAMIILSLAMSAYALNREQILYRKTEPVQEKRAFSAWLSQLGLVGKEFQKTRRLLVENLGGK
GUT_GENOME120905_00066157-237TFTFPLSEDAAKNRAYAELCAMMAARAKDAIRVSAEPVIQENEKYYLRIWLVQLGLSGKGGKESRKALLSGLKGHTAFRTK
GUT_GENOME009300_00148255-336FNATLHAGVIRSFVLLVLSMNAKALISDRIKAEKNPIMIAGNEKFAMRTWLVSMGWTGDMFKNPKNHMLKNLTGNSAWRFGK
GUT_GENOME089520_01996269-352EKISLAFAGPLTQDKVSAYTELCSAMNRMAVTQKRIQAKTVNDANEKYALRIWLIRLGLNGDEHKTLRKILMQNLSGHAAFRTE
GUT_GENOME242984_00208136-208NQPGLASYAKFINSLCKMSVEIKRVNNSKHKQVNDKYAFRCFLLRLGFIGDEFKQDRKIMLSRLEGSCAFRNG
GUT_GENOME236864_00944227-301NPTGWNSFAELLTTVAARAMKAHHVSSKRMEPLDSEMKYFCRSWLMQLGMGGAEYKATRAALLKHLHGYAAFRTA
GUT_GENOME092618_00818188-274KVDEERIRFPWFTLTGEDGEAEAYMQLISALADMARNSKRITAKEKDVENEKYAFRCFLLRLGFIGQEYKTTRKVLMKNLEGNAAWK
GUT_GENOME031909_00099194-283ATGNALKNRALIELAAFMVGAAKKAKRVQPATRKPENEKYYLRMWLLRIGMGTRASHESRMALLKGLNGWSAFRTEGEAKAHARKQKERR
GUT_GENOME003584_00878242-317LHAGQLKAYVQLCLAMNYRALHTRAAKYQPLQSDNQRYTMRCWLLRLGFIGDEFATARGVFTNRLPGYTAWRNGRP
GUT_GENOME006709_01922234-312CFESTLHAGVARANITLALAISAQAINQTRTLAKKTPVTENPAFTFRTFLLRLGLVGEEYKNVRMHLLKDLPGDPAWRY
GUT_GENOME080335_0066554-140SEGLSFPWWDELPEFEKITAYTEFLSKMVAYAKRIGLTTHRAASDKVVNEKYELRSLFYRIGLSGKENKEVRKILLAPLSGNSAWKT
GUT_GENOME258557_00078222-302SELHAGKIRAYIVFALAMNHQALTQKSASYRKVQEENEKFAMRVYLNRIGFIGDEFKSCRKHLYEHLDGNAAWRYGSKENC
GUT_GENOME077171_01974127-207VKFPWFTTGEPEDAEAYSQFVTALCKMAKEQKRINHKPCTTDNEKFSFRCFLIRLGFVGKEFWQTRKVLLRHLTGSSAYRF
GUT_GENOME093517_01255205-288IEYRMFNGTMNDLEVKAYIQFALALTQSALSLDRCSMNKPTNITNDKYAMRLWLNRMNLIGDEFKDCRRYMTKALSGDGAFADR
GUT_GENOME070475_00556227-310IEFRLFNSTLHAGKVKTAIQFALAIMELAKSKKRADPTPAQTDNQAYLMHCFINQLKLKGKEFETCRKFLLENLSGNVAWRTSS
GUT_GENOME194868_00203170-257DETISFPWFQGELTSDEVKAYTHFVTALCETAKTQQRVNATEKQVENEKYAFRCFLIRLGFVGSEYKADRKILLKNLSGNSAFKNGAP
GUT_GENOME282369_01402250-330CFNSTLHAGRAKAYVNLCLAMSAQAIAQRTSVMRKTVSDNELFTFRVWLVRMGLNGDEFKNTRDHLLANLQGNRAWRYDKD
GUT_GENOME236222_00653227-313IEFRLFNGTLHAGKVKTAVQLCLALVGCAKEKTKATAKVPDVVRQNPAFAMRTWLNQLGLMGEEYKTLRHFMYLELPGNAAWRYGSP
GUT_GENOME066840_02472241-319TDHGIDGEMNAYMQLVSGIAKRAKMLTRVTATESPSDNDKFTMRLFLVSLNFKGAEYAFARKFFLRNLSGNSGWRTEEA
GUT_GENOME039536_02011197-280LETVLDEHDPTRWQVHATLLNAMLKHAKAAKRVFLKADANPENEKYRANSLLTRLGLGGPEHKELRRVLMAHLNGYAAFKSAAD
GUT_GENOME089520_02382141-227TLKFPWFTLHNLDGEADAYNHLITAICKMAKEQRRVTAVERPTDNEKFTMRIFLIRLGFVGDDYKKARKILLRNLSGNSSWKSGHRP
GUT_GENOME083382_02312141-218FPADSDETSRNAYALFIEKLIAFAEQRTRVTAEDKPCDNPKYAFRCFLLRLGFIGAEYREERAVLLRNLQGCSAFRSG
GUT_GENOME000598_01080153-238KETLEFKFIKGFENAEIASQFAEALNESSKKFKHSSPKERQTDNEKYTFRTWLLRLGFIGDGYKEARKELLKNLNGNGSFRNEKEG
GUT_GENOME184139_00009134-204DDALAYQQFVDKLVQYAISHQRIMSEPHEESNEKYAFRCFLLRLGFIGPKFKDQRKVLLRNLTGSAAFKNQ
GUT_GENOME286223_00163206-287FPFDESQPDRWTTYAGLLNRIFDAAMKATRVRPDRVEPDTENEKYLAHVWLQRLVYSGVDSKAERKILLGHLKGYCAFKNGM
GUT_GENOME116636_00036190-272VAFPWFQEQPPEDERKAAIALISHLGAFSKQAKRVTAKAKPVANEKYAFRCFLLRLGFIGKEWKDERKTLLQNLSGSSAFRDG
GUT_GENOME087594_02890253-336ETVSFTGFGEAADVDHLRAFGHLAVMMNNQALHQKRIQAKAVDAANEKYAMRIWLVRIGMDGEEFKQTRKILMENLTGHSAFRT
GUT_GENOME112248_02004214-297VEFRCFNSCKHAGKVKAYIQFCLAISAQAINQRSSRYQPMDRANDKYTFRTWMLRMQLIGDEFKTCRLHMLANLEGTSDYKDRE
GUT_GENOME106415_00252252-328FNATLNTAEVLADIQLCMALSARAKAIKTASPARPETDNPKYTFRCLLLRIGMSGDAFKTAREQLICRLPGNSAWRM
GUT_GENOME089924_01073149-259NMCNILQSRSELIAQALRLQEELRFCIAEDLAFTVMLDAFAIPETEACLYLLRQCYGMAAATGKARMKPCDGSNPKFQMRSWLLRLGFIGEEYERPRHTLLSGLEGDSAFF
GUT_GENOME177412_00335127-211TVSFPWCESITPETAREAVIPLVARLCQRAQEATRIRSTPPAPGNDKYTMRCFLLSLGFIGPEHKQARRILLAGLEGDAAWRTPT
GUT_GENOME219371_01520207-294GTIEFRMFNSTLHAGEVKSYIQLCLAISHQALVQRGASRIKTQPENEKYTFRTWLLRLGLIGDEFKTARQHLLKNLEGNIAWRDPAQA
GUT_GENOME211785_00332134-220DHGYLAFDLYDSQPEQKELDACTDLLNALYTTAREQVRVTAKAQADIDNPKYAMRCFLLKLGFIGPEYKETRSLLLSRLPGNASFKS
GUT_GENOME208963_01461130-211FPWFTLHDPSDGDAYIKFISMLCAFVKERKRVNDKPDTSDNEKYAFRCFLLRIGMIGAEYKAARKVLLRNLTGSSAFRHGKP
GUT_GENOME238320_00231162-247IKFKGFPFNENQDEVKAYMELAALINKQSLKQNRVKHEMINVDNEKYAFRVWLIRIGMKGKEYKNIRKILLENLPGNSAFKTEEQA
GUT_GENOME031344_01166156-249DTSQGKENGVISFPYFKSTLNKKELLSDIQFAQIVSSFAENNRTVSQKKSENQNDKFMMRTWLVRAGMVGEEYKFARKMLTKNLEGNSAWQKMM
GUT_GENOME267276_01082157-243FEEDGISFCGFPVSTDSETVKAYMDLASLANKMALAQKRVQLDTSDDENEKYAFRGWLLRLGMKGDDYKATRKILLENLDGNAAFRT
GUT_GENOME171359_02530177-266DSSTIIFKFFQAERSPEEIHAYTQLVGLLNQTAMKLKYTSPKGKDTDNDKFTFRLFLIRLGMVGEEYKTTRKILLAKLDGNSAFRDGKKP
GUT_GENOME099640_00985124-208KTISFPWFETVPTAEEISAYTTFFTLLMKHAKEAKRITGKDHAVENEKYYFRCFLLRLGLIGAEYKDARRILLQHLEGSSAFRFL
GUT_GENOME264154_03625181-271FSLPLDAFSIPAIEACACLLVQVCAMAASTRKARMKPCNMSNPKYQMRSWLLRLGFIGDQFARPRQTLLEHLDGNAAFFDDAGRQKAKEKR
GUT_GENOME205235_01645153-236LRFAWFQRELIPEESAAYTLFFEALCKTAKAQNRVSMKSSEVVNPKYEMRCFLLKLGFIGKEYAPARKLFTVGLPGNAAFKHGP
GUT_GENOME045301_01097203-285TVEFRLFNSTLEPDRVQAYLQFTLALCKQAMIHKKAVMKKTRIENEKYAFRCFLIRLGLNGDEFKNCRKVMLENLTGDSAWKG
GUT_GENOME081981_01803149-233IAFPWFNELPDAAEAKAYTEFISLLCKLSKELKRTSSKETPVTNEKYAFRCFLLRLGFIGSEYKESRRLLLKDLSGNSSWKNGAP
GUT_GENOME193555_0070334-140LFAVLEAKHSLLSKAIPGFSIEKDGILILRSVFSEEEQSAYYLFSEKLYQMIGNRKWVVPKRKKPELAYSEKYLFRIWLNQLGMRGRAFAVTRKLLLENLEGCSAYS
GUT_GENOME054703_00272155-230AAPQDSKTIDTYSRFVCALCKLAREQKRVTSKAKEVANEKYAFRCFLLRLGFIGTEFKETRKVLLKNFKGSSAFKG
GUT_GENOME099534_01632160-245EDAILFTGFPYTEEPTAVKAYTVLASKMVQSAGMQKRISSKPTIEENEKYYMRSWMVRIGLGGKDTKEERAFLLGGLKGHTAFRTP
GUT_GENOME257457_01962241-314SLHAGEVRSQIVLALAISNAAMTKKYCSPHVSQSDNMRYSFRVWLLNLGLIGEEFKNCRSYLLKHLEGDIAWRH
GUT_GENOME103528_00187307-389IVFDGFPDRVDADHVKTFTQLVSCMARTAKKQKRVVARETEAENERYSMRTWLLRIGMNGPEFKEARKHLMENLTGHTAFRTA
GUT_GENOME069554_01754216-295TTHAGEGKSHIVLCLAIAALAKNAKCASTKNQRTFCAESAKYDMRVFLLRLGLIGPEFKNVRMHLLKHLPGNTAWKCPQD
GUT_GENOME105360_01989220-303TVEFRCFNSTTNAGKVRTYVRFAQAVTAQALNQRSAAVHKTQSSNECYTFRTWLIRIGLNGDEFKVPRKYLLENLEGAKDWKDK
GUT_GENOME274355_01867173-241MLKAYMTLMCMAFAKAKESTRVQAKLIHPDNEKYYMRSWLIRLGLGGKGRAETRKALLKNLKGHSAFRT
GUT_GENOME246010_00484161-245EEGIVGFPIYKSTLNFSELEGNIQLAEGISGYSENTKRISGNENTSANEKFVMRTWLVRIGMVGEEYRATRKQFTKELRGNSAWL
GUT_GENOME011046_01442173-236WMLLMTKMVAFAKAAHRVCPQRQQPEAEKYFMRGWLLRMGFGGSDFKAARQALLKNLKGCSAFP
GUT_GENOME283020_01486140-219FTWFLYTADGDEIAAYTQFISGLCDMARKAKRVSSKSTETDNDKYTFRCFLFRLGFIGKEYKTARKILIRNLMVNSVLRY
GUT_GENOME237197_01288166-251SMKFPWFKKMPDSDECQAFTDFIGKLAEHSKKHKRISSKVKKTENEKYAFRCFLLRLGFIGTEYKKQRKILLSRLNGSSAFKIGKP
GUT_GENOME182432_02265129-217ICFPWFPLTEDSERTSAYAGLIAALCTTAKEKKRVTAKAQDGFENEKFAMRVWLIGLGCVGVEYKYLRKIMGEFLGGNSAWRHGKPEKD
GUT_GENOME237512_02103140-236RSGGARATRGIEFLSDKVLFTGFPEGRTEAEFAAFQDLANGMAASCETAAWVKADPVQTINERYTFRGWMNSIGMGGSEHRETRRILMQHLNGNAAF
GUT_GENOME236864_02506340-418PYEQDSIRWVFYSQLISACIKAAKAAKRVLPRRLDSATDKYHANAWLNRLGFGGPEYKELRRTLMGHLNGYAAFSAGRR
GUT_GENOME020004_01741152-238TIWFDWFDEMQDAEKAEYYFMFFKALYQMAEKAVRVTAKEKPIENEKFAMRTFLNRIGLSGKEYKPLRKELMKNLSGDGAFRYGRPE
GUT_GENOME280216_00701191-271SIPWFKFDSNDTNCMAYAFFIEKMISLARSLKRVNAREKPVENEKYAFRCFLLRLGMVGKEWKDARKVLLKNLSGSSAFKG
GUT_GENOME183613_00020204-295KRTIEYRFFNGSTHAGEVKTAVQLAILIALRAKSAKASSARNPRPYNEASAKYDLRVFLLRLGANGDLFKTMRFHLCKNLPGSAAWKDGRHD
GUT_GENOME023924_00235156-230GFPMEAESTISFAELTCMMAERAKEMKWINPAETIEANEKYYMRIWLIRLGLGGKGGKKTRDLLLKNLKGNTAFR
GUT_GENOME000598_01013154-241EKGTFTFKLLGENLSHEKMSAFMELASLINENAKRLKHTSFKQSQEDNPKYAFRTWLTRLGMNGSHYKDIRKTLLSNLEGSGAFRKVP
GUT_GENOME142488_00302119-203ITEETVSFPWFGQLEPEEATAYTHFIDKLVKLAMKLQRVTCYSREYTNERYSFRGFLLKLGFIGKEYKTDRRILLQNLSGNASYL
GUT_GENOME096876_00034158-246TEDEIVFEAFPFLENGEDMGAYAKLASSMCATALDSKRVNPAETIAENEKYYMRIWLLRLGLSGAEGKKTRKVMLTNLKGHSAFRTPED
GUT_GENOME173984_00343217-296FSTLPETDDPAVLRTFTTLCAMMNKQVLSQQRIQAKEITEQNEKYAMRIWLLRLGMNGPEYKEERRILMRNLSGHCAFRT
GUT_GENOME260696_01053190-274FDGFGVAQDAETVQTFTKLAAAMNKMAITQKRVQAKDVDDSNEKYALRIWLIRLGLNGADFKADRKRLMTPLSGHTAFRNDEERE
GUT_GENOME014928_02976152-234IEFPWFERTDDEEERAAYTLFIERLAELSNRLKWAASTEKDAPNEKYAMRCFLLRLGFIGAEYKRARAVLLRNLQGSSAFRDG
GUT_GENOME096500_00336151-225GDIQTSSEFLSLLIKKAKELQYTSSKPIETDNDKYTFRTWLIRLGMIGPEYKAHRKTLLSSLTGSSAFRNGLPAN
GUT_GENOME000255_01896408-495ITFRFPYTEDAVKVKAWTDLATAMVKQAREQKRIDPEERIEENEKYYMRIWLLRIGFGGKDMKDSRNALMVNLKGHYAFRTQADIDRA
GUT_GENOME018613_00235130-214GKVCFPWLRSDASPEEIRAFDRFICALCEMARNAKRVTAKVHPNDNAKYAMRCFLLRLGFIGEEYKETRKILLRNLAGSSAFRHG
GUT_GENOME239875_00034160-243TITFKLGQDGDDPDKVEAATQLLALINKSARGLKRNASDKVKSTDNEKYTFRTWLLRLGMIGNEYKTARKVLLKHLSGNSAFRK
GUT_GENOME158031_01167196-284GMIEFKLFRATLQSETVEAYIDLSLGISALSMTQRSAMFQEAKPVENEKFAMRTLLVRLGMIGEEFEESRMILTKPLEGNGAWRYGKKR
GUT_GENOME220580_02156125-216FSRKCMTFPLFYRSAEERKAYQDLSKNLINTAKKRKWFQQKKSSSEKEELPKSDKYAFRVWLNQLGMKGKNYAASRKLLTQNLSGNTVYSNE
GUT_GENOME096236_02227129-211TIRFPWFEHTPTPEITNAATRLICRMIEMAKKQKRVTAREKQTDNEKYAFRCFLLRLGFIGAEYKEVRKTLLSNFSGSSAFKS
GUT_GENOME032997_00276206-295TVEFRWFEATLHAGRIKAYLQFCLAVAAKALNGRAASSRKRDFDPQSAKYDFRVFLLHLGLIGDEFKTARKHLMANMPGDAAFKNGRPKP
GUT_GENOME068621_00289208-278AYSKVIIAMADRAKTQRNASATPLSPEDSEMKYFCRSWLIQLGFGGPEHKEERRVLLGHLHGFAAFRTADK
GUT_GENOME255728_01724162-234SSLDPDEARATLQLALAVAAQAVNQRRINAKVVYPENERYAFRCWMLKMGLAGDEYKPMRRRFLKRLPEDNAT
GUT_GENOME192643_00468220-305EEISFPWFTLSGIDGEAEAYAQFITYLCKMAVEQKRVQDKPYDGDNDRFAMRIFMVRMGMKGKQFDLARKLMRKHLTGNSGWRYED
GUT_GENOME276006_02404126-212VVTDTTVSFPWFPFTAEPDEVNAYSDFVTKLCEMARRQKRVVAVVVETGNDKYAFRCFLLRLGFIGDEYKIARKVLLKNLTGNSAFR
GUT_GENOME096572_00224163-242IPADAGAERCQATTHLIEKLCHMAKTVKRVNATEHAVTNEKYSFRCFLLRLGFIGPQYKVDRKVLLHNFTGSAAFKNGKA
GUT_GENOME103689_00156117-204LEVTDEKVSLDWFKEIDPKESEAYQLFLNHLVDYAKKRTRVITQPREYENPRYAFRCFLLQLGFIGPEYKEARKILLSKLHGSCAFRK
GUT_GENOME009414_02274169-250FPQTENIMEYCRLASAMVKRASEQKRVNPKQTIEENEKYYMRAWLVSLGFSGNEGKETRAFFLKGLKGHTAFRTPEDAENGK
GUT_GENOME178919_01096161-247EKHTLAIKLLKDNPTSDEMAAFLDLAACMNENARKLKYSSFKPAQEDNPKYVMRTWLLRLGMSGDAFKKTRKVLLARLSGSAAFRTP
GUT_GENOME176797_01263132-214VEFPWFDVAPDPQVVEATTVLIARIIDHAKTATRASAKPAETGGNDKYAMRCWLLRLGLIGDDTKPVRRTLLKHLDGNAAWRT
GUT_GENOME122111_00366148-239VSFTEDKVLFRFFFEGRSEKVRGIIEMLGLAFQSAKKSKRISLRKKEPENEKYYMRMWLLRIGMGEKRYHESRMELLKGLKGYSAFPNEEKA
GUT_GENOME074774_00585181-264ITFAWLHGTITDETAKAYAEFISKLCTMARTQKRVTAKEKIVDNEKYAFRCFLLRLGMIGNAYKQSRKILLQNLTGSSAFKSGH
GUT_GENOME286597_00244128-206FTLTKGNQADETAAYTAFITKMTERVRKQKRITAKEHPVESEKYAMRCFLLRLGFIGDEYRTARKILLRNLTGSAAWRG
GUT_GENOME025984_00201175-241MTYADLLNRMITAAQEALRIAPTLLTPENEKYAMRGWLLRLGYGGSDLKARRRILLENLKGYAAFRD
GUT_GENOME126431_00587158-247EADKITFTGFPLTDDSDLVTAFTTLAGKINILALESKYIRVKKTPVDNEKYTFRIWLVRLGLDGTEYKTTRKLLLSHLSGHSAFRTEEQK
GUT_GENOME096866_01153130-207CFYATLDADTILAYIALTVKLCDLAKTLRYASPVERPVENQKYAFRCFLLRLGFIGKKYKTERKILLAPLEGNTAFKN
GUT_GENOME008726_00604164-236DPDCNKAYAQLASAMIAKAKEASRIKSDEQKPENEKYYFRSWIIQLGFGGADFKTTRKVLLQNLKGYSAFRTD
GUT_GENOME048295_00180146-231TIQFPWFKLDLSAKEDEKLACILFLNKLMAFAKGAKRVTAKEKDTDNDKYAFRCFLLRLGFIGDEFKGARKILLSRLSGNSAFKSK
GUT_GENOME227393_00087300-377KNGLHAGQLKSYIQLCLALSEMAKGLRTASPKPQQTENPKFAMRTWLIRLGLVGEEFATARTFLTRNLDGDAAFRFGR
GUT_GENOME081723_00101137-210YIPEPEETKAITVFIAALCRLAKRRKRILAKDTPIINEKYAFRCFLLQLEFIGKGYKDIRHELLKRLSGSSAFL
GUT_GENOME244790_00926140-227EKDKITFPWFSKIPEPDEVTVYTQFIAALCQMSINQKRINSTEKATDNEKYTFRCFLLRLGFIGDEYKQSRKILLRNLSGSSAFKSGA
GUT_GENOME126721_021511-56MKSYVQLCLVVSNQALAQSGASFKHANTDNDKYAMRCWLLKMGLIVDSAVRANGLI
GUT_GENOME024417_00364366-441VEFRCFNSTVDTQMINTYILMTLAICNQAAHQTRASCRESKSNDRVAMYNWLAQLGLVGPQFEALRHHFLGNLSAG