UHGP-MC 55945


Information


Number of sequences (UHGP-50):
78
Average sequence length:
95±13 aa
Average transmembrane regions:
0.04
Low complexity (%):
1.72
Coiled coils (%):
34.75
Disordered domains (%):
4.05

Pfam dominant architecture:
PF03432
Pfam % dominant architecture:
897
Pfam overlap:
0.09
Pfam overlap type:
shifted

Downloads

Seeds:
MC55945.fasta
Seeds (0.60 cdhit):
MC55945_cdhit.fasta
MSA:
MC55945_msa.fasta
HMM model:
MC55945.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME129509_00961249-332DGRISLLIDIQNNLKAQQSAGFTHWAKLNNLKQAARTINFLTEHGIESYARLTEKLSAITDTRNCVHAATKAIEARQADLALVM
GUT_GENOME082770_02550286-373DSQVKRLIDIESKLREGKGRGFVVWAERNNIDAKAQSVIYLKENHIHSYAELKERIQTLRSARNAINASIREKQNRMKEINRQRQAIR
GUT_GENOME249465_00329284-370LIDIRAKMQAGKGAGYARWAKVFNLKQMAKAMMFMEEHGIKSYAELKEKADGISEKCDALLESVKADEARMSEVSVLRKHLINYAKT
GUT_GENOME141007_01026264-363VRKNKKINPTNQPLSVMVKLDKNKKVIENKGYEQWAKIHNLKHGAKTLNLLKEYGITSIAEWEEKKISHQEAMNICTSELKAIEKELMHTQSMLKQLVIY
GUT_GENOME002968_01581279-373KRNVEFLIDIQKKLAEGKGGGYAQWAKVFNVKAMAEALMVYEREGFQSMEELDAKVYAVTAEFDATADLLKNTEAQLRETKAMKQHILNYGRTRE
GUT_GENOME166268_00357283-383TAPKRTGRKVNLLIDIQAKLAAGKGAGYERWAKIFNLKEAAKTLNFLMENGLADYDELALRAAQAEDGFNAASQKIKQLEARMAGVAKLKTHIIQYSKTRE
GUT_GENOME053505_00942463-557VDIEQKMAEGKGRGYERWAKIHNLKQAAKTLSVYQQYGFTSPEQLEAAVDTAYQKMRQTSGELKALETKLQGKKKLQRQVLAYAQTKAARDGLRA
GUT_GENOME210185_01433332-410IGRVVDMEHNEKVKTSAGYEQWAKIHNLQEQAKTFNFLTENGLLDAEKLDDILAEVTADFKQKKEEMKVTETRLKEVNR
GUT_GENOME082723_00185414-539GQREHKPPAKQRSRTNARPFNLVIDIQSKLQSKGVGYQRWASVYNLKQMSKTLLFLRDHKIESMEQLDQMVMQQVEKRDVLLTSIQQSEKRLAEIGTLKKHIINYSKTRSTYEEYRKAGYSKKFLE
GUT_GENOME142203_02817247-357LKERITENQTIKTPPVKKRIGNVIDMNTNVKVKESKGYEYWATKHNLNTMAESVIYIREHGIKSVKQLDEYIQKVADERQNLQDKIKIIDKEMQELSTTMEKVHTVKKYKQ
GUT_GENOME249253_01352271-357KQFQMLIDIQAKMAEGKTVGYEKWAKKFNRKEAARTVILLKEKGLGNYDDLTAHIENLSARFDALSDSIKVAEKRMVEVQALQQHIK
GUT_GENOME089192_00110276-363QVSLLVDVQAKLQAGKSTGYANWAKKFNLKQMAQTLNWLSDHGIQDFAELSAKVDAASAERRDLLKQARAAEKRLNEISTLRKQVVTY
GUT_GENOME116322_01903294-393QKHHPKTKPIQQNELISKAIDAQRMEAKGKGYTYWAKNFNSKQYAKSILFLTENHLSIDDLLCAVEEACTKADALTAELKKVEAQIASTSQLIDQVRVYL
GUT_GENOME001479_03178113-190ERKISLAVDIQAKLAAGKGPGYERWAKVFNIKQMAAALAYIQDNGLTDYEQLAQKATESADRFHAISEQIKQTEQAMK
GUT_GENOME114765_01135308-420LSQNIQLTAIIQPSSEKKPDKIRKLVDIQANVAAGKGIGYERWAKKFNLKRWSQTLCLLQEKKLLSEDALNQRIVELKTQHDDALAVVKDMDARMVSLKELRGHLVVYRQYKP
GUT_GENOME083293_02302364-462LRLVVDLQNNVKASQNRAYAQKVKISNLQQMAKTIAYVQEHHYDTRESLQMSFEEISEKWKESRKAVKSTEEKIKEINEQIHYAGQYLANKPVFAQMLK
GUT_GENOME054087_02275272-360RQIALLIDIEKKMREGKGRGYQVWAERHNLDAVSQSIIYLKENGINSYEELMSRIADGTKRRNQLKDSMKTCQTRMKAVSEQRKAVLTY
GUT_GENOME221887_02412506-593VQRMVDIAAKKAEGKGQGYEKWATMHNLKQMAATLAAYHQYGFSSPEELDEALTAAHADMQESLAGLKALEATIADKKELRRNLLSYI
GUT_GENOME096561_00112235-343IRERIAGGASRKPQRDDKGVNLVIDIQNSIKAQQNEGYKRWAKVFNLKETSKAVNYLTEHNITDREQLARIVKDAATKFDKTAADLRGIEKRLSDVLLTMKHTENYRRM
GUT_GENOME130400_00460132-226VYIAKSKPLREDKKIRLVVDIENSIKAQQSAGYERWAKIHNLKQAAKSMNFLTENKIEYYSDLESKIADIMTAHDAAARAVKEVEQRMSDLSLLI
GUT_GENOME049215_00568280-357LLIDIQNNIKAQQSAGYKHWAAIENLKRAAETLNFLTEHGIGSMEDLSERCDGAAAATARVKADLRATEKEMERLTLT
GUT_GENOME225781_01301270-358KIIDTSSDKIQSSKGLERWANIQNMKEASRVINILTSQGLSSTDEIENKAISNFTERVNLVNNLNSLKNSIDDIDEQIKNLRLFRKYKP
GUT_GENOME022773_02509371-467KTEKKVDTVLDIQSIIAKNKGPGYERWAKLHNIKAISKTLIFLSEKGLGDYEKLSEAAKEATDKFDYLSARQKEIEARLAEIKALRQHIFNYSKSRK
GUT_GENOME153261_01038244-338KRKPIIRDNRISLIIDIQNCIKAQESKGYEHWAKINNLKQASKTLNYLTEHNINSYSELESRIDETCKQFDDTADKFKSVERNLNDINILRKHIS
GUT_GENOME029618_02711263-359KKRTQATRKKNTLLIDIEAKLQAGKGGGYERWAKVFNVKQMAQTYNYLREHGLLDYAELEEKASAATEQFHALSAQIKAAETRMAEIAVLKTHIINY
GUT_GENOME244353_00340243-353IKERINEAVKNRNNPNKKRVNNVINLSTSKKVKTSKGYEFWATKHNLKAMAETIVELRNLGINSQAQLEKLIQETANDRQNIMDNIKNIEKKMSELSNDMENVHIINKYRE
GUT_GENOME211446_003331442-1555LRERIKGTRTRTVKAPKQQKNGISLLIDIQNSIKAQESRGYEQWAKIHNLKQAAKTLNFLTERKIEQYADLTAKIAEITAANEQASDSIKAVEKRLADMAVLIKNVTTYQKTKP
GUT_GENOME004611_02910302-417IKERIAGTRTVKFRHASSKKPVSKVGLLVDIEAAIRSGKGPGYERWAKVFNLKQLSQAVLYLKEHGDMGYEDLLEKANAATTNFNTLSVQIKDLESRMNANAELQKQIVNYAKTRA
GUT_GENOME273345_01244138-216LQLVVKLQDCVKAQQNKTYTQKVKLTNLQQMANTLIFLQENNYDSIEQLKTNYANALSSFNNSQIKRNTLNQEIVKLNE
GUT_GENOME177963_00573134-264IKAVLAGKVEHHPRQKQLPKEQPFQLLVDIQAKLAEGKGGGYERWAKKYNLKEMSKTLIFLQEQKIGSADELKELSDAALSHYHELGDSIKAAEQRMTEIAVLRAHIVNYAKTRPVYDAYRKSGYSPKFLE
GUT_GENOME172983_0078229-120IRQLIDLQNDSRVQESGGFAHWAKLNNLKQAAKTLNFLMDNNLTSHAELIAKIKELSAASEQAAGSIKELEKKLNHMAVILKQLNAYRQTRP
GUT_GENOME157111_02841111-195EKRKFDLVVDIQEKMAQGKNGGYVRWAKKYNVKQFAESILFLQQHDIHDKETLDALVDGSAARYHELMKIIKDAETKMAENKVLK
GUT_GENOME207233_04047279-367LIDIEEKLQQGKGAGYEKWARTFNLKESAKTLIYLQERGFDDYDVLSAKAEEAGQRFDMLSDKIKSSERRMSEISSLQKQIGTYRKTRD
GUT_GENOME131383_01610269-363RRTLDLPINIQEKLAAGKGAGYERWAKVYNLKAAAKAILFLQEKDIHTMEQLRKTTEDITHHSHELLDSVKQSEKRLTEIAILRKHIVNYAKTRE
GUT_GENOME011047_00855302-433IRAVIAGERPIPTLTDEAPAPARRVNLLIDIQERMAQGKGPAYERWAKVYNLKQMAAALQYLREHDLMDYEAMVASTEAAVERFHALAGELRETEAALEKTAGLMAATVSYAKTRPVFDGYKAARYSKKYLA
GUT_GENOME063323_00352290-386KPARKVNLLIDIQSKLQAGKGAGYAQWAKVFNLKQMAQTISFLEENGLLEYDALAARAAEGTARFNELSGTIKRTEGRMAEIAALQKQIVNYSKTRD
GUT_GENOME257493_00918274-365KKPDTIVDMKNNEKVKSSKGYEVWAGKHNMKTMASALNEMRKSGVNSYEELDLKLKKVASDRQQLLDKIKHIEKEMKSIYSVIENKNTISKN
GUT_GENOME174044_00734403-494EQRPSLLIDIQAKLSEGKGAGYERWAKVFNLKQMAQTINFLKENGDMDFDELSRRAEAASEKCSALTEKMKATEARMSEVEELKKQVINYLR
GUT_GENOME252709_01135283-381HPEPKQTVSLMVDIQEKLRAGKGAGYQRWATGFNLKQMAQTMNYLSEHKLFDYKVLSEKTAAATARYNELSSQIKAAEKRMTEIAVLRTHIINYAKTRE