UHGP-MC 5415


Information


Number of sequences (UHGP-50):
179
Average sequence length:
106±10 aa
Average transmembrane regions:
0
Low complexity (%):
2.57
Coiled coils (%):
0
Disordered domains (%):
0.2

Pfam dominant architecture:
PF01134
Pfam % dominant architecture:
2905
Pfam overlap:
0.26
Pfam overlap type:
reduced

Downloads

Seeds:
MC5415.fasta
Seeds (0.60 cdhit):
MC5415_cdhit.fasta
MSA:
MC5415_msa.fasta
HMM model:
MC5415.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME189838_00080331-455QLRAEKANSETCNLVGCQTKLTQGEQARVFRLVPGMENAEFARFGSMHRNTYVNAPDVLAPDLSLRARPGVYLAGQITGVEGYVESAASGLWLALLLNARARGLELPHPPVESALGGLLNHLRTP
GUT_GENOME134442_00025295-412KPRHLLFLEPESKYNDSIYLQGFSTSMPIDVQEEMVHSLPGLEHAKIMKYAYAIEYNAIRPLEFSASLMSYKVKGLFAAGQVIGTSGYEEAGALGLMAAINCVRFLRNEDPFILRRDE
GUT_GENOME181191_01103323-424LSGMSSSMPEDVQYKMYHSMKGFEHVHIVRNAYAIEYDCIDATQLRSNLEFMNIDGLFAGGQFNGSSGYEEAAVQGFMAGVNAAMRILHRDPVVLDRSEAYI
GUT_GENOME170543_00301304-403GISTSFGADVQDVWIRTIPGLENARIARYGYAIEYDAIDARALNATLESRDRPGIFFAGQINGTSGYEEAAAQGIVAGANAAARALDLQYLMLNRTNAMI
GUT_GENOME231660_01055315-428EGKLYNIVGFQTNLKFGEQKRVFSMIPGLENAEFIRYGVMHRNTFINSTKLLDKTLRLKNRDNIYFAGQITGGEGYVTAIATGMYAAINVANRLENKKEFILEDISEIGAIVNY
GUT_GENOME040305_00334207-319NKQNKKFVLNNAQTFLNETQQAQLISFIPALKNATISSFVEAISCTRILPICVNNHLQCTRNENIYFAGAVLGFDGELEAIASGHLAAINLACKMFGLKEINYPNDTLCQKLC
GUT_GENOME237872_01494307-413ANEYYINGLTTSLPFSVQEELIRSIDGLENAEIVRYGYAIEYDYVNPTELKHTLETKKIKNLYCAGQINGTTGYEEAGAQGIFAGINAALSAKGQDEITLRRDEAYI
GUT_GENOME064579_00818272-380GYAKIMTPFFDLETLRQVPGFEDARYVDPYAGGKGNSIRYLSVAERENTMKAAGIENLFCGGEKSGLFVGHTEAISTGSLAGHNAVRYLKGMKMLELPIGIAIGDLISF
GUT_GENOME231436_02161296-404GLDTHLIYPNGMSTSLPVDVQMAMVNSIPGLERTVIVTPGYAVEYDHIDPRALDATLALPAIPGLWCAGQINGTTGYEEAAGQGLVAGLNAAAHAQELAPLILDRATSY
GUT_GENOME012605_00416270-393ENTEGSLYNLVGFQTNLTFGEQKRVFSLIPALHQAEFIKYGVMHRNSFINAPKFLNKDFSVKGYDTLFIAGQLSGVEGYIESAMSGLVAGISLANKLNGAAPLNLSTKTMIGAITTYLVTPNVN
GUT_GENOME021751_00204282-394LVGFQTKMKYGEQKRIFQMIPGLEKAEFARFGGIHKNTFIKSPLLLDAYLRLKSKPHLRFAGQITGCEGYIESAAVGLMAGYFAAGQQLGTDPLPPPRTTALGSMLSHLIDDT
GUT_GENOME050739_00864316-430RESHRTKEVYINGFSSSLPEEVQIKMIRSLKGLENVRILKPAYAIEYDYINPIELKPTLETKKIEGLFLAGQINGTSGYEEAACQGLIAGINASLKIQKREPFILKRSDGYIGVL
GUT_GENOME051371_01354283-391DPQGSMYNIVGFQTNLRFGEQKRVFSMIPGLENAEFLRYGVMHRNTFIDSPKLLSEQFFMRTRPEIYFAGQMTGVEGYIESAASGIIAGLNMARAVQGKPYVTLPATSM
GUT_GENOME243297_00884260-402VVQLRAETADQRSYNLVGFQTNLKWKEQARVFSMIPGLGHAEFLRYGVMHRNSYVDAPRCLNKLRLNGRPDVWIAGQLSGVEGYVESCAMGLAAALELDALTGGRESVPWPEETATGALLARLADQTGKRFQPVNVNRGIFPP
GUT_GENOME236868_01096261-410VIQLRAENKEKTLFNMVGFQTRLHYGTQKEIFTLVPALRNARFARLGAMHRNTFIESPKLLDETLRLKPEVEKENGLPPTWFAGQITGAEGYTEAIATGWYSAWNMANTLLHDKTDELPEESCIRSLLHHLVSPNENFQPMNFNFGLLPH
GUT_GENOME000604_04378315-410QKLRKVPGFENVRYEDPYAGGRGNSIRLTAMVHRDNGLKVDGLANVFCAGEKAGIFVGHTEALATGTLAGCNAVRYAMGQKPLVLPEETAVGFAIA
GUT_GENOME232348_00139180-287ETARGDIMYINGLSTSMPEDVQDEMIASIPGLEHARVQKYGYAIEYDAINPLNLYKSLESKILKKFCSAGQPNGTSGYEEAAAQGLIAGINAANKLDNLEPLIIKRND
GUT_GENOME190496_00446473-582LYIQGFSSSMPEEVQIRMLRSVPGLRHAVMTRPAYAIEYDCIDPLALLPTLEAKQVSGLYGAGQFNGSSGYEEAAVQGFVAGVNAARKLKGESPFILSRAESYIGTLIDD
GUT_GENOME236868_01693309-404LNGFSSSLPADVQLKALHTIPGLSEVRVMQIGYAVEYDSIDATQLYPTFECKRLSGLYFAGQVCGTSGYEEAAGQGIVAGINAALKCQNAEPFILG
GUT_GENOME205886_00695760-883AENKERTLYNIVGFQTNLKWGEQKRVFSMIPGLEHAEFVRYGVMHRNTFLDAPRVLDAGLFLKEHPNVFFAGQITGFEGYMESAACGLLAARSIYARLEGRQLPPPPVDTMCGALIQYLTTENK
GUT_GENOME167470_01886337-444MYLQGMSSSLPEAVQNAMYRTIKGFENLEIMRPAYAIEYDCVDPTSMLPTLESKVIGGMYGAGQFNGTSGYEEAAAQGLLAGLNAARQALGKSELVLARHTSYLGTLV
GUT_GENOME269730_01279314-412VDGLSTSMPTYVQEEIVHSIVGLEHARFLKYAYAIEYDAIEPSQLEHTLQVRGYPGLYVCGQIASTSGYEEAAALGLMAGINAGLYLQKKEPLILKRDE
GUT_GENOME052351_01530356-463YNIVGFQTHLKWGEQERIIHMIPGLENAKIVRYGVMHRNCFICSPRYLRETYQFINRDDLFFAGQMTGVEGYVESAQSGMVAGMNMARYLQKQPLLIFPRETVMGSLA
GUT_GENOME250710_01557344-444RQEYYANGISTSLPLDIQEKMVHSIRGLEEAKIVRPGYAIEYDYVNPVQLRPTFETRRVDGLWLAGQINGTSGYEEAAAQGLWAALNMDAKIRGREPFLPS
GUT_GENOME190433_00213293-403LEQESVESDEIYINGFTTAMPPFAQEAMLKTISGLENAKIVRYGYAVEYNFVPAHQLKLTLETKVLDGLYTAGTINGTSGYEEAACQGFIAGVNAARKILGKKEIIIDRSE
GUT_GENOME051888_01826292-405LEPEGMQSREWYVQGMSTSLPEDVQWEMYRSVPGLRRCELTRLAYAIEYDCIDSRQLRPTLESLDVRGLFFAGQINGTSGYEEAAAQGLLAGMNAALTAQGKPPIVLTRDQAYI
GUT_GENOME051117_00301254-371IEEISSKFDLINQMQVFSSLNGFENSIIVKKADALDICFLNSRYVVNQFHQCFQNENIFFAGSILGITGYVDCIASGLYTALNVNKYFSDKQMIPLPRDSCIGMLAKRIVSSASIKPQ
GUT_GENOME005380_01435284-409LRAEDAHGSSYNLVGFQTNLTFPEQRRVFRLIPGLENAEFARYGVMHRNTFVDAPRLLDESLRLRSPEAERLGVPVHLAGQIAGTEGYCEAIRSGLHVAFAVAAELRGAQAPALPRETAFGALLAY
GUT_GENOME274062_01542406-523LEPEGRNTDEWYINGLSTSMPLDVQLEIIHSVAGLENAKVLRPAYAVEYDFAPPTQLFPSLESRIVAGLFCAGQINGTSGYEEAAGQGLVAGANAAAKVFGREPIVLKRHESYIGVLI
GUT_GENOME034498_00047192-307LEREGSETNEIYLGGLSSSLPTDVQEGMLKNISGLEDAHIMRYAYAIEYDYIPPQEIKYSLESRTVENLFLAGQINGTSGYEEAGAQGLMAGINAVRKLQGKEPVILDRADSYIGT
GUT_GENOME095735_00931285-398LRSENSQRTAYNLVGFQTNLLWGEQKRVFRMIPGLEEAEFFRYGVMHRNSFVDAPKCLDTTFKIPHTKIRLAGQITGTEGYTEAIASGLLAALNTYCDLMDHPSIQLPVTGAFG
GUT_GENOME238261_01853538-631GLSNSLPEEVQRELVRAIPGLEQADFAAYAYAIEYDCIDPCALDERLSLPEVPNLYFAGQINGTTGYEEAAAQGFYAGVNAALWQQGRGPFVLS
GUT_GENOME234136_00648309-424VMLEPEDSSCSVIYPNGLSCSLSRDVQERMVRSVPGLENAEFLAYAYAIEYDAIDSRELKHTLESKRINGLFFAGQINGTTGYEEAAAQGIMAGINASFLVNGNDPLVLSRQDAYI
GUT_GENOME022541_00493277-390EGTMLNMVGFQTNLKFGEQKRVFGLIPALKNAEFLQYGVMHRNSYVFAPLVLNEYLQIKNAPNVFVGGQLSGVEGYVESIASGLLAAINIFKYIKGENLLKLPQTTCLGGIINY
GUT_GENOME120890_01055458-574LEPEGADTHEYYLNGFSSSLPWDIQVAALQKIKGFENVQLYRPGYAIEYDYFPPTQLYHSLETKLIGGLFFAGQVNGTTGYEEAAAQGLMAGINAVLKLRDDEPLVLRRDESYIGVL
GUT_GENOME206077_00727287-396LRKENKNGTMYNLVGFQTHLKYKAQEELLKYIPGLENAKILRYGVMHRNTYINGPKVLNEYFQMKNNKDIFFAGQMTGVEGYVESAASGLVAALNMDASINEKAMIDFKT
GUT_GENOME018614_00701335-445EPEGEYTDEYYINGISTSLPPEIQKKMIQSVPGMENAVISRYAYAIEYDFLYPDQLNKSLQCKHCKNYFSAGQINGTSGYEEAAGQGLVAGANAALYAAGKPLFELSRTNS
GUT_GENOME018312_00907280-388GFQTNLTFPEQKRVFGLIPGLENAKYARFGLMHRNTYVNAPSCLDRTLRLKNDPNIYIAGQLSGVEGYCESAAMGLLAACYVYQRLSNDKVEYIPVNTLMGSLVNYLVM
GUT_GENOME116301_004221081-1201DNAQGSVYNLVGFQTHLRFPEQKRVFSMIPALKNADFARYGVMHRNTYLRSPGMLDRYYRLIADDRIAFAGQMTGVEGYIESAASGFLAAVEMARRLEGETPVDFPRETAIGALGLYISDT