UHGP-MC 106867


Information


Number of sequences (UHGP-50):
97
Average sequence length:
78±12 aa
Average transmembrane regions:
0
Low complexity (%):
1.21
Coiled coils (%):
0
Disordered domains (%):
0

Pfam dominant architecture:
PF01074
Pfam % dominant architecture:
8660
Pfam overlap:
0.3
Pfam overlap type:
reduced

Downloads

Seeds:
MC106867.fasta
Seeds (0.60 cdhit):
MC106867_cdhit.fasta
MSA:
MC106867_msa.fasta
HMM model:
MC106867.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME008072_03388246-313MTVTGVGHAHIDTGYLWPVRETIRKCARTFANQLDLMEKYPDYVFGASSALHYDFVRQRYPELYQKVK
GUT_GENOME097742_022987-71VGNAHLDPVWLWRWPQGAAEIKATFLSALERMEETPGFIFTAGAVSYYEWVMENEPELFSRIEQR
GUT_GENOME096448_0239251-126MYLVPNAHIDTAWQWPFEETARDVISATFSRAVNALKSNPEYKFTMSASKHYEWAKEYYPEMYEDIKELIENGQWD
GUT_GENOME158229_011574-100VYLIGNAHLDPVWLWQWQEGFAEIKATFRSALDRMKEFSDFKFTSACGAYYMWIEKSDKKMFDEIVMRVKEGRWNLVGGWFIQPDCNIPSGESFARH
GUT_GENOME233832_020046-88IFTIATAHLDTVWRWELPKTIEEFIPDTFEKNFELIEKYPHYRFNFEGSFRYELIEEYYPEAFEKIKEYVAAGKWCVSGSAYE
GUT_GENOME017737_0024680-168AAAKSVRVSCIGHAHIDMNWLWAFDETVMVTIETFRTMLQLMKEYPDFIFMQSQASVYKIVEQFAPELLPEIRERIRLGQWEVTASTWV
GUT_GENOME000729_000264-98LHMIGNAHLDPVWLWDWREGSHENMATLYSAIERLEEFEDVVFTSSSAQFYEWVEEMEPEMFEKIREYVKQGRWIICGGWWVQPDCNLPDGESYV
GUT_GENOME011266_019555-81PTLFMIGNTHFDPVWLWTWEEAMASIRATFRSALARMEEEPDFRYSFATPPVFRWIEDTDPALFDEIRSRVADGHWE
GUT_GENOME011797_00659243-315DFTITCLGHSHLDLAWLWSFKESKKKALRTILNAIYLLERYPSFHYSISQPWQLEQIKELSPETFSLIQKYEK
GUT_GENOME208002_01163242-320RERLAPALSRPALPGAHRLTAVGHAHIDSAWLWPLRETRRKVTRTIANVLRLLDDGADMVFALPAAQHVAWLEEDAPDV
GUT_GENOME106476_0060986-149CLGHAHIDVNWMWGIDETVNITMNTWETMLNLMDMYPEFTFAQSQAYLYDLMAKYRPDLIKRIK
GUT_GENOME208530_00321227-307SAPANSVKVCAIGHAHMDLAWLWPVRETKRKLARTFSTALALGAKYPWYVYGASQPQAYAWMKEGYPELYSRIKKAVADGS
GUT_GENOME253394_0157671-156MAKEAKSYTLLFMAHAHIDMNWMWGMQETAVVATDTMRTMLDLMNEFPDFTFSQSQASVYRIVAETSPDMLEEIKSRVKEGRWEIT
GUT_GENOME063438_01030253-320VGHAHLDVAWLWTVQDGARKAVRTFASQLEMIRRYPGYVFGASQPQLYEFVRQRSPELFRKVQQAVAE
GUT_GENOME187564_0035349-129YRFHLIGHGHIDPVWLWNWREGLSVVMSTFQAALDRMEENPEFKVTTSSVLFYRWVAENDPEMFRKIRKRIAEGRWDVVGG
GUT_GENOME000609_0084419-122LVPALGINASATAELDSSFQLYMVPNTHLDTAWQWPYQHTADNYLRVMYKNQLSALESNPDYKFTTSASAHYQWIKDYYNEDNPIESQRYWERLKTLIENGQWD
GUT_GENOME096390_03917248-316LACVAHSHLDLAYHWTMGQTIQKNARTVLIQLRLMDRYPEFKYSHTQAWAYEMLQKYYPPLFEEVKRRV
GUT_GENOME140247_037979-88IGNSHIDPVWFWNWEEGMQEVKATYASALERMKEFEDFKFTSTSTLFFEWIEGILPEMFEEIKQRVAEGRWEITGGWFLE
GUT_GENOME000609_01006176-278ASAFTLPEGEEPMPEASRGKIYTIATAHLDTIWNWSYETTIREYIPKTMRENFQLFEDNPNYNFNFEGARRYDLMKEYYPEEYEKVKEYIADGRWYTAGSAWE
GUT_GENOME000609_0080338-124GSSAPYAGEDRMYAIATSHLDTVWVWDLETTIKQYIPDTLEDNFSLFERYPDYVFNFEGAYRYQLMEEYYPEEFETLKKYMDSGNWN
GUT_GENOME232724_00298219-299KAHTVHLVAHSHIDIDWQWDREDTYNVARGNFATMTAIMDENPDFRFSQSQPALYRFVEERWPRLFAKMQEAAARGSWDCS
GUT_GENOME022138_002166-99VHLICNAHIDPVWQWEWQEGISAAISTFQSAVDLSKEYDYIFCHNEVTLYKYIEEYAPLLFNEIGELIREGKWCIMGGWYLQPDCTMPSGESFV
GUT_GENOME096546_01445111-210LEGEKLLLPLQKEAKQYALILAAHAHIDMNWMWSWHETVASTVATFQTMLKLMEEYPDFCFSQSQASVYQIIEEYAPELKESIQKRIQEGRWEVTASAWV
GUT_GENOME157990_0027966-148AQAAEKSLMHLSAETKKLEVIMAAHAHSDMNWMWGYNETVSIVLSTFRTMLALLKEYPKFTFSQSMASTYRIVEKHEPEMLKE
GUT_GENOME018611_00532177-251DGLPVYATGHSHLDLAWLWPLRESHRKGVRTFATALANQKKYPWYCFGASQPQLYCWVAEQAPELFEALRRREAE
GUT_GENOME159434_002884-100VYLLCNAHLDPVWLWQRKEGMAEALSTFRVAADFCEKYDGFVFNHNESVLYEWVEEHEPALFERIKKLVQSGKWVIVGGWYLQPDCNMPSGESFVRQ
GUT_GENOME009236_0116343-139AITASAQADRKTLLFISDSHLDTQWNWDVTTTINQYVKNTLNENLALLDKYPNFQFNFEGAIRYKWMKEYYPSQYARMQQYIASGRWHVSGASVDAN
GUT_GENOME075528_01072248-334KEFKVHMIAHAHIDMNWLWDYQDTEDICIRDFRTICDIMDENPDLRFSQSQSCVYDIVQKKDPETFERVLSKIQEGTWEVTAATWSE
GUT_GENOME013055_000465-65IFCVGNAHLDVVWMWRWQEGSCEAKATIRSALDRMKEYPDFRFVCSSAAVYRWIEEFDPEM
GUT_GENOME128575_013457-85LYVVSNAHLDTQWNWTVQDTIRDCVKSTLEKNFELIEKYPHYKMNFEGAFRYKLAKEYYPDLYEKLKGYIAEGRWTVSG
GUT_GENOME199331_00090220-302ECNKTNSLVSCIGHTHIDVAWLWTYAQTKEKAQRSFATVLSLMEQYPEYKFMSSQPQLYKYLKQEAPEVYERVKQAVAQGRWE
GUT_GENOME201020_0125937-110AVCAAHLDTQWNWDLTHTIREYIPRILFQNLWMLERYPDYKFNFEGGIIYRWMKEYYPLHYARLQKYIDEGRWH
GUT_GENOME235153_011003-88NTEKKIYTVATAHLDTCWLWTYEKTISAFVKGTLKENFALFEKFPDYKFNFEGSKRYELMEEYYPEDFEKLREYVKEGRWNPCGSC
GUT_GENOME119486_0528211-82VGYTHIDPVWLWNRAEGMQEVKSSFASALDRLEEFPDFKFMHTSISYLAWLKENCPQQYARIHKYVEEGRWE
GUT_GENOME236883_001497-88TIYILPNAHLDPVWLWDWKEGCSEAIRTCRAMVRLLDDYPELKFNRGEAFIYEYIAEFDPQLFARIRDLIREGRWGVIGGNA
GUT_GENOME247693_0004268-168LEAEAMLAEIVNDAKSYHVLCVGHSHMDMNWEWNFSETVSITLSTMRTMLDLMNDYPEFKYSQPQASIYRILEEYDPEMLDEIKHRVQEGRWELNVGSWCE
GUT_GENOME237320_002258-72IGNAHIDPVWQWCVPEGLSVVKSTFRSALDRMIEYPNYVFTSACASYYKWIKLSEPEMFEEIKQR
GUT_GENOME143495_002086-85IYAIGNSHLDPIWLWPRASGRSSMLNTVRSVVKLMEKFPDFKFSCSSADLYRWCEERDPGLFRRVVELVREGRWEPVGGW