UHGP-MC 53047


Information


Number of sequences (UHGP-50):
153
Average sequence length:
67±3 aa
Average transmembrane regions:
0.02
Low complexity (%):
5.39
Coiled coils (%):
0
Disordered domains (%):
1.81

Pfam dominant architecture:
PF03313
Pfam % dominant architecture:
7059
Pfam overlap:
0.22
Pfam overlap type:
reduced

Downloads

Seeds:
MC53047.fasta
Seeds (0.60 cdhit):
MC53047_cdhit.fasta
MSA:
MC53047_msa.fasta
HMM model:
MC53047.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME208002_01579404-474GGLVQVPCIERNAIASVTAVNAARMALFDKEQTPLVSLDAVVQTMKKTGADMSEKYKETAQGGLAVTIVEC
GUT_GENOME212067_00855398-467GGLVQVPCIERNAMGAIKSITAVRFAMRGNGEHFVSFDTVLETMRRTGKDMHEHYKETAMGGLAVTVPVS
GUT_GENOME231001_00824633-704GGYVQIPCIERNGFGALRAIDAASYAKQLGYLRKNKVSFDSIVNVMKETGKDLNSAYKETSLGGLAKEFGLK
GUT_GENOME254361_02574335-403VQIPCIERNAVAAKRAIDSANLAHMLVGTRTISFDMVVRTMYETGISMNRAFRETSEGGLAKLYSRRAG
GUT_GENOME011045_01175401-471AGLVQVPCIERNSMGAIKAITAANWAINNDPSVPVVSLDKVIKTMWETALDMNFKYKETAQGGLASQIPVS
GUT_GENOME282480_01114216-282VQVPCAQRNASQTANALLSADLALAGMTSVIPADQVVEAMYRVGRQLPMELRETAMGGIAATEAGRE
GUT_GENOME009245_00225216-279AGLVEVPCVKRNAFLAINAFTGAQLALANIKSIIPPDEVIDAMKQIGELMSPTLKESSEGGLSV
GUT_GENOME018465_00275323-386GYVQIPCINRNAMGVVKSLMAYEYAKHQPPMVLDLDDMIQVLYETGKDLKTEYRETSLGGLAKL
GUT_GENOME110741_00799217-283VEIPCIKRNASGAVNALLSADLALAGVKSYIPFDEVVDAMYAIGKAMPSEVRETAKGGLAVSPTGQR
GUT_GENOME067122_00498468-540VEVPCQKRNAAGAAVALVSAEIALAGIGNLVDFDQTVDAMFAVGKSLPFELRESALGGLAATPAACAYCESCM
GUT_GENOME117955_00760687-754GGYVAIPCIERNSISAIKAFNSYVFAKHLVPYRHNAISLDEVIAVMKETGKDLSADYKETAKGGLAKI
GUT_GENOME244327_00233218-284VECPCIKRNAIASANAILSADLVLSGISSLIPFDEVVQAMANVGKSMHQDLRETARGGLAATNTAKE
GUT_GENOME018030_00145694-761AGLVEVPCIKRNAMFSNFAITAADMALAGIRSQIPPDQVVLAVKEVGEHMSTDYKETAHGGLAATLNG
GUT_GENOME187921_01776241-310GGLVEVPCVYRNVGASGIAFTAADMALSGIRCPIEPDEVILAMKEVGDALPTSLRETGEGGCAACESICK
GUT_GENOME078379_02380456-529VEIPCISRNAMSVSNALTSAEMTLGGYDSQIPLDQTIETMYRVGRQLPSELRCTGNGGLCLTPAALAMAKNAGQ
GUT_GENOME275787_00241332-399GGYVQIPCIERNGMAAVHSYTSYIYAKDIAPSRKNRVSFDDVIAVMRETGQAIPTDFKETSLGGLAKI
GUT_GENOME012435_01152214-276GFVEVPCMSKNVLGAVHAIVCAEMACAGFKSVIPADEVVIAMKEVADGMDERFRETAQGGLAN
GUT_GENOME225779_00356387-455GGYVQIPCIERNGFGIIKAWTASSIALDEAASSHMVDLDSCIEAMNLTAKEMSSKYKETAEGGLAKVLC
GUT_GENOME202194_01240537-604GLVQIPCIERNAMAAIKAVNASRLAVRGSGQYKVSLDSIIKTMYETGLNLNSDYKETSLGGLAKNVVC
GUT_GENOME231418_00744213-276GGMVEFPCNLRNANGIMNALASADMAMAGVAMFVSFDEAVEAMKRVGDSLPERLRETGEGGIAV
GUT_GENOME250990_02031423-488GGYVQIPCISRNAMSSVKSWNAWMMARTGQGTKHPVGLDLCIRTMNQTGRDMKDDYRETARGGLAL
GUT_GENOME282119_00297389-450VEAPCLGKNIMCAMNAVAAADYVMAGADALLPLGEVIKAMDEVGRSIPVEYRCTGLGGLSKC
GUT_GENOME099285_01139218-284VEVPCVKRNGFAAVTAMLAADMTMAGVTSVIPVDEVIAAMNEIGRALPKSLRETSEAGLAVTPTAKK
GUT_GENOME201656_00291437-501GGLVEVPCIKRNVIGSVNAITASDMAMAGIESKVPLDEVIDAMAEVGDLLPCSLKETSQAGLAQT
GUT_GENOME129657_00312417-487GGYVQIPCIERNAMGALKAYNAYLIASTENPWHHRVTLDSVIAAMAETGREMNAKFKETSLGGLAVSMVNC
GUT_GENOME133372_02209348-414GLVQIPCIERNAFAATRALDSNLYAAFSDGKHRVSFDQVVRVMKQTGHDIPSLYKETSEGGLAKNVF
GUT_GENOME231837_01133380-445GGQVQIPCVERNAISSVKAINAATMAMSRISEPCVSLDEVIAAMYETGKDMSAKYRETYHGCLGKI
GUT_GENOME026458_00454215-281VEVPCVKRNASGAVIALNSAELALAGITSVICADEVIEAMQSVGCMMHESLKETAQGGIAATQTARK
GUT_GENOME260147_00192373-435VQIPCIERNAMGAQRAVDAANYALLTDGEHHVTFDQIVGIMDETGRDMMDKYRETSKGGIAKL
GUT_GENOME103871_02913331-402GGYVIIPCIERNAVAVLRALDAMSLARTLSRIKKNRVSFDVVVKTMNYTGKHLPIELKETSLGGLAKEVQIV
GUT_GENOME248955_00968356-422VEVPCVNRNVMGAVNALSCAEMALSGVESAIPCDEVIDAMRAVGDALPASLRETGSGGLAATPTGRR
GUT_GENOME134442_00606336-401GYVAIPCIERNGFGVLRAYDGYLYSKNMSNIREAYVKLDDVISVMYETGIRMHQDLKETSRGGLAT
GUT_GENOME032462_01248460-532GGLVEVPCQARNAQGVAAAFTGAQLALSGVRSLVPFDEMAAVMLQVGHALPTTLRETAKGGIATAPSALSACA
GUT_GENOME140366_02696504-574AGYVQVPCIERCAFGAVKAWTAFMVATNEIASRHRVDLDTTIKTLADTGRDMNAKYKETSEAGLAQNLTLC
GUT_GENOME243448_01153213-280AGLVEFPCTFRNSSGVINALICADIALADVKSFIPYEEVLTAANNVGKALPYTLRETSLGGVAMTASG
GUT_GENOME014557_01653216-278GLVEIPCAKRNVSGAINALCTADMVMAGVNSKIPFDDAVAAMYKVGTGLPEELRETALGGCAI
GUT_GENOME096540_01292395-464GGLVQIPCIERNAIGAVKSINSARLARMGEGTHHVTLDNAVQTMAETGRDMHTKYKETSLGGLAKTLGIS
GUT_GENOME098201_01626446-508GECAEIPCISRNAATVSNAIISAQLAVGGFDGVVPLDETIQAMMNCGELMARELRATMLGGLC
GUT_GENOME041364_00774213-280AGLVQVPCSFRNASQAANAVASADLALAGQVSIIPPDEVIEAMYKVGKKMAPEFKETSQGGVAATPSW
GUT_GENOME264140_01017446-515EIPCISRNVSAVVNAVMAANMVSWGFDPVIPLDETIASMYQVGQQLPASLRCTCQGGLCQTKTAQRIACD
GUT_GENOME143695_00853403-471GGLVQIPCIERNGIAAEKALKLGTLALLEDGTDKKVSLDEVIRTMLQTGRDMKATYKETSLAGLAATLR
GUT_GENOME231555_00317333-398GGYVIVPCIERNAIFAVKAFNVANYVMSIEPDHIISFDDVIKTMAETGKDLRAGYRETSTGGLAKY
GUT_GENOME242954_01110214-282GLVEYPCNFRNASGVMNALISADMAMAGIKSIVPFDEVASAMGEVGKLLNVSIRETGLGGLAGTKTGCA
GUT_GENOME000338_02772217-283VEIPCIIRNGLHAITAQAAADMALAGIASVIPPDEVIHVMHEVGQQMPESLRETGIGGLAGTPTGQK
GUT_GENOME121399_01286606-676VQIPCIERNAVAAMRSINAINLANFLTATRKISLDLIIETMYETGRDLSAKYRETSTGGMAKLYHPNIKCD