UHGP-MC 50089


Information


Number of sequences (UHGP-50):
74
Average sequence length:
79±11 aa
Average transmembrane regions:
0
Low complexity (%):
0.55
Coiled coils (%):
0
Disordered domains (%):
0.15

Pfam dominant architecture:
PF01546
Pfam % dominant architecture:
135
Pfam overlap:
0.03
Pfam overlap type:
shifted

Downloads

Seeds:
MC50089.fasta
Seeds (0.60 cdhit):
MC50089_cdhit.fasta
MSA:
MC50089_msa.fasta
HMM model:
MC50089.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME046406_00132360-427GLFVKKMGGGEVDPTSGDFVFYVSEAYLIEKGKLGAPVRGAILTGNGPEALRNISALGNNLVMDPGMC
GUT_GENOME155724_04017354-429LCAGIKEGVLVNEIKGLHAGTNEVTGDFSIESAGFFIRDGKVSHPIKSFTVAGNFFDLLKDILAVGSDLHFGMPKG
GUT_GENOME028642_00772343-439KLSKEGLFNKIKNGIYVTEINGLHAGLNPTSGDFSLQAEGFHIVDGKKDKPITLFTLSGNLFDLFNNIIAVGNDSELLLSSFTVPSIAFKNLKVSAE
GUT_GENOME233284_00529354-448NEHNVPDLKSLCQMMHNGVIIGDLMGSGFNYTTGDFSFGARGFMVENGVITYPIDGITIAGNIKKILKEKLVAIANDADPRYSIHMGSMLLTDVM
GUT_GENOME025984_00595353-425DLDALCAKMGNGIVITGVSGLHAGANAVSGDFSLLAEGYLVEDGKKGQPVEQVTVAGNFFQVLKDVVAVGSDL
GUT_GENOME286625_00434372-438MLQSIPRGVLVTELIGAGLNAVTGDYSQGAAGFWVEGGRIVHPVDGITLSGNMLSILNGLAAVGADE
GUT_GENOME274919_01466349-416LSEKELLAAVSNGIYIDRLNGLHAGFNKVSGDFSFGAKGYLIEGGKIGRPLEQFTIAGNYYQLWKDIK
GUT_GENOME090341_00906356-448EVLAQAGEGVVVTQLQGLHAGANAISGDFSLGAKGYRIRGGKLDRAVKQLTVAGNFYQLLKDVETVGADLEFGMPGAARIGAPSLLVKGLTIA
GUT_GENOME007500_01521188-276TKEELISSIEEGLLITDLSGLHAGLNAISGDFSAQSSGFYIKNGKIEKPVTLIVVSGNFIKMMNEIDEIGSDLFLGYSGVGAPSIKFKG
GUT_GENOME092011_00697337-424SNFFIKPGETSFDDLVANVGDGLLITNVMGLHSGANAVSGDFSLGASGFMIEGGKIGCPVKGITVAGNFLKLLCDVTELGSDLWFGMP
GUT_GENOME070905_00130350-420EEIFEKIGNGVYITGVQGLHAGLNPISGSFNLQSNGFLIENGKKTKPVTLIIVSGTLQDMLNNVSFVGNDF
GUT_GENOME277665_01321347-428ETSEEELMSLAGSGIYIKDFSGLHAGADAVTGDFSLQSEGFTLENGKKAKAVKTFTVAGNFYTLLKDIAAVGNKTGHGIPGG
GUT_GENOME122826_01234396-480MLARLGTGLLVTEVFGHGVNPVTGDYSRGASGFWVENGRLAYPVAGVTLAGNLLDMMTSVEAVGADEHPEGARRTGSLLLPKMTV
GUT_GENOME049589_00417340-430PFTFYVAAGDCTEEELLRKADRGVYINFVGGLHAGANAITGDFSLQSAGFLIENGQKTAPVKSFTVAGNFFALLKQITAAADNVKVPMPGG
GUT_GENOME246218_01969355-419LGDGLVVTELSGLHSGVNTISGDFSLLASGYLVEGGKRSRPVERITIGGSFLELLNNVAAVGRDL
GUT_GENOME032086_00840351-445EKSFEELLKEIGNGILITDFAGLHSGLNSISGDFSLAAEGFLIKNGKKSAPLNQMTAAGNFFELLKDIDEVGNDLKFSLSSTGSPSVLIKKLHFS
GUT_GENOME206088_00419347-437KISEEEMIKKCNNGIYIDSITGLHAGLNAQSGNFSLQATGFMIKDGKIDKPVNLITIAGNLLKVFQDVTLVANNAELLTNSFTIPSIMIKG
GUT_GENOME237439_01174345-433SKEEIIASMKKGVYITSLQGMHAGMNAQSGNYSLQAAGFYIEDGKIVKPVSLITVSGNLIEDFNKVIAVASDAKLTYQAIEAPSILIRA
GUT_GENOME112386_00483351-425ELCRRMGDGLYITEMKGFHAGASAVTGDFSIESAGFLVKNGEIAAPVHSFTVAGNFFDFLKAVDGVSSTAEPGFP
GUT_GENOME277093_01254357-441DLTGSVKRGLFVTDVIGNGLNAVTGDYSQGVSGRLIENGRLTVPVREMTIAGNLKDLFAQMILADDAQDGYAVSVPTALLPDVSV
GUT_GENOME198857_01352378-448DMISSIKLGVYCKKFSGGTVNPATGDFNFAVDKAYLIEKGKITKHLKGVTLIGNGKDILNNVEMVSDDLIL
GUT_GENOME121257_01589361-451SPQSLLSRVGSGMYVTEFMGLHTIDPVSGDFSIGAKGHRIINGEITTPVSGVTIASNLLEFMKNIAAAGNDLTYSYATAAPTLVVDNVVIA
GUT_GENOME258602_00268349-444LSMEQLCEKMGSGIIVTDLEGVFAGVNTINGSFSLLSKGMLVENGKPVKPFCEVTIAGNIYTMLEEIEALGDDPAPTAAGSQFVQTPSILLKGLTV
GUT_GENOME260024_00021332-425VKIAPYTFYIVPGTASQEELYQQAGDGILVTSVTGLHAGTNAITGDFSIQCGGYRIRDGKKAEAITSFTVSGNFLTLLKHIQAVGADLKFQMPF
GUT_GENOME044999_00016348-420TQEELFETMQEGIYITEISGLNSGINDQTLDFSLPCQGYLIEGGKIVKAVSMIICAGNLKELFESVVDVANDT
GUT_GENOME232008_02528376-438TGLIVTSFSGGGPRLINGDYSRSLRGFWAEGGQIRYAVDGVTVAGNLAEMFRNIVAIGTDVIT
GUT_GENOME113641_00949344-429DILSSIEEGILINRYSGGNAGASGELSGIAKNSFYIKNGKIAGAARETVISANLSDMLMNIKAISKEKVNDGGSELPYIAFDNVTI
GUT_GENOME224423_02326354-447EDLIKSTDHGLLIIDVQGLHSGANPVSGDYSLSAYGYEIENGNIKRPVNQITIAGNFYDTLKDIEKIGNDLKFALPGMGGQIGSPSIKIKKLSV
GUT_GENOME127643_00782583-650IIASVSKGIYVNEFSNGQVKIGEGDFTFFVQSGFLIENGRLTRPVNDVNVIGNGPQALKDIVAVGNDC
GUT_GENOME044828_00092380-463MLKKLDRGILVTELIGQGFNPVTGDYSRGAFGFWVENGRIDHAIEGFSIAGNILEMMMQLEAVGADRLTASERSCGSLLFSDLC
GUT_GENOME230902_02141351-422KEELFSLAGDGIYITELQGMHAGANPKTGDFSLGASGFLIKDGKKESCVKGFTVAGDFYKLLASIEAISNDL
GUT_GENOME113687_01365349-441LSRDEVLAQAGDGVYIVGLNGMHAGYSSVSGDFSFGAEAFEIKGGKLGRALNQITVSGNVYRLWNEIEAIGGDLAFNPSAFGAPTVFVRELSI
GUT_GENOME080236_01988366-452QSMMNEVGSGLVINSLMGQGTNIINGNYSRGASGFYFENGNRVHAVDEITIAGNLKDMFLSIAKLGNDLDERSRVQVGSILLPEITV
GUT_GENOME011366_01180355-447TQEELISQIKDGILVNSLTGTHAGVNSISGDFSLQAEGIKIENGKLSHTANPFIVSGNILDFLNNIEMLANDTDYHHSPIYTPSALIKELSFS
GUT_GENOME114403_00232366-453EIIKSTEKGLFAEKMGGGSVNPATGEFNFAVQVGYMIENGKITKPVKGATLVGSGKDVLLHIDMIGDNLSCGYGMCGSMSGSVPTIVG
GUT_GENOME237600_01250351-433SREELFSMAERGIFITGMKGFHAGANVVSGDFSIESEGFLIENGKKGRPVKSFTVSGNFFSLLKNVAALGNSIEDVLPAAHKI
GUT_GENOME141699_01090356-444EDIIKETQDGLFIIELHGTNAGINSVSGDFSLYAAGFLIEKGKLTHPVNQITVSGNIFKVMNNLDEIANDLIIKSSVTSPSVKVRSLSI
GUT_GENOME236868_01080353-440ELLRLFPRCLLVVKLEGNSGCSAISGDMSIGVQGLYAENGEVLHPVEGATLSANFIDLLETLVAIGSEYPDAFSGLQVPALAFPEISV
GUT_GENOME219403_01278358-428EIISSVKNGLYVVDFSGGSVDTASGNFNFSANQAYLIENGKITKPVRGAKLVGNSAEVLMNIDMVGNDFVM
GUT_GENOME096269_00775348-429DLIAELGDGLYVTDVTAYFHGAGINAASGEYSIPATGYVVRNGKIERPFDGVTIAGSLYDFMKNIKEVGSDLLFGLPTSFRP
GUT_GENOME133604_00841797-895FIIEGGSKPLADMIASVKRGLIVARFSGGEPSANGDFSGVAKNSFMIEDGKITDAAAETMISGNLADMLMNVGDISAETVADGVTLMPYISFDGITISG
GUT_GENOME097942_00730352-438LLEKMGDGLLITSIQGLHSGLNPVSGEFSVPCNGFLVESGKVKRAVNQIIMSSNIKEFLNSIVEIGSDSKITLDGSYVPSIMLENIN
GUT_GENOME123134_01198343-422YINPVKGSLDDLLSAAANGIYVTSVEGMHAGANAITGDFSLSSGGFMIENSKKGKPVKGFTISGNFFSLLKDISLIGEDL
GUT_GENOME096500_03436355-426TKEEIIENTEYGLFAKYINAGSVNPATGDFNFSLSEAYLIEGGKITKPVKGATLIGNGSKILQDVDMVGNNL
GUT_GENOME145988_00404350-418EIISRVSFGLYAQKMGGGETDPTTGDFVFNVEEAYLIEDGKITVPVKGAILVGNGPQVLKDILAVGSDL
GUT_GENOME096365_00237352-449MSLGERNFDAILASVQKGIWVTNFNGGNSNPTTGDFSFGIEGFLIENGKISTPLGEMNITGNMLSLWDSLTEAGNDPRETSSWRIPSLLFNNVNFSGL
GUT_GENOME033299_00595465-531MLKEMGTGLVVTELMGQGVSAITGDYSRGAAGFWVENGEIQYPVSEITIAGNLKDMWRNIVTVGNDI
GUT_GENOME015678_00200437-505ISPEDLMSDIKRGVYITGLFGQGVNPVTGDYSKGASGFMIENGRLTFPVHEITIAGNLKEMFSELRAAN
GUT_GENOME127152_00354347-421DVARAQLISRMNTGLLLTYSLDLFHSINVASGAFSIPCGGVYYENGRPIGTISQMTMVGSLNELWTNIEAVGSDL
GUT_GENOME233284_00532398-467QDIISSLDRGVYAVNFSGGQVDTASGNFTFTASEAYYVENGKIQYPVKDITLIGNGQETLKKVTMVGNDL