UHGP-MC 99843


Information


Number of sequences (UHGP-50):
74
Average sequence length:
93±9 aa
Average transmembrane regions:
0
Low complexity (%):
2.3
Coiled coils (%):
0
Disordered domains (%):
0.35

Pfam dominant architecture:
PF17042
Pfam % dominant architecture:
9595
Pfam overlap:
0.5
Pfam overlap type:
reduced

Downloads

Seeds:
MC99843.fasta
Seeds (0.60 cdhit):
MC99843_cdhit.fasta
MSA:
MC99843_msa.fasta
HMM model:
MC99843.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME046284_00612325-409KEVRERVAKSLGCVLEGLISGGEPFSLLITGGDTLLACMERLAVVEMEPVREVGPGIVLSRFQYQGIWCQVISKSGGFGAETLLN
GUT_GENOME245203_01520323-409EETRVLIARRLGALLLRMLGMEEARDYTPMIIGGDTLIAFLTQAGFPRVVLEGEVSAGVVMFSMVFRGRTLRMLSKSGGFGSRTLLG
GUT_GENOME000737_02557335-414KNISELVVMVFKQVKLGSLAVFGGDSLYETMKRLGSKGIEPQMELSSGVVVSMSQVNDQPFCVISKSGGLGTLELAEQID
GUT_GENOME153026_03512310-398EDRRKKTAKRFGKLTADILEGMETYILLVVGGDTLYEVLKHLAIHILQPLYQIEPGIVVSVAKDSKGKTISVISKAGGFGDPETLTRVY
GUT_GENOME023087_00009308-395ISERIASNLGKAAAAVVEAGFQGTLFVFGGDTLACVLREIRAQRVEPLCETESGVVISEVSRDNGSTLFVISKSGSFGSVNVVENVCT
GUT_GENOME162155_00666325-421QPSGKQLHLQIADNLGNLTARVIRAARPELITVFGGDTLFGILRNLGVDTITPERELLPGVVQSVLSSSVDCVRVVSKAGGFGGPSLVEAILREYQL
GUT_GENOME243939_00915326-422TIRSRIPKSLGSLIQKRMAMNPKASLMIMGGDTLLGFFDLMKITEVEPIAELMPGTVLSSFWVDHKSYQVITKSGGFGKESLLVEIAEKLISRQEER
GUT_GENOME171681_01637232-313VEVMIAAVLHRLLDLGLDVPLFVVGGDTLIAFLKQIGCGQITPLYEVSEGIVYSTFRYRGKACRLLSKSGGFGGPDILPRLL
GUT_GENOME023863_00489317-417LTTEDLRVRISTQLAHLMKRLLDSGLDATILCTGGDTLLALMRTVGVAELTPVRELATGAVLTNFVYQGKTYHIISKSGGFGEPALFCGLAALVGAGNQRR
GUT_GENOME226406_01766323-417PLEEITQNVANNIGNLITFLIEKNDLKNIIIFGGDTLIGILKEMNITKIYPLIEITSGVVLAKVYFNDNELNIITKAGGFGNKDIIKDIEKFLEI
GUT_GENOME046406_01663311-414IKDRLKTEEIRQRVARRLGEIMYAWFGFGLDHTLVTIGGDTLAGFMKKAGCSELTPVCELSDGVVCSTLGLGGREVQIVSKSGGFGKSSVLTDVVSQLVTGGGK
GUT_GENOME184890_03814319-410RETGLSLDQLRERILQILGSISKELLNVNRDSILMITGGDTLIGLLRALDVHDMEPIEEIMPGIVLSSFIYRGEPYEIITKSGGFGEKDIFL
GUT_GENOME025292_01654327-410VADNVGRLATAVMDRAEPDTVVIFGGDTLASFLAVLGVRGIRPVRELLPGVVLSQMQRGGKTILLITKSGAFGERDTLVSLLRK
GUT_GENOME000445_04402312-400DTVAQRLGALAAQAIGASRAAGVVATGGDGARQVLLALGAGGIALVDEVIGGVPLGTLTGGTADGLPVVTKAGGFGAEDVLVRAVRAIR
GUT_GENOME003095_00804317-413LSKQDVRFRIGQSLGMLTKQLFRYGVNRTFLFTGGDTLFQSMSVLHIDQLRPIGEVTSGVVLAELDWRGKSIQVITKSGGFGPPDLFDSIVKLYKNY
GUT_GENOME218105_0047059-178VILKSADAPEDVEKVMAFANGGGIPAEALHQRIADSLGLLSARLSEHTAPSSLMVVVGGDTLFGFMNRINATQLVPEKEIQPGIVQSQLLAGGKQQHIVSKAGGFGSDATLREICAYYHV
GUT_GENOME260378_02017328-412IADSLGVIARRLMAGETKRCILMTGGDTLLGCIRKLKKAQLTPIGEFETGIVLSELNTSQGSTFVFTKSGGFGDEALLVNMGKEI
GUT_GENOME238898_00914321-411PQTVPARIATRLGALTAQVLARKKPAALIVFGGDTLLGIARSLGCTSVTPLAELLPGIVLSQMQTDTGALFVVTKAGGFGDEALMRLIEER
GUT_GENOME170529_01776324-407GAILKDLLDRGAVSRLMIIGGDTLLGFMDAIGCRELSPQREILQGVVESEYIYRGVRHSVLSKSGGFGQTNLLLRLESEAPVHA
GUT_GENOME105241_00816343-434AVRKKVLGFLGEMALAAADAVGLRGLVLFGGDTAVAVTRKLGAGGIEITREIEPYIPLGVLIGGEHTNVPVVTKSGGFGREDSLVSIVKKLE
GUT_GENOME143497_01139320-417SGMSKNEIRFAIVSSQGRIIKELIKRTDVNTILVTGGDTLMGFVKEIDGAQLVPICELNQGTVLSRIDMDGKPVQIISKSGGFGREELMEEIIEKIVL
GUT_GENOME096189_03406333-426LNHTEVSNEIVRAMGEICARLLEHGYFKGVSMTGGDTAKQICMQWSISGFELLDELEIGVPISKFLGIDDLYVVTKAGGFGKPDVFIHAIQTIK
GUT_GENOME172949_02362323-402EQVRFQISQTMGGVLERLLSLGLEATVLVTGGDTLLAFMQRIGQDKLIPLGELAPGVVLSQIEYCRKKYAILSKSGGFGS
GUT_GENOME283553_03603339-440MDTEAVRRRIPETLGIILKQMLDQGLNAVWLVTGGDTLLGFMKEIGQWELKPIRQIRPGCVLTKLDYNGREYHMITKSGGFGDENLLPELTEELKGKQKGEE
GUT_GENOME041883_00221323-413EDIRQKISCQMGRILKELIARGADACFMIIGGDTLLAFMDSIGCHELTPVCELQPGVVLSRIAYGGREYEIISKSGGFGEEDLLVTLKGET
GUT_GENOME218104_01196310-411DAGNYAESLCLDETALPARIAWKLGYVTQQIIQKQPPALLTVFGGDTLMGIAAALRCQNITPVTELLPGIVLSQMHTKHGCLDLITKAGGFGSEALLSEIEE
GUT_GENOME074180_00338323-415LDPAEISSRIASNIGLLVQHLLRRPVPITLIVFGGDTLLGIMQVIRCRYITPIRELRPGIVLSCAMCEGGALTLVTKAGGFGGTDTISYINQC
GUT_GENOME036194_01651328-449NEKLIVLIAGALDENDVTKSKECGQKLNIEFFNVGERMAKLMGDLMAALAPSFNAFIMTGGDTAVHACKEVGANSFKVLGEIEKGIPLCLIDSGIPQNCVLVTKAGALGTPQVFTKTVTNLF
GUT_GENOME190882_00126326-416EAIRSAVASCLGSVTARLVRAKGNCTVLVMGGDILQTFLMHMGIVELSIIGEPLTGVVMSEFTSNGAVVRVISKSGGFGSPDLFISLQEKL
GUT_GENOME160283_0257622-98GCIVNELVNQGLDLTILMTGGDTLMGYMKQIGCTQIEPICEIEQGVVVSKIKCGGHSIQVISKSGGFGTVDILCRIA
GUT_GENOME000963_01440303-408SFEEGKELDENTLQKEKMEETRFQVANIHGLIMQKLLETGFDATYVVTGGDTLKGIMDVIGCEKLNPVTEIEKGVVLSKMELKGREIQVISKSGGFGGTDIFCKIR
GUT_GENOME190568_00062317-416KISLLDARHYIANVMGNILKELISLGITSTILVTGGDTLMSFMDCINQSTIIPICELIPGVVLSQFNYNCQTYNLISKSGGFGEDDLLLNLTNIIHNSSY
GUT_GENOME089764_01291307-383RLTISRSLGYTAKWLLDQGIPHTILLTGGDTLLAFIHQLQVSELTPICELISGVVLFSVRYQGKTHYLLSKSGGFGA
GUT_GENOME258256_01966317-401ENLAELVVKICHKRLPDTLIINGGDTALSICRKLGAWGFDLIGEVDTGIPYGLLQNDWGQNLAVITKSGSFGDKCTLHKAILALR
GUT_GENOME147629_01510325-412KTFAQQLAALGKHSLKKRRFGALVLFGGDGSAALLHALGVTALRVLRPIAEGVPLATVRGGTYDGLVVITKSGGFGAPDLLCRMMADL
GUT_GENOME185758_01800311-414DTESVKNYRLRNNMTYEQCRFQIADTIGMIAIEMIRRGLNVTYSLTGGDTLMGLMRATGCKELIPVCEIGQGAVLNLMHWNGKCIQIISKSGGFGEKEIFNRMY
GUT_GENOME154245_01593327-411EEKRNVIAKRLGRIGREMVKSCQDTVVMAIGGDTLFGFLQMMNCRQIIPVQELEVGTVLSCIMDNGREMWIISKSGGFGDAALVS
GUT_GENOME270711_00053320-407KMLAHNLGLLVKALFDSGLKSTALITGGDALLGLMRALHINVLYPVCEIEPGVVLSRFHYQGKERTVISKAGGFGSPELFLKLTGQCQ
GUT_GENOME007620_01183307-411EETVEEPDFQAENVQKTKDIGEKVAKSIGCMLKALMEAGMDSTIMITGGDTLMGFLKNVSVSEITPLIEILPGVVLSTYLIGEKKYQIISKSGGFGDADILEKLA
GUT_GENOME083440_01355296-394VVVLDTGGAVENAGADKTADRIQIANTLGALLKRLLDDGLESTVLIMGGDCLLGLMRQLDVTEIMPLCEVAPGVVLSQFTYGGRQWNLVSKSGGFGAKT
GUT_GENOME105698_00966336-415IRRLLPRHRFGAVVLSGGDTAYEILQSCGADGMELLCELLPGIPLARIRPESGGELLVVTKSGSFGETETFEQIYELLSK
GUT_GENOME062600_00737321-412ENVRVKIAEQMGEIMNRLIMLGLSPTLMVIGGDTLFHFVRKVKCRQISLLSELEKGVVYSRMKIGGNFYGVISKSGGFGDEALLVRLIEKIN
GUT_GENOME264503_01880321-417IDTEEIRFSITKCHSIIVRYLIEKGVDYTIFMTGGDTLMGLMKTVENPEFIPVCELSQGVVLSRLKWKDKNLQIVSKSGGFGAENVLTEIADKLVEG
GUT_GENOME243467_00087296-405GRNKRLLLYGAPPEEGVGSAALEEKSRAVAGLLGELVRQLYRRGLRGVFAVTGGDILSGVLAHMKCSRMRPLCEIESGVVLSRAILPEGDVMIVSKSGGMGSREVFVNVA
GUT_GENOME220096_01733328-414VARALASILHSLLQRGVTGTIMITGGDCLFAFLQELQIDQICPLCEIQPGVVLSKLQYRQYDYFLLSKSGGFGKENLFDQVLAAMQN
GUT_GENOME237511_01180317-400EKTRKQIAQNLGSIVRTLVDQGLEATLLLTGGDTLLGFLEQVGVTRLTPVCEVMPGVVLSRFQYGGTRYVISKSGGFGDVRLME
GUT_GENOME282598_00322326-413SIAKNLAALTLYALNTSDFDSVIIFGGDGASAFLEQAGITELRLNCSVMPGVPMSTAVNGQHKGLQLITKSGGFGKEDLFDGILSHLS
GUT_GENOME010271_03000316-412MKSERMNLSDLRQRIPESLAVLTKKLLRIGDTRVLVVTGGDTLAGLLNEMGIGEMEPKYEVFPGSVLSRISYEGRDIHIISKSGGFGDPSLLVDIAA
GUT_GENOME061268_00248321-417VEIESVWAKVSFALGEFMEQIFSHGYIQNRTLMIIGGDTLLGFIRQMDLKEISPVCELNPGTVLSVIQIEGQKLWLISKSGGFGDENLLEEISHRLA
GUT_GENOME025404_00122313-407EKLGLSLAQMRCRISETMGLLLKTFLDDGLEATWMVTGGDTLMAFMHLAELHEMTPICEYTPGVVLTSVCMNGKTVYLLTKSGGFGEERLLLDLG
GUT_GENOME024905_00742326-415EKLTKIIADNTGRLVTNIIKENGVRNLIVFGGDTLIGILKNIECEYIIPIKEILPGVVFTKVVLKNKSFINVITKAGGFGEKNLITKVDI
GUT_GENOME140104_00726324-411ENIRECIVNRLGEFIDLWLGFGVDGALSLSGGDTVYAFLQRKGCKDIRPVCEIMPGAVLFETKVGERTMQVVSKSGGFGNESIFVEIA
GUT_GENOME010067_01156103-203DGETASQVGEEMGMDLEDLRQHISRTLGHVLRQLVERGVRSTMLVTGGDTLMGFVRAAGVSEIVPVRELLTGSVLSLINVGGTTYNIISKSGGFGERTLVS
GUT_GENOME046051_04245325-420KEQGITMEELRVRISSTLGYVVKELIQMGLDTTLLITGGDCLTGFMKHIGREEIAPVCEMAPGTVLSGIEIGDKTFAVISKSGGFGSSTLMADLAG
GUT_GENOME093691_00960305-389GSRMAASMGRCYATLLEAGFDSPFLIIGGDTLEACMDAACSRELIPLGEPLPGTVLCRALYQGNTRYLLSRAGGFGNSRALVDLL
GUT_GENOME186063_00580329-433RKKGISLEEVRTRIADAMGFVLGNMVERGLNSTLLITGGDTLMGFMNQIKVCEMEPVCEMAPGTVLSRFRISDHMYQAISKSGGFGSEDLLVQLADTILDKKERA
GUT_GENOME215096_02086310-414DGKSTDIEAARDMGLQIAARLGHIAGLILKSGVQKTLMVIGGDTLLGFCKANAYGDIQIVYELNDGMVLSRIEVQKQNFWLISKSGAFGGRELLVNLAKNQKIRR
GUT_GENOME225018_00596316-416RENHIPLEEARVTISKTLGEILKQLLEMGLEATFMIIGGDTLAGFITGMRCGEITIYQELEQGTVLSSIRTEGKEQWIISKSGGFGDRKLLMEVEQLVKHS
GUT_GENOME001309_00466316-413DRGLNLSDVRSRIPEDMGALIKGLLDRQVRTTLMVTGGDILMGFVQALGTYEITPIRELFTGTVLSTVQLDGKTQLLISKSGGFGDPDVILRLLDTLQ
GUT_GENOME062814_02281311-396ERSAERARIADRLGYITKRAAETFGYKDLAVFGGDTLLAVLNHLKLSELRPVEEIEPGVVLSETGGYRVISKAGGLGREDVVERML
GUT_GENOME023992_00886351-453RQMKVEEMRQCIADSIGRAAKYLIEADGPGENGTCRTLMITGGDVLIQTMGELGVAELHPICEMDPGVVLSGFTCKGQKRYVLSKSGGFGPSDQLLRILNKLN
GUT_GENOME131027_01020321-416QDVQNRISHTMGRLLHILIDRGIRATMMITGGDTLMGFMKDVHVAEMTPVKELIAGSVLSTVNVDGMPINVISKSGGFGKENLIAELSAIVLPENK
GUT_GENOME197279_00531320-404GKAVSSMIAEIVSRLAEWKRLTLVVVGGDTLMAVCGKLFGRRLEPCRQLLPGVVLSRTRTADGTEGWLISKAGSFGEENALRELL
GUT_GENOME010983_00554308-415DTQTDGESVEVTRGRVAKRLGEVIASLIHMRVDHRYLPFIIGGDTLMGFLRRMENPEVTLVDEIAPGVVMFHLRQQGREILMLSKSGGFGDDALLGEIGSGTLGKVSA
GUT_GENOME001292_03120357-447EDVRVRISETLGYLLKKLIDAGMEATYLITGGDTLIGFMKAIGVSELEPMNEIRPGCVLTSLNYQDKKHYVITKSGGFGQERLIEQLTRIL
GUT_GENOME096506_01452328-428AREQHLTAMEVAERISQALGDMTRELVEQVGDLRALVLTGGDTAKDVSKQLGAKGFRLTRQIEPGIPQGSLMGTDRLMEVVTKAGAFGSEKSIYRAIKALK
GUT_GENOME016248_00628324-401MARRVQEIADRAAVDTLIIFGGDTLAAVIRRMGIQSLQPVAQPAKGVVLCQVEYRGRSLRLITKSGGFGGENLVEIIQ
GUT_GENOME096723_00895318-407EQARALTAANLGAMAAKLATSTRVPTLLVMGGDVLLALLEGAGCACVRMEAQIAPGVVVSFVELEGMPLHIVSKSGGFGDRDLFTHVAEL
GUT_GENOME239428_01142323-429MDENQIRFSVCASHGKIVCYLWKHKVNMTVLMTGGDTLMGFMEEIGINQISPVGELEKGVVLSKLDWNGKQLQVISKSGGFGEEATLVEIAKQVLKKPKKWLKDEKT