UHGP-MC 23622


Information


Number of sequences (UHGP-50):
101
Average sequence length:
78±7 aa
Average transmembrane regions:
0
Low complexity (%):
0.86
Coiled coils (%):
0
Disordered domains (%):
0.18

Pfam dominant architecture:
PF03466
Pfam % dominant architecture:
7030
Pfam overlap:
0.35
Pfam overlap type:
reduced

Downloads

Seeds:
MC23622.fasta
Seeds (0.60 cdhit):
MC23622_cdhit.fasta
MSA:
MC23622_msa.fasta
HMM model:
MC23622.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME059338_01093214-294ALAQAGLTRRVVLSVPHFLFVKSVLASTDLVAMLPLRLAIASPELQYIEPPMPIAGYEMSMLWHERSHRDPAHRWLRDFIV
GUT_GENOME156079_01836243-319PFFISTVSLLLETDLVYKLPLETAQKLTRFLPLSVLPLEDGVDMPWQPAVFWHRSTNDAPLMTWLRAKIVAYACRIS
GUT_GENOME083187_00487243-319RVAMDCGYYLASAFFLLETDFYVMLPTFTAKCLVKELPLAILPASPMANAVVWKARMIWHERSERDPALQWFRSVIA
GUT_GENOME110892_01334213-302ANIPHVGNGDAAIWTPYFLSVPTMLDSSDFLAIFPLHLANQMALRHRLHILGRPENAPIFSPTLLWHDRTHFDPASQWFRAKIISQCRAY
GUT_GENOME138977_00480238-308PHFLAAAFFLVDSDDLLTLPLETARFLERLLPVTHCEDPVFAHCPWDPIMLWHERTDKSPSHQWLRAVIAE
GUT_GENOME164731_01084229-304AVRTPYFFSAVQMVRRSDLCFTCSETLAHEVAGTDEFRILPLPEECSTKWVPKLVWHERGHSDPAMQWLRSVIISL
GUT_GENOME026681_01128225-304IVIKTPYFMGAAKIISESNLVMLLSDIMADWFISLGALRRIPVVSEELEHVQSKGFVTKLIWHERSHLDPAMQWMRGMIS
GUT_GENOME245879_02061245-323FLVVPGILEETDCVAVLPLLTARLFESEDRIAVMPLHPQTELGRGRETFWARLIWHERVDRDPAMAWLRGLIKANAERI
GUT_GENOME110892_01223213-308LRDKCFPSWHQFRSGIKTQYFLPFIRSVARSDQLLMVIPEKSAYALLNNTDLVIIPTQTKGLSDEPKMVWHQLTHHDYVMQVVRSIIFSCSQEEQR
GUT_GENOME037095_01208223-302ANPEIFLRTSFYSCGLAMLQGTSLLAMAPLQFVLASQKWLDISILGRPAKGFLHEPCVIWHDCRHNDPGNQWLRSVILSH
GUT_GENOME260051_00219234-306DTPYFLSIPFFIIESDSYGFLSRETAEYLAQFLPIEILEMDSALVRPWTPHLLWHVTSAGDYQLQWLRTTISN
GUT_GENOME256176_01440236-315KTVIELPYFFSSIQLVEKSDFTIVLPNFIGNFASTHQKIVGIPVADAPERIYQQTLVWHDRVHIDPGNQYLRSTIVNGAR
GUT_GENOME143124_04348218-296GLSRQTVVSVPSFLVAERVLRNTDFVSVFPTSLARGMMSTLKNYPLPYTLPKFEFAMAWHVRTHLSPLHRWVRDQIAEE
GUT_GENOME007781_00528235-296FLLQEDEVCTMPYQLACRLKNFFPLEILGAPADLEPYAVTLLWSEKVDKEVSHQWLRSLILA
GUT_GENOME000452_02896217-299GIRRRLRLTVPHFMAVGPVLQATDMIAVVPRRFADCACKPFGLATAPCPVKIPESVINVFWHARNHREPANQWLRQVVVEQFA
GUT_GENOME120773_01717232-305VAVWTPYFLAAAELARRTDCYLIAAEANARVWVREGKLAVLPTEPLARTFTPLLIWHDRLNGDPALQWLRGKIL
GUT_GENOME097747_01397238-308PHFLAAAFFLKDTDYWITLPKESAERLTHMDDFVILRDPVHGTLPWRPTLIWHERTDGSPLHQWVRSKIVE
GUT_GENOME143124_01185225-298RIAVRARHFGALPEMVMNSRMLAIVPTTYARNAQPRYGFKVWTLPYAPAYDVRLLWHASTARDPSLQWVRALVH
GUT_GENOME195440_01684230-305ALQCDYFLPSVFLLLESDCYARMPLNTARYLSRYLPIALLPPEIRRLPVWSGRMIWHERTDVDPLLLWVRRLFMNE
GUT_GENOME231923_03284214-297GLRRQVAVSVTHFLAVPEMIVVTDYCATLPRLICQQLAKDPRLKVLPAPVDLGTFPVEMGWHARYRDDPAHRWLRSLMMEVAQT
GUT_GENOME210225_01681243-313FFLAAPLALPGTDCYAVIPAAAARFALDPDRYAVLPFGPDAPLLTVRLGWHERTHADAAAQLIRSILIEAV
GUT_GENOME200155_02225231-307SPFFVSSAFACLVADVVVVLPETTARTLSQWLPITIYKTPQLGESPFCPTLYWHESKNADPANQWLRSIIISHTRSS
GUT_GENOME011632_01386229-304GRTQIRTAYILSEPLILARTDLVGMLPESITRQYQYAGLPIVSLCKVFGHSPHRPHLLWHERSDEDPVTAWIRALF
GUT_GENOME056490_00400237-321KVGLSVPYFAAAATILEKTDLTLLLPDETAQILRDVYNFPLAILPYGDLSEGLSYRARLVWHERTHREPVMQWFRGMFALYASRH
GUT_GENOME016305_00326231-316GECVLRIPYLLPALDLLSHSDMTMTLSWRAAERLLATDPRFVVLPLDEESPVYPVRIVWHEREEDDPECAWLRGLFRSVNHREEEE
GUT_GENOME098425_02140223-311AGGKTIVEVPYFLGAPYFLEGTDCTLALPRKTAEFFERHLKNVTAIPWTNENGENNARLIWHERCDRSPHMQWIRSLFATYAGSPDEEA
GUT_GENOME003350_00847261-342KVAFRTPYFLAGAMALQTTELTMELPSVTAHRLEALGLIRAFPMPKSKPSNIPKLIWHESSVSDPAVQWLRSVLVTGVADDT
GUT_GENOME248197_00106223-304PPDASQNIAIVSAFFVPAAFMLLETDFYVRLPVPTAERLKAMLPLETLPAGMRKLPDWEGKILWHARTDFDPALSWFRSRVA
GUT_GENOME158410_0070513-87RSAIRSYYFLSFIRVVLNTDLLMVLPDRTALYFAKNDTLTILPTQVKPQTHTPKMVWHNTTHKDPVLQWVRSMIF
GUT_GENOME143124_04905213-291GARDRIQVRMQQYLAAPHLVLNSDLIWTCPESLVVELCKYFPLAVKPVPLPLGHFEVALYWHDRFHKDPASQWFRAQVV
GUT_GENOME147611_00969238-309YFLAAAFAVMKTDFYVRLPAKTADLLAEYLPIVKLPEGFRRMPMWDGKLIWHERTDLDPALQWLRGIFIRTL
GUT_GENOME143124_01359221-303RVVVEVYHVMSLLPILESSDLIAVVPRDVAHTCARYANLRIVELPFAGPSITVHQFWHERFHKDPANCWLRSAIHEMCAALRD
GUT_GENOME025053_01610225-308TGIQVPYFMAAPFFLPDSDFTLFLPEPTAKFFAQILDLDIVTPPSDRLAYATRIVWHDRVYSDVESQWLRSLFVAYARPVAGDE
GUT_GENOME207953_02202221-292RVPFFQAAVEVLLRTDCLMTTPAHIAWQLSREHALSFCDLPFATRVQQYHLLWHQRHHLDPAHRWFRELAYP
GUT_GENOME143406_01783230-300PYFSSIPFLLKGSELTSVIPTRLARHFVKAASLVIIPSPSPPKKYNVRLIWHHRVHNNAANRWIRQLILSF
GUT_GENOME015816_01212244-324RAAMRTTQFLSLVPAVTASRMLLILPRRTAEVLARGGLLSVIPTVERSIEHRPQLIWHHRANDDLELQWVRSILVDSARRA
GUT_GENOME286625_01181230-301PYFNSAPQVLARNDYYMWCPAPTARLWRRMTDIAVINADAEGRYDFTPRLIWHERTHHDLVCQWLRGMIASC
GUT_GENOME180687_01626232-322LRESAFPTWAGAKTVARTAFFLPVLSFLHEAPLLAILPRRTAVFMAGEGRFTIVPTLTKSNSHQAQLIWHDRIHSDPLMQWVRSMIFTCAK
GUT_GENOME262268_01297227-316RTRIVVPYFIGAQYFLENTDLTLSLPKETAEKIAERNPYLTTIPSPLAKSRAIVPCLFWHERTHHDPAMQWIRAQFKAYAQTEEGAREAR
GUT_GENOME243565_01779218-298RGGIRNIVYEVNKVWSMPAIVGSTDLVCVLPRRFAELMAPLFGLEIYDSPVPISDQSYHMIWHEKNNDDPGHKWLRETLMR
GUT_GENOME064235_01078233-315TPYFVTLPFALLESDAYAVMTRELAERFAAWMPLKILDVKLDPDSPLRERWNPTVIWHSRSNDDHQLQWLRAAIVECFQERRR
GUT_GENOME109761_00377227-305ELRAAQTAVWTPYFLLLPMLLSRTDLIAVMPLQLANNFIRLGRKLVILGRSADAPIFEPVLVWHEKSEKEPALQWLRSH
GUT_GENOME032138_02016249-328YFLAVPALLERTNFTAILPKDTALALTTYLREPLTVLPYGPNGQEATLYYTRIIWHERTDRDPALVWLRGLFATYAGCDI
GUT_GENOME258867_00605218-304HAPVPGLAKQETSVTLPYFLAAPHLLAGNDSTLLLPTVSARVLARALPVSVLPTRGLGGPADFAARLVWHKSRGCNAAHQWLRAMIA
GUT_GENOME195571_01898228-305AVWTPYFISVLQIMSDTELWGAVPFQTFMRLRRWFPDLVILGRPASAKVFSPKLLWHERLHEDPASVWVRSVLLSAAK
GUT_GENOME179273_01110231-301YFLGAPYLLLGTRNTVLLPRSTAIFFEKHIVGLTSIPVTLDAPSPYTRLIWHERSDSSVEMQWIRSVIKTY
GUT_GENOME095640_01151214-296LLADGVPSVVVPFFNTTPFLALETDFVVWMPTPAAALWMKLGRFAVRDLPEALQCVFTPKLIWNNRSESDPLHQWVRSLIAAV
GUT_GENOME113629_00836228-314IEMPHFLGAPYLLKETMATLVLPDRTAEFFARMLPEELAIVPIPGEHRVSSTRLIWHERNHASREMQWIRSMFVTILRPGQEGEQAV
GUT_GENOME016358_00503226-309SQKTSVISPYFLGSFIFVETSDLTVTMPRETAEFVAERNPNIAILPFPSAKQGSIVTCLFWHDRVHNDPAMQWVRSQFKLNAQT
GUT_GENOME041858_01418233-308TPYLLTVPHMLTQCDLIGRLPESSVKELIGNLPLTILPKNLVADPAWTPVFLWHDRTHFDPAHQWFRSVLLRSLED
GUT_GENOME143124_00077242-323VRAEIALRVRSFLSVLPIVADTNLIAPVPSNFGRLAVRESNLEICESPVKFPVFDLNMYWHHRYHADPALTWLRESLVELFG
GUT_GENOME146006_003479-83RVVLRTGHFLSLASIVEQTDLISVVPFEVANVLSHQAEIAIFPLPFTSPNFPIMQHWHERYHSDAFNQWLRGVVR
GUT_GENOME247295_00080246-313FFLAVPLAVQNTNYYSIVPKVTAEMAFSQDKLSLLPLGPSSPTLTTRMAWHERLHTDPGSQMIRSVLK
GUT_GENOME212162_03604226-314GLGYIDLALGKMGIQRKIALRSQHYLMASTVVQQTDMAMTVPERFARHHDLHYAVLPVSDVPKLETHLYWHESTDQDPANRWMREQIIE
GUT_GENOME219020_02429260-332IETPYFLSMPYLLAQSDAYALLPREGAELLSSHLPLAILETDPADIRGWNPQIIWHRRNNADFALQWVRSEIC
GUT_GENOME138977_01288254-338EAAFSMPFFLAVPYVLEQTDFTYMAPVITLMYFVKQSAYKLRMLPAPPEVAPFTPCIIWHHGSHTDPFLQWVRAVIADGCRNEAR
GUT_GENOME277263_00384238-313ACSTPYFTSVPNFLIHSDFTAFLPDDTAFLFKELYGADIEILPFTDSEKFTYHTRLIWHERTDNDMVMQWFRSLFA
GUT_GENOME207610_0170581-144FIVLPELLEQSDLLVVLTERLISNLPNLKRFEPPLEIQGFTKTLIWHERTHKDPAYRWVRELIE
GUT_GENOME096099_03332235-316YFLAAPMILTQTDFVLALPLQTARYFAGMAELTILPYPSAASPFYTRLIWHHRVHHDPAIEWLRDLFLRFAGEEQAAGESMA
GUT_GENOME145858_00336246-314FLGTGELVAESDMIAVVPEQVAQHITRRYPCRSWPLPFPLAKIAIRQVWHQRLHRDPGHIWLRSLIATL
GUT_GENOME032032_01675235-304PYFLAAPWIVLETDATAKILEKNARMLARYLPIEILPLPPGLAGRYERRVIWHENASGDPALMWLIAMLR
GUT_GENOME143124_00859214-294RGIERKIALRTSHYMSLGTLINDSDLIVVLPRAVAELHVRSANARIVEPPFPIRRYDLKQYWHRRFHDDPKSIWLRTLIQG
GUT_GENOME232606_01603235-312LGEPPVKCPYFISALLLTAQTDKVMVCAEVLAEMLASILPVQYRVLEDEPAWKPAMNWHQRTHRSPLHQWLRSMLHTR
GUT_GENOME129550_01288238-314QGHIDSPFFLSGPLITRCTDAVCICPEIIAEPWRQAGLISCLDADFLQQTFTLHLVWHERTHVLPEFQFMRSLLLSK
GUT_GENOME118348_01634234-310VETPFFSTMPFVLMESDAFAVLTRDLAEAYARIMPLTIVEDAKDCLPQVRFQWTPSIVWHARSSEDYALQWLRSMIV
GUT_GENOME146531_03195294-370SIAYQGMAMMSVLSVVSQTHLVAIAPRWLAEEFAESLELQVLPLPLKQNSRTCYLSWHEAAGRDKGHQWMEEQLVSI
GUT_GENOME247004_01618471-547PYFLACAWTLLDTDAYMVVPAPLTDLLAERLPVEAFEPADLFTPLWIPSFIWHARTDLDPLRQWVRAFLITSIREKW
GUT_GENOME065735_01760234-304YFLAAPYFLLHNDWTVVAPIVAMKKTLDPNLFSIFPVSEEAPKLTSRLVWHKRVHRDPAIQWLRSLFIACR
GUT_GENOME032138_02057249-328ALEVPYFLAGIAAVAHSDAYMLVPKRLAAVASAMGGYAALPLDFDGGSRWAPSLLWHDLTDLDPLHQWFRSKLVAFNREP
GUT_GENOME098425_00501241-311FFIPGAFLVLETDFIHLMPKETAVRLQKHLPLAILPSPDDVTWRPQLLWHRTKNASPLHVWVRSQIISCAR
GUT_GENOME203811_01391215-305LRHSVFPEFSRAASAVKSYYFISFIGATVGTELLMVLPENSARLFERLGQAVILPTAAKSVEHVTKMIWHETTHNDLVMQWVRAMMVSSAI
GUT_GENOME082931_0025782-164MGMKTLVCPYFQASTTYLLSSDAVQWIPETAAQAINRNNEFRIITLPKKYTVCFTPKLFWSVKTDKDPVNQWLRSLIIDAAAN
GUT_GENOME068652_00170217-299GVVRNIRLEVPHFVAVGHILQHSDLLATVPERFAASCEAPFGLSVMPPPVALPNIEINLFWHGRYNKDPANRWLRQLMFDLFS
GUT_GENOME235919_00636218-305PEAWRERSVAVRTPYFFSAVEMVKHSDLLLKASEMLADTVLPGGSLAAEPVPDVLHADFEPKLIWNRRTHTDPAAQWLRSLIAGSVSH
GUT_GENOME095640_01237237-300LLLEESDFVLAVPWRTAKRLARRMPLTALADIPEAPSLKPALIWHAHRSADPAFVWLRSLFLST
GUT_GENOME016817_01597231-324RIAVSMPYMLAAPALLEATDLTLLVPGKLALDFVRKAHLTAIPAPSMKSSFPRTLYWHRRTDKDPALQWIRGMILRFGAAETDETAAGRIFAEP
GUT_GENOME282900_00762218-299LPELKSSQTFIELPYFLGAPYFLKGTEYTLLMPTRTARFFARQIPGLIAIPYEGEYAENYTRLIWHERSDKSITMQWLRSMF
GUT_GENOME015510_01053226-304AVRVPFFFGAARIVAASNLVTLMSDVMCEWFLPPGLLHALPLKELDDCEHRFTVRLIWHDRSHSDPAMQWIRSVILESY
GUT_GENOME125058_02279289-360PYFIAGFFTILQNDCFLAAPLAVAQRLSNWLPIELLHFDEGSQFLWTPEILWHERTDASPLHQWVRSIVCSA
GUT_GENOME020307_01051235-302FFLAAPYFLEGSDALCVLPRALAELALDPGRFSLLPAPANAPTLTMRLAWHKRTDEDPYFQWVRGLFE
GUT_GENOME022884_01483306-373PYFVTVPLLLGAEATTCIPFQTAHALCRTTDLCILGRSKKSSVFTPSFIWSDRVDRDPAHQWLRSFLF
GUT_GENOME195724_01795276-363EQAAITMPYMLGASRIIRESDMVLRMTGRLALEYVRSGELVALPTPWVRPGIERALYWHQRTDRDPGLRWIRAMIRSVGVKISDEEAV
GUT_GENOME147523_01561211-293LRKLGKTRHIALQVPFFSAAVNVLIQSDYMMVVPEHIAVNLAKSQPICHYPLPFDTEIHQYWLMWHPKYDHDSAHRWVREKVT
GUT_GENOME036853_00801247-321PYFASAAMLVGIDGGVALIPYQTAKRLARILPIEILGAPKTAIPFQPFLIWDETREADVALEWLRALIVSGLQAS
GUT_GENOME143124_02395229-308RHAIRVPNILIVPAVVAATDMIATVPLRIAREAAKRLDIKAVQPPFKIANPDISMYWHELNHRDPAHIWFRQAIQTIAQA
GUT_GENOME076056_00533234-310AVWSPFFIPIAQILCQTNLWGAVPLQLAVQLQKLMPDLIILGRPKNAVVYGPTLIWHNRTHQDPASVWVRSVIVSAP
GUT_GENOME185929_01776313-382YFLTVPYVVARTDFTFEGPLITLRRFMMDSQFQFRILPMPEESGPFEPRLVWHHCSHTDPFLQWVRGLVV
GUT_GENOME098425_00194236-310AIYTAYFLSSPFFVLETDHVLVMPTPTAEYLSSFLPIQVMNLCPDSPPMDMYLIWHERVHASPALQWIRSFLVSK
GUT_GENOME015415_01838229-311ARGVRTLVVPYFNASPFLLLETNHFQWIPKVTARRWTAFGAFEAIDPPEGLAVTLGPNLIWSARSDRDPLHQWVRGVILAGAR
GUT_GENOME049458_02081237-318SVLRVPYFNSIAFALLDTDYIDWIPESAANYWENFPGLISFPLPEEISPAFTPRLIWSARSTHNPEHVWLRSLIISRARELY
GUT_GENOME010473_00572237-314SPFFLANAWSLLDSDGYTILPIPLAEILTERLPLAALKPKDIVIPPWVPHLYWSVRTDADPLRQWVRSFLISCARRRW
GUT_GENOME120366_01466273-352ETAVTVPYFLAVPSLLNATDFTAVMPMQTARLFEERDGIVAIPIHPDPESGRGRDAFNTRLIWHERVADEPALQWLRGLF
GUT_GENOME244286_01958218-304ALGRRRRVTLSICSFLVLPDILRQSDLISVVPRRLALNGGLIILPPPLSIPGFTKTLAWHERTHRHKGHQWLRQLLVSTASESTDLV