UHGP-MC 21487


Information


Number of sequences (UHGP-50):
90
Average sequence length:
86±10 aa
Average transmembrane regions:
0
Low complexity (%):
7.96
Coiled coils (%):
0
Disordered domains (%):
0.04

Pfam dominant architecture:
PF00528
Pfam % dominant architecture:
8889
Pfam overlap:
0.08
Pfam overlap type:
shifted

Downloads

Seeds:
MC21487.fasta
Seeds (0.60 cdhit):
MC21487_cdhit.fasta
MSA:
MC21487_msa.fasta
HMM model:
MC21487.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME007367_0220219-98IAIVFPVLVILFWQYASTHGLVKASLVPAPLTIAKTFLSYIESGKLWKNLSVSFGRVACGYVIGAMCGVAVGFLMGLFKP
GUT_GENOME095736_0355161-136WELLVWAGWSNGRLVPPPSKVFTTIAELARSGELTRHIVATLSRVGIGFGFGVAAGTLLGGICGYWGLARRLVDPT
GUT_GENOME141284_0256141-109LFGLWWVASNDHWMPAQILPTPQQTWQNFLQMSSQDLWWHLGISLKRLALGLFSGTIAGIVIGVLLGYS
GUT_GENOME127701_0145612-107SKERLLYLRRTRNEKAAVYSVRIFILVAIFGLWELAGNLGWIDPFIMSQPSRILNTIANLYAKGDLFPHIGISCLETVVGFLLGTVAGTLIASLLW
GUT_GENOME096493_0079918-93PIVLILIWQISSSKGIIPIYILPKPSKIGKTFMAMCLSGELFAHIAISIIRVLEGFFIGTILALIIGILCGLYEKI
GUT_GENOME096269_008545-102IISISSAVLFLALWIVLTLGGFVNKLYLPGPDQIFGVFMEDYSKLFRHLGFTLGRQFAGFFLGSALGILVGLLIASNRYVQAFMDPIIEVLRPIPPLA
GUT_GENOME096547_0185338-115VSPLAVVVLLQLGSTLGLIPERILPSPLKILAAGWEVISDGSLVQALAVSAQRVAVGFCIGAVAGVTLGLVVGFSRIF
GUT_GENOME253903_0340632-113KQLLTITSPILILILWEVFSRTGILDIRFFPPPTAIVSTFFELATSGLLWTHVSVSLYRIAMGFLLGVIPGVIIGLLMGLYA
GUT_GENOME257704_0061513-108GLVLPLLLIAAWEAASRQDASHAYAFVPLRRIGEGLAELLASGELWLSVAASLQRTCLGLLFGVAAGFVLGAAMALSRSAGRLLAPLFHGLRQVPI
GUT_GENOME096464_0047126-118MKWITPFLLPVLLIITWIFASKIISNQAVLPSPGKIVYNFIHSTDNFIGLGSLPRNIGYSLIRVVLGYGLGVLVAVPLGLLMGYFKVISNLFE
GUT_GENOME070390_0061610-96NMTIVPVILIVIWQLLASAGLIYEVILPSPAKCVVALGEIIKNGSLKIDLLVSLKRVLIGCFWGIGIGMIFGLIAGFSGIFERIIDP
GUT_GENOME260354_0051313-103LKRLSTDKRKTYAVRIAILVLFIAFWEIAASLEWIDPFIMSQPSRIVNTIVNLSQNGELFMHLGVSTVETVIGFVSGTILGTVIAILLWWS
GUT_GENOME158618_007649-104NKAKKIILSIAVILFWITAWHIASIWIANETILPSPLDVCKRLLVLGTEAEFWQITALSCVRIIVGFFIGCILGTILGIVTRLFVAVKYLLSPILT
GUT_GENOME120038_012964-103LHSVSTEQKHYLQAQKRNIRLIQASRILLLILFLGLWEITARLGILDSFIYSSPSQILDKMCKMISDGSLAKHVSVTLTETLISFALVTLLGIFCAVILW
GUT_GENOME236154_0105519-124RPSFIQRHRRTARQWLERSAALIVFFAVWELLPRLGIVGSAFLSPPSHVLQAIGQLMESGQLFKHVTASLQRSASGLLLAVAAGVALGLLMGVIRRVEAFLDPLLQ
GUT_GENOME188413_010599-93HRHLAYKTISILSFCVIIALWCLLTYGGMVTELFVPSPTKVLSTTVDMARDGSLWINCWMSVKRVMVGWFWSAVAALPIGMVMAR
GUT_GENOME141308_0302325-106ITFIVLWAIASKLNWVNPKLIPAPFEVVQIAIHQFQQQEFWTGLGASLFRDLTGFVLGSILAITFGIVVGVSRWANYLFLPS
GUT_GENOME033069_0044518-106LILFFAIWQIVTQMNQMHEWFNPKFLPAPTDIIKKAVEYAGDGTLWLHASVSIRRILLGFLIGGVLAITVGIIFANQKLVEAWFSPIIN
GUT_GENOME000598_0082916-120KKESNWVNKLIVFGRRSIALIAFFALWEIAPNIGLINKQFIPPISDILVYLVQMAADGDLFVHVGASLTRALEGFALAVLVGVPLGFLLGGWFKKFEEILDPLLQ
GUT_GENOME077018_0086113-90FMSLLIIVVIWIFLSNKINNSIYLPKISEIINEILRIIKEKNFLLNIINSLLRGFISFFIALIIAIVLGIVSGFNKFV
GUT_GENOME009466_018292-87QKIRRIAEVILSAAFWLVVWAALSYAVGKEILLPSPVRVFARLFELMGTGEYYGSVFFSLARISLGFIIGTLAGCLLAYFSFTSKT
GUT_GENOME214974_0023017-97FLAVIAPLTVLLIWALGSNLGLIRASILPSPQRVLSTLISLCTSGQMAEDLSISMLRVLRGFGLGAVCGIIIGSLMGFSKT
GUT_GENOME036221_0251915-110GLITWVLLLLIWQIGSLFYSDDFLPGPVTTFAGAGKLVASGDLFRDILISMQRVLKGWLLGILFAIPAGLCIGHFKKIGEIFEPFLNFFRFVPAIG
GUT_GENOME145698_0491830-112RSQFLTISVAVFVLLLLIWWLATSSGTIKPIFLPGIENVLLRMITLAQNGTLQSDIASSLYRMIIAFTISSLMAIIIGVLAGC
GUT_GENOME183774_0021428-105VMLLILWQVLSSKGLIKQSVLPSPSKIFLTLIDMIKSGELLRNLYVSVVRVIKGYCIGAALGIIFGIFIGLFRIAERA
GUT_GENOME093698_0082418-101FIVFLMIWQVVSVSGIIPEYMLPSPIKVVSALKTDFLLLLGHTGTTLTEAFWGLVCGIILGLITAILMDMCKILNKILYPIIIV
GUT_GENOME231320_016451-84MRRLQPLIYPLISLAVLVFIWDMAVRLFAIPDYLLPSPGAVFGALASGFQDNSLWPHIAVTLGETLSGFAVGSVLAITLGVALA
GUT_GENOME143124_019522-102VERLRAFARQAMAPLYPLSVLAVAWELLSRSGWVSPRLMPSIVKVGRAFVEGLANGDLVYHASISFSRALSGFGLAIVIGVLLGVLMARSRWFEWMVEPIF
GUT_GENOME183774_0029924-93VILILWQVLPEIGILDSQFIPSLSEVLVQLYEMCLESELLVHVAVSLWRVLIGLIIAAVIAIPLGFLLGG
GUT_GENOME141303_0051133-114IVLALVLPAIALAVWHYAFQKQWLPEQILPSPALVWQTTLELIDTHELQDNLWISLKRVAWSVLVGGSVGLFLGFLIGLSRI
GUT_GENOME212435_0310155-133VLIIWSLLTYTGVVDKLFLPTPTDTLKAALTMFSELGFFKDIMSTISIVMIGFVVSAIVAVPLGILIGTYKPFEAFFEP
GUT_GENOME095736_0066013-99VPLVAIVAVWQLASSIAIVNPVLFPSPAKVLLAAIDMFRSGVLLKDLLVSLRRAAAGFVVGASLGVTLGLLTSRVRLFSIGLSPLFN
GUT_GENOME142610_0533252-129AIIPAVTLVLWQLAGSTGLVSATFLPTPLSIARAFTDLLVTGELTHHLGVSMGRAGIGFLIGGVLGLLFGVLTGLFRS
GUT_GENOME142591_0514716-92LPFALALAIWYALSASGAVLELFLPSPAQTWDTFVALLRDGTLIESVGVSLRRLVVGWLIGVLAATLVGWALGMFDW
GUT_GENOME243814_0385616-119IADIINYIALPIGILIIWYWVTKNGEIPSLLLPSIGQVKDNFLIQLSNGQLWNDLSASVIIVAKGYLVGAALGIGFGIMMGAFSRVNKLFSVIFDTIRQIPGIA
GUT_GENOME004668_0271124-96IVLLLLWYLVTMSGEIIRPQILPNPINVLKAYPDLISNSALFTNTWYTVKLNLMGYFYALIIAIPLGLIIGLF
GUT_GENOME018037_008438-94EGMKKFCEHAAVLIFWLLVWSGISAAVGSDLLVPSPVSVVLKIAELCQTKSFWLGVATSLLRVTKGFLLGSVLGVLTAVLTAKSRIA
GUT_GENOME117552_0055090-178LFLFFCIWEFISYMNAQNGWFNPVFLPSPVTVLETAYDYMLDGTLFMHIGVSFYRMITGFVLGVAAALIIGIWVAMSRDADNILSPVLN
GUT_GENOME246268_001141-100MNKKLRTFVYPVCAFIVLLLLWQVCVMLSTSEIAFPTPVAVFKEFFYILTHKVGNYYTVVGHVLWSLSHVLWSLSRVLVGFSIAAVLGIILGLTMGWFDY
GUT_GENOME066938_00874440-522LPLWVLVAWHVATSTGLLSTVVLPKISSVASTLVQQLSTGTFKGDLAISILRILEGYAVAAVAGIFFGVAMGMSDTAHKFFSL
GUT_GENOME096530_0128446-131IPVALIVLWELLSRASVVPSNQLPAPTTILEKTISLGQDGSLWGHIWITTYRVFAGFIIGTAAAIVLGAVVGFYKKAEQLFDPMIQ
GUT_GENOME243144_0695049-125LVIAIWQLAGSAGWVNPLFLPAPSAIAVAIYKLATSGALWQHLSWSIMRIGTGWMIGTAAGVIVGFAIGLSTMARGV
GUT_GENOME065212_000772-103KVDKTKLKSHIYLVILLVAFVILWQFLAEKELINPMFFSSPKEVWADTVEMFSTGYILPHIGITLYAAFLGLFYGIVFGTLIALIVGNYKILAHIVEPIFVG
GUT_GENOME096381_0369055-128LVLLVALWETAPRLGLADRVFLPPFSEVAAAWWELLADGQLVDDARASLVRSLSGFGLAVALAVPLGLLIGWYK
GUT_GENOME088558_0098119-88LSIGVFVLIWWIYVTAKDVPAYILPAPDKVLKSLTEMFMDGSVYPHLWTTAYEVVLGFLIGSFLGILLGY
GUT_GENOME103727_0025333-103ILVLVLWQLLSTSGMVSQIFLPAPTAVVEAFVKSLMDGSLATDTWISFSRIMKGFLLAIVIGVPLGVLVGS
GUT_GENOME092839_0058320-94ISWVAIIALWGIGSLKYDEYFLPSPKETIDSMKELIKDGTLWSDIVASIGRVIKGWLLSIIFAVPMGLIVGNFRP
GUT_GENOME140365_0647128-105FLAPLVALVVIWQAVVTLFDVHSGIFPGVPAVFRAGIEAIRDGSLPMHVAASFARVLVGTGLALLFAVPLGIAMGVSP
GUT_GENOME090075_010741-99MKKLKEKKYVLLNIMSIVAFLLFWQGFSIYNQTAEIMNPKYLPGPIEVVQTVIEYIFDKGTLFQHIWTSLYRILGGFLLGSVVAVAVGYICSKSKIISN
GUT_GENOME143124_0462532-113DRLLGVLSPLVLLLIWEIAARTGFIDTRFFPAPTSIFQTLLVLAESGELWANSRASLQRLFWGFLVGGVPALIIGVLMGLYR
GUT_GENOME171691_0318552-126WELAANRLWIDPFFYSSPSGIVMRLYEWITEGTAEGSLWFNLWVTMEEALIGFFAGSIAGVLVGIGLGRNQFLSD
GUT_GENOME169960_0016027-114WLLIWFIASRAVGSELLLPSPAATLARLWELARQGDFWLAVLQSLLRIVAGCALGIAAGSVLGVLTAVSRVLYELFMPLLTVVRSTPV
GUT_GENOME158642_0079924-104LIFLFAWQLIALCNPEMFPTPAQTAVSLVKMLMPASGTSVLVHVWASLRRVLIAYVIACVIGIFLGILFGWSRICYDYVHP
GUT_GENOME085701_0042311-104DRKKYLLFAILSFVIVLAAWLIASESGAIKEIFLPKPQNVVNYYIESIKDGSLLQNTGISIYRITLGFVYAVLLGVPIGILVGTFKKAEAFIRP
GUT_GENOME040713_0161942-129RMLTVITIILFFAIWQFVVDNGIVDSRTLPSPLKIYEAFLMKMTKKAPDGNLLSVNIIASLQISLSGFFSAVIIGVPLGLVMGWYKVL
GUT_GENOME040780_011304-78KAARIALTALIWLCLWEIAALITGSELLLPGPAATAKSLFSLLVSGQFWASAAATLARISAGYILGMLAGAVLGA
GUT_GENOME183774_0106637-114WEIAPRLNLIDPVFIPPPSTIAYAFIKLIISGELIRAIIISMSRAICGFGIAMILAIPLGFFMGWFKKFEKYMDPLFS
GUT_GENOME140360_0241428-123RRKWEVVRLVSPLALLALWQLGSAIGVIAQDVLPAPSLILQAGVELTRNGQLADALHISTVRVVEGLALGGVIGIVAGAAVGLSRWVEATVDPPLQ
GUT_GENOME237512_0146064-139LAFAAIFGIWSYASYGLKVDDLFLPSPDQVLTSAINMFTQENFLSDIRMSVQRVLIGFVISAVVGIPLGLVIGTYA
GUT_GENOME142249_026628-93KFLRLVKSVLIFFIILLLWKITNYLGIWSDYILPSPEKVYSTFLNMISDGSIFINVYASMKRVLIGFAISTAIGVPLGIFFGIYSG
GUT_GENOME188400_0252213-99GKKLFYQFEKIAPVVILLVIWYLVTSQSNYSGVLFPKFTTMISTIWSKIVDGSLFINLGVSFGRVIKGYCLAAVFGISFGIIIGLSP
GUT_GENOME283775_012805-101KRKKQILPDNTATSVMISLILPVLILVFWLIGSKNDLFNPSIIPTPHKMWLQLTKMISDGSLLEHIIVSFRRVILGYFLGAVSGVILGVLTGLYTRL
GUT_GENOME096443_0219656-132IFIFLLWYVITKLNIFSEVFIPSPAKTWAAFTDLLVSGYKGNSLLFHLLDSLYRLSIAFFFALITAVPLGLISGYSS
GUT_GENOME139702_020537-105KKYTIVSIVSLIMFFIIWQMYAKLNVRMGWMNAAFLPAPTDVLKTFREFYEEGTLFENIGVSVERMLKGFFVGSVLGILLGYIMAKIPIFESVLQPIIN
GUT_GENOME142591_044725-98RKSFCALIGSLVLLLVWQLAVWIGNYEKSLLPAPLDVLRGLGELAASGALFAHLKVSLLRFAAGYLSASAAGIALGLALGRTRLLWAMIDPVVQ
GUT_GENOME242991_0107629-104VILLLIVWQIASTKVGIPLLLPTPKSTFTEFYSNVTNSQVMTNIGITMSRVVKGWLIAVLVGVPIGMAMGLSKIFE
GUT_GENOME097341_0053054-144ISVLVFLGIWELAVRIGWIDSKYLCAPTTVVKTFIQKLSDPKPDGAVLGVHIITSLKLVLVGYCSAVIVGVPLGLIMGYYHTFDKLVSPIF
GUT_GENOME067581_0056717-100SKKRYMASGILTFVIIFLIWIVLCASGVVKEMFLPSPAKVLKDIIETAKDGTLWANMGYSIFRITMGFIIASLVGIPLGILAGS
GUT_GENOME196553_0086037-113LIIWQLLALSVNISFIFPGPLDVMQALVGLIKTQEYYRIITASTLRIAVGFVISFILGVVLGIISGRFGIIREFIEP
GUT_GENOME158592_033144-117RDRTELEKKQQRDRRKFAAISACSLVVFFGVWYLATAIMHLMPDYCLPSPVQVLEAFVYKLNHKAPDGGTLFQHILASLKVALSGYLLGAVIGIPLGICMAWFKRIDLFVTPLF
GUT_GENOME153026_016807-124SSCNRHKRNERLDKMKKNKKYMVISAISMICFFVLWELVTDVFHLFPVYSMPSPIKTFQAFFIKLVNPSPDGSVLLVHVAASLQVALTGYVLGAIVGIPLGILMAWYKPVDLFVKPIF
GUT_GENOME195555_0086110-101MKRWQKSYTPAVSLASVAVLLLIWEGICRAGVVSSLFLPPPTLIVSSLIDMIKDGEIGVSLAASLYRILMGFFLGSAFGLLVGLVTGTSALA
GUT_GENOME044174_0094112-88ISFAVLIALWQIAYMAGGINEALFPSPLKAWDALLEMISDGSIFTNIGASMYRFVIGYISAILAAVLLGLFLGWFNI
GUT_GENOME143124_0419144-121WQLAASQGWMAPQILPAPEQVWASFVELLENGDIAANFAVSLHRIGMGFLLGAALGLATGIALGVSNTARDYLDPLLR
GUT_GENOME096448_0096531-108WETLSARRIINPAILPAPSSIFSTCWSMLTSGEIARHMRMSAFRVCSGFLLGSAGGLVLGILIGLFHKIDRALTLIIG
GUT_GENOME213427_0156125-115LKHPIRTTNLISAGVVLAILVVWMIVSQQGIVDEKIVPSVRSVWEALVSISVEGYKGSTLLQHLGASFFRLFTAFGLAIVTAVPLGLLSGQ
GUT_GENOME140601_036619-99DHTGWAARNIGSIAVVSFLAIWEILYRIGLINPIFLSSPLKIVQVCITLFQNGVLTEHLLFTLGHFTVGFGLAAVSAVVLGLVFGWYRPLG
GUT_GENOME096439_0371614-98IIFFIVLCEFVVQRGIISDLYLAAPSNVVVTLYELIISGELTGHFLTTVLEFAIGFGAAILLGVGLGFAMGLSKPLEGYFTPLLS
GUT_GENOME141303_0051547-123ILLFLAIWEIVPRIGLVDSAFLPAFSTVLLSGWGLIQNGQLLTHLQASLIRSLSGFIFAIIVAIPLGLAIGWYTRFS
GUT_GENOME096235_0139716-89ALLIILWQGVSYFGWVNPYLLPSPGDIIGAFTELLFTGEIFTHIKSSLFRLGVGMAITIGLAVPLGLFIGLFSR
GUT_GENOME207731_0394942-128LFFAGWQITSSQGWLNPAVIPPLDNIVIALWNGISSGALLDDIAISLQRAGLAFFSAVAIGIPLGLFMGQIRPLERALDPLLQLFRQ
GUT_GENOME208176_0040312-87KNLLVFFITLLIWEILYKIKIFSPIYLPGPFKVFQTFCAMVFDGTIFTNLLASLSRVFVGYFLALFVGFFLALFFY
GUT_GENOME224712_0213050-143KIYANLMIVPVLLLAAWQLLSSTGLLLEVVLPSPWKVLMGLETIILNGTLAIDLRDSGIRVLIGYFWGVVIGLTLGLASGLSKFIERLVGPVVD
GUT_GENOME096547_0103461-139ASGPVVLVILWVAASASGAVTSQQFPAPWEVGHAFAELARTGELWENMSVSLRRVAVGLVTGISVGLVLGIVSGLFRLG
GUT_GENOME140888_0257531-103LSAVPILVFLAVWQYAPSAGLISPTVLPAPTTIMREVAELTMTGELVEHIAISLGRTVAGLALAIMLAVPLGF
GUT_GENOME107769_0013228-106IAISIAALILTWWALAVVFDKSYFPTPPEVFEALIQSFDNVPGIGLSMSTQLFSSLERFVFGFLLALVTAVPVGLLLGH
GUT_GENOME105612_009655-108HVNEEKKSFISTGKNIFYAVFLPVLLLILWEWAADTGRINGNVIPPPSRLAETFVQLVTSGKLAQGLLISFQRVLIGFLIASVVGIVLGFLMGLFTPVNKMLSS