UHGP-MC 123222


Information


Number of sequences (UHGP-50):
216
Average sequence length:
80±7 aa
Average transmembrane regions:
0
Low complexity (%):
21.53
Coiled coils (%):
0
Disordered domains (%):
0.17

Pfam dominant architecture:
PF04060
Pfam % dominant architecture:
7639
Pfam overlap:
0.39
Pfam overlap type:
extended

Downloads

Seeds:
MC123222.fasta
Seeds (0.60 cdhit):
MC123222_cdhit.fasta
MSA:
MC123222_msa.fasta
HMM model:
MC123222.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME175641_012814-90IILSIIVVAVIGLILGICIAVAAKKFAVEVDPRIDEVTGMLPGANCGGCGFAGCSDLAKNIVEGNAPVNKCPVCTPEAVCRIAEFLG
GUT_GENOME114370_009807-80IAKTVLILAAIAFVLGVLLAVASKIFAVKKDEREEAVVGCLPGANCGACGFAGCSAYASAVVHDGVPTNKCVPG
GUT_GENOME038105_019894-90MIYPALVMGGLGVVFGSLLAFASKKFYVEVDQRQTDIRALLPGANCGGCGFPGCDGYAEACASGAAKLTLCAAAGPEVAAKIAEIMG
GUT_GENOME127408_000562-82LEQIWFPVALLTAIGVAAGLLLALASRFLTIPVGKKVEEARSFLPGINCGACGYPGCDQYAKAVVEEGAAPNRCVPGREAT
GUT_GENOME113571_0078017-100VIGIAAALALVFVALILIVSKFCAVKTDEKVAKIAEHLAGANCGGCGYAGCEGFAKALAEGKADLSACGPTPNENKAAIAEILG
GUT_GENOME058772_015636-95IYAVLVLGVIAVVFGLILSVAAKAFEVKVDERLPKIQACLAGANCGGRGEPPRAGGAMQPARAAPRQFLPVRLPSPPARPLAQRALPRLL
GUT_GENOME165783_007912-88ITAIIMMFVVGAVLGLILGIASNVFKVEVDERVEKVTAMLPGYNCGSCGFAGCSGMAEAMVAGEVATASACKPSKPDARAKIVEYLT
GUT_GENOME250136_009481-101MILIAVISLGAIGAIGAVFLYAASKKFEVYEDPRIAQVQEVLPGANCGGCGYPGCGGFATACVKADTLDGLLCPVGGAPVMGKVATILGKEAASAEPMVAV
GUT_GENOME258497_011684-91LLYTILTLCVLGILSAVILYVVAQKFRVEEDPRIDEVEKMLPGANCGGCGFAGCRGMADALVKQDDISSLFCPVGGGDTMKAVAAYLG
GUT_GENOME190435_015437-90LVLVAAGLVGATGLILGVFLGVASEKFKVEVDETEAKIRGALPGNNCGACGFPGCDGCAKAMALGEAPVTQCPVGGSPVAEKIA
GUT_GENOME037754_0197016-106TATVLAVASRVFHVEEDPRVQAVLEALPGANCGGCGYAGCEGYATAVATDPAVPANRCCAGGAETSIAVGELTGKTVGASDPLVSFRRCDK
GUT_GENOME238203_011963-76TILFSIFVIAFLGLLLGLFLGFTAKRFEVPEDKKVNQIIACLPGANCGGCGYAGCADFANAIVHEGAAPNRCAS
GUT_GENOME199674_007604-86ILPIGIFIVFGIVFGVLLTVISKVFAVKVDERAQKITETLPGANCGACGYAGCADYADAIVNKGAPMNACLPGGAAAASKIGE
GUT_GENOME258751_016888-87VMLVGGIGLLAGVILAAASKFFAVEVDERVTKIRECLPGANCGGCGFAGCDDYAANLIANPDLPTNLCPVGGAAVAQKIS
GUT_GENOME272665_004631-84MIGIIITTSIALILGLIIVFIDSITDSNNKESEYLELLPGYNCGACGFGSCSGMAHKMCENINNYKRCKPLKGEKLIKMEEYIE
GUT_GENOME239413_014421-75MNTIIFTVVIALLIGFVLGTLLGLFKKIFSVEVDPKVSHVREALPGANCGGCGYAGCDSFAVAVVKGEAPANGCV
GUT_GENOME204687_00421203-288MTGVIIVGVIGLVCSVLLVVASHFLSVPVDTRVEDIRAFLPGANCGACGYAGCDDYAKAVAGGSAKPNLCTPGGTSAAEQISEYLG
GUT_GENOME056441_013025-89FIAIGILAGLGLIFGFGLAIAAKVLYVKVDSRVEDITAILPNANCGACGYPGCAGFAEAIVAQTAENLSLCKPGLKSGKPEAIRK
GUT_GENOME219575_003026-80ILIAALVMSVLGLLFGALLGVTGRVFKVPVNEKAEALRECLPGANCGACGFPGCDGYAAAVADGKAEVGACAVGG
GUT_GENOME177955_002821-75MSYIIVSVLALGIIALIASSVLYVCSRKFAVKVDPRVGQINELLPKANCGGCGFAGCQALAEAMVKAIDNGEKDI
GUT_GENOME242957_010291-75MSILGPILILGIIGAVLGLLLGFSAKKFAVTEDSRVSLVREVLPGANCGGCGFSGCDAYARAVVSGAPLNKCGPG
GUT_GENOME110771_0081022-95IIAFTAKKFAVESDPRIEIVQDLLPGANCGGCGYAGCADMAQAIVLKGADPAKCPVSTEEARRKIAEAMGITLG
GUT_GENOME254344_002165-84ISVLIPVAIFAATAFVIGFLLAVASRVFSVKKDERIEKITELLPGANCGACGYSGCSDYAEAMVKNGAPADKCKVGDSEN
GUT_GENOME058044_010931-90MYILYALLILAGLSLILGILVTIFSKVLYVKEDTRIEEVTKMLPGYNCGACGYPGCSGMADALVNKKEPSPDKCKPSKKENKDKIVAYLK
GUT_GENOME134614_004421-88MSAILIAIGIMSLGAIVLGTALGFSAVRFKVEGDPLIEQVNAILPQTQCGQCGFAGCKPYAEAIIKGEAEINLCPPGGVDGMQRIAEL
GUT_GENOME237441_025574-75IIITIIVSFVIALVLGICLGIFQRLFAVPVDPKVQEIRKLLSGGNCGGCGFAGCDDFAKAVVEGRASADGCT
GUT_GENOME114426_0089021-95MGLAFVVSLLVSAASRQYSGRTRARSEERIVSLLPCENCGACGRPDCRAFAEDVLYRGERPAACTKLSVPAAAEI
GUT_GENOME231144_0047510-80IVAIFTVLIIIGVLLSLLANFTDSKNQSSADKRIEKLLPNINCGQCGYPGCQAYAEALSKGLAKPSLCKPA
GUT_GENOME122307_005614-83TILVAILFTGVIALLLGIIIGVVSKLFSVPSDPRVELVLGLLPGANCGGCGKAGCADFAKAVVAGECPPNKCPVSSSEQV
GUT_GENOME100879_005693-81MFLMAVSFAGISGLLAGLILGVAGKIFYVKTDERVEMIRNELPGNNCGGCGYAGCDDYASAIVEKSARPDMCTAGIGPE
GUT_GENOME054146_0008318-91LLYIISRKFHVEEDSRIDEIESILPGANCGGCGYAGCRNFAEQCTKVTSLDSLLCPVGGNEVMQKIAPILGLNA
GUT_GENOME130348_009761-80MVTTSILALFILGFVAAALLGVASKLLAVEEDPRVEAVVGVLPGANCGGCGYAGCENYAQAVVNDPNVPANLCVAGGEET
GUT_GENOME038955_013935-80IIWTIVVISLLGLVLAVVLYLVAEKFKVEEDPRIEQVEKVMPGANCGGCGFAGCHAFAEAAVKAPNLDNNFCPVGG
GUT_GENOME045204_003385-83LTAIMAVVILTCIGLVFGFVLASADKKFAMEENPLIGEVKEALPQGQCGACGFAGCAKYAEAVVLDPDVPPNLCVPGKA
GUT_GENOME236232_0129532-99FAVETDETVARITEALPNANCGSCGFSGCEGYAKAVAEGKAPTNKCKPGGSAAAAKLAEIMGTEALEA
GUT_GENOME125984_003325-83ILTPVVIVVVAGFIAALLLTIASKAFYVKVDERVTQVRAILPGANCGGCGFAGCDDYANAVVTDENQSCSACTVGGAAC
GUT_GENOME237840_014354-75LYAAIVLGSIGLVLSILIGICGKFLITQKDDIVEEIYDLLPKINCGGCGNPSCMQMAEKLLQGESIENCRPC
GUT_GENOME052443_004232940-3035INLICAIVTVLFGLLLLISKRHKNKDDEEEDETKDQTTTNEDEEKIRAVLPGNNCGACGYPGCDGLACAIAKKEAPINQCPVGGNEVAKKIGEILH
GUT_GENOME102095_0132017-92AAVLLFAAAKKFYVYEDPRIGEIEEILPGANCGGCGFSGCHAFAIECANASSMDGLNCTGVNSEGMKKIADIVGLA
GUT_GENOME062725_0066711-84IIYGAIALFVLGIVLSILARSKGGGDDEIANQLEKALPGAQCAQCGYPGCRAYAEAMANGTASCNKCTPGGQDT
GUT_GENOME205443_002082-74NIASAVILCTIVGAVGAIILVAAAKFMAVEEDPRIEEVASCLAGANCGGCGYAGCADYAKAVVMDGVACDKCA
GUT_GENOME253848_016424-92AILSAAACVAGIGLICAVVLVVAAKFMAVRENALERALRECLPGVNCGACGYTGCDGYAKALAEGAEKQANLCIPGGDAVSRQISEALG
GUT_GENOME141226_006494-95VIMAVVVLGVIAAIFGLILAYASKVFAVEKDPREEAIAGCLPGANCGGCGYAGCGGYAAAVVKGEAPVNKCAAGGESVAAQIAEIMGVAAGD
GUT_GENOME260024_000506-80IPFLILAVMGLLFGIGIGLVSKKFSVPQDPKFPLIRDALPGANCGGCGYAGCDAYAKAVVSGEAPINACAVGGEA
GUT_GENOME259553_007653-91VLYIILIIVGIMAVASGVVLLINHFSRKKKGLSTEAEFVYKMLPCVDCGMCGEKNCSEFAKRVAASQRNPEECKLIKPEVCEKIHKYFK
GUT_GENOME180609_000861-79MDTIWITVAVLGVTGIIAAVVLWFVARKFYVYEDPRIGEVEALLPGANCGSCGQSGCHAFATACVNASSLSGLSCPVGG
GUT_GENOME011645_007071-77MTAILNAFLIIAGLGAVLGLGLAIADRKLTSPKDSKLEELEGMMPGANCGGCGFAGCSAYADAVYQGKVLPGLCSPG
GUT_GENOME115017_0077714-78MGVIIVAVDNNLSKTKGNKEDEILKLLPNINCGGCGYVNCSNMALEIIKNKEAIYKCRPLKNKEE
GUT_GENOME111314_008371-93MEAIIWSLAVIGGLGILFGLLLGIASKKLSVEVDERVEQIRELLPGANCGGCGQPGCDGFASALVEGTVSVGNCSACSAENAQKIGQILGVSV
GUT_GENOME254370_003404-87GMITAVGICAAAGFFAAILFGKAEKDTSVRREVLREKILDCLPGENCGACGFGDCPAAALAVAEGRAAADVCPAGGTAAAQAIG
GUT_GENOME096574_011871-88MDFTQIFPPVAIMSILGGAFALLIGVVSKLTHIDVDPRITQIREALPGANCGACGYPGCDGCAAAMAEGKAPLNACVVGGAKVTDKIA
GUT_GENOME217842_003971-86MPILTAIIVLGGIGILGAVILYAVSKRFFVKEDPRIELIEQLLPGANCGSCGRSGCHDFACACAKASSLDSLVCPGAGNEAMAKIA
GUT_GENOME280822_0017828-109VGIVVAIMASIGLVFGLLIILISKFCVIKENPKIAEINAQLCGANCGGCGYPGCDGFARALVEGKASLDACGATSKENKIEI
GUT_GENOME097835_011563-92LWSILTPVLIFIGLGVLAGLLLSVFSKIFAVPSDEKVEEVRAVLPGLNCGVCGFSGCDNYANSIVHEGARTNRCVPGGDDTAKKISEILG
GUT_GENOME243844_0052420-103ISTAVLAIAGLALGLALVIVIVFHFFSVETDEREEQLLEILPGANCGGCGYSGCQGYASALASGKDKTFNRCTAGGAETAAQVA
GUT_GENOME212238_003825-80TILTPAIFLGIAGAFFGVILSIASKVFYVEVDPKVAEVRDALPGANCGACGFPGCDGLAEAIASGKAPVNGCVIGG
GUT_GENOME179445_0017324-105AIIIACVAGVLAFFLAYLGTKMTVERDARIDDVKRHLSGANCGGCGYPGCDAFAEALVKGETTLAKCNATSKQGKENIARIL
GUT_GENOME257076_0181916-105ILLFVLLLAGITSLLTAFFGKRYALRLDRTYCGRMSALLPGEDCGACGYSQCSEYAEAALYMDADADRCPKMTAPVRQQFGACRQEFQQL
GUT_GENOME254219_012995-88ISILKAIGIVLAISAILGLLLALASKYLSVKEDPRIAEVTKMLPGANCGGCGYAGCSGMAESIVSGANKNLSACRPSKPEAREK
GUT_GENOME232602_004648-82ISVFTFSVFILGAVLTYIAHRFKVEPKDPMQHLVEACLPQVQCGQCGYVGCAEYANAILLEGAPINLCPPGGDKT
GUT_GENOME248497_019344-90ILIPILAVTIIGLICGVGLAVASHVMAVKEDERFPAIRECLPGANCGACGFTGCDGYAKALLTPGTKTNLCVPGGAEAARKLAEVLG
GUT_GENOME011001_010635-85IISNILILGGTALAAAIVLYIVSQKFRVEENPKIAEIEALLPGANCGACGKAGCHAFAVACANSSKEQFKELFCTAGGKEV
GUT_GENOME183018_002765-77MLPIGIVSGMGLIAGVILSLATIIFHKPVDEKEEALREALPGINCGTCGFSGCDGYAKAMAQGQAGATNCTPG
GUT_GENOME054676_009591-85MTIVYAIAVLGVLAIVFGLILAVAAKVFEVEVDPRLPEIQACLAGANCGGCGYPGCGGCAEAILAGKAPVTACAPAGAEGAAKIA
GUT_GENOME177666_009073-82NEILIAVVILGLTGLAMGLFLAFASKKFEVKVDERVSAITGCLPGANCGGCGFPGCGGYADAVVNKGAKPNMCAPGGKAV
GUT_GENOME203577_006741-82MIGVIIFTIIAFIISIILVNVSNKVDKKIDKKEEIFNLLPHYNCGVCGFINCDNMALKILENKDNVLKCKPMKNKEEVINKI
GUT_GENOME079406_007551-79MQGIIYFTIIAFILSIIIVTLNNLLNKRNKKEEEILKMLPSYNCGSCGYLGCSDMASEILKDSSALEKCKIAKNKEEIL
GUT_GENOME096866_01072221-307IVIPIITVCVLGIVFALILSVASTVLAVPKDQKEEDIRAMLPGANCGACGFSGCDGYAAALAKGEAKPGLCAPGGAAVAKAVGDYLG
GUT_GENOME199275_001461-91MTGIIIAACAVGVTGLLLGLFLGFMGKKFAVEVDQKEIDVRAELPGNNCGGCGYAGCDALAKAIAAGEADCGACPVGGAPVAAKIAAIMGA
GUT_GENOME119008_0071324-94WNVVLIALGIMGGLAILLGLLIILVSKVFAVKTDSRISEIAALLPGANCGGCGYAGCQALAEAIVAGKATP
GUT_GENOME044386_016796-77VVLALGVLGVVGLIFGILLEVADKKLAVEVDPRISKVGEALGGANCGACGYAGCAALAEAIVSGEAKPNACP
GUT_GENOME211663_011851-95MNPILMAIVLVTVIGLIGAIILVAASIFMYVPVDERVEKITAVLAGANCGACGCAGCADYAKSIVENGNAINKCTPGGAKSVAAIAEIMGVQAEA
GUT_GENOME048490_007255-88AILLSAGVMVVLGVLFGVILTIADKKFKVEVDPRVEAVRACLGGANCGACGYAGCDAFADAVVAGKAPVNGCSPSGAKGAEAIA
GUT_GENOME097742_0126135-104AVANRFFSVQADPKQLKIRENLPGANCGGCGFPGCDAFAKAVAEGKAPPNGCPVSNAEQKAAVAAVMGVE
GUT_GENOME147939_005163-97AMIQPVFILGIMGAAFGILLGVAAKAFKVEKDERVEKVSDLLAGANCGGCGYAGCGAFAEALCAGKANLSDCPSTKAESKDRIAAILGVSTGAED
GUT_GENOME035418_015491-78MKEILYAVLVLGIMGAVFGAVLAIASKVFAVKTDERLPKLIEALPGANCGGCGFAGCQAYAQAVLEGRAEIGLCVAGG
GUT_GENOME248749_014534-87IIPALVFAAIGLLAGVLLTVCSKVFEVKTDERIDKVNEVLPQVNCGSCGFSGCAAYAEAIVKNGVATNLCNPGGSEVAKKVAEV
GUT_GENOME237868_018081-87MNTIVIAIVLVSVIGILAGILLSYASKIFAVKEDQLFIDLRAELPGANCGGCGFAGCDDYAHALADKSLNTPCNKCAVGGPAVAEKL
GUT_GENOME025387_002084-89IDILWAVLLIVLIAIVLAIALALAGKFLAVKEDKRAEEVMKNMPNANCGACGFPGCAGLVGAILAGDEKHIKKCAVIKDDKAQIIK
GUT_GENOME096561_003677-87WSIIVLGGLGAVFGIMLGVASKKFAVEKDETAVAVRECLPGANCGGCGFPGCDGLADALAKGKAEIAACAVLSQEQAKEIA
GUT_GENOME258607_0108228-97FAVKQDERAEKIREVLPGANCGACGYAGCDAYAEAVAKGAKTNACVPGGQKVAEKISAIMGVSAEAVQPK
GUT_GENOME112985_014675-84LFAVLVMAILGLVLGLVISLAGKAFYVRTDERVDRVRELLPGANCGACGYPSCNEYARNVVKKGEKTNRCTVGGDKTAAR
GUT_GENOME236439_005769-89IIWSALILLGTGFILSLLFFFAGSKTSGAEKRINNLLPGMNCGQCGYIGCAKYAEALIKGEAVPNLCRPGGPDLVQLLADA
GUT_GENOME265986_002681-87MNIFNITIIIVSVALAVFAVVFIVALIVKKSKPKNASVDLVTKLLPGVNCGACGKEYCKWLAEDIVEGKAKAEDCPLLSFENRAKIE
GUT_GENOME127134_0037614-101TILYSTLVLLVLALLFGFTLAVLGKKFAVREDERIKLVREKLSGANCGGCGYAGCDAFAKAVVEGKADINACAATSADNKKAIADIMG
GUT_GENOME235388_009061-92MVTSILVASISSALIALILAVLLIFSSKIFRVEVDERVTEVTEMLPGANCGGCGYPGCAQFAKALVADEAPVDGCSVGGAATAEAVAKYLGK
GUT_GENOME009775_00162181-260IEMTVFIFSVVLLAVLGILIGILLGVAGKVFAVETDERVEKVRECLPGNNCGGCGYPGCDGLAEAIVAGDAPVNGCPVGG
GUT_GENOME093517_008796-92YGLLIFAIIAAVLGLVLIIPLLFIRGENHEDIETTVRELLPGDDCGKCGYKNCDELASAVAKGECELDECVVIDSELAEEIDEELNS
GUT_GENOME283315_000495-78VLPAIIVAAVGLIAGIILTIAAKLMFVPVDERVAAIEEVLPGANCGACGFAGCSDYAKALGNDADVSTSLCPVG
GUT_GENOME170646_004895-89SILTAVAIVSGIALIAGLGLAIASKIFAVPVDEKAEAIRECLPGANCGACGFSGCDGYAAALSKGETTNTALCAPGGNDVAKAIG
GUT_GENOME258967_010534-81ILIALACFAVISGILGVILAIASKVFAVHTDERIEKITECLPGANCGGCGYTGCEALAKAIVEGKAPVNACNSATDEA
GUT_GENOME069343_012881-90MTIFIASICIVAGLGLILGLALALADKYIAVPLDEKQEQILNLLPGANCGSCGLAGCGAMAKALSEGKASLSQCPVVNQESRNKLAGILG
GUT_GENOME092027_016781-93MQILIAAVILGALGLIFGAVIFVASKYLSVPTDPKRDAVRECLPGANCGGCGFAGCDSYADAIAAGKAAPNLCPVGGDAVAGKIAEIIGVSAE
GUT_GENOME190132_013263-77TVVWTVALMAAVGLVGSVLLLIVSRKLAIGEDERLTYLMSILPGVNCGACGHPGCEQYAKAMMNGAPPNACTTGG
GUT_GENOME015077_001214-82ILWAGLIIGSIGLLGGILLCVVSKLCQTNKSNEKLEKIRAALPGANCGSCGFAGCDAYAEAVEQGNAEPGLCAPGGTQT
GUT_GENOME004399_002293-89ILNSILVLGVMGLIFGGILAYAAKKFAVKVDERVEAILEVLPGANCGGCGFPGCGGLASAIVDGSASVNGCPVGGSDCAAKIGEIMG
GUT_GENOME033024_012491-82MWKSVALVAGLGGMFGAILAVAGRKLAVHTDPRIEEITNLLPGANCGSCGYPGCGGLAEAIVAGRDGVSPCLACSPDSKKKI
GUT_GENOME113654_0046823-96FLSFIAKTSKQGDESTASKIEKILPGVQCAQCGFPGCSAYAEAVASGTALCNKCIPGGPDTVKEIAALLGIDPP
GUT_GENOME000537_024714-114IITPVILVVVIGAIAAVILTLAAKFMAVPVDETAVAVREVLPGANCGACGYAGCDDYAAALAADPKGVAPTLCIPGGSTSSSAISKILGVDAGSAVPVVARVRCSGISEKT
GUT_GENOME113664_004502-82MEILIPVLLVVAMGFVFAVILTIASKIFFVPVDETVTNLREELPGANCGACGFAGCDDYASALAADHSIGTARCPVGGAEV
GUT_GENOME018118_007809-86IAFAVLLGIGALLGVLLTIASKKLAVKEDPRIKEVEAMLPNANCGNCGYPGCHELAAALVSGEETKVSKCKVGNKDKN
GUT_GENOME208175_0723117-84YASRRFAVEDDPVVEKIDALLPQSQCGQCGYPGCRPYAEAVGTQGEKINCCAPGGEAVMLKIAELLNV