UHGP-MC 17948


Information


Number of sequences (UHGP-50):
57
Average sequence length:
111±12 aa
Average transmembrane regions:
0
Low complexity (%):
1.14
Coiled coils (%):
0
Disordered domains (%):
0.35

Pfam dominant architecture:
PF02156
Pfam % dominant architecture:
351
Pfam overlap:
0.13
Pfam overlap type:
shifted

Downloads

Seeds:
MC17948.fasta
Seeds (0.60 cdhit):
MC17948_cdhit.fasta
MSA:
MC17948_msa.fasta
HMM model:
MC17948.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME233268_01682664-774PIWTGSYSFIENSWGSVQLAADLLADVTVGQTFYVTVEGAESGAQISIKSMAAGWPSLESANEDDKWGVTNLSEGNSEFSFSLSAEDLASVKEFGMVVSGQNYIVTKIAIK
GUT_GENOME021542_02066493-618LKQVALSILETVLWTGSVDLGNWANGFQDLAWSGYDWTTVSVGQKLLVYFEQDTAADFWQLKLGQGNGWNTLPDFKDFAGGADAVDIAAGNTSLEYVLGARDVATLQDGGQGLIMQGANLIIKKVA
GUT_GENOME095416_02039146-261TIWEGESVFGSWADGFSIAADKFANAKAGDTIEFIYTADTEGDGTTKPTYWQIKTVYPDTEITLEGNANELNSYGCAPVSSGSTSYKITLTETDVTNLKAKGLFANGYFLVVTKVN
GUT_GENOME274708_00342152-262IWSGECVFGNWAEGFNVPAEKFADASAGDILEFVYTTDTNTKETWWQFKTIFSGTEETLSSNKGDLNEYGCASVASGSTSYKIILNAEDVAKLKEKGMFVNGHYVIVTAVN
GUT_GENOME003640_02449169-282EQVVWEGSSNFGTSWDGSLAVQISADKFASAKSGATFTFYYDCNSDADYSQIGICDGSWNTLSSAKDADPQWGTINVGGTTSYTTTLSDDDIKTIQSGGMVIKGYNTTLTKVTF
GUT_GENOME108547_01350670-786VQEIPQEKTIWKGSFAAGNWSGNQDLAWGGFDWSTVKAGQKLIFTLTQDTAQTYWQFSLRHGDGWGELPEKVVFEEMTAGQTRVEVVMTQTNLDDLIAKGGLVITGCNFTLTEVAVL
GUT_GENOME237877_0198821-138RDIVLLEGDYKMTWGPDCCVDPMKFAGLSGGDSILVSVKDVNGYAQCSVRDKQNGWKEVTPELSYFLVKGDFSFEVSDSLAHAMRHHTLMFSGTNFTITKVVLRTKRPADTREGYDDL
GUT_GENOME095002_0188522-130EQVVWTGEQVFTDWGDNIIVPASDVNFVADGDKVIVYFSEVKEGASIGVKTNVEGWPELKGTGFKNPAVGDAKAEWDLGAEAAEELKATGLIVQGLNMTVSSIAVLTAA
GUT_GENOME095119_01489537-628GPLDTGSWSSYAQIPSASFATAQVGQVITVVVSNLQSGAQGSFKNGSTWGAIAPGMEYFTIDGNFSLTITQDVLTQLQVGGLIVSGQNYTIE
GUT_GENOME236229_02235158-281VLWEGENDFGTSWAWNQTLSLSAMKFASAKEDKSPKLDFYFTEDVANADYWQLKTLIEGNQTALTSNANEINAYDCVELQAEATTYSITLNAEDLANLKAKGMRIQGYGVILNKVTLTNQSSDS
GUT_GENOME206559_00882152-291EVEVSTTLWAGEQAVVGWSGAQQFTAAMCSGFKAGDRIVVGVSAISGDQSQVDLRNGAGWANFTPGLNEDITGKDMPCAVTFTLTEEVAEIVRTNGMVVTGCNFTFTKVEHITTALTLPSDMKGNAARTVWEGDEVISWV
GUT_GENOME034989_01555655-769ISLETTIWSGAWDCGSWGGNQDLAWGGYDWSTAKPGQILRFYMTPTVAPGEWWCISLRHGDGWGNLPDPVPGQYDTPENNLLELTLTRGILDDLIANGGLVLTGQGYQLDKVTIE
GUT_GENOME247475_00427129-247SISVEPASEILWEGLKVINNTGWTVGVLESYLFANVKEGDVLFLSVEQTSEGAECVLKERSTWKNLHSAVPNGNSFRDFKSDGVSLFSFSLDADAVTSLKTHGMLVAGSKYNLLSVALG
GUT_GENOME272684_00710527-634IADGAAVVIYAGPAVNAGNWEGYVQLGSELFADAHVGSIITVYISNIGADAQGAFQNSSWSSIDPAYDGFALSGDSFTMTVTESILSQIQSGGLIVKGKNYTIESVTI
GUT_GENOME270792_00746331-444SAVLWEGSFAMGNWSGMMEYKNVAENDQWKATAMAGLEAGDKLVFSYSGVESGAQIQLATFAGSDWTWTELVPYDDIVGGQYIYTVTDDDVEMLSEHGFAVKGQKATLVKIELV
GUT_GENOME023340_00575738-841LFEGSTDMGTDWEKSETIAKFNAVKGGTITAEFEEGSTEYWQLKFMDGGWTPLTSPKTNKWDSVELGKNATLYSFDLNAADAKAVSSNGMVISGYGVTLKKLTY
GUT_GENOME233268_01181432-550RDVWTGSGVLNWDEGAQIKIAASEFANFSTESKIILTYSFDGSASYTNIQFCDQSWSGMTLSSTDGAFTGSDFDPASHDLTPGTYTTAFAVGQETVTKLKSGGLILQGYGVTLTKIALS
GUT_GENOME284158_0119125-160LWSGTLTCTDDWGGSQSIAADGLQQAVEGDVIAVTVSAVSQTASYPQVSLRKTAGWDQFEPAVGVILQKDAALPYEARMTLTADVVDEIKANGFVLTGCGFTATSIDLIHKQELAEGEKGNPVHNVWTGEKAIDWS
GUT_GENOME012456_0026730-122GPKTIGSAWKDKIVLEPRLFANVEAGDVLTVYTDSYKRSSQGTFQNPNNWQAIAEQYKYFGIAGPFRMAVTTDMVPLLKQYGVCIGGHNYRIL
GUT_GENOME085999_02728542-665SISFIPTERPETVIWEGTFTTTSSWEGNQDLGWNGYDWSSVEPGTVITVYYTLNTATTDWQLRLGGCGIGWAALPTIPAVSLEAGSTSFSATLTAEDLAVLSDGNNSGLVVTGCNFTMTKITLK
GUT_GENOME237724_03742274-380TQIWEGEIVMPGDWSGNVQMTDDAAKAVFADAQVGQVIRVAVKDVAAGAQGSFKNSGWSEIASGTDYFDISGDYTLVITEDVLKSLQEGGLIIGGHDYTAVAVYLEN
GUT_GENOME034925_0204248-137GGWKDNIVVLPQQFADTKVGDIIRVYTATAKSFAQGCFQNPKDWKPVAPEYAYFNIGRTFSLTVTDSILPQLREHGLAIGGHDYVIGKVT
GUT_GENOME242947_00158151-270YYNDVAAETATTIWTGSMEAGNWAGHTKLESSFFANCNAGDKLRVHVTELSTTDGKIWLQNGSWADFDPIVSYTFVATDNVPMDVEFTLTKDMLSTINSTGALIIKGQYYTMTEVTLISY
GUT_GENOME233264_0049753-145GPKTIGKGWQDKIWVEPAQFARCKAGDVLTLYTADACPWGQGALQHPKTGEPIAPQYAWFGVMEPVTVTLTEPMATMLQTEGLTIGGHEYTIV
GUT_GENOME241031_0159924-137TLWEGSTEFGTGWDKSVIIPASDLNAIGDDQATFTIEYTLDSSLGYWQYKLFTNATGWPALEYSEENGNDYSCIAFDAGSTTTSFTLGAADIATIKATGLGFQGYGYTLTKVST
GUT_GENOME034989_0166522-137EITLWEGSCNFGTSWSESFGVPASDLSVLDNESAVLTFHYTLDTDCSYWQYKPCSNGSGWTPLEAATELGNSYQCISVEAGSTKTDLPLGAKDIATIKENGLRLQGYGMTVTKVTY
GUT_GENOME236889_01019497-607IWEGTLVFDGGWGNNLQLTADKFANIKDGSKIYFYYTLDLTSSYWQLKPMDGSWTALSYNNIVDPQWQCIGMAADSSEFAMEVNAADVTALKSTGMVLNGCWLTVSKIAIK
GUT_GENOME236463_02098170-253LEASLFTQAKAGNKLRFSYSNLACGAQAHISSSTWGDITDATDYKSLSSSYYEYTITEAMLTELQKNGCIVNGIGYTLTSVDII
GUT_GENOME113000_00509247-363VWKGESVMPGDWSGNLQLTDDASKDFFSKAIEGATLTVMTKDRAAGCQGSLKNSSWSEFAKGLDYFNNDWDGAHPACIEWTDDYFALTLDSNMAEELRNNGIIVSGHDYTVTEVILT
GUT_GENOME035916_00366892-989PIDWNAGETNNWVSIPASQFRNAEAGCILRMCFSGLSMGAQGHVCRGNWSDLADAAEYLQLTGSSFSFEITEPMLAQLKADGLIVTGTGYTLETVEIV
GUT_GENOME255115_01481557-671IWTGNFDLGNWANGMQDLAWGGYDFSALQAGNTIIVYFTQDATASSWQLKLGRGSDWSTLPDFQEYAGGGDATDLTAGATSFSYQLGANDVNEILTNNGLIFQGANVTLTQISII
GUT_GENOME041946_01794220-328GNFDVGTSWGSSKGLAADNFADAKAGDTVTLDFTENSSEYYWQLKIMSAADGWPPLSGPSPLNAYQCVELAAGDMTFSFKLTEDDVATLKANGMMLSGYGVTYTKLTLA
GUT_GENOME218838_02386255-355FETGAWANNLQIAPTVTVEGQEEPVNIFADVTVGQSIQVTFTDAATDAQGSLKNSSWSQIADGTEYFDISGSSYTLAITQSVLDQLKEGGLIVGGQNYTIT
GUT_GENOME234137_01157906-1024IWSGETALGSWEASVGDLSWGGYDWTTAKSGDKLTVYFTVDESIGYTDLRFGNGSWTALPSTLADPATDKDGNYSGFAADATSKSIRLSQADVDVLVSEGGLVLCGAGLVIKAVELCHA
GUT_GENOME062369_00026153-249WQGEKVMPTDWGAWLTLPASKFADAKIGWTLRLMASDVASGAQAQLSSAAWKCLANKSFGGKYVDFELTQDILMELQQNGVNINGCGYTLTAVKLFD
GUT_GENOME212983_00790665-777ETTIWTGSFTVGDWDGGFDALSWDKYDWSTVKAGTVLRLYCTPTVADGEWWCVSMRHGQGWGALPEPCPGQIDTPAGGVAALELTQAILDDMVANGGLIVTGAFFTLTKVTLE
GUT_GENOME236229_02191144-236PVVLGNWKGECKVASDKFIDAKVGDKLLITFTDVESGAQIQIADGDYNAIVEYDDIVGTTYEFIITDEALASFQETGLVLKGQKATITDVAIC
GUT_GENOME049628_01036225-327PVVTGGWAASAQIANAKLPELKAGDVIRVYISDVQSGAQGSLKCTAAGWPGIDTDFEYFEITQQDIDAGYYMCTLTENAIANLSGNDLIVSGQNYTINKVSVF
GUT_GENOME233266_016193-149IRSFITSALATVSAAAILTAGVSAYDLNKDLGTFWSASVTVPSSEFEGITTDSQITVTFTTDDSLADVDGHSYWVIKPMVNDAGWPLISGISEMPASEDGSAYVVQPGDTSVSFTIPEDFIEHVQIAGIALMGHGVKLETLTVSNDA
GUT_GENOME100296_0098122-131VIWEGNVTTGSWGNQEGVSCVKIGNANFATASEGDEIVVTVSEVAAGTEYPKYILKNADGWADLPGSSTIDTPSAGTYSSELEAEAVEMVKANGMIIQGDGLTITKVELN
GUT_GENOME061897_0190928-141SLWKGGQECTDDWKGWQQVTADKCALAAEGDEIVINVSTISPTCQWPQVMLNNSSWSSLGDAKSFLLTGQSAPTTATFTLTPSMVTEMKAGGFIIKGCGYTFSEVILRHKIENG
GUT_GENOME021645_0019715-122FATAAERTLLIGPKTIGRGWKDNILVEARQFADVKSGDLLCLYVDNAKKSAQGAFQDIENWQGVAPEWGCFNVTGTVRMKVTQEILDKIQARGMIIGGHDYRILRLTH
GUT_GENOME249924_00213658-762ETLWQGEQVMGGWSGSVQMDASLFVNAKVGKTLVVSIKDLEPSVTYWQVGLKKNTAGWPDLKMVDLAADATGHEFIIDDAMLAELTTNGLIISGCNYTLTKVELK
GUT_GENOME208504_01776243-355PVVLYGGPDKVMPSDWSGNIQLTTQAAKDILAGASIGSRLAVKITDVKPGAQGSIKNSSWAGFVDEHGKNWDYFDISGDSYSMTLDQTTLNEMRANGLIIGGHDYTVTGVTVE
GUT_GENOME243710_00107120-243VVAVALVSEKTKPTVEKIIVSDPTVIENWNGLQIPADKFGDVKVGDKIKVYIASLKDNSQGSLKTMNSNWTQIAEGTEYFSISGESYSLDVTADILEKLQSTGLVISGQHYTIVKVCVISAAEN
GUT_GENOME234551_01769683-780VSVDSWSVYQKIEAAKFANAAVGDELAITIPSLNGSNHQLFLQNGNWKTLAGVDEKYVISEAPYTFKTTITEEMLAELQDKGIVVKGIGYDLSSVDIK
GUT_GENOME139105_00733388-496VLWSGSEDLKTDWSASVSIPAEKFADIKAGDTLVFTFTKGSADYFQIKIMDGDWTPLASPKTNEWDCVELSVSPYEVKISAADASALKAKGMVVSGYGVVLKSVAVKSA
GUT_GENOME236229_0220228-145REIVLLEGPFEADWSDKHCVDPMQMVGVATGDIIHVYTSNVQSYAVAYIRTKANGWAAISSEYDNFRITGDFECTLDETLLAVIKNQTLMFGGSGYTIEKVTLITERPEDMREGYDDL
GUT_GENOME250642_0014825-125VVYSGPAKVCGWEGVDLPAGVFADAGAGDVVRVYASGFLGADNSGTAGFMSSGSALFPDETEFKVYGDFTLIVSPSVLSKLRSGGLTVTGYNYTIDKITLE
GUT_GENOME266041_01918291-407PAQETVLWEGSVDTGSWNGQTVEASLFASLTADMTYTLYFETPYETGAQLTFKNPGDWTALLSATPSDPEWGCYTIQSGETSYSFQLSAEDLAKVQANGMYLSGQKVVITKFVVAAQ