UHGP-MC 73772


Information


Number of sequences (UHGP-50):
391
Average sequence length:
55±5 aa
Average transmembrane regions:
0
Low complexity (%):
11.4
Coiled coils (%):
0
Disordered domains (%):
0.07

Pfam dominant architecture:
PF12389
Pfam % dominant architecture:
384
Pfam overlap:
0.68
Pfam overlap type:
equivalent

Downloads

Seeds:
MC73772.fasta
Seeds (0.60 cdhit):
MC73772_cdhit.fasta
MSA:
MC73772_msa.fasta
HMM model:
MC73772.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME088589_008051-63MKKSTSTKRALTMSVVSLLVCAAVFIGSTFAWFTDSASVNVSSIRSGKLDIGLEMSEDGVTWT
GUT_GENOME130638_014351-58MEKFRRFFQEKFILSAVMCLLLALVLITATYGWYAINNSARAYGIDLNTGGIGGIKVA
GUT_GENOME283622_0058477-127KRKLKQSFGFAFVALIALIALGIAWFMSNSKVTSTGTSVSAQDDRLFELAS
GUT_GENOME215936_004662-63KKLNKRLMGSVFALVIALALATTSTFAWFTMNTTPTVEGFNLNVTAQDGLYISVSEVGGTNG
GUT_GENOME153052_004263-60KNKNLILSSFFMFAISLACLITASYAWFSMTGSVEVTGPEFKASAPENLQISNDGNTW
GUT_GENOME130638_014398-62KKVRGWKSKLLICLLYVAVLSTATYAWFTLNNKPKVYNLTLTAGGAADLLIADDL
GUT_GENOME283683_008307-62KNTSAKKKLIPAVAMLTTSAVMLSTATYAWFTMNKDATVTGLQLTATTSNSLELSL
GUT_GENOME256840_0130617-73KMRNKVIAVISMFALSAVMMTSATFAWVTLSQSPEVKGISTTVAANGSLEIALSDID
GUT_GENOME060638_008922-55KKKGLIISTVVMVVVLIASLTTATYAWFSAQASATVDDLSITTKAADGLQIAMT
GUT_GENOME257154_007607-68KKQKVLMPAGIRSKLMAAASMLMVGVIMMVSSTYAWFTLSTAPEVKNISTTVAGNGSLEIAL
GUT_GENOME009824_014372-56KKLTKKLFLSVFTALFAVVALGTSTFAWFSMNTTVQATGMQAQAKADASLAIAKT
GUT_GENOME114329_001951-56MKNFRKLIPAVCMLLLAAVMAGTSTFAWFSMNTTVTAGNMAVSATAPTNLRICATN
GUT_GENOME009824_014382-61KKYYKKIVFFLFATLLAFVTLSTSTLAWFSISQINYLDNLEVNIKSDVKMQISLDGQNYY
GUT_GENOME000467_0318328-86KQENPRQLRRRFTAAMVALFLTFLAVASATYAWYIYNTSRHTTNVRMAAGAGVNLQISN
GUT_GENOME165930_015911-56MKKFFSLLKRNKLIMPFVSLIVTGLMLVASSLAWFSMVDRVDGGGMTVGVDNRFIT
GUT_GENOME239006_018896-62TISMLKKQLIGAAAGALTALVALSSATYAWYVANNTVKATTNTVSAKTNGFVLQIND
GUT_GENOME118110_003942-56KAMRKILPALCMLLVSAVMLGSSTFAWFSSNSKVTASGMQISVSTPTSLFISTES
GUT_GENOME046130_0092117-64SLTLLLVAVISITTATYAWFTLSNSTSVQSMEVRVGTGTKLMVSTSDS
GUT_GENOME139859_016071-63MKTRKSTKRALLLSVCSILLCLVMLIGTTFAWFTDTASTAVNTIKSGTLDIVLEMKKADGAWE
GUT_GENOME254518_0004524-80QRRKRNKKDIRNAFVMLCVTIAMMSTATFAWFTMTDSPTVQGLKMTAATTGGLELKD
GUT_GENOME011245_0067820-69RKKVLMIFAMLLCSGILVGLATYAWFTLSSNPEVRGITATVGANGSLEIA
GUT_GENOME232412_012901-52MKRLILTLGLLLVSAALLGTSTFAWFSMNKDVAITGMKVKATTPAYVYISNA
GUT_GENOME019228_0190917-75MKTKSQKLKIQLMGAILSASVSAFALTSATYAWYVANNKVTSETSSIMAEANGFMLQIV
GUT_GENOME225803_014642-57KQTSRKKVLVSSVAMMMVATVSLGSATFAWFSQNTKATASGITAQTSQSSNIVLSE
GUT_GENOME000079_023081-60MNGKELKKSTYMALIGVLISAVALATATYAWFVNNQSVEVERVSFSSEASSDLLIAVHDG
GUT_GENOME282198_0102722-82KLGVAALGLVIAALVSVSATFAWITLSRAPEVTAIDTTLSANGALEIALNRGETPEDADID
GUT_GENOME162900_0088916-73VAVKQLKKQLIASILSVAVAFVALASSTYAWYVSNNTVDATTSTIAAKTDNFVLQVAK
GUT_GENOME236505_000281-51MKKIIPSICMLLVTAVLMGTSTYAWFSMNKKVEVTGMKVTASTSSNLVISE
GUT_GENOME009596_007649-65GLKKQLAAAIAMVLVAAISLGTSTYAWFVNNTKVTAGTTKVSATTANTLLIKNKDNE
GUT_GENOME012714_008173-52KKRKAFINLIISAVLLVCTLFVSVTSFGWFAFNDSVSGSGMGVVVKTDET
GUT_GENOME105006_000854-63SVKRLKVQLVGAALTVVVSAVALSATTYAWFVSNSRVHAATSTISAQANGMVLQIVEGNT
GUT_GENOME118413_001021-72MNKRSSTKKALTASIMSMALCTVLLIGATFAWFTDSVNSGTNKIQAGNLDVAFEYSKDGGTNWTEVTKDTDD
GUT_GENOME101989_007433-57VKQLKKQLFLSIIAVLITFSSLSSATYAWYAMNTVVTASGMKISANTEGLNFEIT
GUT_GENOME237439_007534-55RSKLLLSATCLLTISVAATATSAYAWFVSNRQASVVINNANVKTNATNLKIA
GUT_GENOME238996_004543-62KRNKSQQRRVKNLALVFTLTTIVLVTSTFAWFIGMRTVNVNQFDVTIAAVDGLSLSLDGE
GUT_GENOME036520_008803-57KNARKLIPAVAMLLVSASMLSTASYAWFSMNSQVTAGGMNVNVAAPANLMISTDK
GUT_GENOME266006_0075650-101NRKKILTAFLMLIVTAISLTTASYAWFTENTTVTVDSIDVNVSAANGIQVSV
GUT_GENOME139957_001503-53KRRILLALIMLIISGVSLTTATYAWFTANRAVTVEEIDVKASASGGIQISA
GUT_GENOME023864_001635-53KRQYHIKILASALLIVLSLSMLVMVTFAWYVMSTAPEVSGMQIKIGSDT
GUT_GENOME237439_007541-57MKTRSKLLLAAVSLLTVSVAATATSAYAWYTANRQVSGKVTQMAVQATSGTLAISHN
GUT_GENOME010548_0083917-62ILLLFLTIVMFSTATYAWFTANQTVTISTLDMHVETSDGLQISTNA
GUT_GENOME035267_0002011-68KAAWQKLMGAIAMLLVASIMMSATSYAWFVLSTAPEVTDLTTTAGANGALEIALQSTN
GUT_GENOME233454_017533-61KSLKRKLIMSSIAVATAIVGTTASTYAWFVSNTDVTSSVTGNVASSDSSLYISKDSSSF
GUT_GENOME007885_003814-55FRKLVPAFAMLLVSATVLASTTFAWFSMNNKVTATGMEVTAKANTQYLIVGQ
GUT_GENOME233266_0271910-62QSVLRRLVPAAAMLTVSAMMLSSATYAWFTMNKEVQMTGLKMSATAGEGIEIS
GUT_GENOME283611_011906-66SINSLKKQLVAAVAMVCVAAVALGSSTYAWFVANNKVQATGMTVQAQSEGGIVISNESKTD
GUT_GENOME127509_004892-59NSKKLKTQLLAAVAMTLVAVIALGSSTFAWFANNTEVKATGMNVVAKSNDTFLLISEE
GUT_GENOME131674_013194-57SKKSLLASGVSLLASAALLVGTTFAWFTDSVTNTGNKIQAGKLDVTLEKFDPAA
GUT_GENOME237487_009535-71RQKNSKRRLNSLILLVAFTAIMLIVSTYAWFSSQKNVTLTGLKGKVNVAEGLQISLDAEHWVNTINF
GUT_GENOME137183_0008031-91KGKSQKNKRKSELNAMFFIILIAAVLFIISTYAWFSTQKNVSITNLKGTVQVAEGLEISLD
GUT_GENOME030838_0149613-64AAKKIIPAVGMLALSATMLSTSTYAWFTMNKEVEMTGLSMTAAVGEGMEIAL
GUT_GENOME272669_000342-60QKSGKKNNRRKKNLVLLCAFTAIILTVSTYAWFIGMRTVNVSAFEVNIASTKSLLLSID
GUT_GENOME082130_012589-59RKRMMLSSLAMLLIATLTLGMATYAWFSMSKTTEVTGLNFTAEAANGIQIS
GUT_GENOME264398_008673-58MNAKRRMIPAIAMLAISAVVLSTASYAWFTMSRSVTASGITLKAVAPTNLLISNKV
GUT_GENOME237448_004755-58RKLTLSVLSLVAVVICFAATTYAWFDVNSEANVTGFNFTAISGEGFLVSVDGVN
GUT_GENOME069583_012691-48MVVVLIASLTTATYAWFTSADAVKVDSINLNVKSSAKVQVGVRVNETK
GUT_GENOME167458_008326-70KVSKRKEKEKNYRLFYMLVLLLVTAVSLSFSSYAWFTTNRIARVDLLNVNVRAQGGIEISVDGDK
GUT_GENOME282985_011602-72KKTQKSFRKKALLSSLSMLLVSTVAVGSATFAWFTSNKSVTASGMSVKAAAATGLQITKDNANEWAPSVTF
GUT_GENOME157360_000732-55KKSMLLTTIAMIVVVVIALSTATFAWFSAAGQSAVEGTMTVQAGAEFTIKVSDG
GUT_GENOME222393_0084617-70RTKLTAATVMLLIAAILMVSTTYAWFTLSTAPEVTGITTNVGANGNLEMLLLNK
GUT_GENOME095664_006364-60KALKKQLGAAIAMVLVAAIALGAATFAWFVNNNAVTATGVDVSTSSSVPNLYITSTG
GUT_GENOME237448_004733-66STKRKLILSSIMLAMTFTCLTSTTYAWFARNKEAWTEEFNFDIDNYDGLLISIDGKNFTASINN
GUT_GENOME134770_0094330-89KKHKKRIGLLILLLLFTGVMLGTSTYAWFTANKTVNVNKITVNVAAQNGIQISVDGTTWK
GUT_GENOME258266_005291-79MSSNKQTRRTLVASVLALVVCAAMLIGTTFAWFTDSVTSGRNTITAGNLDVELEYATVVDTQAGTLSEWKTVQDATDLF
GUT_GENOME171681_0093512-66KTLKGQLFGAVAMMLVAAIALGTSTYAWFVNNQTVQVDPMKLTVSTSTSLTVAVG
GUT_GENOME181233_012141-59MKTKAKLVLALSVLTAGTAVAGATGTFAWFTTNRSATLTYSKVTAQSSNAKLEVTMGSF
GUT_GENOME099237_014346-59LTKKRALISSVAMLLVAIIALGTATFAWFTKASVATAQGINVNTIKASELQIRS
GUT_GENOME009619_004531-57MKKSKIILPAVALLTISTVAAASSTVAWFTANRTVKVNTSTVSVYNPESDLKVTLGG
GUT_GENOME013280_0060814-70VRKRKRKIFLSILMILFTGIVLTMGTYAWFSANKTVTVDSINVNVSSSNGIQVSTDA
GUT_GENOME037803_012281-63MKTTSRKRLLISSVAMLLVAMLALGTATFAWFTTSTNPYADKFSAKTTKQSTLLLSDNSRSNW
GUT_GENOME222362_004099-64LKRIRIQSFLTCFLSLSTLFAAGSATFAWFSTNKRATAKYMNIVARDSSIVKTANY
GUT_GENOME008653_011022-58KTSSRKRLLVSSVAMLLVAMLALGTATYAWFTNTTSATAEKANVTVAAPSGLIIGAV
GUT_GENOME064818_007375-64LSIKRLKKQMAAAILSVCVAAVALSTSTYAWFISNKNVNGTTSSISATADGVVLQINAGD
GUT_GENOME025259_0143412-81RNSKPKEPSLYRSLCVSILCMFLSTVMLVGTTFAWFHSSATSAVSTITAGSLNVSTSLITQDSVESLNWV
GUT_GENOME011451_011251-60MKKFKKSSVIVPALARIAVTATASVSGTVAWFTANRAVSASLNSFTAYDENGSLTITATN
GUT_GENOME020894_001648-63RRAKKKSIATRIRLVLLFSVILIVNTYAWWSSTKDVKLSSLEADVTSWDVAYYVNE
GUT_GENOME013306_018061-59MKHKGLKTSLMLSILCLLVAVIALGAVTYAWFTFDPYTNVTPMEGKISGGKANLLISDK
GUT_GENOME000537_025592-54KNTKKQLFTAVAMLLVATVALGTATYAWFVNNAAVEVENMDFTASSSMALDIG
GUT_GENOME264398_008661-53MTKKLIVSAACLGMASLLLVYASWAWFSSARAVKATGITLHVETPNNIQISLT
GUT_GENOME014147_005571-65MNKITRKLILSISACAMTLVCLTSTTYAWFARNSVAWIDDFKLQIQQHDGLVISVDGINYSTSID
GUT_GENOME235539_006392-75KKNTTKRSLLASVLALVMCVTMLVGTTFAWFTDSASVSVSKIEAGKLDIDVFYADTADGSEGSNWTLLTKNSDP
GUT_GENOME020696_001301-60MKKKGLIISTVVMVVVLIASLTTATYAWFSVSSVTKIGEFNVSVVSDNAVNVGIKKTYEL
GUT_GENOME282269_002031-55MKGKSREKLKIASSLTMCLVCLFCIFAATFAWFAQNKNVGASGLQIGLGTEDGIL
GUT_GENOME191082_014241-55MKKFKKLIPAFCAMLVSAAMLGTSTYAWFSVNKKVEANGMSVTAQANTQYFVIST
GUT_GENOME271772_014544-57FSSKKALFLSVISMVICVSMLIGSTFAWFTDSATANVNTIQAGNLDVELVGKDG
GUT_GENOME109437_016591-54MKKFSKLIPAFCMLLISAMLMGTSTFAWFSMNTQVKATGMQIQAKSNETYLLIG
GUT_GENOME170739_002665-59IRQLKRELVGAIISLAITVLALASTTYAWYVANNKVDAITSTISATTNGFILQIA
GUT_GENOME170618_0020578-137EELLSEIEKDKKKAVRSLVLAITALIAMIAICIAWFVANNTVKSGTVAVSATDDARFVLA
GUT_GENOME014865_002214-57TFKKRAFISAIAMLIVSAIVLTSATYAWFSMAKRVEVESMELNVTSPEGIQISA
GUT_GENOME218104_014921-55MKATRKLIPALAMLVVAAVVMSTASFAWFSMSREVSALGMDVTVTAPNNLLIKGA
GUT_GENOME015626_008899-57KLIAAFLLLMLSLTLAAAGTYAWFTLSTAPEIGGMQLSIGGSNTIRIAP
GUT_GENOME156114_0054419-77AREKLVGAVFMFLIAIITTVSATYAWITLSTSPEVTSVDTTVTANGSLEIALANGTGDA
GUT_GENOME014147_005592-71KLRKKIMLGLWTLMLLAVTFTSTTYAWFKLNSHADVTFDFKVNGGLGFLVSVDNENFSNDITEDKLLMAY
GUT_GENOME008642_004681-74MNNKRATKRALLTSVMALVMCVVMLVGTTFAWFTDTASTGVNKIQAGNLDIEVEYRTTAGGDWKTLDNATDLFG
GUT_GENOME020465_009771-65MNETKNNSVKALKKQMAAAVAMVCVAAVALGSSTYAWFVTNNTVKATTSTISAQSNAAFMKIKYN
GUT_GENOME070915_006905-56HILKLTALALMVIMSLTVAVTSTYAWLTMSSSPVLEGLRVTVGGDNTIKVAA
GUT_GENOME243681_008061-76MSQSKHIRKALLASVLSILLCAAVLIGSTFAWFTDSATSGNNKIVAGNLDVELEYTTDFVTWNTVAGQTNLFEENT
GUT_GENOME037732_003267-70KNKRKSISYLVFVIALTAILLVMSTYAWFSTQRDVTITGLTGEVKVAEGLEISLDAENWGQTID
GUT_GENOME223386_0175819-73KKRKRLIVFVAIITLLSMTTATVAWFSVNTFAGVDKLDLNISLAAQLKVSMENHG
GUT_GENOME170464_0068415-72KQKRRLKTALIMSLMCIVLLSGATYAWFTIANKATVNRLALQVVSDGRLYIAGSRDNI
GUT_GENOME023488_009205-61FKKSKFVILPALATLVLTGVASVTGTVAWFTANRTVKASASQFKAEAEGGALSITTS