UHGP-MC 114239


Information


Number of sequences (UHGP-50):
152
Average sequence length:
65±5 aa
Average transmembrane regions:
0
Low complexity (%):
0.7
Coiled coils (%):
0.17
Disordered domains (%):
0.1

Pfam dominant architecture:
PF08843
Pfam % dominant architecture:
8882
Pfam overlap:
0.17
Pfam overlap type:
shifted

Downloads

Seeds:
MC114239.fasta
Seeds (0.60 cdhit):
MC114239_cdhit.fasta
MSA:
MC114239_msa.fasta
HMM model:
MC114239.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME147999_0097286-135EKDWQTNHGNFIEAFLLFLNKQTDQFVLKGGTALSLCYGLDRFSEDIDLD
GUT_GENOME234651_008954-75SKILIEKISHETGFIGSNIEKVIRLLDVLDFIFSKSSFQEKLVLKGGTAINLAYTNLARLSVDIDLDYHGSI
GUT_GENOME019128_0013218-83RVSESTHIPLAMVEKDFWVCFVLARIFSDAELRDALRFKGGTSLSKGYGLIKRFSEDLDLILDKSL
GUT_GENOME017802_000538-75LIDEVSFETGIAPSYIEKDWYLVLTLSLLKELNTADTKVIFAGGTSLSKAFGLISRFSEDVDFSVVGN
GUT_GENOME201157_011931-75MFSNANSFKAKIKNMAKDRGIPAQQLQQNFLIEQVLKLIAKSSYKDSFIVKGGYLIGQLIGLDKRTTMDLDVTLK
GUT_GENOME258425_0024520-78RRGIPAAVLEKDILLTDVLQAISEIRPRGFSLAFCGGTALAKAYKVIDRMSEDLDFKVT
GUT_GENOME008072_0307213-84KAIFIKAAFDIGIRPDMVEKDYWVSWTLNQLFADKKLGSIFLFKGGTSLSKAFHIIKRFSEDIDLLLDLGEV
GUT_GENOME014963_0038714-77ILIATSNALGIEMAIVEKDYYVSLLLKEINKNYPDIIFKGGTSLSKCYKIISRFSEDIDIGINA
GUT_GENOME193396_0113119-80VAPMMKMNEGIIEKDFYVVLILELLFHHSKFGKSFAFKGGTSLSKGYNIIKRFSEDIDLVMD
GUT_GENOME264202_0202211-93KALIFLTAKDMGILEFYIEKDYWVTYILKKLSSSSFKNDVVFKGGTSLSKGYNAINRFSEDIDLQLINSSLGDNQKKKLLKNI
GUT_GENOME101942_019334-77DENKDLFSQAIRAASNELNIQTQYIEKDYWISLVLRQLSQCEYADMTVFKGGTSLSKGYGLISRFSEDVDVAIL
GUT_GENOME206088_0007914-78INQISKEKGINEAIIEKDYFVSLILQEIAKENGNIVFKGGTSLSKCFGLINRFSEDIDLSCERKL
GUT_GENOME147482_0343513-75VSDALELGNPSITEKDYWVVSLLAMLESVESEHHQLVFSGGTALAKSNIKILRMSEDVDIKLI
GUT_GENOME143396_0371015-80VEIIQATADHLSIPAVYVEKDYWVTYILRSLSRSDYKERLIFKGGTALSKAYKLIRRFSEDIDLAA
GUT_GENOME206760_0071811-83KQFVNKISIETNIAMDILEKDYYVCCILQELSKKQDELQAYFKGGTAIYKILDTMNRFSEDIDLTVKINEELS
GUT_GENOME171447_0365124-87AQQHPSGLGASFLEKDLWVTEILWLLFNEDLLGDLSVAFKGGTALSKCWNVIERFSEDIDLSVH
GUT_GENOME080362_0305015-80VLETAAQMLGRPAYVLEKDIYVVWALGRIFSAPIGNHLTFKGGTSLSKVYRLIDRFSEDLDLTYDV
GUT_GENOME238202_0128712-80RAQILQQIAIARHLDATAIEKDWWVTMCLTALFQCRCADYINFKGGTSLSKCWHLIDRMSEDIDIAVDR
GUT_GENOME081295_0193512-75EVIYSAATDLNLPIPVVEKDYYVTMLLKQLAEKAPACVFKGGTSLSKCHHAIDRFSEDIDIAFT
GUT_GENOME258542_0124415-76LLASEAMRIDSGIIEKDYYVTMFLKSLVARQPQILFKGGTSLSKCYRLIKRFSEDIDLNLVC
GUT_GENOME231809_0104518-69VDYSFVEKDWFITRALNALANDPDLVFSGGTSLFKAHRLIERFSEDIDFLVI
GUT_GENOME274086_0053714-81IRQYSLATGIPETFVEKDIYVLKVLSVLANINYPDITIAFSGGTCLSKAYNKIKRFSEDLDFCIQTSI
GUT_GENOME243880_0011112-81VLIEAIHQKTGYREDVLEKDYYVTLILKELAEKQAQGLPAYFKGGTALYKALKTTNRFSEDIDLSVDTKE
GUT_GENOME030984_018689-80ELLRDIIVTVSERTGIDESIVEKDYYVTMILKELVQRNPDVVFKGGTSLSKAYHVIDRFSEDIDITFEEHLG
GUT_GENOME001710_0116813-75IIALAADHFGYEQSHVEKDYWVSKILRDISMSEYADKTYFKGGTSLSKAYGLIERFSEDLDLF
GUT_GENOME274569_0049813-80KALLLEVAHKANLEPHIVEKDFWVSWILGKIFSDKELNKILCFKGGTSLSKVFGLIERFSEDIDLILA
GUT_GENOME009424_0223810-77EWKEIIKTVAREQGRTELMVEKDTIQSMFLLELSKSELPFVFKGGTSLSKAYNLIDRFSEDIDLSMNR
GUT_GENOME152550_0261212-79TDILDRVSTELNIRQREAIEKDWWVTTVLRAIFSLPYAKHLSFKGGTSLSKCWHLIDRFSEDIDIAID
GUT_GENOME013746_0167914-81IIEAAKQVNLSEFIAEKDYWVTYLLKNLVKSEFANEFVFKGGTCLSKAYNLIERFSEDIDLLMIETDK
GUT_GENOME123726_017633-73DLTKLFPDVADALGIESVAIVEKDHYIVELLRLLQPLSFDTHQLVFAGGTALSKAGISLNRMSEDVDIKLV
GUT_GENOME040205_0031115-83LFRNTADKMGLNDAIVEKDFWVCFTLDYLFHRCPWKDSITFKGGTSLSKAFNLISRFSEDIDLILDWRV
GUT_GENOME226900_0352426-82ELSTGNICHERWGRYCVTMLLKPLSEKIPYIVFKGDTSLSKCHKVIKQFSEEIDITI
GUT_GENOME225930_0361325-106NGLPAFVAEKDVHVTDALRVLASLHIVHEAKLKGFDPRSKKVPNEPINIDLPVRFVFAGGTCLSKAYNLINRMSEDIDIKVI
GUT_GENOME096544_0082113-75AERFGVEMEQVRRDHLVSHVLGAIASGVPTDDIVFFGGTALSRTHLADARLSEDIDLIALAPR
GUT_GENOME100290_0065913-77AIQATSQELGMAQEFVEKDYWICQILQSLSRHPLNERIVWKGGTSLSKAYGLIRRFSSDVDFAVL
GUT_GENOME143619_0116721-81ARHFGKNSIVLEKDIWVCWVLKQIFEMPNRLSMAFKGGTSLSKIYKVIDRFSEDIDITLDY
GUT_GENOME152049_009214-71NYKKNIEKIAVRTGFIRSTLEKVERLLDILEWINNHEKLGRLLALKGGTAINTVIFNFPRLSIKTRLL
GUT_GENOME030734_0111313-72QVPWTETEQVEQDLLICRALTEIYKDPYLASHLAFRGGTALHKLFLSPQPRYSEDIDLVQ
GUT_GENOME278959_0071413-78ELIKIVSDEKHIPEDAVLMDYYIVYMLEKLSNSEYKDLCVFKGGTSLSKCYPESIERFSQDIDLTY
GUT_GENOME188454_0205412-77ELIRLASAHFKIVPAFIEKDYWITHVLKQLSNYQDANHVVFKGGTSLSKGYHLINRFSEDIDLAMM
GUT_GENOME095248_0115211-77FDQLLSVVADERGVDPVLVEKDYWIMHCLWGLQAQGFQFELKGGTSLSKGFGVIHRFSEDIDIRIEP
GUT_GENOME237448_0022711-81EITEVIEATSRKSGLASSIVEKDLWVCYILYYLFNRCDYKDYFEFKGGTSLSKAYDLIDRMSEDIDIVLNS
GUT_GENOME048404_0020313-70RKEPNPEMAEKDYLECLILDKLFSDAYICDNFVFAGGASLSKSYRMTNRIGQDIDLVC
GUT_GENOME126618_0163313-77VLLGTAAEFSMSEEFVAKDYWAMMMLAEAMKRSETLVFKGGTCLSKCYGVISRFSEDVDLGIPYE
GUT_GENOME227920_0130012-76KRLVNNTATQYGLRPDEVVKDYFMMLVLQQIVKIDPSIILKGGTSLSKGYGITNRFSEDLDLAVS
GUT_GENOME095248_0113510-81ELIEALVAEAAPGGITAGLLEKDEYLTDALRALFALQPEGMQLVFCGGTSLSKAYGLIERMSEDADLKVVIP
GUT_GENOME239314_0123310-74RQVIEGTAKELQMSRAIVEKDYYVTALLAEISKRTPDLVFKGDTSLSKCYKIIQRFSEDIDLNLS
GUT_GENOME267159_0079625-98LDVLEITSAKTHLPQLAVEKDWWVTMVLKALSATQHFELMSFKGGTSLSKGWNLINRFSEDIDIAMRREGKFSI
GUT_GENOME015476_0022812-78LQALSEELAIDPAFLEKDWYATHALQLLQETTDTTFECIFSGGTCLSKAYHLIKRFSEDLDFRVQGA
GUT_GENOME158444_0077313-76LVIQTAAFKHIPQNAVIKDYMICSILQKLSKSEYVNKCIFKGGTSLSKCYEGAIDRFSEDIDLT
GUT_GENOME047929_006706-78KRLALMQTAVKVGLPVEAVEKDLWVTAILQAVFSLPYSKMFVFKGGTSLSKVWKRIERFSEDIDLAIDRCQFK
GUT_GENOME096279_0506522-90SHPLGYPAYIIEKDFWVTQTLLAIYNHFAPSLSEKSKFPFIFKGGTSLSKCYGIINRMSEDIDLSIALD
GUT_GENOME252374_0122615-80LFRHIGEQLGITPSIVEKDFLVCRVLQILFGEESLSPYLCFRGGTSLSKAYKVIRRFSEDIDIALS
GUT_GENOME027519_0058015-80VVRTANALGLTTSFVVKDYFIFEMLRSIVGINPSVVFKGGTSLSKCHHVIDRFSEDIDLGLEVEHA
GUT_GENOME096089_012586-70NIADSYAFELGNTKDAVIKEILHYDILQSLSQSDIANDIVFQGGTSLRLCYGNNRHSEDLDFALK
GUT_GENOME138244_0128910-75EHLHDLHSQYSRKVDIHILERTVFAFGLLEALARTGLPFIFKGGTSLILLFGTLKRLSTDIDITVS
GUT_GENOME018727_004559-76DFTEAVQAVSRGLNISPALVEKDYYVTLVLKRLNEELSGLIFKGGTSLSKCHKAINRFSEDIDLTLDS
GUT_GENOME285286_0162615-87IVQNTALRTNIEDLAIEKDWWVTITLKALFSTSFSEFLLFKGGTSLSKGKWENIDLRRFSEDIDISLSRSWFT
GUT_GENOME238010_0085015-92AMLQQTEVGHPGVNQVAIEKDWWVTVTLKALFQTDCRDSLIFKGGTSLSKGFNIIERFSEDIDLAINHSFFGIEGTSK
GUT_GENOME213065_006954-71EDLRMLFAQVASTSGFPQHLIEKDYWLTRILSKVGYLSPDLVFKGGTCLNKIYFDYFRLSEDLDFIML
GUT_GENOME220728_0119217-81LKTSEATGKSVAIVEKDYWISYLLDYLFAKSSFNDMLVFKGGTSLSKGFNLINRMSEDIDLILNW
GUT_GENOME207623_060527-72ILAVAQQTSLTPHVVEKDYVLGWMLAGIYEHEELAESWIFKGGTCLKKCFFETYRFSEDLDFTLTK
GUT_GENOME103696_0171013-75VLAGAAGFLGIHDAIIEKDYYVTLILRILAKNVEEAVFKGGTSLSKCHHVISRFSEDIDIGFV
GUT_GENOME257279_0002313-78LLILNCAEKMGVPPAFVVKDYYITMLLKEVTSSNPSACFKGGTSLSKSHHIINRFSEDVDLGMERE
GUT_GENOME243070_0165020-85FDTTALRLGTASQNVEKDFWVCWTLDALFNGLKEGGPRLLFKGGTSLSKGFGLINRFSEDVDVTVF
GUT_GENOME001937_0111320-82KAAEGTTLPPQAIEKDWWVTKVLQAIRSLDYRDSVQFKGGTSLSKGWGLISRFSEDVDLSIDR
GUT_GENOME070821_001088-75ELIEEVAAEMGVNPSFVEKDWYAVQILKAIAPVKFPAPVIFTGGTSLSKGFSLIKRFSEDLDFKVSGG
GUT_GENOME130359_0178511-78FKEAAARTAEDTGFVAEAVEKDYYVSMILRGISESLPFSVFKGGTSLSKCYGVIKRFSEDIDLTTDIP
GUT_GENOME188911_0318220-84IESISKKKKVDKILIEKVIRALMLLEGLSSSGLDFVFKGGTALMLLLGTTKRLSIDIDIIVPNKS
GUT_GENOME244130_0136918-74FQRVFGVGPNQIHHDFIVSHVLDLLRMHKDELLFAGGTALARTYLKSHRFSEDIDLW
GUT_GENOME070503_0158213-74IINEISFEKKISTGIIEKDYYVTYFLKELLAIDKNFIFKGGTSLSKGFKIIERFSEDIDLNY
GUT_GENOME030105_013266-76NKENFQEMIELVSTDTGRAAAVIEKDYYVTLILRLLSEQLSNVVFKGGTSLSKGYHAINRFSEDIDITFDE
GUT_GENOME007828_0022311-73EELVVAASNELAISANVIEKDYYVTLILKAISEQMKDIVFKGGTSLTKCYQLLERFSEDIDLS
GUT_GENOME022812_0011311-80EQTILQVAQKSGIEAGIIEKDYYVTLLLRELTNALPSMIFKGGTSLSKCHKVIKRFSEDIDITLDENHLT
GUT_GENOME057382_0143417-85ASRPTTDGGLGISPLFIEKDYWVSRSLKLMAEHDKDGRAVFKGGTSLSKAYGIGARFSEDIDVAISDAW