UHGP-MC 22906


Information


Number of sequences (UHGP-50):
196
Average sequence length:
80±10 aa
Average transmembrane regions:
0.09
Low complexity (%):
5.07
Coiled coils (%):
30.39
Disordered domains (%):
8.89

Pfam dominant architecture:
PF07669
Pfam % dominant architecture:
102
Pfam overlap:
0.12
Pfam overlap type:
shifted

Downloads

Seeds:
MC22906.fasta
Seeds (0.60 cdhit):
MC22906_cdhit.fasta
MSA:
MC22906_msa.fasta
HMM model:
MC22906.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME142590_009941083-1165REFESKLEANLQHQERLAAGEGTPREKAAALKEAERLRKVLVELSEYEHDVLYPLASQQVAIDLDDGVKVNYPKFYPALKKIP
GUT_GENOME122315_011861088-1185MHRYTPDLLARLRTEYVLPQQDRYRTQIDVIDDAMATADRREAADLCKRRKKLADQLAETVAYEEKVHHLADQMIEIDLDDGVKRNYALFQDVLAKIK
GUT_GENOME243234_003831079-1145LENRINDSEGAEKIRLEKQRAIIKEKSTELLSYEEKIHSLADQMIDIDLDDGVKDNYEIFKDVLAKI
GUT_GENOME018532_004001164-1253MHRMNAYTAERVRSKYLLPYIEHLEAEIDKLDARRAELSTKETKQLQALQKQLDECREYHERLQVVAEQAISFDLDDGVVVNYAKFGDIL
GUT_GENOME139760_000951147-1221HIEFLRGRIAAESARGAELTAKERSRLKKMQQAMEECLEYDGRLHVVADRMQGIDLDDGVAVNYAKFGDVLAKLK
GUT_GENOME192169_021571112-1183KYESEINRLDNDINSEILSTKEKTAAKKRKEKIQKQLAECKQYDQVIAHVAHQRIAIDLDDGVKVNYQKFQG
GUT_GENOME180343_008111099-1176KLEAEKQNLESVLSTALDKSDANTKFKRIEEIKKIIADLNSYDSDIMYPLAQKNIQIDLDDGVKCNYIKFGNALEKIT
GUT_GENOME022791_000181110-1179RLKNDMEDLNGRINSAEGRDKIRLEKERQKLAASYNEAIEYGQVLDHMANKYIAIDLDDGVKVNYAKFQG
GUT_GENOME095452_013201104-1168EVKDLQAQLDSAARSQQAALRKRLKWVETALADVTAYIDDSLYPLSIRKVSIDLDDGVRVNYNKV
GUT_GENOME207903_016271093-1179EVKRLDILIESDLSAREKAAAMKKKDSINKQILECLAYDQVIAHVANQKIDIDLDDGVAVNYDKFQDVQIPQGDGKKPLKANLLAKI
GUT_GENOME001416_001311133-1201DNRIEWREQQIAASDNQKDINKWSKDMTKIQKQLKELRDYDEKLGHLALNRIELDLDDGVIVNYDKLQR
GUT_GENOME116569_004971142-1226EKELKEIADKLNGDVDLTSKRELSKRQADISAKLQETNEYDEKISHIADQKIKIDLDDGVVVNYAKFSVKNPKTGKDESILAKIK
GUT_GENOME142490_007841082-1159KYQNEIEMIDTRLANPSLSATDRRNLEKDKISYQKKIEELQEFDKHLAVYANEPIEIDLDDGVKVNYAKFDKVLAKIK
GUT_GENOME234999_019651155-1233EVNRMQDVIDHSHSAHEVSVAEKRLEKMKKQIKECQDYDAKLGHLALDQIHLDLDDGVKINYRKAQTGRDGKFYEVLAD
GUT_GENOME033940_008091149-1234TGYVYKQQELYKTRIETYKKRVDTAATQADKNKYKKELKRLQDQFDEIHAYEEKIHHYADLRQEIDLDDGVKHNYALFADVLAKIK
GUT_GENOME285390_000291147-1220RLELLEREEAGAGSTSSRNALKKQRELLLKKQDELRHFDDLLRHYADQNIQLDLDDGVKVNYAKLADLLAESKT
GUT_GENOME211950_003541084-1180RYTPDLIARMRTQYIHEQQARYRNQIEMLERQIDGDVSTSERVRLNKQLKKFKEQDEELRKYEEKIHHWADRMEPMDLDDGVKANYAKFQELLAKIK
GUT_GENOME170225_013071158-1246TIVARVRTDYLHKTQKAIEQNLAHCDNIIANSSNKSEVSKATKDKTKYIKQLDEIKVYDEALRHVATQNIEIDLDDGVKVNYAKFQNVE
GUT_GENOME154959_000141153-1238EKYENEVRSIDTMIQGMTDQRQISAEEKRREKLVKQIAEIKEYDEKLDHLAQERIEIDLDDGVKVNYEKIQTDRDGKKYQILAPIK
GUT_GENOME110332_002961131-1209TTLSRIRKDYLHEFQNKLDNAIQRAENEGNVKLTSLYSKYQTECLEFDRKIKDLADEQIELDLDDGVKVNYAKFQGLLE
GUT_GENOME232102_016251117-1193KVEKEIEITERASDSAQGAERNRLQKQGEQLRKALVELRDYESEVLYPLASRALPLDLDDGVLVNYLKMGPALAKIG
GUT_GENOME233179_00923328-395DEAKSASERVRANKDAVKYAKAKKELDEYERNVLYPLSTERIEIDLDDGVKVNYRKFASALAKVKALE
GUT_GENOME096493_013021151-1222EREIHKLQLVLTSKEYTSKDKTLAKKKISKIVKQMEECRKYDEVATYLANKKIVIDLDDGVRVNYDKFQGIE
GUT_GENOME225305_008161114-1202MHRMDKYTVQKIQRNYLYPHQEHIKREIEKLSENESNLSKQEIKRLEQLRNWEIECRDYNEVLKGLANQQIEFDLDDGVSVNYAKFAGA
GUT_GENOME029001_00530221-294LFRTERKTIEDKLSSPLVDDAMKRRLSKELENIIACQDELTLYGQALSHMADMNISIDLDDGVKVNYQKFQNVE
GUT_GENOME176354_002411118-1207PKARLDYLHRVQTTYEKLLSDVNYRLTTELSMADKKETKNKQIDLMAKLQEIKEYDEKIAHIANQRISIDLDDGVKVNYEKFKDILAKIK
GUT_GENOME073454_025661110-1211NADTTGNMRVEYLHRMQRVYEKEIIRMQEIIDNSRDNKEISNATKRKEKLQKQIKETKDYDAKVAHLALSRIDIDLDDGVKVNYEKVQTASDGKKMQILAKI
GUT_GENOME206259_010501133-1208SPLLRTYAQQIVQENNVISASSDARTVSLAQKRLDKLKLQQAECEKYEKNIAHLAGLRIALDLDDGVKVNYAKAQT
GUT_GENOME025126_005041054-1151IHRYTPDLLARVRTDYVHEQQERYRSRIAELERQRSDAARGQMGAIDRELRGLRAKLEETNGFEERLHHLADQMVEIDLDDGVKENYEKLRDVLADIR
GUT_GENOME262100_004471066-1136ESALKNVDYTIANSTSAVDKAAATKKREKYIKQLNETRTYFQALSHVALQRIEIDLDDGVKTNYAKFQGIE
GUT_GENOME024772_01283222-307SRLRRNYLLDYKNKLDLKLAQLEKANESGKGASYKKIERIRKQTVDMQNYDASCLYQFAQDDISIDLDDGVKVNYPKFGTALAKIT
GUT_GENOME145807_043171120-1217RYNEGTLSRMRTEYVTPLLGKYDAYAEQLEKQIETADSTSEANRFKKELDALIKKQVELREFDDKLKHYADMRISLDLDDGVKVNYGKFGDLLADVKA
GUT_GENOME097942_015241099-1173RYKQELSRLEELTKEASGNERIQLQKRVDRLKKKIIESQQFEEKVQHIADSYVEIDLDDGVKENYKIFKDVLAKR
GUT_GENOME241315_019271089-1159IEHLKNRILYLKGQPTTATVTRELTQHEAALKECLDYEARLHNIANQQIAFDLDDGVKVNYAKFGDVLAAI
GUT_GENOME006255_014481331-1419RIRTDYLHEQQSRYRTAISDLQNRIDNAASTGVRVKLQKQLTAVQAQMEEARLYEEKIHHLADQMLEIDLDDGVKHNYAIFKDVLAKIK
GUT_GENOME058783_009301151-1212QTSESLSAAETRQIAKWQTQLDECREYHDRLHQYSQKNISFDLDDGVVKNYALFGDVVAKLK
GUT_GENOME147137_034641100-1179TSHKNHLEAVSISASSSQGEKTKALKEIEKITKMIAEMEDYEREVLYPLATAQVEINLDDGVKVNYPKLGAALKKIVGLD
GUT_GENOME178483_004211132-1201ARVSADKVAQARKDKEKFMKQLEECRAYFTKISHLAQDYIKIDLDDGVKANYEKVQIDRNGEKVEILAKI
GUT_GENOME254880_000171135-1208IEYLGTRIAEMESRAATLSTKERKDMTKLQKDLEECREYHDRLHLIADKQIDFDLDDGVVVNYAKFEDVVVKLK
GUT_GENOME175228_010251158-1260RVRTDYLHRAQKYVETAMQSAQYTIDNASSASEKSKATKAVTKYTKQLAEMKIYDEAIAHIASKRIEIDLDDGVKVNYEKFQGVEVAQEGKKALKVDLLAKIK
GUT_GENOME202925_009581089-1172SRMRNRYLHRMEKIYQQRIEECRNTIAADPSSPAAVKAKKEQEKFTAQLLECQAYDEKLGHLALSYITLDLDDGVKVNYEKAQT
GUT_GENOME212223_024781102-1195LHRYDGDTMAMIRTSYLHTLQAAYEKRVTTLDTFIASETNTRQKNQLIKQRDHTRKQLEELVKYDAQLQHVANMHIAIDLDDGVVVNHQKVQAD
GUT_GENOME096497_009481104-1195TTLSRIRTDYLHEVQTRLEAEKKELLDIIEGDYTTREINNAKRELKSLNKKIDELKVYDERLHHMADRQIEIDLDDGVKHNYELFIGLLAKM
GUT_GENOME088253_011731131-1204IEVDQNLLDQETTASAKSKYRKAIATLNKQMNEIMKYDQILDHLSQSPVDLDLDDGVLVNHDKLQQGEKLLSKL
GUT_GENOME038070_005271121-1207EILRLQAVAESTTLSAREKTQAKKNIDKINKKIEECKLYDQVVAHVANERISIDLDDGVKHNYGLFQGIKVPTDSGKEVKMDLLAKI
GUT_GENOME155973_000021112-1185ISLDEQQMNHATSARDKNQYRKDTEKLRKMLDEIEKYDLKLQHQALAEIDIDLDDGVLANHAKVQGNEKLFSKI
GUT_GENOME161159_012191129-1208VEQQLREAEEETLRDDLTQAQRNRALKLVNELKEKVKEVKEFEQELVEMASHRLTIDLDDGVKANYPKFYPLVEPIKGLE
GUT_GENOME164556_014201076-1149EYIEKINAKIGQLDQSDRAADNRQADKYRAAVHELTDWERTVIYPLTNERVDIDLDDGVKVNYNKFPHALVKVT
GUT_GENOME183774_01251649-734VHRFDKNILSKIRTDYIRKLQSKLYEKKNALCLEKGNTAQKLSILHNQLDELREYDKKLHHMENMQVEIHLDSGIRANYEKFKLIL
GUT_GENOME109755_004881121-1207MHRMNAYTAERIRTKYLLSHIEWLVQQQTEMEANAANLNARERKRLDSITRQIAECREYHDRLHTVADEQIAFDLDDGVVVNYAKFG
GUT_GENOME089625_011651182-1244DDDISAKEKKNIEKQLKELDTLLKELREYANEVKHIAEQKISLDLDDGVNVNYEKLGAILKKR
GUT_GENOME038164_012521111-1183LENMDKDIAATDSSAKQRQLNKEKEKLVKQYAELNKFDDALNHAINERITMDLDDGVKVNYAKFVPLVAESKK
GUT_GENOME170487_02400837-910KLQEQVERIDQTLVSAAITPADKARLTKDKKRLQDQLTEMQPYEEAMAHMAYQRIALDFDDGVKVNYDNFQHVT