UHGP-MC 19899


Information


Number of sequences (UHGP-50):
62
Average sequence length:
111±12 aa
Average transmembrane regions:
0.02
Low complexity (%):
0.2
Coiled coils (%):
0
Disordered domains (%):
1.86

Pfam dominant architecture:
PF03636
Pfam % dominant architecture:
8871
Pfam overlap:
0.44
Pfam overlap type:
reduced

Downloads

Seeds:
MC19899.fasta
Seeds (0.60 cdhit):
MC19899_cdhit.fasta
MSA:
MC19899_msa.fasta
HMM model:
MC19899.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME194079_0102623-119ESLLSLGNGFLGWRGALVTNDFSDDSYPGLYAAGVFNQTTTPVAGRQVVNEDLVNLPNLQRINIIIDEQPLRVNAETVKQLKQTLDFKTGTLTDSFR
GUT_GENOME254305_004761-94MEKIELGEWSVSVSRWDRAWQPQMESLFCQGNGYLGVRGAVDEPVPGQKRDTFIAGTYDSCFGEVSELPNLPDVMGKEITIDGELLQLSGRNHE
GUT_GENOME017166_00118222-345IKSYPISETSFVEDNFCLNDVKHIETVFALGNGYMGLRGTYDEEDEIGQINGMYINGIFAVKPYSHPALFKGFAKRDQFTVNLSDWRIINLYVDGEKACFLKHNIKDHFRELDFISGQLKRSFI
GUT_GENOME096465_0260610-118ELENWLISETAFSAAGLGKCEAILCLGNGYMGLRSSTEESYFQEKRNFFINGTFNRSQKNEVTELPNLADSIQMDIRVDGERFSLEVGETNDYLRQLNLKTAELSRSFI
GUT_GENOME213695_021354-117TNVKWDAWGIECSFFDADTLGKEQALFALGNGYMGVRSAVEEEYPQPPGNTSVFRYTIVSGTFDQNPEPNQSTELPNAVDFFAMDIRLNGEPLNLNTGTYREYRRRLDLKTGLL
GUT_GENOME096494_002284-118TLTSAKLSQNELLQEESLFFLGNGYLGVRGNFEEGYQVGFTSIRGTYMNAFHDTTKITYGEKLYGFPDVQQTMVNMTDAQGIRIAIGEEHFSLFSGDVLSYERKLHMDKGYAERL
GUT_GENOME015152_019677-128RYFQVHPWKIIEEGFDPDYSQVAESIFSLGNEYTGLRGYFDEGYSGPRLQGSYLNGVYERRMLPKSGYKGMLPFTEFMVNTVDWVYTRISSGEQVLDLAKCRFSDFRRELDLRTGALTRRFV
GUT_GENOME000729_0342719-122MEETEFDARNLGKYEAIFAQGNGYIGMRNALEERYVEEVRNTFITGTFNKAGDEEVTELPNIPDVTAMDIYVNGYRLNLQSGKVMEYSRVMNLKNGETTRKIVW
GUT_GENOME171682_030365-121YSWKISEDSFCLEDNLNNESLYTLANGYMGMRGDGPEQLPPRYSKRGTFINGFYETSTIPYGESAYGFAKNMQTMLNVCDAKLVTLTLEGETFSLLKGTVRSWHRELDMKNGLEIRD
GUT_GENOME096547_0210721-136AWHETRPARTDDELGQSETLFALSNGYLGMRGNPPEGRDSHEHGTFINGVHETWNIEHAEDAYGFAREGQTIVQVPDAKAMRLYIDDEPLRLGVAEVEDYGRTVDFREGVERRSFI
GUT_GENOME245840_0124030-128ESLFTLGNGSLGFRGDFEEGYPGAREGVYLNGFYDTAPIRYGETAYGYARNGQTMLNLANAKRIRIFLDGEELCLARGEVKNYRRVLSMKEGTLTREFE
GUT_GENOME267597_0031120-109ETLFAQANGFMGIRASEPIATAQATPGVFVNGFYEKHEIIYGENAYGYAKHHETMIKLFDLRRIDIEIDGEALEVLKNQERCLDMQTGLL
GUT_GENOME254154_0077220-130ALNEETLAKYESVMCQGNGYMCLRAAAEEKASDKIGRYTLVAGTFDHFPGYACNELPNLPDTTEQRITVNETPVTAAAADKDTYLRTLDLATGLLHRTFCQKVEKDSVKVD
GUT_GENOME165621_001109-129FTVDPWQVQEEGFDPNHARVAESIFSIANEYSGVRGYFEEGYSGDHLLGSYFNGVYENNPEDHSIIYAGISRQGHFMVNATDWLFTRIKVDDEQLDLATSDVDRFSRVLDFRNGVLERQFV
GUT_GENOME213368_0010044-153FQVDPWKVIEQGFDPAYARVSESIFSLGNESIGVRGCFDEGGHVDSLRGAYTNGIYDMEKLSRSYKGIIDKTHFMIPSAEWLMTTISVDGETLDLGQSQFSDFTRELNLH
GUT_GENOME254097_012348-140LTDPWLITEESYDLNVEANIATLFTTGNGYMGLRGSLEEFGSVRIQGAYIRGVIDRIIEIPAAFADNVYMKKYYFNEEGLRHFEKQDCIVNFADPLAVRIRIDGETFYPWEGELLSWRRTLDTRHGVLQREVR
GUT_GENOME066223_017187-113LPGWTVERRAFDPAHTAKTESIYAQGNGYINVRCAEDERYVGQTRGTFATLTFNKALPDEVTELPNLPDVTAIDLTVNGERFSLDRGTYRDYLSRFDIRTGEVERSC
GUT_GENOME019475_0391718-122SWMILEDHYDPEENLKYESLFCLSNGYLGTRGAYEERAAKSIPCTYINGVFDKSETFMRELANLPDWLGIKLYVEKELIGIETCRVLEFSRALDMKHALLAKRFV
GUT_GENOME028853_005531-128MEKNTYPVEEWKVTEEKFVKEWNYRNETTFVLSNGYIGTRGTFDEGYPFSVDEGLEGNFINGFYESEHIMYGEWNFGFPEKSQSLLNLPNLKKTTIEINGEMFDLRTGEIVEYSRSLLMNEGTVVRNV
GUT_GENOME276352_0045711-122VDPWKITETGFNPERGRVSESIFSLGNEYMGTRGYFDEVYSGDSLKGSYFNGVWEEKPITYFEHFKGLSERCCFMINANNWIYTRIFANGEELDLAKCKVEDFYRELDMKHG
GUT_GENOME000496_0051616-129KYYEGAWAQGNGYIQMRASFEEDLSGASQDEHYWRLPANVTLEEARNPVSKWGVYVPGIYGNHPILGEEIVNLPYPMGIQLYQDEERFDMGLSDYSDFAESLNLKNGVLTRHFV
GUT_GENOME096381_011054-124QSAYAVEPWTLRESELNLDVLPQSESVFALSNGHIGWRGNLDEGEPHGLPGSYLNGVHELHPLPYAEAGYGYPESGQTVINVTNGKVMRLLVNDEPFDVRYGKLRSHERVLDLRTGLLRRV
GUT_GENOME117767_0095416-124EQELLINETLFHNANGYFGVRGNFEERYPADRPTVRGQYVNGFYNFSDMKQPEKHYGFPEQKQTMVNTFDTQTMVLTLEGERVDLFTGKVLEFCRILDMKKGTTVRRFV
GUT_GENOME096230_038576-118HSWVLSCEEFCSETKLHTAPIYTIGNGLFCCRGFMEEEQEGIAGLGGIYMAGVFGHADYRPWRGEGRELVNIPNFFALRIFINEKPITITEETVTSFNMELDMQNGIFKRSYV
GUT_GENOME096467_0095735-117ETVFTIGNGAQCVRGSFEEGLAGEDPATFLHGVWDDEPVGMSELANLPRWIGVDVWVDGERVSLRRGVVEDYHRRLDLRTGLL
GUT_GENOME101188_001868-152LTEEKYDMAKERHHATLFATGNGYMGVRGSFEEYSTLGVQGLYVRGILDEITEICTPVPDNYYMKKYYFNEERLKDFEKTECGVNLADILTVRIVIGGELFQMHEGTVLSYRRELDFRTGTLVRRVRWKNASGDITDLRFERFAS
GUT_GENOME183568_0045315-132PWTISEPSFTQNNNKRNETIFTVGNGYIGMRGFAEETTPGLADYSYPGIFINGFYETAPIRYGEIAYGYAKNHQTILNVPNTLKIAILADGEPVNLCSGRIDHYFRRLHLKEGYMERG
GUT_GENOME189606_0096613-112LTNWLVEESSFDARYLGKCEAIFCQGNGYLGVRSALEEEYVGETRNLFVTGTFNKFDENEVTELPNIPDMTKIVLMIDGRRFSMEKGVLHDYKRTLNVKT
GUT_GENOME219234_0470214-122DEESCVLHETVFHNGNGYIGVRSNFEEGYPEEQKTIRGSYINGFYDFMKMPQAEKLCGFVEEKQTMVNVADTQTIRLKLGSEEFSLFEGTVLRFCRRLDMEKGVTERSM
GUT_GENOME211654_000572-120KKYLKTDEWNIIEDSFQADRMRWSESIFSLGNGRFGQRGHFEEPYSSDSYRGAFVAGITFLDKTRVGWWKNGFPQFYTRIPNAPDWSSISLRLIDEELDLAQWDVDSFNRRLDMKAGIS
GUT_GENOME142364_0364611-128LIEQGFSKDTIKSNETLFTLANGHLGVRGNIEESDFNNEYIDNMGTYVNGFYEESPITYGESAYGYAKQNQTICKLPNAHIVNFSIEGDCFDLSKGITSEHTRILDLNEGILKRSFIW
GUT_GENOME176144_0247218-123IEETSFNEKYTGKCESVFTQGNGYLGLRNSLEEQYVNTVRGMFITGTFNKASKDEVTELPNVSDIVNMEIELNGERFSMNENNIKEYSRTLNLYTGETCRNVLWKG
GUT_GENOME158414_0067027-117LYKYESVMCIGNGYMCMRASTEEQYPQQLRSTLIAGTFDRFGDSPTELPFCADVSPVDVFVDGKKLDLRKGVLSNYSVSVDLKNGLLSRSF
GUT_GENOME073019_002739-128FKEHPWQIIEEGFDPAYSRVAESIFSLANEYMGVRGYMEEGYSGESLTGSYFNGIYEQKKQEGPHYVGITDVTEFMVNSVNWLDTGLEADGERLDLGRSRVRSFCRWLNMKTGLLTRQFI
GUT_GENOME085913_00467232-344LVEKNFDKKEINHLESLFALGNGYLGMRGTYDEPTAGEVCGMYINGVFATKQYNHLVKFKGYATKNEFTVNLPDWRIFTVTVDGESAAFSDNNISEHERRLEFDTGRLTRNFV
GUT_GENOME000211_0285119-115WVIEESRFSRRELKKIEAIFAQGNGYLGQRAALEEVYSTETRGLYLAGLYDRFGEEEVTELPNLADFCNIRIFLDDEEFRMVQGKTVLYSRRLNLKD
GUT_GENOME188391_037697-127KYFQVDPWKIVEDGFDKNYSKVAESVYSLGNEYMGIRGYFEEGYSGDTLIGSYFNGVYESEKLEGSSYKGIVDQTDFMVNSVDWIYTRILVDGEYLDLNQSKIRNYQRILDMRTGILTRNL
GUT_GENOME096287_02147277-393IDPWRMVERRYSPDAVPQTETLFALANGYLGIRGSFEEGEPSSRPATLLNGFHETWPIVYPETAHGFATTGQTILPVPDGTVVRLLVDDEPVTCATHEVTEFERVLDMRRAALVRTV
GUT_GENOME096244_017711-110MSSLTVLKEPIFSPHTLNKYASLMAAGNGYLGLRACHEEDYTEQTRGMYLAGFYHRASPNEASELVNLPDVLQMRIEMDGEIFTLLSGEILSYQRELCFATGELRRSVLW
GUT_GENOME143497_0018014-108WILGEDEFCLRHQGKGESIFCLANGYMGIRSAHEEPYPFQTRGLFVSGCFNSSNNETVELPNGADTCEMLITLDGELFSMTTGKVTGYSRKLNLY
GUT_GENOME096494_0022118-114WVVTESAYHPKLVGKAEAIFCLGNGYMGQRAAAEERNLAETRNLFVAGTFNKFADNEVTELPNAADVLWMDIVLDGQDFNLEQGTIVSYERSLNLKE
GUT_GENOME000001_012322-126KRIFNKIDEWKIIQDKIEFEENRLAESIMSLGNGYMGMRGNYEEMYSKDSHRGSYIAGVWFPDKTRVGWWKNGYPEYFGKVPNSINYIGIDVLVNGKYLDLGKCEVENFYRELDMKNGILTRTFI
GUT_GENOME076741_0263417-117ESLLFNGNGYIGLRGNLEEDYYDHFSTNRETYINGFYETKELHYPEKMYGFTPTGETMISVIDGQTTLITIGGEQFSIMEGTISDSERYLDMEKGITVRNL
GUT_GENOME008325_0211114-128TKHYEGLFTQGSGYLHIRGSYEEGIMAAPQNEEYMRMPANVTIEKPRHPRSKFGTYIPGITGRHPLLNEEMVNLPYPFLISVYVDGEQFDVDESNVIKHERILDMRDGVLHRNFV
GUT_GENOME034932_01627122-223HTKRQYGQESMQTVGNGFLGLRGTYLEAKANVDNYPATYVAGVFNQLATPVNNRNVINEDLVNLPNAQYLSFKVDDGDFFKIDQKNIQESLRSLDLKTGTLT
GUT_GENOME022451_009167-133LTENSFDMSAEKKYGTLFTTSNGYMGIRGSLEENGTIGVQGGFVRGLIDEIQFCQNISIDSEYMRKFYINEDAAKDAQVQEGIINFADILFFQISIDEETFYPWTGKILSWKRTLDMKNNLLFRSVK
GUT_GENOME286681_008936-107ISENKFNADKLQKFESIACQGNGYIGVRNSLEEKYVHTHRNTFINGVFDAPHGEVTELAALPDVTNFELYINSDRFDMLTGTVNKYERSLNMKNGESVRRIT
GUT_GENOME096505_0442516-136REWSLGEDAFEDENNQRSESVFALGNGYIGMRGNFEEGYHGTAGTSVAGNYLNGFYDSEPIVYPEGAFGLPARNQSMLNVTDARIIELSIEGHTFRLDSGIVHRYRRWLDMKSGILHREVE
GUT_GENOME216297_021942-112KILKITPWTIGCATDSEDTLDFRESVFSQGNGYMGVRGYAPEGEKKHGFERSTFLSGFFEYIKPGTTDMVNQPDFSVFRIALNGSDVSAYTRTGYEETLNLKDGTFTRRYI