UHGP-MC 101740


Information


Number of sequences (UHGP-50):
66
Average sequence length:
119±13 aa
Average transmembrane regions:
0.02
Low complexity (%):
1.41
Coiled coils (%):
0
Disordered domains (%):
1.95

Pfam dominant architecture:
PF00206
Pfam % dominant architecture:
5000
Pfam overlap:
0.4
Pfam overlap type:
reduced

Downloads

Seeds:
MC101740.fasta
Seeds (0.60 cdhit):
MC101740_cdhit.fasta
MSA:
MC101740_msa.fasta
HMM model:
MC101740.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME128505_011931-130MAKLWGGRFTQSTDSFTDHFHSSISFDSRMYKEDITGSMAHAAMLGKQGIIPTDDAALIIKTLKEILADIEAGKVTFDEKAEDIHMNVETILISRIGDVGKRLHTGRSRNDQVALDIRMYTKKEIIAIKE
GUT_GENOME171648_045056-113VRLWGARFRTSPSPELTALSRSESSYFRLVPYDLTSSLSHARELVRAGILTEAETETVFAAIKGIGADYAAGAIEPSAADEDIHTFLERVLGERIGALGGKLRAGRSR
GUT_GENOME147522_034651-109MALWGGRFSQAADMRFKQFNDSLRFDYRLAEQDIVGSIAWSKALRSVGVLSEMEQQRLELALNEIKLAVMEDPEQILRSDAEDIHSWVEQQLIAKVGDLGKKLHTGRSR
GUT_GENOME194696_014002-137KLWKGRFQKEADPKTNDFNSSISIDSRMYKEDIEGSIAHATMLGAAGIIDKGESEKICAELEKIEKDIETGALNIDPDAEDIHTFIEGELTARIGDAGKRLHTARSRNDQVALDVRLTLRKECAGLMEQLKELINV
GUT_GENOME186146_007802-132KLWAGRFQKETDTLVNDFNSSIGFDARLYRQDIQGSIAHAQMLGRQGIIEAHEAEKIVEGLKAILADIEDDKVEFSLDNEDIHMNIEAMLTQRIGDAGKRLHTGRSRNDQVAVDTRLYVKEEIPVIIGQVL
GUT_GENOME276596_0054219-147WSGRFSEPVDAFVLRYTASVDFDKRMALVDIDGSVAHATMLEKVGVISAKDLEDIKRGMAQIREEIQSGKFEWQLELEDVHLNIEARLTALIGDAGKRLHTGRSRNDQVATDIRLYLRGEVDEIIKRVE
GUT_GENOME237438_014581-153MAKLWQKNYSIDLLMQHFTVRNDYILDQELLLSDCTASIAHARMLSSIGVLSDDELGKLTEGLRAIIEKKLTTGFEIREEDEDCHTAIEGYLSSVYGETGKRIHTGRSRNDQVQTALRLYMRESSVRIALETLSFAKELLAFSDKYKNVPMPG
GUT_GENOME243328_0013526-131VILDREFFLHDIAASAAHAQGLQHIGILSADELAGLLRELDILAQDFREGRFVLDTQYEDGHSAIEARLTERLGDAGRKIHTGRSRNDQILVATRLWLKEKLQRVA
GUT_GENOME055649_011011-136MEKLWAGRAHGALDKAADDFNSSISVDSRMYRQDIRGSMAHAAMLAKQGIISAAEGKALIEGLGGILADIDEGKLAIDPAAEDIHSFIEAVLTERLGDTGRKLHTARSRNDQVAVDTRLYLRDAAAETIELIAKLI
GUT_GENOME141741_02385409-509LWGGRFTEKAAHWVDAFGASIGFDQQMAKEDLEGSLAHVKMLGKTGIIPQADADTITAGLHHLQKELAAGKLHFTVENEDIHLNMEALLTAEIGPVAGKLH
GUT_GENOME236236_0124329-143LDLALAEVDCLGTAAHVTMLTRMNFQPTLFTPAQRNAVVAALGDIIAEARRGDFTITAADQDVHLAVERRLTERLGDLGKKVHTCRSRNDQVAVDIRLHIRNELNALQGEVADLA
GUT_GENOME096285_005821-126MANLWSGRFSKSMADFTQDYNESLSFDHVLYRYSILGSLAHVTMLYKQDIIDKKTYEEISRGLKIVRERLDKNEIELDIANEDIMMLIEAQLIEEIGEAGKALHTARSRNDQNALDETLFLRQSVV
GUT_GENOME055860_014711-133MPQLWQGRASKAVDSRVNDFNSSIRFDARMIEQDIQGSLVHSAMLGRQGIISRQDVDDIHKGLHSILDDLHSGALEIDPNAEDVHTFVEQTLTARVGDAGKRLHTGRSRNDQVALDIRLNLRAASEHIQGQIK
GUT_GENOME037258_0036623-148NTIAEDHLTFPYQLLDHLAHVLMLKEQGIISQEDAAQILGAVLKLRREGPDIVPHKPGLTDLYSNVEEWVIDQVGIQVGGKLHTGRSRNDMNPAVERMYIRRKLLEQMEAITFLMETLLVQAEVHK
GUT_GENOME049011_0024528-160MWVMWKGRFSKPTADLVQRYGESVSYDWRLFRQDIAGSIAHARAQLKAGLLSQEEFNAIESGLKDILKDIEAGNFSWSRELEDVHMNIESELTRRIGAPGAKLHTARSRNDQVATDTRLYCRAEIDEILGKVR
GUT_GENOME007070_000526-130WGGRFETQPEEWVDDFNASIDFDKNLIKQDVQGSIAHATMLAKQHIITDDEAQSIINELKNIQSDFEEGKLKFKASLEDIHLNIEHELIQRIGEAGGKLHTGRSRNDQVATDMHLYTKEQVQYII
GUT_GENOME214069_005912-136KYWQGKFNDGLDEAIGQFISTLPFDKRLYKHDIMGSIAHCTMLGEQGIIEEEETKIILKALTQIFYDITTGKLQPENTRDVYEFLDAQLYERVGALAEKLNVARTVCDRSTLNVRMYVKELCSDLNEELKKLIET
GUT_GENOME018858_0155763-193LWGGRFSAAPSGALERFGASLPFDKRMWRQDIRGSKAHAAMLAHQGVISQEDAAAIRAGLDEIAGEIERGEFTFDVDRDEDIHMAIERVLTERIGPAGGRLHTGRSRNDQTAVDTHMIARDLADEALAAIA
GUT_GENOME232847_003505-138QIRNGRLEGNRSEDCEIYLASMDADREIAECDIRVDMAHVLMLVQQNLIDKDAAKKILVALKGYMENGLPENAFDPAREDIHAGIEAQLMEDVGADAGGRMHLGRSRNDEVATCLRMRTRQYILDALSGIFELR
GUT_GENOME044099_00406136-265MAQKLWEKNVKVDASIDLFTVGRDREMDLYLARYDILGSIAHITMLESIGLLTREELDKLTQELRNIYAIAEKGDFLIEDGVEDVHSQVELMLTRSLGDIGKKIHSGRSRNDQVLVDLKLFIRSELQSVV
GUT_GENOME276851_0151213-134LWGGRFKSGPSPELARLSKSTQFDWRLADDDIAGSRAHARALGRAGLLTADELQRMEDALDELQRRVDSGAFAPIEDDEDEATALERGLLQIAGDELGGKLRAGRSRNDQIAALIRMWLRRH
GUT_GENOME023460_004951-129MMKLWKGKYFDEEDLIADEFCASIGFDQRLWKEDILCGIAHVKKLAAEGKLTEDEYDSIVVHLQEILWDAEAGRLHFTAEDNNIQTAVQRLLTRRIGDIAWKLQLDRDPALHSARTLRLYQKEAIQSIY
GUT_GENOME255396_0005412-109WGGRFSKGPSELMLRFSESVSFDRLLAEFDIQGSVAQAKMLAHTGIISKRESAQIVAGLSKILKKIRAGEFAWDESLEDVHMNIEQALTRDVPAAAKL
GUT_GENOME201908_016232-136NLRSGRLKTEMTDEAAEYTSSLEFDKIIFEADIKTNFAHTLMLKEENIIDDEIADNILGALDQLKEDGYNELVFDPSVEDIHMAIENYVTDKIGPTAGFMHTAKSRNDQVATDVRLVLRDKITETQIGILEFMEG
GUT_GENOME055249_003442-107KLWEGRFAAPTAADADAFNESLSFDKKLYKADITASLAHSEMLGRCGILSDEDRLAIQSGLKAILADIEAGTLEISGQEDIHSFVENELVSRIGDAGKRLQPQRSG
GUT_GENOME166767_014213-99KLWGGRFTKGTDKAVEDFTSSIAFDARMYAEDIAGSKAHATMLAKQNIISQEDCDAIVAGLTKIKGQIDKGEFPFSVALEDIHMNIEKRLTDDIGEG
GUT_GENOME018459_016201-135MAQLWGGRFTKETDKLVYNFNASISFDQKFYRQDIRGSIAHVTMLASSGILTDEERDQIIAGLKGILNDVENGSLQITSEYEDIHSFVEANLIDRIGDVGKKLHTGRSRNDQVALDMKLYVRDEIIELKELIYKF
GUT_GENOME045773_008371-106MWKGRFAQDTDEAVINFTQSLDLDWRMAAADIRGSIAHVRMLAHTGLLDAKEAETIEKNLREIAEEIKSGDFTPKVSLEDVHMNIESRLIEKCGATGARLHMGRSR
GUT_GENOME019305_0024821-131AVSIGKKLYKHDVMGSIAHVTALGELGLITSGDAEVLQRALTRIFYDVTSDKIAIPAEEDLFDFLDNELKTRVGELGEKVNVARTRDDRTALDVRMYVRDAAGEIAEYVKT
GUT_GENOME179220_001631-120MAKLWGGRFELESSALLDVFNASITFDQKLWRYDILGSKTHARMLGRIGVLDSSEVALIESGLEKIASEIEQGSFRFSLEQEDIHMAIESALIEHIGDVGKKLHTARSRNDQVALDCRMY
GUT_GENOME103718_0191526-137RFSGGPAREFLSSLAADAAIFEADLAVDRAHTVMLAEQGIVDSAVAGDILAALDDVEAAGHDALPDGEDVHEAIETAVIDRVGPDGGRMHTARSRNDEVATCIRYRLREDLL
GUT_GENOME236868_0053510-115MWDGRFDCGMAQSMIDLSFSLDFDKELLEEDIQGSLGHGKGLVESGVLSKADYAKISKGLKSILADIRAGKNLWLPTDEDVHMAVERVLTERIGDLGKKIHTGRSR
GUT_GENOME099801_00643404-530QMWGGRFAKSTDEMINEFQASINFDKRMYHEDIAGSIAHATMLCKVGILTEEDKNNIISGLKNILAKIEQGDFNFSVDLEDIHMNIEKRLTEAIGEAGGRLHTARSRNDQVALDTHMYVRRETVEVI