UHGP-MC 121014


Information


Number of sequences (UHGP-50):
52
Average sequence length:
105±9 aa
Average transmembrane regions:
0.02
Low complexity (%):
2.41
Coiled coils (%):
0
Disordered domains (%):
2.38

Pfam dominant architecture:
PF00571
Pfam % dominant architecture:
385
Pfam overlap:
0.05
Pfam overlap type:
shifted

Downloads

Seeds:
MC121014.fasta
Seeds (0.60 cdhit):
MC121014_cdhit.fasta
MSA:
MC121014_msa.fasta
HMM model:
MC121014.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME108554_0081827-133SLRRLFPPLPESSELTVECRRDAYSASAIARAVEDADAHLLNLNVTSADPGPDEVSVDLRISHRNTGSVARSLERYGFRVTGIREGYDADFEKMRRRIDELMTRIEL
GUT_GENOME110381_008708-128KTERERLFPYAEDSSTLVLSCPKHDFCASQLCRAIEDCDAQVLNLNIIDATIPGSSDGPDIPRFDIDYEDTPRTYIELRISHRNTEAAARSVARYGYTVEYTRDAIGTAKADIDSRLESLM
GUT_GENOME010959_01619122-218SSGAIVELEIPREDYSATELTRLVEDNDCRVMNLMIRPDEKTGIWKACLRIDREDAALVLRSLERFDYRVVSCYQLHGVMDERIEQRIKELMYYLEM
GUT_GENOME217842_0032859-172TKDTIISTCAGLFPQLQESTELTIVCPAGHYSASSIAHAVEDADAHLLNLNVTQGTLPNSATTVEIRVNHSRGSMVARSLMRYGYETIAMRSSSADPFDETTVQRANALLHLLD
GUT_GENOME012902_01321124-218EPGALIMIEMGVNDYSLGEIARLMEENNAKILSLMLCNKTESSRITICIKINLIDASAVIQTLRRFDYNVSGSFDNVGNDDLRERFESFMKFLDI
GUT_GENOME277206_0049463-176SLLEAFGRMIAPRDDCSVITVECRASDYSASLIARAIEDADAHLVDLWSAPALHDSDSADRVRVTVRVRLENPEAAVRSLTRYGFDVIGAYPRYAHDVTADNDRLSALQVYLNV
GUT_GENOME223235_0088378-186DKLEALLPQHADGCELLVACPPEEFSASAIARAVEDCDARLLALTVTAMRTPDDHPVVLVRADTRNADAISRSLARYGYQTVDTSARLTPEQQSEAMARINELLHYLDL
GUT_GENOME024737_01638128-222GCVITLRIAPIDYSVSEIARILESNHAILLSLLSTTLEPDGYLEVVLKINLEDPSAVIRSFERFNYTVTDISAKENLTGEIFQQRINEFIHYINI
GUT_GENOME007180_01027105-219IGYYDITDIIKFFHETPFLKEMGGIIIVEKNINDYSMSQISQIVESNNGKLLGMFVSSAEADKIQITVKIALGSLNDIIQTFRRYNYEIISEHQEDNYLNALKERSDYLDKYLNI
GUT_GENOME235709_0188636-143ELRRLMPPSDESSILRVAVDRRDYIASRLAMAVEDCDAHLLNLNVTSGSDSSSRLLVELRVNHRNPHLVVRSLERYGFEVVDFAAGYSTDDSETARRRIDELIHYLEI
GUT_GENOME175935_0117317-131EVRDWTRLLIGCRRDQYAASLVARAVEDCDAHLLNLNVTSPGALTGDGAPGDYAAGDYAEYPVQVELRVSHRNAESVTRSLERYGFTVLEAESSGDSGDPGRTSSNLSNLLNYLQ
GUT_GENOME277114_01121120-220NAEAAGSVIVLEMMPQDYSLTDIARLVEANNAHVLNLLSHQDKDTGRLLITLKIDLEDASPVIRSFERFNYTVLYHFMEKGMVDDMLRQRMNELLHYMNML
GUT_GENOME178201_0003762-167QALSVLLPHSDEYSELMVRTDTRSYSASALARAVEDTEASLISMLAYPAGDGDINVYLRINRADPSHAVRSLERYGYFVAYAHGAENSQAELTAERILELQHYLNI
GUT_GENOME034525_02444207-289RTQYAVEDCDAHLLNLNVMPGADYGFDMVVDLRVDLAAPYAAARSLERYGYAVIDAHADGDDPAGGDDTLRRRVDELMRYLNI
GUT_GENOME101855_0080721-129EEAEALLPRRSDSCELLVACLPDAYSASAMARAVEDCDAQLLALSVTSMRDAAGRPVVMLRVNTRNSASIERSLARYGYETIFSRGDLSEGEQTEARARINELLRYLEM
GUT_GENOME233280_0176964-158LGRQITPRDDSSLIEVECDPEQYSASRLAHAVEDADVHLVDLLSHPTPDGHIRVTLRIRCEDAEGAAHSLRRYGYEVVDVYQQPNLDQTAAYERI
GUT_GENOME007559_0031520-129AARLEALLPQQPDTSSLLLACKPHDYSASSIAKAVEDCDVQLLALSVTAMRDSLERTVVAVTVGASSADGVVRSLERYGYEVINVVSQGDSPHRSRDIDRVNELLHYLEM
GUT_GENOME132172_0043169-164KADSLVEVVCGADVYSASRLAMAVEDADAPLLGIWCRRSAGNRVEALLRIGSPDPSAACRSIRRYGYQAMPLGGKADADLLTAQRNVDALRMILEI
GUT_GENOME238255_0012723-125DRRFPVNIDATEMVIACDPDDYSAALIARAVEDCNAHVLNLNLTGERTDKGDLIVDIRVNHRNGDHIARSLERYGFEVLDYAGTDDDDDDTARERALEILKIL
GUT_GENOME254647_0077435-174LLPQFLESPDATVEVTGSDGGTVGVVDARAMLRCIGTMLGDSREASWVEATVAASAFSASSVARAVEDADANLLDMLTSADPADGNRLRISLRVSHADPSGVVRSLERYGYEVTAASGVVNEDYDKARRHLSELELYLNM
GUT_GENOME212983_0010218-120FLAPVAETSQLVVSCAAGDYSASRIAHAVEDCDAPLLNLNVAAGDAAGVSSRVTVSLRIGHRDPERVARSLERYGYEVVSASGSDTSDDRLRRNYDELMHLLH
GUT_GENOME237421_01882115-219EAYSTYTAANQQGDVIQIVLGYNDLYISEISRIIENEDARIINLFVVPISESTQIRLIIKLNKMGLCRVIRALERHNYKVEYFYKYMVEEDLSDNYGLLMKYLSM
GUT_GENOME096499_0032627-125SEAENSLIILEMPIKDYTLTEIARIVESNNAHVISLSTLPISGGSELLVSLKLDISDLTTILRSFERFNYNVTYFFMKEGEVTDQQKERLDELMYYLDM
GUT_GENOME244290_01302118-217YPLFSENGAILTVKTTGLHYSMTEISQIIESNNAKIYGMFINAIKEDGIEITLKISNENLSSIDETFERYGYTVLNKHYDDEKDELMKDRFGFFQKFLEI
GUT_GENOME212984_0123517-125LRIAHLMPPQQEASVLTVACWPDDFSASALARAVEDSDAHLINLNVTATRLDDGRITVELRTNHRHYDSTARSLERYGFEVIAADAPAADEPDEAFRLRAAELLHIINI
GUT_GENOME100339_0228788-191KMIAARDDCSVVTVECRPEDYSASLLAHAVEDSDAHLVDMLTTPADDGNIRVTLRVRHSDPTAAVHNLERYDFHVVESHGADGNSRDAEIAVERLLSLQTLLNV
GUT_GENOME078983_0034032-135IFPEIDEWSVLTVSCAYGHYSASRIAHAVEDSNCHLLNLNVTAGLCADTAQTVVELRVSTANPESVARSLERYGYEVIAVDSSSSSAVDDVTRERIEGLMRFLR
GUT_GENOME210386_012767-106LIPRDDCSLIRLQCVPADYSASILTHACEDVGAPVVDLLTSPSDGTRIAVTMRLRCDDPSQAIHNLERYGYEVTEAEGNSYADASIAAERILELQTILNV
GUT_GENOME257457_02596115-219LSVVCNTNQQGAVILLEMYPEDYSLSELSRLVEDNNFKVVNLLTYPYENTGMLRVNLKIDREDATPLLRSLERFNYKVVCCFQQQGFIDETLRQRLDELMYYLEM
GUT_GENOME022440_0070442-137ILRLMPENPETGRLTVACAPADYSASIIARAVEDVDAHLLNLNVTADSDAARVVTDLRVSHRNVGAVARSLMRYGYEILSADAPLAPPGPDSPDDT
GUT_GENOME103260_00250143-239QPGAIIEIAVRPEEYSATEICRLVEDNNSKVVSLFSFPDGESGMLTVAIKIDREDASSVLRSLERFNYRILRCYQHQDVIDERTEHRFRELMYYLEM
GUT_GENOME112873_0153052-161GVAQALADIFTPSPGSSLVRVECKASDFSASLLGRAVEDADIALLGLWRMPSDDSGMANVLLMLDATNPSAACRSLRRFGFETYPLQGTPDADMQTALEHISALKMFLEI
GUT_GENOME152089_0191062-160RALSETFTRSDEASVLTVECAREDYSASLLAHAVEDVDAHLLNLSVMPADGGRLNVSLRIGHRDPQAAIHSLERYGFNVTEAYAGSHASPTALDERLAA
GUT_GENOME040960_0115165-168MGRMIAARDDCSLITLECGAADYSASRISRAVEDTDAHLVDMWTVPAADGRLSVTLRVRREDPTQTVHSLERYGYEVVSASGREYADSEAAAMRFLELKTFLNV
GUT_GENOME212961_0145972-172ERLFPAVEECCELSVACLRADFSASRICRAVEDADAHVLNLNVTSETLPTGEIVVDLRVSHRDAGAVSRSLERYGYIVTGVNSGYDLMADEMADRVGELMA
GUT_GENOME244437_0169512-132DYFFPPVGESSRLLVGCRREDYSASRIARAVEDCDAHLLNLNVTSMGDTADVYNNLASDNARDGKLPVVFDLRVSHRNPSCITRSLERYGYTVLDTLSDDDSSDTAVARERIDQFFRFLEI
GUT_GENOME094321_0084115-119MEVFFPPNADSSRLLIGCSYDDYIASRIARAVEDVDARLLNLNVTSLEIEGQQVVAALRIDHRNPDSAARSLERYGFTVLNSDSPAPDDDTLRDRYEQLMKYLSL
GUT_GENOME206550_0046916-122EQLFAPAADTSSLLVACDRREYSASRLARAVEDADAHLLNLNVTSLRADDVPDGTVVVALRVGRRDPQPVARSIERYGYDVLDFAGDGSDDSYDTARRNADALLRLL
GUT_GENOME217850_0020714-128FYPPLPDSSRLLVGCGVADYSASRIARAVEDCDAHLLNLNVTALSNRHDAPLADETPETPFERFPVVVDLRVSHRNPAAIGRSLERYGYTVLDWEAPAGSADDEMAINSLNHLLR
GUT_GENOME041813_005689-112FPRSPDASSLMVACRPQDYSASVIARAVEDCGAHVINLNVLASHTSHGDVQVALRIDRRNPVPAARSLRRFGYEVVEIDAPEADDPYAEQARSRANELLRYLEI
GUT_GENOME216436_0197427-129EHLFPPAEESSELTVACYRSDYSASRIAHAAEDVDAQILNLNITSAVSDHGEILVDLRINRRNPYSVARSLERYGYRLVEIRNNGATDTSPLAERIGELLVHL
GUT_GENOME113855_0170632-139LLERLLPSRTDSSVLLVACSPADYSAGAIARAVEDCDAHLLSLSVTSDTTAGGQAVVFLRVSHRDPSSAARSLERYGYEVIAMRSDSSVSSDDEARLRALEVLHYLSV
GUT_GENOME103078_0032620-128KAEALLPPLPDGAGELLMSCRPEDYSASAIARAVEDCDVQLLALTVTGMRDGEGLPVVALRVGASDTAPVERSLRRYGYTPFYLGGPVDEGMRQRARDRAREILKMLEI
GUT_GENOME244447_0051918-127AAGRYDALVPESAEASEMLVAVDIVDYSASAVAKAVEDCDVALLGLAVTGMTSPSGRGMVWLRIGARNTHGVERSLARYGYETVFTMNADNTVVSDTDRDRINELLRYLE