UHGP-MC 28134


Information


Number of sequences (UHGP-50):
86
Average sequence length:
115±10 aa
Average transmembrane regions:
0.01
Low complexity (%):
1.16
Coiled coils (%):
0.18
Disordered domains (%):
0.95

Pfam dominant architecture:
PF02498
Pfam % dominant architecture:
3023
Pfam overlap:
0.69
Pfam overlap type:
extended

Downloads

Seeds:
MC28134.fasta
Seeds (0.60 cdhit):
MC28134_cdhit.fasta
MSA:
MC28134_msa.fasta
HMM model:
MC28134.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME198746_01691148-273IKELFEQFEAIANEYEGVECWSARELAQVLGYSKWERFEGVIERAKDACINAGEMVEYHFPGVGKMIPLAKGAQREVKDYMLTRYACYLIAQNGDPRKPQISFAQNYFAVQTRRAELVQKRLLEYE
GUT_GENOME113115_0163910-146KKFEEIKHIDNDGTEFWYARELSLCLEYAQWRNFAKVIDRAMLACQNSGFSINDHFVEVGKSVEMPTKPRKNQTEIGFAEVGKTKTKTIPDYKLTRYACYLIVQNGDPRKEVIALGQTYFAIQTRRQEVQEYFNNLD
GUT_GENOME213231_009606-135VGQIKEQFDLVIHSDQEAHIEFWYARDLMPLLGYERWENFDKAIGRAIESCETSGIEVSDHFREVTKMITAGKGAQRSVKDYMLTRYACYLIAQNGDPKKEEIAFAQSYFAVQTRKQELIEERISLIERT
GUT_GENOME204014_01419136-241NMPSFWTTDNMFVYNDVVDAAGNVVVKAKEACRNAGEDVQNHFPDVGKMVSIGYGVEKQIDDILLTRYACYLIAQNGDSRKPQVAFAQTYFAVQTRKAEVIEQRLL
GUT_GENOME076783_0367839-158RPTFEEIRKVDGNGKEYWSSRDLCNAMGYSGYWKFQNVIDKAIKVASEKGMNIDEHFNHAVDMFKVGNGAFRKVEAFHLSRMACMIISENADSRKLLVKQARAYFSQSVSTTELMQNSLE
GUT_GENOME051355_0031243-157FDNIRHTDKNDTEYWSARELMPILGYKRWENFYKVIKQAMISCENSEYMVKNHFRELTKMVNIGSKTSRLIDDYKLSRYACYLIAQNGDAKKKSIALAQTYFAVQTRKIEILEKE
GUT_GENOME249178_0283712-126SDFEQIKKQNESGSEYWTSRDLCVALGYSTYQKFTRTINKAIAIANHKGLNTAEHFNHTVEMVKLGSGSFRKVENIHLSRMACLIIAENADGKKPQVQMAKEYFRQETPTTELLN
GUT_GENOME268596_0002155-177NIFENIKHIDEYENEYWYARELSKVLEYKDWRNFLKVLNKAKDACKNSGFDIDEQLVEVNRLSKRNNNATANIQDYKLSRYICYLIVQNADPSKEVVALGQTYFAIQTRKQEITEQEYDSLSD
GUT_GENOME090969_0182570-192TLDDLAHTDAESGVEFWYARDIMGFLGYTQWRNFEAAVKKAMVSAESSKTAGEHHFAGVSKMVELGSGSKRKVKDYKLTRYACYLIAQNGDPRKEEIAFAQSYSALQTRKQELIEERMRALAR
GUT_GENOME245569_0214410-123FEEIKRFNPEIGEFWYARSLATVLEYKDWRNFINVINKAKEACLNSKHNILDHFVDVTEMVELGSGAKRQIEDILLSRYACYLIIQNADPSKEIIALGQTYFATQTRKQELEEE
GUT_GENOME195746_01080421-565IKRAMIACENSGHNVMDDFAEVSKIVEAGATHKSTRDYELSRYACYLIVQNGDPRKEVIALGQTYFAIQTYRQEVADHFNELDEDNRRLVVRGDIKQWNQMLAETAHNAGVITNEEFAIFQNAGYMGLYGGLDVDDIHTRKQLEV
GUT_GENOME118338_0025732-131GITYWYASDLAMMLGYNDMQAILKAMNKAYAVCNNLNIPIAENFIQTQSPNTPSDFKMTRFACYLTVMNGNISNPKVAAAQAYFAKLADEIHTLCQSADE
GUT_GENOME057993_010656-134IAEMIRRFNGISKITDEGVEYWLARDLMNVLGYNNWRSFENVVIRAMTACKTAGYEVENYFINTYKESSLGKTGIRNIPDYMLTRYACYLVAQNGDSKKDAIAFAQSYFALQTRKQEIIEDRLSLNKRM
GUT_GENOME136488_0063013-134LDSYANVEDGVEYWLARDLMEPMGYTRWENFAEVVKRAKVSCETNKTPVDSHFRDSTKMVTAGVAARAVKDYKLTRYACYLIAQNGDSNKSEIALAQAYFAVQTRRQELIEQRFAEIQRLQA
GUT_GENOME068610_0103727-133FEDCSHQNGICYWLDTDLMRMLGYEDMVTFKKVIIKAMNACTTLGEDPAEDFIQYKHVDEYEISGSYKLSRFACLLIAQNADVSKIQVKTAQYMLARMADVLLQKQD
GUT_GENOME096386_02359119-237SKESVFEQIRHIDENGIEYWSAREMAKVLEYSEYRHFLPVIEKAKEACANSNNNPLDHFEDILEMVSIGSGAKRPLESVKLSRYACYLIVQNADPGKEVVANGQTYFAVQTRIAEIKQM
GUT_GENOME286462_0098627-133QSPFEQLREVDADGREWWNSRKLARVMGYGKYWNFERVIAKAQAWASRKGYHLGEHFVEITEMAELGSGAVREVTSIRLSRAACMAVVMNADQKKEMVKVARQYFSS
GUT_GENOME091570_0184113-134LSTIAHTTEDSKVEYWYARELMTYMGYDRWENFSKAITRAKQACDNSGVSVESHFRDTTRDVTLGSGATRSIADVKLTRYACYLIAQNGDPKKEEVALLQSYFAVQTRKTEIIEQRMGEISR
GUT_GENOME231905_0153814-124SPFDAIKNTNPETGTEFWSARDLMPLMGYATWERFQNPLERARKSAENQGGGDQFRRSAKSPETGGRERIDFHLTRFAAYLVAMNGDPNKPEVAAAQAYFAIKTREAETAK
GUT_GENOME205748_006116-105GIKHIDEKNEYWFARELQKNLGYKRWSSFAAVIKKAQTACLKSDNQINNHFIIYKNNNNVDYKLSRFACLLIIQNANPQIKEVALAQTYFADQTRKMELK
GUT_GENOME231503_0122113-118FEQVSCVLNDVECWSARDLCSLLGYKLWQNFTKVIDKAKEACVNVGQNVEDHFIDVNKMVVIGSGAERQIDDIMLTRYACYLVAQNGDPRKPQIAFAQTETVKMEQ
GUT_GENOME239081_00683128-235NEYNQEFWYARELQEVLEYSQWRYFYGVISKTIEACESSGLSVEDHFAEVRKMVVLGSGAEREVEDFMLTRYACYLIVMNGDSRKEVIALGQTYFAVKTRQQELIEDY
GUT_GENOME205427_0166215-126KSPFEKIRKFDGQGRSYWTSRELCAALGYSTYQKFSVPLAKAMRSAEADGINVDEHFNLMVEMVGVGSGARRSVENYHLTREACIFIARQVYAKKKEVQAALEYFSSVSIDF
GUT_GENOME231147_0061033-142ILETCSHQNGRRYWLAKEFMLRLGYSSWDQFRRVINRAIASCAKLELDVSEAFEQIRSDDGRLVDYKLNRFACFLIATYADDKKPEVQRIKIALSAFAETVIQAAIDSNG
GUT_GENOME114810_0146414-127FEQMRHCENGVKYWLASELGPFLGYAVWQELPIARAMKACENSGQNPDAHFEQIVKTVPLRSGGKHEIKDFRLSRYACYLLFQNGDPSKPAVAEGQSYLALQTRRQELQKPSSS
GUT_GENOME263197_027534-103DGKSFWNSRSLSKTMGYSEYSKFKRVIDKAISVCQENNQRVEDHFAHASDMVEVGSGAFRQVDCYRLSQYACLLIAMQADGRKEQTLQAIQYFSGKVESD
GUT_GENOME143098_0204216-117DKFELSSHKNGTTYWLASDLAQMLEYDSVKSLNKAINKAISICSNIGHNILEHFTPINGGADYKLSRFACFLTSMNADSKKPAVAKAQAYLATIAEALHHYI
GUT_GENOME008415_0114540-158KETFESIKKIDENGTEWWSSRELARVLTYSNYKYFLEVMRKAWAAVSNSGLKPSDHFVVYNEMVPIGSNAERQVDTVKMTRFACYLTVQNADSSKTIVAQAQTYFAIQTRRAEKLLDAP
GUT_GENOME144559_041015-128HQPFEEIRHYGTEGQEFWSARELAPLLDYRDWRNFQKVLARATQACEASNQAASDHFVETTKMVVLGSGAQRELEDVHLSRYACYLVVQNGDPAKPVIAAGQTYFAIQTRRQELADDEAFRQLR
GUT_GENOME120591_0129831-110NGIEFWYARELQKILEYKQWSRFESVIEKAKVACENSSNISSIEDFADVGKLSKRANNAEVEIRDYKLTRYECYLIAQNV
GUT_GENOME237192_0118523-145TEKMFEDIKHIDEEGNEYWLARELQIAFKYKEWRKFDGIINKSITACNNSNINANDQFVQVDKLIQHGKGGMRNIKDYKLSRYACYLIAQNGDSHMKVIALAQTYFAIQTRKQELSEKEYSML
GUT_GENOME237064_0037366-191FDDIRHIDEQENEYWEARELMKLLGYKEWRNFEKIIYKAMIACNNSKNTIVYDFVDVNKIVEAGASSKVIKDYKLSRYACYLIVQNGDSRKKPIALGQTYFAIQTRKMELTEIEYNNLSEDEKRLY
GUT_GENOME261969_0127017-148PETNTPNDNLFEKIKHIDDDGVEFWYARELQTVLGYSEWRNFNKVIDKAKIACKSSGYIVSSEFVDVNKLVYVGANLQREVQDIVLSRYACYLIAMNGDPRKETIALAQTYFTVKTREREISEHFDELTENK
GUT_GENOME011027_0154213-132SFEDFKNQNGITFWWASELMLMLGYDDMRVFHKVIDKATKAFMSLGINHYENIIYVEREREDKKFPDFKLTRFACYIVAMNSDPRKVEVARVQAYFAEQTRKFEVYLQGSDDMDRILFRE
GUT_GENOME018885_0070718-131NYESIFENQANSNGFTYWFARDLMNTLGYKDYNNFKKSINKAISICASLNIDVEDNFKKTKRIIDNKETEDYKLSRFACYLTTMNSDIKKPEVAKAQAYLAKYAEIIVTLQQEA
GUT_GENOME243952_014113-114KIFDSIRQMDEAGQEYWSARELASVIGYADYRNFSKIIDKAKVLCVQNNKSVEDHFVEVNEMVTIGSEAKRKLKSYHLSRYACLLISMSLSSRKENALPALEYFSGKQNLSS