UHGP-MC 39417


Information


Number of sequences (UHGP-50):
151
Average sequence length:
82±9 aa
Average transmembrane regions:
0.11
Low complexity (%):
2.73
Coiled coils (%):
0
Disordered domains (%):
11.32

Pfam dominant architecture:
PF18823
Pfam % dominant architecture:
132
Pfam overlap:
0.27
Pfam overlap type:
shifted

Downloads

Seeds:
MC39417.fasta
Seeds (0.60 cdhit):
MC39417_cdhit.fasta
MSA:
MC39417_msa.fasta
HMM model:
MC39417.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME114054_001711571-1671SSVVDGLRERAVELGRELGADVDVVSDVSEITDPNPEVQERKRRAKGWYDAKTGRVVINLAAHSDMADVEQTLLHEIVGHRGLRGLVGHENFNGFLDGIYE
GUT_GENOME125015_009932603-2689SSPEAMTEAARALSEQLGVPVEIVTDVDSITHPNEAVEVKRRGAKGWYDKRTGKVAIVVPNHLDVEDVAATVFHEVVAHRGLRELVG
GUT_GENOME152111_008644253-4348TYIIDSIHAEAAKLNIAVKVYTSLDQVPQGAARRAIERGLRVKGWHYKGDICVYLPYADGMEDIKATILHEGVAHFGLRKMVGEKNMNAFMDGIYA
GUT_GENOME285079_014443530-3599VELSERLGVDIEIIEDNSSLEGKRGRSKGWYSVKDGRISVVIGNHVSVADIEKTVLHEAVAHHGLRELLG
GUT_GENOME061833_001025535-5633VSDSNLYRQSETTPHAAISSESKSQAAQKATENLNLGGRVVVHESVEGLEGKEATAKGWYDTQTGQIHVVLSNNADAADVTQTILHEAVAHHGLRELFG
GUT_GENOME023735_002485696-5782TSPEAKQEYAERLSKKFNTPIRIVTDVNELTHENPEIQAAMRRHKGFYDVRTGEVVVVVPNNADVEDVAESVFHEVVAHKGLREIIG
GUT_GENOME278301_003992733-2817SSMVIDANQIAESLNVPVQVINSIDEVPEGAAKEAIQKGRKVKGWYDVKTGKVYVYLPNATSAEDVKQTILHEGIAHFGLRQLVG
GUT_GENOME243003_001991760-1857RITEERRKRAAAADMARTFGVPMKLHEDMSTLPEEVRAGFDETHFVKGWFYKGDNAVHIYLPANVDADDVRATFLHEAVGHYGCRELLGEVGYNQVLD
GUT_GENOME113426_00865547-627MDRPETESVVESHVQRLGDKLGVHVNMMWDVEDIPDPRVKADLEAGKKVAGWYDEKTGRVYLYMPNVRDTYTAEKTVWHEV
GUT_GENOME232734_010295167-5244TRRAVESVSDVARKLGLNVEVKTDTEGLTGRRARAKGWYDTKTGKIVIILPNHTSGQDVMRTILHEGVAHHGLRELFG
GUT_GENOME053365_001075181-5262NNAMAERVNELAERLHTPVRIIRTDEEVAALPSTRQRRMKGSFNPMTGEVTIVVPNNANMTDVENTFIHEVVGHDGLRVLFP
GUT_GENOME252025_001814176-4263TSQDIEAAATKLAESLGEKVRIVRDVNEIENQNKKVQENQRKSKGWFNTDTREIVVVLPNATGIEDVQATILHEAVGHKGLRELVGDD
GUT_GENOME258514_016823596-3676IEDAVLRLADALHTSVQIVRGEKQVADTETERRRRQTAGGWYDPRTGQVYIVPANLASVAEAEATVLHEVVGHKGLRNVLG
GUT_GENOME096415_017471994-2075IRNEIQSLTDKLNTHIRVVDNVWEIVDDNPKLQQRKRDSKGWFDSSTGEIVIVLPNASSVEDVQSTILHEVVAHKGLRDMFG
GUT_GENOME128080_004292959-3059VKSENEVRAKEEVREHAEEARKQKMRSAVEDMAGKLHLNNVDIVDKPAGRGAGRRERAKGMFERSTGRITINIGNCADKRDAVVTLLHEAVAHHGLRKLFG
GUT_GENOME082353_013764984-5057AVRHWADKMHLGDKVTVLTSTDGLQGKKARAKGWFVPKDGKITVVLPNHTDVGDVIRTLLHEGVAHYGLRQLFG
GUT_GENOME019585_013522363-2452TFTSKDDFIPAIESLANSIHIKVNIIKDINELPDNLIKRKIKEGNNVKGWFIPATNSVNMYLPHISSQEDAQRTFFHEAVAHYGLRKMFG
GUT_GENOME214482_01558639-721LIPVIESMSGALHTSVCIINDMDELPDGKVKRNIEAGHNIKGWFAPATDQVTVYLPNILDPDDARRTVFHEVVGHYGLRKMFG
GUT_GENOME223961_001474206-4282IQDAARETASKLGIGDRVTMMETADGLTGRRAKAKGWFDTETGKIVVVLGNHRNREDVMQTILHEGVAHYGLRQLFG
GUT_GENOME130507_017474694-4782RQQREFAQRERQRMAERVNELASRLHLDNIEAISDTSGMDPILATKKGFYNKRTGKISIILPNHLSAMDVEKTVLHEAVAHYGLRKLFG
GUT_GENOME244242_014404943-5041EALNAKVKRTMQEDRDMMRATVQQMGEKLHTPINIIEDVESITHPNAAVQERRRKSKGWYDTKTGQVNIVLANNRDVDDVKASVGHETIAHKGLRELVG
GUT_GENOME175935_0145922-100QRLAEALHTPVRFVEDVEEITDPNPVVQAQKRASKGWYDIATGEVVVVLANCDNVADVEATIFHEAVGHKGLRELVGDE
GUT_GENOME111254_027971681-1770IQEYIKREAGKLNIPVRIVGDVSQISPSEKNYTRKLKSQGWYDQTTGEIVIVAPNHGSIRDAQRTLLHEAVAHYGLPAMLGRENFDKLCD
GUT_GENOME101068_03784565-649LIATINEYATELHHTSIRIIHNVEELPEDSTPYRLIKAGRDIKAYFDPKSKEVAVYLPNIQNTEDAKRSVYHEVVAHYGLREMYG
GUT_GENOME096203_008032390-2482DNTSISNHEQVESEIEKLSDLLHVPVKMVRSLEELSDGMARRAIVNGRNVKGWFDTKTGEVVVYLPNATGEEDAKATFLHEIVGHKGLRALLG
GUT_GENOME039151_028432958-3043SAPMNTAVNELSESLHTPIEKITSEDQLPQGEARRRIESGANIKGWYSPKENKVYLYMPNTTSVEDAQATIFHEVVAHKGLRELFG
GUT_GENOME001158_03499883-960VSFLAGKLHVPVEVIRRADEIGSPDIRGLLSCGKDIRGWYDIPSQRVCLYLPHARGKADVERTLLHEGVAHYGLRKLA
GUT_GENOME101337_025123982-4072VSNLIRKGKENKIVSAINALSNALHSPVNIVRRFDDLPSDVRSNEKMVKGWSDLETGEISVYLPNATNVEDVQATVLHEVIGHRGLNEVFG
GUT_GENOME278928_000644251-4348IKTVEKVAKRTGGKVKMVNSVEEIENPKVRKDIENGKQVTGWYDENTGEVHLYMPNIHDTYTAEKTVWHETVGHKGMRGLLGDKFDSYMRSLWMDLDN
GUT_GENOME050035_000443817-3915DNLYREVSSVKSEIDYNENEAYSLAEKLNIPLQVVISPEQISDPSVKSAIESGRKIKGWFSVSEGKVYVYLPNASGIEDVKQTILHEGVAHYGLRKLVG
GUT_GENOME230586_017723581-3667DNVSSIESSINGWSNKLNTPVRVIHDVDDITDTDENMLARKRDSKGWYDTSTGEIVIVSPNSTSVGDAQRTFLHEVVGHHGLRELFG
GUT_GENOME245669_006104895-4997RDLRKRMDTDGTSMSIRVREIAEELHTPVELITTPEEIKQLPTVRHQNAKGWFKDGKVYVVVPNNTNVADVENTAIHEIVGHKGLRELVGEEHFNLFLDEIYD
GUT_GENOME164301_00984330-430TDYSNTAQLSDSEKAKHQAATDLAQKLHVEGDVEVVTTTKGLTGRQAKAKGWYDVKTGRITIVLPNHNGRADVVNTILHEAVGHYGLRELVGKEKMNEFLD
GUT_GENOME216739_004572832-2918VRRIARTLNTPVEIIDDLDAITDSDPLVQRRKRHSKGFYDPQTGQTFIVLPNITTLADAEATVLHEIVGHMGLRSLMGDRFGDFLDK
GUT_GENOME284693_007704564-4641MVMRIEELAGKLHLDKSRLDIVTDVSQLKGNRAKAKGFYNKRTGKITIVIPNNASTIDVEQTLLHEAVAHYGLRKLFG
GUT_GENOME129169_00334273-365VEQEEYDMQDLEEVATSMAEVLGDDVRVIHDTAEIEGRNESETNRMRGAKGWYDPKTGQVVVVLPNAESADDVEATILHEVVGHKGLQELVGK
GUT_GENOME115700_012663868-3954SSKFSQVAAIDELASSLHIPIHIIRDINDITDEDKDTQRKKRGSKGWYDMETGEVYLVLPNAENIADAQATVLHEVVAHKGLRGLLG
GUT_GENOME167108_006834725-4792IGEITSVYERIGDVPGNERFSKRRLRSKGWYDPETGRIAIVMNNHHSPQDVLQTILHEAVAHYGLRKL
GUT_GENOME270036_002694498-4582EPSDIRESISELAGRLGENIRIVEDVNELTDSNPKVQRRMRNSFGWYDTETGEIVIVLPNARSVADARATIFHEAVAHKGLREFV
GUT_GENOME159312_009152418-2506REVESREAVEAAAVRLAGALHTPVEVVRDVEAITDADAGRQKRKRTARGWYDVETGKVVLVLPNAESAADAEATVLHEVVGHMGLRAVL
GUT_GENOME214832_018144992-5072TVEAKRERVEQMAHKLGTPVEVVTDAGKLPENMRGRKAWYDTRTGKVVVVLPNHADAGDVAETVFHEVVGHKGLRELVGDR
GUT_GENOME276337_018132379-2458TAENLSRTLHTPIRIVRNVNDIHDDEKEALRKRRAYGWYDMETGEVVIVLPNCRDEAEVQATILHEVVGHKGLRELVGEE
GUT_GENOME219992_015143925-4029LSATKKVKNFENSKSSDVGRMSAKAREMAEKLHLDNVEFVTDTSVLTGRQKKAKGFFNPQTGKITIVLSNHTDASDVERTILHEAVAHYGLRQLFGERFDTFLDN
GUT_GENOME122368_001311860-1945EGKIVFSVDEMSDKLNTPIHIARSLNEIPDGSAKRAIEEGRKVKAWFAPKSNEVVLYLPNTTDVNDAIRSVLHEVVGHKGLRNLFG
GUT_GENOME217754_008612950-3053ASSEANILNMRDRVKELSEKSDIHVRVITDESELTKMSEDGKPRYSRRERRAKGWWSAKDDEVVIVLPNNRDVADVDNTFVHEVVGHKGLRALVGEERFDEFLG
GUT_GENOME243458_003812334-2411AEDALRKLGLEGVPVEVVSRDDVAEAYARDAMGWHDRKTGKVTIVAENNELEDIEATVLHEVVGHHGLEALLGKEGLA
GUT_GENOME132131_049762457-2550LRENSEDLHEINAAKVLAAITKLSDVLHTHIHMPRTVEELPDGAAKRAIERGRNIKAWYDTNTGEVSVYIPNLNNASDAVKSVLHEVVGHKGLR