UHGP-MC 58951


Information


Number of sequences (UHGP-50):
74
Average sequence length:
88±10 aa
Average transmembrane regions:
0.02
Low complexity (%):
3.06
Coiled coils (%):
0
Disordered domains (%):
2.34

Pfam dominant architecture:
PF00710
Pfam % dominant architecture:
7973
Pfam overlap:
0.47
Pfam overlap type:
reduced

Downloads

Seeds:
MC58951.fasta
Seeds (0.60 cdhit):
MC58951_cdhit.fasta
MSA:
MC58951_msa.fasta
HMM model:
MC58951.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME243297_0018938-148EVREAFCLPSPELSPADWLSLAREMEGCPDADGFVVTHGTDTMTSAAATLAFLGAGGARPVVLTGAMKAPSEPGADGPTNLKDAELFARELARRKIRGVFLAFGGSLLWGA
GUT_GENOME024196_0117489-174LVITHGTDTLEETAFFLYLTAKSPKPVVITGSMLPASAHSADGPGNLINAICTAASRESWGKGVIVCMCNRLLSARDAAKTSTYRL
GUT_GENOME097560_0016677-146RGVVLLHGSDTLTYTAALLAHLLPDKPLVLVAAGTPPEAPGSNALPNLDLAVQYLLSGAKDGVQIAYDGL
GUT_GENOME095287_03903126-239KRDDVDGVVVTHGTDTLEETAFFLDLTVRSDKPIVVVGSMRPATALSADGPLNLLDAVTVAASREAAGKGVLVTMNDQIHAGRDVTKRVNVVPSAFASQWGPLGMVVEGKTYFF
GUT_GENOME015162_02089173-258DADGCIITHGTDTMAYTAAALSFMLLDIDKPVILTGSQLPLDHAITDARVNLLEAAEAARSVSRGVYVCFHHKLIAGTRAVKMRTT
GUT_GENOME080352_0007282-180YDKDDSIDGIVITHGTDTLEETAFFLNLTLNCYKPVVLTGSMRPATSTSADGPMNLYQAVALACSTDAYDAGVLALFSDTIYSGRDLQKTNSYKTDAFK
GUT_GENOME021289_0261091-175IVITQGTDTLEETAFFLDLLWDADVPLVLTGAMHPSDHPGADGPANLLDASRVALDTQSRKRGVLVVMNGQIHQPLHVRKCSSIT
GUT_GENOME113646_0025687-188GVVVTHGTDTLEETSYFLDITHQSDKPIVLTAAMRGAGDVSPDGPANIYCAVQAAADCSSRDKGVMVCLNNLLHAAGEVMKTHSANCATFASPWWGPIGYVE
GUT_GENOME000217_0343380-161VIIHGTDTLAFTSSALSYMIQNSRKPIVVTGSQYPIDHEKTDGVKNLADSCIFAAESGMPGVYVVFNGKVMFGNHVKKIKSK
GUT_GENOME126926_0071297-165KGVIILHGSDTLAFTSAIIANAFADKNIVLVASDKPVENPLSNAKANFDAALAHLLSDKSGVYVSYNGI
GUT_GENOME219695_017751-72MDAYNGFVITHGTDTLAYTSSMLSYLCGSLSKPVVLTGSQRTLDEPNSDAPVNLRDALLVALSGLSGIFAVF
GUT_GENOME000603_03705174-260VICHGTDTMAYTAAALSYLIQHSSKPIVLTGAQKPIGSEITDAKANLRDSILYAADPHSRGVQIVFDGKVILGTRAKKTKTLSFAAF
GUT_GENOME098570_0287693-201ELKNLIKKTLDRQDITGVIVTHGTDTLEETAYFLDLTIRHHKPVVVVGAMRNSSELGYDGSSNLAAAVCTAISPKAIDKGVLVVLNNEVNIASEVTKTNTLSLNTFQSP
GUT_GENOME103523_0076779-185DGFVVLHGTDTMAYATSALSFALASFGKPVVFTGSQLPARSAGSDAFSNMAGALSAVLSGRAEGVSLFFGRHLFRGNRVIKRSTWDFEGFESPCAAPLACAGAPWEW
GUT_GENOME000452_0595489-188GIVITHGTDTLEETAYFLSLVIRHDKPVVLVGAMRPATALGAEGPANLYNAVALARHPDARGRGPLVVMNEDVHYAREIQKIASAGINAFASPNRGRAGV
GUT_GENOME015880_0106984-172VVTHGTDTLGYSAAAVTYMLKGLGKPVVFTGSQLPIEHSDTDAKINLFDAVCFACENVSGVYVVFNGIVIYGTHARKIKTTSFDAFESA
GUT_GENOME096235_0260082-162IIITHGTDTLEETAYFLELTVKDSRPVILTASQRDASERDSDGPRNLHNSMRIAMDPHAKERGVLIALNEEIHAARDVRKL
GUT_GENOME119539_0171481-165VVVVQGTNLMEEMAFGLDILLRTDVPVVCTGAMRPATSVSADGPYNLIDAVAVAACDLCRGMGVLVVMNEKIHSAQYVRKEHGLC
GUT_GENOME103890_0365262-165SKDLTFDVWERLVARIRHWIDVERVDGVVITHGTDTLEETAMLLHLTQQTDTPIVLTAAMRPSTSLSADGPLNLLNAVRLAASPSARGRGVLVALNQRVHAARD
GUT_GENOME158674_00731164-263GYDGIVVTHGTDTMAYTASILSFMLHNIPIPIVLTGSQLPILHPLTDGVENLRCAFAMAASGAPGVFVAFNRKVILGARAVKVRTTGFDAFESVNAPYAA
GUT_GENOME164254_00256102-187IQKAVKNNEADAFVITHGTDTLEETAFFLDQVLQINNPVVLVGSTRPADSISADGPKNLLNAIRTAVDIDSFGRGVLVCYNERIFS
GUT_GENOME151500_0096667-166VDHWERLAATVRERLSQADVAGVVITHGTDSMEETAYFLHLTVASAKPVVITGAMRPATAISADGPLNLLQAVQTARSPQARGKGVLVVMNGQIHGAREV
GUT_GENOME096468_0108475-175REAVRRAVDEDDCDAVLVLHGTDTLAYSAAALSFQLIGLPAPVVFTGSMLPAGVPDSDAWENIEGALAALAQGLTPGVWLHFHGQTMLPTRCAKVRSSGRD
GUT_GENOME035741_0144551-128GRAAYEGIILTHGTDTLAYTAALLDQLLAGTPLPVVLVSSQLPLQEPAADGVDNLAAAVDFIADTHMPGIFAACRGED
GUT_GENOME142546_0204283-173FVVLHGTDTLAHTASALACLLAGLGKPVAVTGSMRPLFEAGSDAPANIRLAVSAVRQAGLCEVVVPFAGRVWRGCRVRKAHSLADAAFVAP
GUT_GENOME113453_00580210-293SDYDGFVITHGTDTMAYGAAALSCLIQNPDKPVVFTGSQLPMDEPGSDAPGNLTDAFRCAGQGSGGVWVCFNGRVISGRAARKV
GUT_GENOME063202_0025918-98VIAHGTDTLACTAALLHHLLPHIDRPVVVTGSMLPMGAEGSDAPGNLLDALRVATDGRRGVYAVLRGSILRGTNVLKVHST
GUT_GENOME000576_0096699-191VVITHGTDTLEETAYFLQLTIGAPVPIVLTGAMRSSNEVGSDGEFNLITALRVAVSETARDKGVLVVFNGEIHSAFNVTKTHTSSVDTFKSVH
GUT_GENOME238933_0234980-168GADETVDGIVITHGTDSLEETAYFLQLTAHTNKPIVLVGSMRPATAISADGPLNLLEAVQLAADERTGTFGVVVAMNGTICSARFVEKT
GUT_GENOME014048_0009374-163SNIDGVVITHGSDTLEESAFFLNLVIKSSKPIVLVAAMRPSDSVSADGLQNLYNALCLCADTSSRDRGVMVVMNDKIFGARDMTKTQTLN
GUT_GENOME212685_0044979-175CIEDLDRRDSADGFVITHGTDTLEETAYFLSLVLKTEKPVVLTGAMRPSTGLSADGPMNLAGAICLAASSEARAMGVLVLMNDTISSAFSVQKTNTT
GUT_GENOME208007_0074978-177FRGIVLTHGTDTLEETAFLLNRYWNLDTPLVVTGAMRPANAPGADGPANLHDAIVTAASDQARGLGVLAVFDSLVHAADRVTKVSSRSIDAFDSEPSGPL
GUT_GENOME140242_0162884-191YDGFVISHGTDTMAYTAAALSVMLQGLNKPVILTGSQLPMEAAGTDAKQNLYDAFLYGVRPEARGVSVVFNGKVIDGLCARKWKTESFDAFVSINRPLLATIENGRVK