UHGP-MC 120531


Information


Number of sequences (UHGP-50):
113
Average sequence length:
134±11 aa
Average transmembrane regions:
0
Low complexity (%):
6.9
Coiled coils (%):
0
Disordered domains (%):
0.14

Pfam dominant architecture:
PF02554
Pfam % dominant architecture:
6283
Pfam overlap:
0.59
Pfam overlap type:
reduced

Downloads

Seeds:
MC120531.fasta
Seeds (0.60 cdhit):
MC120531_cdhit.fasta
MSA:
MC120531_msa.fasta
HMM model:
MC120531.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME063311_004285-142VYFLLAVAGLIVGYIIYGSIVAKIFGADENRPTPAKTMADGVDYVEMPMWKVWLIQLLNIAGVGPVFGPILGALYGPSALLWIVIGTIFAGAVHDYFSGMLSVRYKGANVPTIVGYNLGNVAKQVMRVFAVILLLPSC
GUT_GENOME050735_000468-148ASAIILLAAYFTYGKFLANTVFKLDDKNPVPSVEMEDGVDYVPAKKGMLLGQHFSAIAAAGPINGPILAAVIYGWAPALLWVILGCIFIGGIHDMGSLVASVRHKAKSITEVVRLNVSQRAWILFMIFVWITLVYVIVAFT
GUT_GENOME096069_019901-146MNTSILLIIGIVIYLVCYLWYGKSLERRVVGADNSRPTPAHSKFDDVDFVPSHPAVLFGHHFASIAGGAPILGPALAMAWGWWAGLLWIWFGNILIGAVHDYLAIMASVRHEGRSIQWIAGKMMRPRTSYLFQVFAYLTLVLALAA
GUT_GENOME225078_0058849-177VAYRYYSLYIAQKVMKLDPTRATPAVINNDGLNYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGYLPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKEEMGPVPGTIALFGCFLIMIIIL
GUT_GENOME232852_015429-145LGTLILIGAYFTYGKFLSKKVFELDNTRTAPSEEFNDGIDYVPTDAKYLAGQHFSAIAAAGPVTGPIIAGMTFGWFPTACWILIGTILIGGVHDMGALVASVRHKAAGIADTMKNYVSQRVWILFNVFIFFTLVMVI
GUT_GENOME237521_002728-147LIAVAYFVGAYFLYGKFLSRVFKIDPDRTTPAHTMKDGVDYMPASPYVLFGHHFASIAGSGPILGPIFAAELGWGPALLWIFIGCVFVGAMHDFASLFLSVRNQGKSISYVIEKLIGYSGRMFFSIFCWATLALVVAVFG
GUT_GENOME238070_005561-142MNAFILLLIGIVAFFIAYISYGSWLAKKWGIDPGKKTPAHTLTDGKDYVPTDAKVLLGHHFSSIAGAGPITGPIIAMMFGWLPVYLWIVLGCIFFGGVHDYGSLLASLRHEGKSIGEVIRANVGQKGKIMFTLYAFITICLV
GUT_GENOME205052_0064114-151FAALCVFAIGYRFYGLFIAAKVLNVDANRVTPAVKYADGQDYVKTDKFVLFGHHFAAIAAAGPLLGPVLAAQFGYLPGLLWILIGCVLAGGVHDMVVLFCSVRHKGRSLAYIATREIDPTTGFVASWAVLAILILTLA
GUT_GENOME100276_005819-134VTVLALFTVGYLGYSRYLAQFVELDDSRETPAHKYEDGQEYVPAKKPVLLGHHYSSIAGGAPIVGPITAGVVWGWVPALAWIAIGNPLLGSVHDFVSLSSSLRHDGKSIGYIIGEYVGERGKNMLL
GUT_GENOME159085_0148813-135VGYLTYSRYVDAQFEPEDNVTAATEKYDGVDYVPIPCWKNMVIHLLNIAGMGPVLAAIQGVLFGPWVFIIVPLGCIFMGAVHDYMCGMVSIRTGGLQLTGMIKKFLGEKFFRFFMVAVTLMSF
GUT_GENOME243411_010572-141ITFLCGIFILLVGGLWYSFYIERLFSPDGRITPAIARADGIDFTGMPCWKNALIALLSITGTGVILGSIQAAFFGPIAFLLIPLANVLGGSVHNYLSGMIAMRNRGMQMPRLVEKYLGLGVARVFLVLLMILLFFTGVIL
GUT_GENOME037890_002845-143LVSICLLLAGYFTYGRFVERYFGVSAERDTPVKRLADGVDYQELKPWRMYVIQFLNIAGLGPIFGAIMGAAYGPMAYLWIVVGCIFMGAAHDFFSGMLSLRNDGKDLPEIVGTYLGKRMRQVLIVFAAFLLLAVGVSFV
GUT_GENOME194652_024314-153LLLLALSMVALAAAYVIYGRYLEKTWGIDPKAKTPAVANEDGVDFVPSSKWEVFAHQFSSIAGAGPVTGPVMALMFGWVPTVLWIIVGGIFFGAVQDFGALYASVKSNGKSMGGIIEEYIGKTGKKLFFLFCWLFTLLVIAVFADMVAGT
GUT_GENOME264221_000747-144FGISAVIFIIGYIVYGRFMANVYGLSDTNETPAVKFEDGIDYCPAHPAVLLGHHFASIAGAGPITGPIAAAMKFGWLPTILWCVIGSTFLGGPHDMGALVASLRHDGQSIGAVVEKWIGKTGKFLFLSFTILTLILVV
GUT_GENOME142597_0179322-160FLIGLAVLLGGGYLYGKFCERVFHPDRRTTPAYALADGVNFVPMKRWKNALIELLNIAGTGPVLGPIQGILFGPIAFVLIPIGCVLGGAVHDYFSGMLSLRNNGAQMPGLMQRFMGKGVYQIYNIFVCLLMFLVGVVFI
GUT_GENOME067860_017434-149ILILVVGIALLALGYIFYGSWLAKKWGVNPDRPTPAHTKFDNKDFVPANPAVLMGHHFSSIAGAGPINGPIQAAVFGWLPVFLWVVIGGIFFGAMHDFGALFASIRHGGRSIGEVIKDNIGPKAYKLFVIFALLVLILVIASFTSV
GUT_GENOME171366_0484472-204ICILMIAYRLYGTFMAAKVLKLDDSKPTPAHELNDGKDYVPTNKWVTFGHHFAAIAAAGPLVGPILAAQFGYLPGLLWLLIGAVIGGAVHDAVVLFASMRKQGKSLSEVAKDELGPVAGFCTGLAMLFIITIT
GUT_GENOME027527_0041410-146SLALLLGGYFVYGAFVEKVFGADFRRLTPVKSRRDGVDYIELPRYKIFLIQLLNIAGLGPVLGPILGALYGPSALLWVVFGCIFGGAVHDYCSAMMSLRYGGASYPEIIGRNLGTGVRRFMEVFAIAFMIMVGAVFV
GUT_GENOME070452_013114-114FFIGLAILFFGYIFYSKYIERQFSPEEREMPCNRLYNGVDYVPLPAWKNQLINLLDIAGMGPILSALQGMVFGPAVFIIVPVGCILMGCVHDYFSGMISARNDGLQITQIT
GUT_GENOME224777_0001278-219VVTSVCTYAIGYRFYALYIQKKIMRPFDLNATPAERVNNGKDFDPTNRVVLYGHHFAAIAGAGPLVGPVLAAQMGYLPGTLWIIFGVLLAGAVQDMLVLAFSMRRGGRSLGQMALDEIGKVGGAVAAIMILIMLMIVLAVLA
GUT_GENOME096506_006174-145FVASIALLLTGYFVYAKVVERIFGIDDQQETPAYRNNDGFDYMPMSWWKASLIQLLNIAGLGPIFGAILGALYGPVAFIWIVVGSIFAGAVHDYFSGMLSLRHNGAQYPTLVGKYLGKHAKSIINLLSIALMILVAAAFTAG
GUT_GENOME047023_004538-148VITILCFLVAYVTYGSWLAKQWDIDPKRKTPAHDFEDGVDYIPAKAPVLLGHHFASIAGAGPINGPIQAAVFGWVPVLLWIILGGIFFGAVQDFSAIVVSIRHKGKSLGEVIEENIGHRCKMLFTIFSWLVLLLVVAAFSD
GUT_GENOME206146_006005-153LLCLAILIVGYFVYGKIVDNTFGPDDRETPAVRINDGVDYVVMPQWKLFLVQLLNIAGLGPIFGAMQGALWGPVVFLWITFGTIFAGGVHDYFSGMMSERNDGASIAEITGKYLGPVMQNVMRVFSVVLLIMVGTVFAVGPAGLIVELC
GUT_GENOME043712_007469-148LGILILAAGYFTYGRVLEKIFRPDASRQTPAVACTDGVDYVVMPRWRVFLIQLLNIAGLGPIFGAVMGVLYGPAALLWIVIGCIFGGMVHDYFSGMISLRHKGENLPEILGRYLGSQAQWISRAVCIVFSVLVGVVFAVG
GUT_GENOME058778_008601-162MNTLVIVLIAACCLVAGYIFYGRWLANKWGIDPKAKTPAVTKNDGQDYVPTDGWVVFAHQFSSIADAVVGLVVAHSTMNLPAYTGFHNEKSDDLFPILFVTVACGAVSGFHSLVSSGTSSKTISNEKDMPKVGFGAMLLESLLAVLALCVSALALTSLDSVA
GUT_GENOME000017_0197418-138MITFFSAIVILILGYFIYGRFVERTFGIDDSLKTPAITLEDGVDYVPMDWKKIFLIQFLNIAGLGPIFGAIQGALFGPAAFLWIVFGTIFAGGVHDFLSGYLSLKNKGVSASELVGIYLGE
GUT_GENOME172942_021514-145IVILAVSAAILFLAYLVPGTRLAKRWGIYKETPGMGFRRRDDMDFIPVRTPVLLGHHFASVAGVGFLAGTIQAASFGWLPALLWILIGGIFIGALLDFSALFLSVRNQGKSVGTVMKQVVSGPSGVLVLVFSWLCLVVFSAY
GUT_GENOME096407_010047-143GVVILIVAYLTYGKYLEKNFGIDNRETPALLKADGVDFVEMNWIKSFLVQFLNIAGLGPITGAVAGAMWGSSAFLWIVFGTIFAGAVHDYYTGMISMRNNGENMQELIGKYLGNRAKLFTKYFLIILLIIVGVAFIT
GUT_GENOME113381_023596-145ICLLILVVGYFVYGRYVERVFGPEAKRPTPALTKADGVDYIPLPTWKIFMIQFLNIAGLGPIFGAIMGAKFGTASYLWIVLGSVLAGATHDYFSGMLSIRHGGESLPEIIGRYLGMTTKQVMRGFTVILMVLVGAVFVAG
GUT_GENOME096283_0022778-218IGMIVFALGYRFYSKWIAEKIYRLDPNYVTPAHQFEDGVDFVPTNKLVLWGHHFTSVAGAAPIVGPAIAVYWGWLPAFLWVILGTIFAAGVHDFGTIVLSVRNKGQSVGTITSKIIGSRAKNMFLFIILILVLMVNAVFAW
GUT_GENOME141417_009067-149GVALLIVGYFTYGRYIEKNFQIDENRQTPAEALRDGYDFVPMPKWKNGMIELLNIAGTGPIFGPILGALYGPVAYIWIVLGCIFAGAVHDYMIGMISLRNNGAYLPELASRYLGKSMKHVINIFSMLLLILVATVFVVTPANL
GUT_GENOME238211_007091-148MISFVIVIALICFGLAYKYYGRWLNQYFLTDKDEEMPSIYLRDDMDYFPAPFGMLFAHHFSSIAGAGAIIGPVVAGAMFGWVPALLWIVFGCIFVGGVHDISSLAMSLKHDGLSLAEVVQRTFGRRVHFLILVVFWISLVLVVAAFLD
GUT_GENOME096509_013516-137LLIVSGILLLIAYFTYGKYLEKRLGVDPNRKTPAITIADGVDYVPASKPVLLGHHFATIAGGGPIVGPITAAVFGWIPAVIWILVGSIFLGGVHDYASLQASIRHKAQSIGSIIKEYIGKKGQTLFLIFSIA
GUT_GENOME011213_004555-147IGGILLLVLGYLFYGKLVEKIVGADNRLTPADRISDGVDYIQLPHWKNMLIQLLNIAGVGPVIGVIIGIKFGTITFLIIPIGCIFAGAVHDYMSGMMSLRANGSNLPALVKNNLGSAYASFFAVFMSLLLILVVAVFINIPAN
GUT_GENOME284971_011336-144VATLLLVLGYFVYGTFTEKVFGASSERRTPAVKYNDGLDFVPLPVWKIFLIQFLNIAGLGPVFGAILGAIYGPVCLLWIVFGSILAGGVHDYMTGMFSVRFKGCTMDYLVGKVLNKHFALLFMIFLVIILILVGAVFAL
GUT_GENOME200688_013054-152FLVCLALLVTAYFTYGRYLERLVGIDPAARTPCSRLYDGVDYVPLPRWRIFLIQLLNIAGLGPIFGAVLGAAYGPVAFLWITFGGIFMSAAHDFIAGVISLRHDGASLPETAGVYLGGGMKIVMRLFSAGLMILVGAVFLSQPASLVAA