UHGP-MC 107408


Information


Number of sequences (UHGP-50):
83
Average sequence length:
114±22 aa
Average transmembrane regions:
0.02
Low complexity (%):
10.54
Coiled coils (%):
0
Disordered domains (%):
1.57

Pfam dominant architecture:
PF03389
Pfam % dominant architecture:
120
Pfam overlap:
0.34
Pfam overlap type:
shifted

Downloads

Seeds:
MC107408.fasta
Seeds (0.60 cdhit):
MC107408_cdhit.fasta
MSA:
MC107408_msa.fasta
HMM model:
MC107408.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME231259_023621-150MIYRKKILTGLFLFVTLILVQAMPVLAASITEADVQSVGKETAAGNVLVWFLCAVAFLKISQKIDSFMSSLGINVGHTGGSMMAELMIAAKGLTTAKNMAGGAMFRGGSFHGGSSAQHMSSASFMSGGLAGAVSRQFAQGAMQSATGQGG
GUT_GENOME254321_01692189-295MFASMCLMMVMNIIFLKLLISAMGYVPSGLGVLPWMLLIVGIARVARKIDSVVARIGLNPAITGDGLGRGLPGMVAFAAIRGLGMAVTRSASAASKGSGGAKPHGGH
GUT_GENOME078723_0134687-243LKKILFLALMVAALSCLFCQPAFAISEEEVQAQVDAVGKEAVSGNVLIWFMCAIGFLKVSQKIDSFMASLGVNVGHTGGSMLAEAMIAARGFSGFKSFASNHFAGGRSSHSSHVRANGGGKGGAGFGAGFASGGLAGVIKRNVTNNAVKTATTPPDA
GUT_GENOME009394_00038417-560AVVYCITMWCINITLFLIEKAPDKCAGMNNTLEEIVYFFIIIGFLKIAQKIDSYLKDMGMTVAGTGANLGRAALESMTLGYGMFRMGTSAYRMGKGALRTSGALHGSLNNTLFNATNGQAGKPSKGAKNEATLTQSANYAKRKI
GUT_GENOME025908_008581-96MNKKLKIVLLCTFMLCTVFAVNCYAITDADVKNAVDAGSKESVSGNLFIWFLCAIAFMKVSQKIDSFMNSLGISVGRTGGSMMGEAMIVAKTVGSA
GUT_GENOME096793_020613-136KKLVAVGGSSLMLSLICCTQAFAIEESDVESAIAASSNEAVAGNIFIWFLCSVAFLKISQKIDSFLAGLGINVGRTGGSMLGELLIAGRSLGTVAGGLGSAVGNIFNRNHANGNTTTNQAAGQAFTGGGNSLIG
GUT_GENOME095705_02760198-283MFGSMCVVMVSNVFFLKVILSALSTIPNTLTIIPWLIFVVGLCKVARRIDDILCRLGLNAAHTGRERMFPGAVTAMFLRSAANTVR
GUT_GENOME260268_00865199-315MFGSMCLLMATNVMFVKMLLSVLSYYPSGLDVLPWMVLVITIVKVAKKADSILARIGLNPAMTGDPLGRGFPGAMTMMVVRSLVSNAAHTIGRNSGQQRSGSGNPKPNAPTGPRTSG
GUT_GENOME021981_00404242-336VGCQLFLMICNVFFFKMFIYGFSRFNETMAAMSGINGGETTGMMIVWVLILFGILYVGQRVDAYMSTLGLNAAQTGRGMGAALVASALGAGRAIQ
GUT_GENOME185468_00586233-351MLVSELMILALNVWFVVVFRNAIIVNSMIAGEYEVNGHTVGSGILWCFIAIAFLKTAQKIDSHIATLGLTTAQLSSGIGATLLATGMGIGHMARGVSNGIKNARNAMSGRELAAAKKFD
GUT_GENOME074108_010959-143KPFIFATALFFILSCLGVVPAAALTEDEVQQQVATEGSAAVTGNIFIWFLCSIAFLKVSQKVDSFMSSLGINVGHTGGSMLGEAMVAMRGIGLAAKGITGKSFGSSGGGGGSASADSSGNVAFAGGLAGIVSRKF
GUT_GENOME046130_00151194-298GWARMYGSMCFMMVSSVIFLKMLISALGYVPAGLDAIPWAILIVAIARVARKVDEIITRIGLNPAITGSGLGRNLPGMLAYTVVRGMTSTISHTVGRSMGGGATQ
GUT_GENOME231183_023945-145RKIFSCVVCALTLATVLAMPAFAVSESDVQAAVAANGREAVSGNLFVWMLCAIAFLKASQKIDSFLSSLGINVGRTGGSLLGETAIALRALTMHGSRGGAAAHAAAPGSSAAPGSSRFLQGGLWGASSRQLTRAAVQSATG
GUT_GENOME136708_00224199-308MYGSMCLLMVMNVVFVKMLLSILSFHPSGLAVLPWIVMVLTVVKVAKKSDAIITRIGLNPAITGDSLGRSFPGVLTYTVARTMASRVVQTAGKNTAAGKTGAQTQTKSGS
GUT_GENOME001016_00708295-432LMVMNVFFIALFLKSVSSFSTSIKTIGENNNSTAKIAIVITWCIVEFALLYVAGQFDSYLNTMGFSTAETGAGMMASMVMDAIDIGTINPLKGRKGKAGGFIAKRREKNAESSPRTPSLSGPLSRLRRNSPIKRKEGQ
GUT_GENOME024418_01507208-311KAWFQMVISTVLLICFDVLFIRGANSAFAAFAVKGATTDSGDGFILLWVIAIIAFLKVGSNIDFQLKNMGLSAPQTGGRLLNSMIIDGYGISRIGKGVAQSFTG
GUT_GENOME094508_00744200-293YASMILLLISNVLFLKLILSAMGTMPTGLMVLPWTVLIVGLAKTARKADTLLSKIGLNPTFTGDPLDHGTGRFVAMLAARSVINSAMHTSGAKA
GUT_GENOME012310_0049712-144SVFAVIFSVSVFAVIFSVSVFAITENEVQNEVSRIGREGVTGNIFVWFLCAVAFLKVSQKIDSFMSSLGINVGHTGGSMLAEALITAKSIGSTFRGGGKSVSAGRSNGASVNTAAFKGGLSGMVGRGITGNAV
GUT_GENOME028199_00949211-300MFLGQCLLMLLNVWSVKMLMSILANGQSDVFLRFILAIAFCRVAQKFDTYLQSMGINAAHTGGSLADDLLALGGTLKSTAGGIITGAGNK
GUT_GENOME089404_01331242-345LQMFWSQCVLLILNIWVVGIARTALNNGLFGASNTEMVKWGLITYAFLKIAQRLDDMMQTAGLKITRTTGLDPISEASGVLRSIGNVFGGVASVAGHVAGVGKN
GUT_GENOME197990_014565-156MRKRFFLNCLLILTLVVVMSTSAFAISESDVESAVSASGKEAVSGNVLVWFLCAIAFLKVSQKIDSFLSSLGLNVGHTGGSMLSEAMIAMRAINTATSAVGSALGSRSRHGSAPASGKSGSGSAAAAGFFSGGLVGMASRKIASDAVRTATT
GUT_GENOME025789_00911269-374LLNIVFLYAYLSAVAYANANGKITLDFAQDALSGNNGVLVWYWATLAVLKVGRQIDQYLSSLGMSVASTASNLGNEALMAAGTMMAGLRTAGAGRQLVDKAARKAG
GUT_GENOME060255_00914218-338ELLILVLNLWFMVIFQSAIIDNPMEKSVTVNGYTCGGILWCFIALAFLKTAQSIDSHIAALGLTTAQLGQGVANTLLATGAGLRRGARAAQHAFGLNPATVFAGNASNGTTRTQRKAAQGL
GUT_GENOME222479_01247123-228GWCRMYGSMLVMMIMNIVFLKLIMSAMSQMAAGGVLIWLVFVVALTRVARKIDSHIGKIGLNPAQTGSGIGSRLPGMMTMMAVKVMSSTVSRSLAGAKGNTGKNGS
GUT_GENOME141051_03862210-320MVGSQLLLLVMNVWFLRGFNSSMGQYIGNGGALSTGQGSIFLWLFCALAFLKTAQKFDSYLAAMGLNVAQTGSSMGMELMMAARVISGVGGGVRNAGSMFHSTSTATGTGA
GUT_GENOME255591_0119613-156TRICVIVLILVCVCTLPALAISEADVENKVAAVGREQVTGSVLIWFLCAVAFLKVSQKIDSFMSSLGVNVGHTGGSMLAEVMIAAKTVSSVASGAGRFFGGRSHRGAGASGSKGADGSPGFLRGGLAGVVSRKITNDAVRSATS
GUT_GENOME104393_015022-145KKTLFTLLTAISLAFLYCPAVAVFALIKDETAAAVGAQGKDAVSGNLFLWLLCAIAFLKVVTKLDGILHSLSIGVSRSPGSMLSEVLLAFRGFEIGQAFMGLGLAKAAAATTNTKTTPGNIFASGLSGMVSRHVQQSAASSISW
GUT_GENOME096140_008266-153LFLFAALAACMVLFFSVPALAAKLTEADVEQAVASQGKETVTGNVFIWFLCAIAFLKVSQKIDSFMASLGINVGNTGGNMMAELMIAGKSLSAAVSSHGGSIGRSLGGGYQKTASPGAAAVGDSFLSGGLAGAVGRQVERSAVNAATG
GUT_GENOME001016_00709222-338WVRMVGSQLFLMLCNVIFFRLFMMGLGSYDGLIETYNQQVEAKAGIMASYNKGTVVIVWVLMMHGILAIATRVDSYLNTLGLSAAQTGRGLAGALVAAGMGVRRTVSSIKSGAGKAY
GUT_GENOME122439_010605-149MKYITGIVLSVLVICALTIPAFALTESDVQSQVSASGKEGVAGNLFVWFLCAVAFLKISQKIDSFMQGLGINVGHTGGSMLAEVMLAARSIAAARGVAGRGRSGNAGRAGSTGGSSGGGDSNTFLQGGLAGVVNRSFYNGAYKSA
GUT_GENOME260354_000302-148KRRYKILILMIVVIAMVSFTTIPAFALTESEVQDQVNAVGKEAVTGNVFIWFLCAIAFLKVSQKIDSFMSSLGINVGHTGGSMMAELLIAARGIGAAKSFAGRNGGGGGSRSGSGSSSGGGSTFMRGGLSGVISRKFTNDATKRATG
GUT_GENOME030650_01325181-278GWLRMFASMCTLMALNVMFVKMLLSAMSNSPTGVAIVPWVMLITGIVRTAKKIDSIILRIGLNPASTGDPLGHHHIPGMLSALMFHHAAEFIKNTISN