UHGP-MC 7501


Information


Number of sequences (UHGP-50):
82
Average sequence length:
77±4 aa
Average transmembrane regions:
0
Low complexity (%):
1.28
Coiled coils (%):
0.94
Disordered domains (%):
0.08

Pfam dominant architecture:
PF07934
Pfam % dominant architecture:
122
Pfam overlap:
0.56
Pfam overlap type:
reduced

Downloads

Seeds:
MC7501.fasta
Seeds (0.60 cdhit):
MC7501_cdhit.fasta
MSA:
MC7501_msa.fasta
HMM model:
MC7501.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME093114_0024510-87LCRKCIEEDIPEAQLAKYLDEYVEQLPADIRVSAAEFNRRLAICAGCEHLLKYTCRLCGCYAQVRAAKRMNRCPVPYA
GUT_GENOME023437_0039620-100QRICRRCLAKDMPDASYFDNMYEYIRQLDPEIKAEDSLYEERLSFCKECDHLLNGMCRICGCFVEMRAAIRKNHCPDIHGK
GUT_GENOME095707_0003610-80RFAQEFYARDMAEELTSYLESLPPEIKVDRQEHQRRLAICSKCDGNAGGLCRYCGCYIAARTVKKALACPH
GUT_GENOME253442_0173623-106EKKFCRKCLLADMEETDYLLHLKAYIAAYPLEKRVADSVYKARLEQCRSCEHLYQGMCRKCGCFVELRALKPHMDCPAVPPKWM
GUT_GENOME138434_003564-80CKRCLLLQAGGEVTLKEIMSQLNSVEKELRAADETYKNRLSECKKCDNLLAGMCTKCGCYVEYRAGIKNKDCPDCDN
GUT_GENOME107643_00160512-588IPCVRCLLREMDEQTALQQVLDFQRSVPKMEQADPSQYETRLAVCKACKWLNKGTCRKCGTYVEARAFRSDAHCPLG
GUT_GENOME113804_0199821-99VRFCRKCLTSELAEQADTYRTIKEHVDNLDPDVRADQEQYTQRLDTCRQCEWLLEGMCRSCGCYVELRAAVIKNVCPRK
GUT_GENOME204272_0063119-95KVQCRKCLLEDMDENDFLRDMRSHIAAYPADKKVSEEEYRRRLDFCKDCEKLVDGMCVLCGCYVELRALKIGMRCAD
GUT_GENOME134736_009341-72MAKCKRCGLKTVLSEDDIQKMVEQVTSMKSVRLVSSDVYENRFDICQICDDFMYGSTCGVCGCVMQVKRDYR
GUT_GENOME214180_0145717-98RECKKCLLLEAGKDKSYSQIGDLLSLADSAEKVDKSEYIRRLSLCRGCDFLTDGMCVKCGCYVEYRAAFINKNCADYENKKW
GUT_GENOME191958_0143211-87CEKCLFREMSDHEYFRYIQSYLDTMDQRSRVTDMEYGERLDKCKICQYLRNGMCRLCGCFVEIRCASKHRHCPDTTP
GUT_GENOME096550_046031-71MTEESIRRLLASPMFTSDHCVDDEVYAERLRICNGCAKLLDGVNCTVCGCIIPVAAKLKRKSCPLPGGALW
GUT_GENOME265702_000278-82CRRCLLRQADARAYETVAAYVAEIPPDQRADDALYEKRLAACLQCSYLSAGLCTKCGCYAEMRAAFREKSCADYD
GUT_GENOME028374_01264154-230VMMQDMADENDYYASVVRYRATMPRRVRTPDAEYTARLEHCRTCPSLNAGTCMQCGCYVEMRAARTDMHCPLDNAPW
GUT_GENOME149690_010478-85NLRVCKKCLTRDMMDKADYFKNMYDYIENLDQDIKTEDSLYEKRLATCKECERLADGMCRGCGCFVEMRGAGGSWAAV
GUT_GENOME188397_007106-86RVCRKCLPGQEKEAAYFEKLTRYIQRMDESVKVDQRTYESRLAVCAECEKQLNGMCRLCGCFVELRAAQRARKCPDIPARW
GUT_GENOME105900_0185312-94GRVCRRCLLEELGEGDYLESVRRYRARLPEKERTPDDEYEARLSACKDCAELVNATCNLCGCYVEIRAARRSSSCPAIPPRWS
GUT_GENOME141041_031213-68KGQIDIDFYIENDRVVSDELYEKRVAICMECSALLNNETCRFSGELVTYRAMIMEKGCPFPEIRKW
GUT_GENOME089759_007634-74RVCRRCLLRELDGEYFQSIYQYIQNLPAEWKVDKETYAARLERCRECPHLINGMCELCGCFVEVRAAKKSS
GUT_GENOME239690_0276810-89CKGCERDIALSEEQIARILNNMRPKMECVNDEVYEARLLACAQCEELMSGHTCGISGSIVRIRALAAAQNCPSYHGSRWQ
GUT_GENOME158607_0134213-94VLCRKCLEIPVGERELAEALDRYLEKLPAEVRVPPEVYEARLALCADCPHRLLYTCVKCGCYVQARAAKRGTVCPGEDRKWE
GUT_GENOME249693_0170610-90PPCRRCLLSDDVNRAAYATIADYIASLDPERKVDEAAYARRLAACRTCEHLHNALCAKCGCYVEVRCIKRALTCPDVPPRW
GUT_GENOME209602_0259522-98KYPCRKCLLREMDQNAYMESLYAYIARLEPEIKADEAVYEERLGICKGCDYLKEGLCGACGCFVELRAVIAKNVCPY
GUT_GENOME063562_010124-80CKRCSLKTVLTDEDVAPMVREVISSGMSLVDEDIYSSRLSKCLECDKLAYGTTCMLCGAVVQVRCLLERGKCPFPKE
GUT_GENOME083078_0049814-94PFCRKCLLSESGFAAEYARVADMVEALPPEKRVAEDEYRRRLELCRGCGQLGAGVCGECGCFVELRAAKRAMHCPAAERKW
GUT_GENOME115089_003934-83KPFCVRCLIREMTDRDALAQIRKYQSQVPDHEKADDELYEERLTVCKECKYLHLGTCLKCGGYVEARAYRTAQHCPLGER
GUT_GENOME043644_002504-83NCKRCLLFEASQNVTYNQIQDYIKTIDASEKVSDEVYSHRLSMCKKCDNLISGICLKCGCYVEVRAILKNKNCPNFDDKK
GUT_GENOME200667_0191423-98ICKKCLLREMAEADAAMIKKYVDAIKGEDRVPEKEYEARLAVCKACDRLNAGTCNACGCYVELRALTGVSHCPHKK
GUT_GENOME103071_005895-78RCRKCLFNQDSGELYQSIQELIASLSPEEKAGEEEYRRRLDVCQECEKLQNGMCLSCGCFVQVRAAKGKERCPW
GUT_GENOME265353_0123013-92QRFCKKCLLRDMDEDAYFLDLQSYISHLDEEVKVPEEEYERRLAVCSGCDWLLNGMCRKCGCFVELRAVMKKNHCADSDK
GUT_GENOME198940_0045531-112RFCRKCLTREMDQTEYFQNLHEYIAHLDKDLKVDDTVYEARLAHCKTCDDLYQGMCRICGCYVELRAAMKKNRCPQVHPRWN
GUT_GENOME117117_018288-86PVCKRCLLAEIDVDGLYKKVSELIELMPEDVKAPDEIYQKRLNICRSCESLENGICRECGCFVELRAASKKNYCPSVYK
GUT_GENOME200288_0588315-85DVKISDAKMARLVEIASRSRPTVQDEEYERRLSICSACPGLQYGTTCRHCGCLVQVRAKLSESTCPFPYES
GUT_GENOME001056_008334-72RCKRCLLREMANEDMYHRIQRTIDAIPPKLRCSKGEYDARLELCKECEKLIGGMCRVCGCFVEVRAAKK
GUT_GENOME205814_016823-82VCKRCLLSRMSAEAFQNIKENIELIPAEKKADEALYKKRLEICTECDCLINGMCSKCGCFVEMRAAFAINRCPHEDKLWE
GUT_GENOME236224_000216-87PFCRRCLLEDMPSQAALAASIRELIALLPEGKRAPREEAERRLKQCRACDQLRSGMCALCGCYVELRAAKARMGCPALPPRW
GUT_GENOME097609_011903-83HVCKKCLLLEAGEKASFEGVKSYLETIDSSLKVSPEVYKKRLEYCKNCDSLIAGMCIKCGCYCELRAALKNKACADYDNRK
GUT_GENOME216102_0229711-79ANLQKDMMTESIAAYVASLPAESIVDDDEYERRLAICSRCDDLVGGLTCSHCGCFVLARARKKQMDCPM
GUT_GENOME096435_006813-82CKGCYADNVMSIHDVKLLVEEQLALETDLVDDAVYKERVASCMTCPSLLNQTTCSLCGCFIHFRAKIAYKHCPHPTDFKW
GUT_GENOME218569_012265-84CPECELRAIQSSITKESLEREVSAMKYVQGITAPENQYKNRLDICEKCSALVSQIMCSECGAYVLFRAKNKKSSCPRAKW
GUT_GENOME170002_006503-78PFCKRCLLSEYDEKFYAETVSDYIAHIPEDKKCGSEEYSQRLDICKSCDELSNGMCGECGCFVEVRAAKSHMGCPI
GUT_GENOME206726_021784-83CRRCQWTKEMFDKELLKLEEYIERIPEEERVSDEEYERRLMLCDTCSFMRGGMCGRCGCYVALRAAKRQQYCPDVHKKWW
GUT_GENOME042510_006345-82RVCLRCLLRESGGADTLEDIRKRIEKIHAYDKADDEEYIRRLGICKECEALVSGTCMKCGCYPEFKAAFIRQRCPQKK
GUT_GENOME038385_004395-82KIPCIRCLLRDLDQKAALKQVTAYCERLPVQQRAKKELFEERLSICRDCKWLRLGVCQKSGYYVEASAYDKTKHCPLG
GUT_GENOME240865_016066-82RICKKCLLREMDEAGFFQNMYDYIARIPADDKAPEEEYERRLSICKECEKLLSGMCRMCGCYVEMRAAIALRDCPGK
GUT_GENOME234096_0107112-89KLRCVRCLLNAIDDKTALEIVNKYKADTPESDRALPDVYRQRLDTCLACKYLERGVCLKCGYYVEARMYRECGTCPLK
GUT_GENOME096513_026136-86CKGCREEYKVTEEQIQRILASPAFAPERCVPDEIYAQRLELCRACPKLQGGTTCLACGCIIPVVAKLKERECPLPGQRKWN
GUT_GENOME066069_012883-76CKGCDVKETAATVDVEALIEEQLAIENDLAALEVVQSRIKICEACPFRSNHTCTKCGCFYKFRANLSKKYCPAG
GUT_GENOME260024_014553-79CRICEELGWSVDEHVAEALRKLEERPQQLAEEKEYGRRLEICAQCKEKLPDGTCRMCGCYVVLRARLKTARCPYRDR
GUT_GENOME088986_0007310-86PCQRTGLSDREYEVILKNYLENMDEDMKTPEPAYRERMSSCSRCEFLRNGICRLCGCFAALRAAKTHNHCADTPSRW
GUT_GENOME233266_004609-83CTRCLLEEAGNRDLAQIIAERIAVMPAAEKADERLYRQRLEICLKCDALNGGTCGKCGCYVELRAARIKGYCPAA
GUT_GENOME113647_001785-81RLCETVFPTPAQLEAYLEAYVADLPDQERADETVMARRLALCRTCPHLRIATCSLCGCYVQARCAKVRLRCPATPPR
GUT_GENOME207750_012813-76CIGCELKEQVSQMDVDTLVQEQLSFETDLAMEEVRDQRLQICHQCEQLNQHTCGRCGCFVRFRVSLKQKSCPDR
GUT_GENOME099498_004806-78RCLLGDFPEGKELAELIADYVASLPEEFRAAPEEISRRLNVCRDCAELFDGTCRPQIVILFFVQQQGPLRQGG
GUT_GENOME134205_014426-81KPCLKCLLRDLDEEAYMKQLHRYIVQLDPDVKTAQQVYEKRLELCKACDYLEAGTCLACGCYVELRAAVKKNRCPY
GUT_GENOME033489_0215212-88ERYCRKCELAKQYGGGLEEYLLRYLAQLTPEEQAEDVTYARRLACCGKCTWYGEHMCRACGCYVQLRAAVKGQNVPM
GUT_GENOME096866_0154323-106RPPCRRCLLEDMTTEKKLYVTVREYLDAVPQNVKTPDAVYRSRLDACRRCQNLQNGMCRLCGCFVEMRAVKDANTCPDTPPHWR
GUT_GENOME258515_0068010-88CRRCAAQELPEPELLRYLDEYVSSLPEEMRASDETYARRLAACAVCPHRTRYTCTLCGCYVQARAAKRAMACPLPGAPR
GUT_GENOME251254_02371493-565CRCTLLESGQADMAKLVRDYVDSLSADEKTDEATYAARLNICRTCDDLHSGTCALCGCYVEARAAKKRQGCPK
GUT_GENOME066712_020243-83QDCKRCNIKTVLSPDDIERMVREVEAMKGVRLAEKEVYDARIAVCMACGQFEYGSTCMRCGCVMQVRARLLEGRCPEKKWE
GUT_GENOME142591_017996-89DGCKGCADSVRVSPDKLERLIAVALQGREAAAEAEAARRLAACRACSGLQYGTTCRYCGCLVDVRARLRGSACPHPGASRWRSE
GUT_GENOME241159_024757-88CKGCSASVNVTSEDIKAMILSIINSGNFKLVSDETYSKRIQKCANCKYIEYNTTCRQCGCIVQIRALQQEKDCPYPKNSMWE
GUT_GENOME255629_0069623-98CFRCLLSETDHTLYETVRAYIDSLSPDLRVPDALYRERLAACTSCKNLINGLCGLCGCFVEARAAKTGSRCPAAAP
GUT_GENOME135970_016717-82SSSLKTATGLAKSMAQIASDGFATVSDTELKRRKKICSECIYWDECGNLGAGKCRVCGCTPVKFKFASQHCPRKKF
GUT_GENOME108999_005824-84RECKRCLLKDMDTQEYYRTVSEYVESLDESLRAPKEEYERRLSLCRECDCLINGMCRLCGCFVEARAAKTSSYCPGTPRRW