UHGP-MC 3723


Information


Number of sequences (UHGP-50):
50
Average sequence length:
104±6 aa
Average transmembrane regions:
0.04
Low complexity (%):
5.03
Coiled coils (%):
0
Disordered domains (%):
4.18

Pfam dominant architecture:
PF01704
Pfam % dominant architecture:
8000
Pfam overlap:
0.16
Pfam overlap type:
shifted

Downloads

Seeds:
MC3723.fasta
Seeds (0.60 cdhit):
MC3723_cdhit.fasta
MSA:
MC3723_msa.fasta
HMM model:
MC3723.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME139886_000448-112KIQETLQKYGQEQLLVNYDKLDEQKKKTLLEEIKTIDFAQMEKLYENVGKTMESEDVKIDPIPYIDKAKIEDKERNIYFEKGANEIRNGKLAIVTMAGGQGTRLR
GUT_GENOME015135_009787-117AKKILEKYNQQQIISIADKMPKEKQEMLAEQVLKIDFEELKELYEKTFENIYVDLEQLEPIRGVNPNKLPKEKIEEYEQIASKIIKQNKFAVATMAGGQGTRLRTPTSKRY
GUT_GENOME095161_0034612-104LFKYNQEYVLPYFKKMNIMEKKNFINQINMVPWKKCLKALEDKNVLKNPLPIEYEDINTFSIDDRRKLEKIGEDNINRYAVLLLAGGQGTRLG
GUT_GENOME040496_012132-103EDKIIEKLNQMGQQHITKRLDNNENLRGQICKIDFDEIMKLYKGTKKNTPILEKYFEGIGYVEKDKLSMEEYGELEKIGNEVIKNGKYAVVTMAGGQGTRLG
GUT_GENOME170405_0068012-118LLENYGQEHIIEVLNQKNGEEKEKLIEQILSISFEQLADLYQKAKQIPELENDKIENIPYVDGQKLSKEDRENYEKIGTQEIKNGHYAVVTMAGGQRNKIRASWTKR
GUT_GENOME230048_0035118-118EAKQYLTAHGQEHLLRFYDTLTVSQQQHLLEQISALRLERPQFQDRNALVQKRGKFAPLGALTQEEMQNKKAEYRKIGIQAIRNGKVGAVLLAGGQGSRLG
GUT_GENOME018091_012845-110QATALLKKNGQEQVLRFWKKLSAKERKALLAQIESIDFKEVARCQAMLPGSGVAAETVKKGRPTAPKVAELKGAALKKAVAAGERELAAGHVGVLLVAGGQGSRLG
GUT_GENOME021899_000254-110NFKKAEDILTKYHQEHLLQFYDELSNEQQEYLVNQILSINFDEIINLYKKSKLTSTISTEYIEPISYYDKLNFTPNDIDIYNSIGQNCIKDGKIAVVTLAGGQGSRL
GUT_GENOME243949_002881-111MDSNENEIRRKLKKYNQEHLLTNYDNLDEEGKLKLLEQIEKIDFEQMQKLYNQAISEVDFENIEIEPVDYVDKSKMTNSEKEQYLEKGIESIKDKELAVVTMAGGQGTRLG
GUT_GENOME163073_0107549-144LNTHGQEQVAERLQALSGSQQEKLASQVSKIDWSVVESIANKESETGEKSVEPLTAVEQKDIETNKDKYSKAGLDAIKAGKVAAVLLAGGQGTRLG
GUT_GENOME103750_025315-112LNKVNKLLKEYNQEHLLKFYDELNEDDKELLLDEILEIDFELIKNTYKKIGDANNSNTKNEFNPMIAKVLNMYSKEEKENFYNMGIDALKDNKVAVCLVAGGQGTRLG
GUT_GENOME159409_013311-111MDNKLTKAKEILHHYNQEHLLYFYDELSEEQKELLLNQILGIDFNKILTLYKNSFKSTKLDLNTVSPLPHIEKDKISKEALEHYIAIGENIIKSGFLAIVTMAGGQGTRLG
GUT_GENOME057850_0061212-128MENELKSLMAEFDSAGQSQVFKYFGELGESQRQALLEQLSKVDLKELESLVDTLVLKESKADKLDAENLKPAAFIPLAKSKDSDPEWLNAKKVGEAAIRRGELAAFVVAGGQGTRLG
GUT_GENOME207133_000747-111EAIEILEKYNQHHVVKHMEKLDSNKKNEVIKQIEDINFEEMKDLYDKTKTNRNKKNCDIKPIKTIIADSIEQKQKNEYIELGEKVLKEGKLAVVTMAGGQGTRLR
GUT_GENOME283107_005014-105QTAYQKLEPYHQTHLLRFFDELTPPQQQHLLHQIEQLDLEPFAYLEHGKADAARGTLQPLGACTIEQIEAHRETYQKIGLDAIRAGKVGCVLLAGGQGSRLG
GUT_GENOME069882_006504-109LEEMKEYLKKENAEYLLRFYDELSEIEKAELFKQIEHIDFELMKKLYDSRNIPPIKNKKIENIAHEILEKIPQEERKKYCERGEELLRSNKVAVCQMAGGQGTRLG
GUT_GENOME018655_0102311-109LKRHNQEQLLNFYNELDEKEKADLLNQIDKINLEQVDELYAHRKDIPDANKKIENIPVVKKEKLSDGEKEEYSKIGEKIIKDKRIAVCQMAGGQGTRLG
GUT_GENOME096388_003663-93QWIPQYEKLREASKEKIDAQLNELDLERVKSVYEDVYVNQTAFDTSNVEEVPFVTEEELDVEVLERLAHTSVEDGKVAVLLMAGGQGTRLG
GUT_GENOME096269_028791-109MQTKLNEVRNKLQAYGQEHLLAFWPSLSDDSREKLLQQIDSIDFDLLQQLSGKAGKKEKPDSIEPISAEKWTEFDKSRQAELEQLGWSLLKQGKAAVLVVAGGQGTRLG
GUT_GENOME247621_0061019-113LANFGQQHLLKYYDELDEAQRAALLNQIEQIDFSVVSTNKQPEKRGRITPIGVTELAEIRTRGDEFFDAGMAELRAGHVGAVLLAGGMGTRLGSD
GUT_GENOME235600_002814-107ELHSKLKTFGQEHLLTGIETLTPSARQTYLAALNRLDLPYLTSLYRHADVSPTLSGTISPCRSVAADQLTPDESAAYLACGRQLIHDGAVAVVTMAGGQGTRLG
GUT_GENOME254158_013791-111MNEKYEEAKKLVKKYNQEHLLKFYDELDEEKKKELLDQILSIDFELMEKLYKDATKPLDLKDVTVEPIDYVDKSKLTATETKAYEEKGIEAIKANKFAVVTMAGGQGTRLG
GUT_GENOME051033_011873-109ILEKYNQEQLERFYKDLSKTEQSKLQKEIENIDFEQINSLYINSKKDEVIELKEIEPIKYYIKRKLSKSIIEEYSNLAKEILRKNKLCVITMAGGQGSRLGVNGPKG
GUT_GENOME272779_0026023-130MKIWEDDNQKYIKAMMEKNTTEQNGELTKRLEKIDFSVLEHIERKETVNERGVFAPLDAVEVGEIEARGEEFKETGLKAIREGKVGAVLLAGGQGTRLGLDRPKGTLN
GUT_GENOME095216_0121411-115LAPVHQEQLLRFWPELSPAQQEKLAGQLDEIDFNEMDQLIREYVLVRPKTRIPDDLGPAPYFPLVPKDEAQRALYEKAEARGAELLRAGKVSCLTVAGGQGTRLG
GUT_GENOME137644_010312-110EKMELAEQKLRKYHQEHILPFLQNSNEQKKEELIDQILQMDIEEIMHLYEQALQPIQYQNTTIEPVVSKNKETLSVEEKKTYTKLGEEVIKRGEYAVATMAGGQGTRLG
GUT_GENOME139644_011218-108LLKYNQKKVLEVYNSLCSSDKKTLENELSKVDFEKMTKLFSLTNNVTVSDISDISPLENVYVKYSLQENILNKWKSIGDSVIKDSHYAVVTMAGGQGTRLG
GUT_GENOME025996_013593-114EKYEKAKKLTEQYGQEHLLAFYGELTEAQQDRLLEQILSIDFLLVRKLYEHAKAEEAGETDTQGEITPIGCTEKALLSAEELKQYEEAGMAAMKAGLFAAVTMAGGQGTRLG
GUT_GENOME049930_008159-111KETLIKYNQEHLLAFWDELDEFQKQQLLTQLSHINFKKIDSLYKTLKNIPIPEGELIEPLDYYIKNEISTKERRHLENQGTEILKSGHYAVITMAGGQGTRLG
GUT_GENOME048409_0154964-162VQKLLEEKGQLHLLRYYDELKSDEQQALLSQIDQIDFSLIDMIGKNNSGNDSDIAPVAALQLDAINANHDTYLNAGIDTIKNGDLALVLLAGGQGTRLG
GUT_GENOME178871_004826-103IMQNLNSSKQTALIAYLKNTDLTTRKKIEQQLDTIDWHIFSQKNVNQNTLSNIAPMPITTQSSIKSKYTYYKNLGLNHMKEGKLTLVLLAGGQGSRLG
GUT_GENOME026080_013854-113EKKLAKAKDILKKNNQEKILMFLNKLDDGKKESLAEQILNLDFEQLNRLYAETKTEPEILEKKIEHTKYVDEYKISEEIREKYTKLGENVIKNNQYAVVTMAGGQGTRLG
GUT_GENOME026445_0123231-123KSRQEHILSYYETLDDTQKEKLKEQLSRMDLSVLAGLKHQAAEERGTFMPLGALTIDEIRSKEKFYMVQGLNALQNGKIGAVLLAGGQGTRLG
GUT_GENOME170043_0071514-106YNQEHILEEMKEMNNEQKNILFEQIDKIDFEKMKKLYNTVHKVEENEKITPIESINKKNLKDIETIEQKGIDIIKNDEYAVITMAGGQGTRLG
GUT_GENOME197150_000074-104KERKNILEGKQQVHLLRFLDQLNDDEKQILFSQIDAVDWGFEQAFHNDKKKEDTMIEEIDALHLDKIQADSERYRSIGLQAIKEGKLCLLLLAGGQGTRLG
GUT_GENOME064302_005796-106KAIKILEKYNITTIKELLEKYKNEELINQILSIDFEQIALINNNIKKEKTFSNDKIENIPYTNPCSVSFRELQHYTQIGEDIIKNGHYAVVTMAGGMRNKA
GUT_GENOME234281_0001426-141RHTKEQLLALLAPVGQEHLLRFWNEISDADRDLLGDQIESINWAEVLGWLDAIMKNGEAEQIPFEKLVPAPYVPLKPENESQALRQKGAIADGEELLRGGKVAGFTVAGGQGTRLG
GUT_GENOME118115_0063214-121KKQEAIDILKEYDQEHIIRLLEKLDSKKQEELIEQIQKIDFHQMLELYNNTKRKIEFKESKIEPLKYLDKAKLTEQQHEKFDKLGAKVVENGQYAVVTMAGGQGTRLG
GUT_GENOME096373_021847-108LEKYGQGHLNEYEKLMSTNEKERLNQKIDTLDLEAIHSLYEQVYVNRQTIKDVSSVQEINYEVKAQLQEETIELYKSKGIDAIREGQFAVLLLAGGQGTRLG
GUT_GENOME274349_007203-116KYRSVVKLLKKYNQEQVLAFYNTLPKEEKELLLDQVLNIDFEKLISLYNEFQKDVSYTDAVITPLEHVEETKLSSYIKSKYVKKGEEIIENGGLAVITLAGGQGTRLGLSLPKG
GUT_GENOME025996_000614-109KVQEITEKYGQQQLLLYWDTLSEQEKEQLTEDIFNIDFPLMDSLYKNVGSGAVTAGDISISPMPCTDATSMTEQEKERLCRKGMGILMKGKMAALTMAGGQGSRLG
GUT_GENOME119794_001137-98KIDNSHLLSYQNELSKEEYWQLKKEIFQIPFHHLRKIYQNSYKDEEYSAKDITPLKYYEQEKIDSFYSEIGSNYYNSYAVLTLSGGSGSRFG
GUT_GENOME042567_0022314-109CEQAGQQHLLAYYEELTKEDQEKLLAQIEKLDITLLDLLKKESKEVEKGKLEPLGAVTLDEIQKEKDQYRKMGCEAIRAGKVGAVLLAGGQGTRLG
GUT_GENOME265702_002203-109QEILEQVEQYKQTHILKFFNELSDEEKSGLIRQIERIDFSKILKPDDNVNTKDRAISPINVLDTETIQESYAAFMQIGMEALKSGSVGLVLLSGGQGSRLGFQQSKG