UHGP-MC 102799


Information


Number of sequences (UHGP-50):
158
Average sequence length:
79±6 aa
Average transmembrane regions:
0.15
Low complexity (%):
6.96
Coiled coils (%):
0
Disordered domains (%):
14.94

Pfam dominant architecture:
PF18668
Pfam % dominant architecture:
8165
Pfam overlap:
0.76
Pfam overlap type:
extended

Downloads

Seeds:
MC102799.fasta
Seeds (0.60 cdhit):
MC102799_cdhit.fasta
MSA:
MC102799_msa.fasta
HMM model:
MC102799.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME145671_0218236-144DRFGVLRKTWHGMEMIFSRFIDYITGRGEQAVAAIGWQELGNWAVGLAVDNRQQIVYYNGSWYKYLGELEHVIAGDSPENDGGVWSAENPTGKWSNIGDAALRSNLGSG
GUT_GENOME224857_00722212-297WGYVLIDSFQLGANITTRFQALHWSLPDGNGEYYRWDGALPKIVDAGSIPTTTGGVGIGAWVSVGDASLRVELSSSADGKGSSLIA
GUT_GENOME227607_0106574-145GYTRMDSFQAGATLTLPNQILRETTTGLYYRWDGAFPKSVPAGSTPENSGGIGIGAWQLISTLGFEVPVDTD
GUT_GENOME272778_0275558-144ISQYGWIPVGTFQAGATLTLPNQILKDTTDGEYYRWDGSFQPSGKVVPNGSTPGATGGVGVGAWISVGDSALRSLLSSTSGAANVGT
GUT_GENOME232182_0227362-138FGYINLNGQNFTTGATVNLNELLLNPADNNYYHWTGTFPAGGKIVPPNSTPQSTGGTGPGKWLNVGDTALRGDLARP
GUT_GENOME096419_0289972-146AAGYVPAGAFQEGAGVVSRNGTVLWKLPDGDGDYYRWDGDLPKQVPAGSTPQSTGGIGKGAWVSVGDASLRSDII
GUT_GENOME171625_03265204-281ATGLWGYTLLDSFELGATLNYRYQALKKTSTGEYFRWDGALPKTVPAGSTPESTGGVGLGAWVSIGDAGLRSQLGTIF
GUT_GENOME131092_0112373-156GYITVDSFQQGAQLPDNEITQRNQILRDETTGEYYRWDGDLPKVVPAGSTPESTGGIGKGAWIDIGGANLRTDLKADNGFTFVG
GUT_GENOME127966_0134963-131KDSFQKGATLNSVRDEITYEGYRLVWTGDFPKTVAPFSTPQNTGGVGPGAWAYTSDAMLRGNLKSSAGY
GUT_GENOME095444_04134145-215TFQDGGELHSVLDRVSDGTYLYYWTGTYPVTVSPGSNINTLGGLGVGFWSVDGDALLRSALAQYFGFGNVG
GUT_GENOME144766_01729135-199SFESGATITTRNQALKFNTEDSYYMWNGALPKVIPTASTPSGTGGVGISAWVKLDDATVRFSTGL
GUT_GENOME143501_0462262-146YGFTILSGQTFATGATVNVNDILLDESTGEYFQWTGSFPEGGLIVPPGSTPVTSGGEGPGAWLSVGNAALRSMLSGADGRKYIGK
GUT_GENOME231635_0308644-131WLKQRYEEAFSGLGWAELGEWAVGLEVTTPSQIVHYQGYWYRYGGSLPHTITGASPSLDDNDNWFNLGNDVSLRANLGSGDGLKWVGK
GUT_GENOME232032_0168262-135GFKPVSGSFEKGGLIEEIWQTLLYEANGMFYQWTKELPKRVNQGSTPFKTDGTLVDGWVDRTDLTTRAELVKGQ
GUT_GENOME183862_0398573-148VGYITMDSFQDGKTLTLTNQALRWKLPDGDGDYYRWDGALPKVVPPGSTPTTSGGVDKGKWVNVSDGTARSDLAKP
GUT_GENOME147143_04398317-397GYSLRDRPESFGKGGTLTFASDVLLDEGSGKAYSSAGPYPRTVEKGTNPSSGGFTDRSKERLREMLASTGGAAMIGSQSGP
GUT_GENOME001883_0052861-148YGYVILTGKTFTTGATINNPNEVLLNTTDGEYYKWTGSFASGGKVVPANSTPAGTGGIGPGAWIGVGDASLRAALSSTEDGNGDELIG
GUT_GENOME231992_0044350-138DAANKSIAEKVSAVKTFAEGATLESPREEILFDGYRLVWTGEFPKTVLAGSTPQGAGGIGAGRWAYTSDAVIREDIASNEETLGAGMMG
GUT_GENOME231465_0213358-145ISSLGYITLDSFEDGNNLTLPNQVLRYEATGEYYRWDGELPKSVAPGSTPETSGGIGPGAWLSVGDASLRTDLASGEKASLVGYGSST
GUT_GENOME096244_0321978-152KSTFLNGATLESERDFIWDDNSKSWYYWTGAFSKEVPAASTPESTGGIGVGKWLSVGDASLRSELAQPDGSSMIG
GUT_GENOME064775_0181675-152IGDYTSGPLKIEEYNQLIRYNNELYKLTAATDIPFTTAGNTDETWTGTDAAHFVSVGDAALRQNLGSSEEGLGLSLVR
GUT_GENOME231951_0249463-145GLNPVGTFQGGAVINSAGDIIQDETTGAWYRWDDLTTLPKTVPPGSTPDSSGGTGVGKWLAVDVSDVLRRELALPTGADQIGY
GUT_GENOME002976_01052203-276SFAAGATLTSTKDFIYDQASNAYYYWTGTYGKVVPANSSPEDTGGIGSGAWSVVGDVVLRNSLASTSGASMIGV
GUT_GENOME211898_0324361-144FGWILKESFEDGATLTLPNDALLWRSNGEYYRWSGALPKTVPAGSTPESTGGIGQGAWVGISDAALKTMLASSAGASMIGMGAG
GUT_GENOME103766_03612134-208TFDTGATITERKQVLLNSDDGYWYRYNGTLPVSVPPNSSPDSSWSAVGYLNGYDIGDVRNWSNNLSDIALNDVNL
GUT_GENOME145119_02233183-265YGYNTKRSFEAGNTINYPNDVLLLESEGEYYRWDGPLPKEVPPGSTPDSTGGVAPGAWRGVGDATLRSELASDSGAEIVGTNS
GUT_GENOME231519_02285178-261WGYVPAIGSFEKGSLLTQRFEVLLWESTDEYWRWDGAMPKIVLPGSTPDTAGGRGKGKWLDVTDATLRSNLGSSEPGMGASLIA
GUT_GENOME145815_0484250-126LKSGYTFRDGAILETENDLIKYGDVLYAWTGAFNKEVPAGSSPESTGGIGEGAWKQVIDSNLRRDIASHDGFGNIGK