UHGP-MC 3373


Information


Number of sequences (UHGP-50):
182
Average sequence length:
90±12 aa
Average transmembrane regions:
0.05
Low complexity (%):
10.36
Coiled coils (%):
0
Disordered domains (%):
4.73

Pfam dominant architecture:
PF13786
Pfam % dominant architecture:
1703
Pfam overlap:
0.37
Pfam overlap type:
shifted

Downloads

Seeds:
MC3373.fasta
Seeds (0.60 cdhit):
MC3373_cdhit.fasta
MSA:
MC3373_msa.fasta
HMM model:
MC3373.hmm

Sequences list (filtered 60 P.I.)

Protein Range AA
GUT_GENOME112042_007892-96IDKSNYDNIRIPDKLDSVVHSAIDEALSSKKRFTVLKKVLCTAAALFAAAVSMLNLSPSFAQAVSSIPVISDLCSVFTFREYHFEDDIKYIDVKI
GUT_GENOME253903_006939-86QTQYKDVPIPEELEIMIERNLQHRPHKKKRKLPIFLISAAAAAVLFTVGLNTSPTMAKTLADVPVIGPVVQVLTFTKY
GUT_GENOME188398_0152710-118KELETLKREYESIPIPEGLEFRVRSSIAQAKNAENTENAQNTQSTQARESHPAKKHRYLRKVFGTCAAAMLTVVVLANSGAGVAHAMEQVPLLGLITKVVTFRTYEDQR
GUT_GENOME252160_0090623-108VKNRIETVLSGLPDTPPVSCRIKKRIFPQIAAAAASLAFLTLFLLPNVSKVYAHAVENIPIIGDIVKVFTVKNYSYSDKFHEMSID
GUT_GENOME137556_00367158-246KLENLKEQYNNIEIPTELDSIISTAIDKKPNILSVYFRKISVVAASIIVIFTSVVNISPAFAQSMSNIPVVNSIVKLVNFKTYTAKNGN
GUT_GENOME096270_0254413-82NPQIPKSVDQRINETLRSLPNKKRVPRFYYPVAAVILVGFLLFGVSFMSPTLAETLKSVPVIGSIFKMVG
GUT_GENOME000202_020621-95MKNMKEKYEDIRIPNRLDQVVNETIRNFEREKKLNKNMMIMKRLGIGTTAAAMALTLGINTNQVFAENLESIPVVGGIVKVINFQNYTVNDGNKN
GUT_GENOME171367_026958-94LKKVYSDIKIPNALDDTISNSILRGKNYMKKNNRNKKRIINSFISVAAAVGIFTLSVNISPSFANSLEGIPVLGTLVNILQFNNGSS
GUT_GENOME009994_007707-99LGQAREAYESLRIPDELPDAVQDAVRQARREARRERSGNHFLRYAVCAAACLCVIFVVALNAVPVLAQEMYGVPVLGHMARIFTLNRFEESSE
GUT_GENOME105241_036857-98ARKIYAELPVPDTLSELVEHSIRQAEKEVKIKEKQRKLRIFRGCMLIPAACIACLVLVLNTSEAFAMGVYGIPVLGDIARVLTVRSYTDINR
GUT_GENOME105638_034171-93MNRIEDLKKEYEAYQAPASLKRKVKNAMARAKKERRNKAIIRITGRGILGVAAAMLTIVILVNSSPQIARAMGSVPILGALSELLTFRGYTER
GUT_GENOME103719_024989-78YQDIPIPDALGDVIDRAAARAKHQRRRRPLVRTGICAAAALCVLTVTANSPLAAQAVHIPVLGTVVQMLH
GUT_GENOME230902_0102625-115KKAYQQLEIPLDAVDKLRETIHEVKIRKKRKAWLQGSLTIIAAACLCILVLVNSSSTIAQAMSRIPGLGGLFEAVTFRHYENVTDDYYAKV
GUT_GENOME103756_026264-100QDLEQLKQQYQDTPIPKELDFVVRKALQENGGTTMENKRSHKGIKIAAASVAAAFVVITAGANISPVFAATMGNVPVVGSLVKVLTFREYTMDDGTY
GUT_GENOME246426_012287-92YNALEIPPELEKVVRDAIKQGLNGLKTARGGGRIDRMALRTAAVFALCCVTLLNLSPRFAAAAAELPLLGPLARVLTFREYHYEGE
GUT_GENOME139825_000397-94MKKLKEEYENVKCSDDLKERVETIMKTTVETRMKKGPARIRYALSGIAAAFVICTITLNAMPSAARAAMGVPFIEKIVNVLTFGRFEM
GUT_GENOME096235_028911-101MDDKQLSQLKDEYHKMPIPKELEQVVQKAIEDGKKKQQRSRLMMGRKIVATAAAAVVVFIGGINVSPTMASVIADVPGMKNIVEVLTFRDYQYDDGWHQAN
GUT_GENOME080115_0075212-96EAAHKEKTNIPEDFNIMLKDTLNKLPEKEARKDGASGRMAGRRILTAAAGVPAMLFVLLPNVSPAIAYAMEKLPVIGSVVKVITV
GUT_GENOME205996_0048813-104REAYNNMVIPSTLNSKVKEAVNMSPGRKHTNSRIIITVASTAAAIVILFIAGVNTSTAFAATLSDIPIIGSIVKVVTGRSFSEKNDNVTINV
GUT_GENOME012599_0147512-76KASDELRSSVKRKIAASRRRIRLRSAIAAVGSVAAAFVICMNVSEDMALAASRIPVLSNLVSVVT
GUT_GENOME200288_0530518-99RKTEYETMPVPDSAAYQAVQSGIRQAARKRKSRLRWYMSSISAAAFILLFTGCIRVSPAFASFVEQLPGMEGIVSMIRQDKG
GUT_GENOME142616_008891-107MKPDSKQRLRQARKCYEETPVPEALPYLVRSCLRERPQKTRWARVMAPALVCLCFLFVTLLNCSEVFAQTVSQVPLLGEVAKIFTFRSYSEKDPMKNIDVRVPALSQ
GUT_GENOME006942_0050616-100EDIKAPEKYLKSIDETLNNLQDTNEKIHVKPIWGYSLKFALAIALITFVVLPNLSSEVSYAMQKLPVIGNLIKVITIKNYFDKDG
GUT_GENOME081674_031132-116RQIEDARKKYEEMPIPQELSGRVRAAIVRSEEKRKSQGQKIPVAGIYRRNKIYRRNRIYRRNRIYRRNRIWGLCAGTAAALVLVFTTALNTNTAFADMAAGLPVIGAAARILTFR
GUT_GENOME195266_0102111-108RAREEPFPLPDSFVQRLRETCDSLEEPPRRKPERRQLSHWGAWFAAVAASLLVVLPNVSASAAEVLEQIPVLGSLVQVVTLRNYLYDDGHSFADVSVP
GUT_GENOME096045_003003-86RFEKAKRYYQSIEIPAELNDVIEDALSATPRKKKHRIASKVLLSAACLLIVFTTALNASEVFAQSVYNIPVLGEFARVITWREY
GUT_GENOME098572_0317132-166MEQMEREYKEIKVPTELKGRVLQAMQEGKDALGAEQEKETERVTEKKEVSKQVQDNGKQGRKKTASHYFLRTAQTAAAALLVVTVLANTSAETAYAMSRIPVLGAITKVVTFRTYEQQNENSEAKVEVPKIEAGE
GUT_GENOME246450_005164-85KSGYENIEIPGELDQVVQDALAEGLEQRRKNRVRDLSRRLGPMAAVFLLCTVTALNISPTFAAAACDLPVVGGLCQVFLFRE
GUT_GENOME000711_038403-99KKLEQLRKAYQNVPIPKELDQVVETALQHKPKKKRRLVVWPASTLVAAAVLLAAVVNINPGAAQAMSRLPVIGKVVKAITFTEIKQKDGESSIDVKT
GUT_GENOME137258_005713-104FKKAKNFYKSTKIPKELNQKISNITHKYESNKVNNNQDILGKINKKLLTKQSKIILSTAMSFCLLFVILLNTNVTFAKTAKNIPIIGKVAEILTWADYHDET
GUT_GENOME000338_022455-84LKDLREEYLKTPIPKELNDVVQAALNEKPRRKPRVGRNILVSAAAALLIVTASVNISPAVAKAMSDIPIVDKVIEVITFV
GUT_GENOME213735_002137-101LKKLKSQYIGVEVPDDEKMKLNEVLEYAKIDKKRKRKKVIIARWMISVAALLLVVIIPNTNETLASSLKNIPLVGNFFTVITVRKYDKDKNIKVN
GUT_GENOME045758_027235-82FNKEKIIYKNIEIPEDLDFLVAKSIIEGNKKTSRLVLISKIVAASFIAFIITLNLFPKFSIVAQNLPIIGKLAQLLTR
GUT_GENOME200288_027946-87EQLRKEYQEIPIPDELDSIVNQSIHKFSKEGRRGRGSKYRWLAVSSVAAFMFLIAINVSPPVAKALSSIPGVDRVIQVLTFK
GUT_GENOME096443_035649-96LKNLYDNIQVPIELLDKAIFTGLEKAKAEEKQQQTRRRKWLVLSTTVAVILIGFFTSVRLSPAFAHYISAFPGMEKVIELIAADKGRM
GUT_GENOME274355_018073-96NKLEKLKIDYENTKIPKELDFMVRKTIRDTKNKHRFGSAKCAGTVASVIAIAFTVAVNVSPAIAAGFADVPVLGGIVRVINVRAYNVNENGAEV
GUT_GENOME113641_014968-83KKSYESIKAPEGLYDKISGKIIKMEKRRAIWRKCSGIAACFALFAVVSACFNPDIAHAVQDVPIVGDIVNVITGGR
GUT_GENOME137369_009009-85RVRHESTPVPEELEFAVASALRAGQRRLRGRRALRRSVSGVLAGCACFVLLVNASPAFARAVSDVPVLGGLARVVTV
GUT_GENOME207236_009033-84KYDEIKMPDNIDDITNTAISKGRKQKNLRKYRNVMVASITTLVIGCGVMGVINPSLADSIPIVKNIVNYFDNNKDSRFKGDK
GUT_GENOME147776_0075412-92DQTPTPPLLAGRVENALNATGPKPRRQLVWLRYTALATAFLCGVFVLSVNLSPAFASSVYAIPVLGEVARVVTFTEYSHQD
GUT_GENOME217812_011041-104MSKKIDDFKKQYENIKATEELIMKANKEIKKSKAKRGFAAVAGSAAAFVIAFGVAANVSPTFAYAMSDVPVLSGIVKVVTMGKYENKDNGYEIKVETPKIEGLL
GUT_GENOME199813_0025317-100KKAYQETPVPQDLSRRMERAIAVGLAPKPRQNWSKWAGGMAAGLCACFLLALNTAPAFAQAVYQVPVLGELCRVVTFRSYQQQD
GUT_GENOME000607_001231-92MKSLEDMKKEYDNIKIPDNLLAVIEEGTKRAEKEKKSMKMKKTFQKFAIGAAAAAAVCVILPNTNASIAYAMGDIPVIGNIIKIVTFRDYQV
GUT_GENOME113666_0181016-104KEEYDQIPVPQETRDRIEAGIMRARLEKKRSDRMKNMKRTGVTAAALVLTFGIAVNASPVVAQAMDGIPVIGSIARVVTIRNYNESTNN
GUT_GENOME096465_026931-97MNGKENSIKEALENIEVPEARLDAIIEESFWETSAAPVKEKRRNRWMPAVATAVLVAGISAATLSATPALAHYMAQLPVIGSVFSIFAENPRGAAAF
GUT_GENOME214302_026904-85NEIWDNIVVPKEMENVIKNAVTMGYQEVKTYKEKNWLNIAITIGGMAAAFTLFVGAGFLNPMIANAYSEIPVIGKIFSYLYE
GUT_GENOME216297_017181-88MNRLQDGKELYQSLALPDGLGGAVQEAVRHARRETAIPLSRRVGWAAACAGACLCLVFAAALNAFPALAKEVYDIPVLGQVAKVVTFW
GUT_GENOME252036_008172-117KDKELTEMKNDYENIPIPKELKERVINSIDAAKQDLNEEASEKRRSFRFRKLAVRTAGAAAAALIAISIMANSGEDIAYAMEEIPFLNAIARVVTFREYTHKENQMEASIKVPNIR
GUT_GENOME172924_018856-91RLDKLKEEYHKIPIPKKLDTIINHEKIENIYYEKNKKVNRLRFKVAIAFACMFTVLVNISPVFADNFSKIPIIGGIVEVITIKNYS
GUT_GENOME130357_025655-74RKNYESIQIPMELSGVLVKTVERKRKATARKFAGIVMAFAVLLVSANIPTVYAALSEIPVIGDVVKILHI
GUT_GENOME096465_033349-89REAYKNVPIPKELDEIILKTIPQKKRKKHLFMWPVAAALLLTATVNFSPDVANAMSKVPIMKGIVEVLTFNELNEAANNTS
GUT_GENOME108771_011925-99KEEYENIKIPEQLNSIVQDAIEEGLTTESTGMSDQKPVNIYTRRKKHILRYAGEAVAAMFLIFVVLLNVSPTFAKAAADTPFIGPVCQVFTFREY
GUT_GENOME155739_015852-98NPMEDAKKRYDEVPIPEELSKRVQWEVEKADWRRKAENASVRKRRRHLYGWQKGLAAAAAAMAVFVTALNTSTVFAQSMGELPVIGAIAKVLTFRSY
GUT_GENOME256840_012598-101KEAYRSIQPPEELEETIRAGVRTAERIQRERRQKSRPVTRYAVCGAAALCMIFVVAVNASPVLAKNLYDIPVVGNVARVFTFVQMEKDEGADTL
GUT_GENOME096295_025456-86LDDEKLRYENIEIPEELDFMVRKTLKEGRRKRNLKKFYKYGSGIACTFLLFVVLVNIFPNVAYAMSKVPGLDKLIELVTFD
GUT_GENOME000147_007032-87NKLKNTKKIYDQIEVPDNLSEIIQQTVNETSPPRSTRHFFKLLVPLTLAMTTVFVILLNTSPVLASNLSDIPLLTQIVKVLTFRNY
GUT_GENOME237874_002008-123ILQKSKKIYDNIEIPEELSQVVDESIRHMDEKRKAGERNSEADRRKIAGSKENFRNMDRVKKKKGLGPTKVLWRTLSTAAAVLICFSIGLNSNQAFAMEMSQLPIVGRLAKVLTVR
GUT_GENOME096443_010195-84LEELKAEYKNLPIPENLDDVVQQALTEQPQQKKTFRYKALSGLVAAAAIFAVSVNVSPALAKSISDIPVIGSIVKVITFT
GUT_GENOME130636_019191-102MKRGKEVYDSIEIPVELSGRVNEAIAAVDRRQAEKKKRALVRSRKLRAFVKTTGTLAAAFFICLTVGVNSSYVFAKEVGSIPVIGVLARVLTVRSYSGQEND
GUT_GENOME143282_021713-58QKHCMKSRKKKRFVWPMTSVAAATVFVASVNLSPAAADTLSKIPGVKELVEVVTFE
GUT_GENOME096270_019952-81SKHFPEFHRELENIPVPIEKLDSIIQQTISETRKQKPKRKKFVLTTIGTVAAGFFLFIGSATLSPAMAKVASEIPLVGTF
GUT_GENOME096550_016574-82KERYENIEIPSQLSDVIEKATQRGRAHQRNRRMIRGTSTLVAACAALMIAVNVPGVAMALSDVPVVGSIVKVLQVGGGG
GUT_GENOME243814_011851-83MDRMEEYKKLMSELDNVPMELSHTLTRAKSRFSKKRIFNWIAVPICSVISAAFLFVLMVNISPTFAYACEKIPFLNDLASAVK
GUT_GENOME245980_014566-102NRLQQARQQYESIPVPSELSARLEQTIAEHPFVEATAPQNEYNIKKAHKKRRPFRPFVSLAAACTLLFFVTVNINPAIAEPVNEIPLLSNITRVFTG
GUT_GENOME208951_003515-88NDAREVFEQTPIPDSLGETVTAAIRTGRQTARRQRRSYGWYIAACACLAFIVTLNTVPVFALSIQEIPVLGNIARVFTFRQYSE
GUT_GENOME037405_007714-110ENGKKTYESIDIPKSLNETVMQTIASKKKEELKMKYESNENKKASETKKSYRPVWRGCVAAAAAVLVAGTIGLNTSPVFAENMSKVPVVGQLAQVLTFRSFEGTEGD
GUT_GENOME071328_018774-97LDEMKERYDQTAIPEELNVRIRQEIEKSRNKQAEKRGKGRSRRIKVIIRSMEAAAAAVCILFIAALNTSPVFAKEAGQLPVIGGLARVLTFRSY
GUT_GENOME236871_0117524-105YQSIEMSEEQLNQLKIKMKEAKMTGKINKKHRYVRKIITVIASCAAAFLVAIIVLPNTSGKVAYAMERIPVFGQLVKVITFR
GUT_GENOME207529_0303717-100IVQYSEHIDIPEELDFIVKRAIKEGKAAPKQKHFNFNKWSKKVAAVASIAVASFVGMTNLSPTFAKSMSDVPILGALIKVISFT
GUT_GENOME096458_029129-90LKEISKDINSINPPDNIDEYIKRGLDTGINKRRHRKVRRISFTAAVFMCVLFVISIRTSTVFASYVRRIPGFNRIVDLINYD
GUT_GENOME046065_0161110-101LNTALEDHLNSIPIPEELDLVVQRAIKESKNSSKRMINMKKLTKNIGATAAALAIGFIGMANFSPTFAKSMSELPIIGDIVNVVTFTQFDYH
GUT_GENOME255855_011569-91LQKIYQSVKIPKELPYEVNKAIKNSKKKRLSFAAKTTIKVCTTLATCFIVFVVMINVNPNFVSAVENIPVLSQLVEWLQIREI
GUT_GENOME188058_027461-76MNNNIKDIIKIPKELDNAVLKGFEKGKREKKRAKQKVIFKRSAIAAGIIVAGTTMAGMINPELVSAIPIVGDVFEY
GUT_GENOME212283_029735-86DKNLKKLKDTENIVIPDSLRNKVNEAYSKIQEKEETVISQYVKQSLKVACILIVIIVGINFVNPILAEEVPLFGPVIKVLNE
GUT_GENOME207523_005198-106KEIYDHIKIPGELRQIVDNTLNEQVGKVNSGINKFKDNLVVHIFKYTITAAAVLIICFTVALNTNEVFAKGLDNIPVINSLAKVLTIRSFKNVNDNQSI
GUT_GENOME119656_0046820-104EQIVIPSAVREKIELALAKLPETEKDTKRKDAKLHVSLPLRRFAAAAACLVFLTLFLLPNISVNYAEAMEKIPVIGKIVKVVTIR
GUT_GENOME078476_032155-87DKLRQEYEEIQTPQDLDSFMDTCIDQAQKKQSRFTFYRLWKPALSAALIGYLIVLNTVPAFAQSVFAIPLLGDVSRILCVREY
GUT_GENOME022451_006672-101KKFDKIKQEYEAIKAPEELKSKIADTVKRQNARARLYTRLITGTLSAAAAFVFAFNFVPGLAYATADIPFLNSIVKVVTFGRYEAKDHGYEAKIVTPKIE
GUT_GENOME096494_020171-102MDPKLDRLKEEYMNISIPKELDQVVAQALHVGITINGAQKKKKKRKTWMAYVCSAIAAAALLLVSVVNISPAAAKSLAEVPLIGKLVEVLAFREYSYNDGQF
GUT_GENOME237445_011864-80GKEQYDNIPIPARLDQVILEAIDKAEQDNRRIRLRRWVAGAAAVFCMLFLSANITPVYAYASQIPVIGTVVRVLHIG
GUT_GENOME188398_007155-89GKRRYEEIPIPDDLEWIVARACRRSSHRRIVKKWLTGFAAVFTLLLICANITPLYASAAEVPILGQMVRVLRIGSGGHATDDVFV
GUT_GENOME049473_010661-85MNRMDEYTALLAELEQAPEELNHTVNKALNRQNTLQKKRRFLGTSLGGLAACFAAFVLLVNLSTPFARACGRIPLLADLAKAVSW
GUT_GENOME062780_012961-105MNEFKRAREEYESTPIPEELDARVRAGIRQGRTSSRNPGYTGSPGGRPGRSRGYKVVRRTAGSVAACLAVLMAGLNVSPTFAAAAADVPVLGGLFQVLTIRDYET
GUT_GENOME208505_0024713-98YESIPVPPELEARVRRTLKETEKAPEKKRRSGWNAWTKTITSLAAVFALIVVLANSSATLNTAMARVPVLGPITKVVTFRHFRDQT
GUT_GENOME142591_0201748-120SAPDSVDRAILQGIERGKKLKSRRAGKRRKRMAASAATVLLVAACLFTIRVSPVFAAIVREIPGMEAFVDLIG
GUT_GENOME140601_0314621-92LPAPEQLDQAIERGMQRAKRKRPVKSWSLAVSAAVCVLCLLLAVSIRVSPAFATLVSELPGMSYIVKLISYD
GUT_GENOME142591_046584-83WERELNHLVNASLPEQVDRRIHLTLDQLKRTRKKSFGLRFGTTAAAAVLVLSFGLTALSPAFAEAMKSIPVIGSVFELVG
GUT_GENOME092360_006119-102KRDYHAVQPPENGIQEVRKQMEQAKREQKRRATRRTRWIVGVAAALAVCIAIPNVSPTAAAYLSNIPVIGSIVRVVTLNRYQFDNGHYQADVKT
GUT_GENOME075625_0165111-105ARQQYGQIPVPPQLNAKLEQVIAQHPYQKPAAPKRPFYRSISTIAAACLVAFLTILNVNPSIAQAVEHIPLLGTITRVCTVQTWFSQTDNNTISV
GUT_GENOME208053_015323-89KNIFEEEKRNYENIEIPEELNFRVKKAIIIGKKERLKKRIYNVGKSVAALFIIFVASVNISATAAEAFSSIPGFKKLVELVRFNGGD
GUT_GENOME143400_004501-112MNQKMEELKRQYEEIPMPEELKGKVEEAIKRGQEAERREKYSKKRGGWLLILKGGGAVAAAALLSIVVLSNTSAETAYALENIPVIGAISRVVTLSTFADKQGDYEASIDTP
GUT_GENOME108178_000843-124RLDDIKEAYDNIQVPAELKERVLQSMERGKKDACGEKKESGKAFAGKSERNQKKGHLLPFVRVAEAVAAGGGGVGFWVYCCAKEAAAVAVRVISVNVSPTVANAMEQVPVLGAIVKIVNFTT
GUT_GENOME057177_014027-94KAKKVYESMEIPEQLPNVIDQAFVRASSRRRIAWYRPVLSAAAAVCILFVALLNLSSTFASAMAEVPVLGNVAKVLTFRSYSIETDSE
GUT_GENOME038774_008656-103GKKAYDNIEIPKELSEVVNRAIASQNKEQIREKSRKKRRRDSTVRIFRYVASAAAAFVICMTIGVNTSEAFAKEMSDIPVLGSLARVLTVRSYHGTDG
GUT_GENOME102426_0015513-96KDEVQDLPKHVDDAIKKTLEALPERKRKSTYKQKHYSRAWSKVALIAAAVVGIFIALPNISPDIALAMKNIPVLGAVAEVITFR
GUT_GENOME096498_030422-88NRQMERARARYKEVELPAELDFAVASAIRAGDRKRRSRDGLRRTAAVLASCCACFVLLLNVSPTFAQAVEAVPVLGALARVVTVREY
GUT_GENOME103867_013345-107KIDKLKENYNNIEIPKELDDVINDAFNESENKKLENNKKDWRRNMKNMKKWYASAAAVGLIIVSVNASSTFAKSLENIPVIGNIIRVVNFNNYRIDKDGMDVS
GUT_GENOME000496_0332015-103NEMKNDYEKCEVPLELKNRIEESINRAEREKGRSKVIRFVKGAGMTAVAAMLVMTILVNTNQNVAMAMSEVPVLGSIVRAVTFRTYQEQ
GUT_GENOME212150_009591-79MINENFNNIEIPNNIDEAIDMGIERIVNEQKRNNRKKVIGGIVAGLAIIVSIGMCTPTFAKNIPLLKEIFSFLEHNKSE
GUT_GENOME142610_034115-96LERLQQAYNEIEIPKELSFASRKGIERGKIYKQKNSSSKRTLKWSGSVAAAVLISFTVGVNTVPAFANSMDQIPVLGKLVSVLQFTEGSAGG
GUT_GENOME096497_002497-89KELMKSKTHYHDIQIPDEIDTYIYSGIEHGKRTKRRIHHTKLSTFTIAALLFLSIISIRVSPVFASLVSQVPGMERIIQFIQG
GUT_GENOME103719_0288112-80VEIPEELDAAILRGVQNGKRIMNHKKYVQRTVAGAAAVFAIFVGSINASPALAANLEDVAVLGSLVRVF
GUT_GENOME258823_010055-96LRQAKKNYDAIEIPGELSPAVQDAIRRGRLQKIGRHFSLGKIASCVAACACIVFMVTLNAFPVLAMDLYDLPVIGGLTKVLTFYRFEAADES
GUT_GENOME220584_008938-91RRAYQAVEIPPELESVVERALRQGRTCRRWRVRPLWSAAAMLLLGFIALLNVSRAFAELVGEVPLLGPLARLCTIREYSRSNGD
GUT_GENOME188419_009765-88QDAKKVYDAIKIPEELSSKVKEAISKGSVTPKRRHFRWYQTVVCTAACIFLMFVGALNAFPAFAMQVYDIPVLGNIAKVFTFRE
GUT_GENOME064426_024612-110MSRMEEAKRRYDEIQVPEELHERLEKLIADSAPERVKEAAADGKRGKVVWMKGCAWAAAAVVALFTLALNTNTAFAQEMGEIPVLGAVARVLTFRSYVKEDADMGIAVE
GUT_GENOME213703_014619-91DGKEAYDALRVPEELPAAVREAVRHARCAERRDARLPWGRVAVCALASAVVVFLVTLNAVPALAQELYGMPVLGQMARVFTFW
GUT_GENOME175197_002846-89KAKEAYDSIRVPEELEEKLERAFHTGRARWKKRRIHTKIAQSCTCFCILVCAAFITLVNTNAVFAEGVSSIPVLGSLARVCTFV
GUT_GENOME282991_007459-100LEQLKNDYQQIPIPKELDMMIQNTLERDANERRRQKRLHTWRNIALTAAAVAALFVGSVNLSPTVATALANVPGLSTLVEVCTFGGYHVEEN
GUT_GENOME065623_000522-81SNKKFNDIPIPQNIDFVIEEGVRKAMVEKQRNQTVRHYPKSKKTLGVIAASLAVVLGVGISNPAMASKLPLVGNVFKAIE
GUT_GENOME065600_004121-103MNTLQNAKKRYEEIEIPEELSVRISEEIARADKRRWKGRLSRRRVKNIAAAAAAAAVVFTTALNTSTAFAESVGELPVIGAVARVLTFRSYETSDEDLKISVE
GUT_GENOME073388_018087-89WESAREEFSSVPLPPDLEEHLREGIRQGKKRRKMQTIRRTLGSCAACFLLMVGVLNLSPTVANAAADLPVVGSLFQVLTVRQF
GUT_GENOME243814_040474-90LKEAKQKYDAIEIPDELSEVVNQSIHKMSDRKVIPVKTNYKRVVGGIFAASFLCAVVTLNTNEAFAKTLHEIPVIGNVAKVLTIRSY
GUT_GENOME118410_0067645-153DENKLLEALKKEYMETQIPEEGVENMKKRMEQAKWEKAHIKRKKWLRRVGACAAAAVALTILLPNMNAGVAMAMEKIPVLGGIVRVVTFGRYSFEDENHNAQVEIPQVE
GUT_GENOME005249_0114135-116LRKEYLAQKMSVEQIEKMQKEMEAAKRKRRRGKQLVFFRKFAGAAAVLAVTFIALPNTSLGVARAMENIPLLGKLVNVVTFR
GUT_GENOME142587_030974-78KIEFDQIPIPENIDDVIDRGVTRAVKIKKRQRRKKFITTTTGIAAAVAVFSLFCVSNPALAAKIPLIGHIFERVE
GUT_GENOME096551_011165-101KLDDIHTDYKNIAIPEDLRRRVEETIKQAKEETAMNKSKNKVFKIMKGTAGTAVAAMLAITVLANADYTIANAMNQIPVIGNIAKVVTFRTYEKNEG
GUT_GENOME207909_024238-77NSIDVPTNELNNIIKDSIKRGEKYKYRRKNVRLKKIGGGIAAVFVLVSVIGIMNPKAVSAIPLIGSIFSH
GUT_GENOME199120_034037-95GKRTFEQIEIPEELKETVERAVHSVDKKKSAARYRQRRLVRAARNIGVAAAAVLLCMTVGVNTNEVLAKELGQLPVIGSLVRVLTITSY
GUT_GENOME139860_016595-105QKMQDCRREYEAVEVPQEAKERILMGIERAKKEQEEKEGAKCGRRAGGRIWRYAGIAAAACFGAIVILANSGQSVALAMEKIPVIGAIAKVVTFRTYEDQT
GUT_GENOME009557_016245-89KQMDMLKKEYDQVEVPEQALAAVQTGIQKAKQEKRRSAQWKHWGRLAATAAVVTVVALPNLSPQVAMAVADVPVLNKIVQIVTFD
GUT_GENOME188395_0332913-110LNTSKQGYNQIPMPEQLDEMVNDVFNDYEKNQKKKEMDYMKNNVVYKTCNKTKKIFIRTGVSIAACFSVLIIGLNSSESFAKGVENIPFIGTVAKVLT
GUT_GENOME061534_011716-98LGEMKKEYESIEIPAELKSRLENTIHQAKEENKRVRFFKKLPKKIGYGAVAAVLAITILTNYNEQMAYAMAEVPVLGAISKVVTFREYVKNDG
GUT_GENOME096525_0325581-160KSHYENTSLPSGLAGVVQSGMARAAKRRQTKIRSRLAAWTAAAVLLLFIGSVRVSPVFASYVSRIPGMEGFVEFFSQDKG
GUT_GENOME000598_024616-91KNGFEKITISNRLDDVIKKSISKAKRDKKIRNIKLRLIKINVAIASLIIIFISSVNFIPVFAESLSDVPVISSIANAVQFHYDKNI
GUT_GENOME246457_017626-126LEERLLDTKKIYEQIEIPAELNERMRLALGEMSKVTEDKEEKPEVTQTRRDEKRVSHMKRRWNWKKTTGLVAACAVFCFTAGLNISPTFATDMQANPVLGGISRVLTFRSYEISDGDKTIS
GUT_GENOME137644_001752-88KNLNKAKEVYQSIHIPKHLSYVVNKAIGKNGKKEHHPWNFWKPMVSTISCVFLAFVFMLNVSPSFATTVSEIPILGEVAKVFTIEEY
GUT_GENOME231691_033067-99AKKIYESIKIPNELSLIIDKCIKNEEGKNKVVPMKKKNYIKYLTTIAAAFVLAILIGVNTSEVFADTMQNIPIIGNVMKVLTVRSYEDKNKDR
GUT_GENOME262134_011385-93RDAKDIYDAIEVPQELGETVRVAARSASRKPQLVRRHAVRYTACAAAACLVLFVTALNTIPAFASGVSEIPVLGRIAQVLTFAEYERQG
GUT_GENOME075775_009388-126REIGRTKKIYESMETPMALDDVVRGALKQETWEQRDMRRRVSETGTQGKRRGKAAVPGKKSLHGKIGLAAAALAVCFITALNTNEAFAATAGRLPVIGAVSRVLTFRNYDTSDKDKEIH
GUT_GENOME257007_0040718-97EELSRRVEEAIRRQRPRPRMRGWQKSLTGLAACFALFVVSVNLSPAFAASLQDVPLVGDVAQLVTFDHFHQVDAAKDIVV
GUT_GENOME000677_018658-87KKIYDETPIPKELTDVVNEAIYASAQGSRPINSRSPFIAIFLSISTVFILLLNTSPTFAASLSDIPIISQLAKIFTISSY
GUT_GENOME000338_053951-83MNNQIPDIKEAIEKIEVPIDKLDKTIDVAIKRAKTKHKKPKRKLYPFIGVASLATCVLICSAFVSPAMAKVLSSIPVLNSVFE
GUT_GENOME220529_013067-93AKQIYQNIQIPDRLETVVREAFEEPEEPAHGRIRWYQPVLCSLASLCVLFVVLLNSSQVFAASLENIPVLGSIARVFTFRQYQEEDT
GUT_GENOME228897_0033612-85GQDVPMDKLSAVIDQAVVKTAPKRKHLNWWQTSLIGVGAVAAAFTLAVNTLYSFASAAERMPVVRDAVKVVLFR
GUT_GENOME000573_020546-87FDKAKVQYNSIEMPNDLDTSLMIGMKSGIRKRKMRSRVPIYSIVVMVIGFVLCINTSVVFAQAMRNIPIIKDFADIVIINEG
GUT_GENOME112768_011801-101MKKNVLSNMKKAYDNIEIPEELQERVKQGIKTGKQKEGKNTTVCMKWSFRIGSVIAAAVLAFGLLLHFNEKAAYALSKVPVLGNIVKIVTFRDFEDNTGEM
GUT_GENOME113977_0246512-93RREYLETEIPPELPGIVHAPFLKRKRSRLPQQMLCLAACLCLIFVGALNLNPAFAAAAAQVPVIGAVAQVLTFSEYHVSTDR
GUT_GENOME008113_013175-103KKEYEEIKVPENMKERMEASIARAKKDKRKVKKVKLWKTCTSAAAVLAIVLILPNTSQTAAAAMQQIPLLGNLFKITTVREYQVDEERNMANVKVPQVE
GUT_GENOME097905_008822-84KNINDLKKEYMDIKIPENLDDVVKESIKKVDRKSIVNKKIIGSAAVLAIIFGINISPAFADVVSDIPIVGNIVKLVTVKNYTL
GUT_GENOME000259_0117412-87EAQTPPQELAERINLEIAKSKKRRKAQSGYAGFKIAGAMAACLAIFIILLNTNQGFAMAASRIPVVGSVARVFTFR
GUT_GENOME207750_028125-92LEDMKNEYLSIPVPDELKKRTLKSIKRHRTLKRISHNAIFLAASILLLITTVNISPAAASTLADIPFLNKVIKVVTIAEWKETKENSQ
GUT_GENOME261122_022241-86MRKIFAKEKLVYKNIDIPEELEFIAIKSINEGKKKNNNYIKILSKVLLAFLITFFILLNISPRFSTVSQKLPIVGKLAEILTINKG
GUT_GENOME094967_018759-128LESLKKEYKSIPVPPEAKERIRQGIRQAHEASAGEGTKEKPKTAHRRLLRILRPTAATAAAAAAAITILANVSPVTAQAMEQIPVISSIAKVVTFRTFEDEQNGYQANISIPQVTDGGAG
GUT_GENOME128679_019127-106KELEKLKREYQEMKIPVEGYQKMKDAIDRAKMEKRRRARKNMWKSGMTVAAAMLMMVALPNVNDNMAYACEQVPVLGAFFRVVTVKNYQYSSENRSMNVA
GUT_GENOME087248_0254310-90LKDNYMNIEIPDNLDKVVNDALNSKSIKTKRKTISKWSSVSASVCLVVGAINLSPAFANTLEEIPVIGNVIKIINFRNYSI
GUT_GENOME094792_014621-88MNAGKVKYDAIEIPAGLEDAIDSGIKRAGRQRPMRALRRTAAGAAAAVCVLFAGANIMPVYSFAADLPVLGSIVRVLHVGSGGEVTDG
GUT_GENOME056098_002073-86DKSDYLNIEIPDELDAIVDGAILEGKRQRKGGGTAGFFKKTAATAAALVISFVTLLNVSPTLAQAAYQIPVLGDLCRVVTFREY
GUT_GENOME152588_008268-82SISIPDDLDDVILRGMNQGKTRAAHRRRLKRTAACIACSFALVAGLFLGGVYTSPSFAASVEQIPVLGQLVHIFY
GUT_GENOME016866_015583-78GKDRYEDIEIPAQLTDVIHHAQKRATARKNSSRMIHYASVIAACAAFLFVVNIPSVANAMSKIPVVGSIVQVLQFG
GUT_GENOME085197_000154-92EPWKKAREEYQNQPIPPQLEARVKQAIGEGRQSSPRQHWLKKGLISLAAAAACFTLMMHGSPVFAQTVANLPVIGEFLRIVTGVQIEKE
GUT_GENOME282543_009124-77RKIDDLIEDYMNIKISDKLEDKVNEAIKESKKIKKRGINMKKGFIGVAASVAVIIGMLNVSPAFADAVKEVPIV
GUT_GENOME032999_013045-87DDLKSEYEEIEIPKDLDAFIDQAIHTASRSRKRRQIFIGTGCAAAAIVLSFGVAVNTSKVFAQSVFKLPIIGDAAKIICIRNY
GUT_GENOME000104_0170812-93QNYNDIKIPENLNEVVNDAINSKHRNKRINKKWIVAAASVCVVIGAVNINKSFAQSLEEIPVIGSVIKIINFSNYQIKEDGY
GUT_GENOME044535_007691-87MNKKDKQIRQRLSNENMEIPSTVKSKIEQTLMQLPKQEQQPVYSNRKRYAFACIAFIMLFLLPNVSTTYAKTLEQIPIIGELVKVVT
GUT_GENOME207525_005071-100MKKLYDLKNDYLKVQIPDELEFLVKKSIMKKEKNMKRKRKFLTCTASVAAAAAVFVGTINISPVAAQAMSNVPVLKDLVKVVTFREYHYEDDSHELDIKV
GUT_GENOME206730_0190012-86FDCIEIPESLDSLMQQTLRQGRRKGRLTRIMKYASSIAAVFVLATVTLLNTSPVFARAAQDVPVLGGLCRVFTFR
GUT_GENOME173857_016737-87AKKAYEDIPLPEELGGRVWAGICQGRANRRRARRRRVLRTVGTFAACFVLLVGGLNLFPGFASAAAEIPVLGSFFQVLTVR
GUT_GENOME141727_002903-91KEEFNHHINDIPIPEDRLIQREKTAILQAKKNQYRRTKTIKYSSLVACGICVSLLGFGFISPTMAKTLSSVPVIGPIYAQFNDIASDKI
GUT_GENOME201412_009901-88MNKQDRKIRRLLKKEKIVMPESIAQRFDFTVAWLQKNTERKPHFFTIGIRTTAVLLLLAFFILPNVSPEIAYAMQEIPVVDKIVRVIT
GUT_GENOME038060_01786169-298ERSHAHETPMDKLKSDYESIPVPTEAKERMLAGIAQAKKEQKGVIIMKFAKKTGGAAAAAMIAITVLANATPALANAMEQIPVIGSIAKVVTFRTYEDKKDNFEADIKVPQVTIEGTEGAQVPANKSIED
GUT_GENOME060293_025781-96MKDKLKDEYENIKIPDDYHQKVQKSIVYGLEKQSSRQRRIHLSMKLALSCFLIFIVFLNTSTVFAKTVSQIPYLGQICKVLTFTNQDEIDKTKSIH
GUT_GENOME096448_0153211-90KVYYREIPTPAELQDRIAAAYRSFRPETAREMRVLRWIAVSAASMFAMFVILLNSSPVFARSTHGIPVVGDIARVFTFRE
GUT_GENOME065332_016974-78NFDEIPIPSDKLDNIVSENLDAIKKQSRKKKQHRIFAGGAAAAVLITAFSAFCISNPVLASKIPLIGSIFKAVQD
GUT_GENOME120067_015824-93LEDAKKDYEDIPIPQELSERIMMEVKKADKRRKKQMIKRISRYGMTAAASLTILFTVGLNTSVAFANAAENIPVIGAMAKVLTFRSYQTQ
GUT_GENOME143709_028635-101LNNLRQEYEKTEIPAELDTVIRSAMERANQHLKEHKGGEQQMNKRSGKWFLKGLAGAAAVVILFTAAVNTLPGFAAAASHWPVVGKLVKILQFNDGQ
GUT_GENOME007015_018826-96KLDHLKQQYEDIQIPKDLKQKVKSSIEAGKSENESRRKKARFTGIAAKLGISAAAALFVITVAANANQNMAYAMTKVPVLKNIVRVVTFHS
GUT_GENOME064080_026471-91MNNSKYNEINIPDNLDERIDEGVKNANLQKIKNNRRKRNRAIGTIAASLVAVTTLGIVNPALAAKLPIVGSVFESIEKNIHFPGNYSQYAT
GUT_GENOME103725_011004-93QEVKIPEDLHNIVQNSIQTGLTRRKARQRQNRLTAISAAAVCCTFLFFFQTNAVFAKTVMDLPIVSRIARLFLASDSQTEEISYIADIKI
GUT_GENOME110728_011536-101LSDMKREYEQIPVPAELEFRVRASIQQAKLARKKERTVMTNVKRIIIGIGATAAAFMVGITGLANTNASIAHAMEQIPVLGAITRVVTFRNYDDDR
GUT_GENOME074111_0117518-90QLIDQTKEKMRERAVLNRTCRYSFGRQKRFFSRAAVVMAAVSCVFVVSVNASPAFAASIQNLPILGTLSQVLT
GUT_GENOME128637_005529-87TLRQELETVPEAMETIAERALAREKTCRKRKRVWGVPAGSLAACFALFLLMVNCFPTFAAACEGLPVLGELAEALRFDK
GUT_GENOME007061_0086212-131RQLENLRQEYASLPVPLEARERILKGIALGKSASHSKEFPPLPPQKGVIFMKLIKRTGMTAAAAMMAITILANIDPTIANAMEQIPVIGPISKVVTFRTYENNTDNFEANVQIPQVEAAP