The Keio Collection (2006) is a set of 3,985 mutants of Escherichia coli strain K-12 BW25113, which has played a key role in understanding systematic genomics.[1] The purpose of this collection was to determine the function of all nonessential genes in Escherichia coli.[2] The Keio Collection gets its’ name from Keio University in Tokyo, Japan.
History/Timeline
In the year 2000, Researchers Kirill Datsenko and Barry Wanner worked on creating method called λ Red which was a recombination method. Development of λ Red allowed for specific deletion of genes within the E.coli Genome. In 2006 the paper “Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection” was published. By late 2010 the collection had been set as standard for biotechnology, the paper garnered upwards of 3800 citations.[3] As of 2025, the Keio Collection has also been a corner-stone in the development of new bioinformatics tool in the EcoCyc database that allows for a comprehensive catalog of each part of the E.coli system that works as a metabolic flux model.[4]
Process
Escherichia coli was chosen for this collection because it is a well-studied organism that is easy to genetically modify.[5] 4,288 genes were selected for single gene deletion and were replaced with a kanamycin resistance marker to select for mutant growth.[2] Gene deletions were performed through homologous recombination, ensuring nearby genomic region were not disrupted. 3,985 of these genes were successfully deleted and were determined to be nonessential genes.
Applications
The aim of the study is to understand how genes influence E. coli as a cellular system. Through proper examination, researchers have been able to identify genes responsible for cell morphology,[6] interactive metabolites,[7] and antibiotic sensitivity.[8] These genes have been essential in the development of antibiotics,[9] allowing scientist to focus on specific encoded parts of the cell. The collection is also widely used in systems biology to examine gene networks and interactions in bacteria. In addition to that, high-throughput screening using the Keio Collection allows rapid identification of gene functions across various environmental conditions.[10]The Keio Collection has also been used to study metabolic pathways and identify genes involved in cellular metabolism under different nutrient conditions.[5]
Research advancement
The Keio collection has changed the way researchers perform genomic tests. The compiling of this complete set of Escherichia coli’s gene knockout strains has made testing individual genes less time and labor intensive compared to traditional methods.[1] Researchers could perform high-throughput studies, examining “more than 65,000 data points” from close to 4,000 mutants.[6] The Keio collection becoming more popularized led to results that were replicable and could be compared between different project teams and institutions. This uniformity resulted in the development of standardized techniques and shared protocols for large phenotypic screenings. This standardization and efficiency in screening was very important in advancing Functional Genomics, the study of individual gene function and the effect on more complex systems and overall cell behavior. Genes function differently when environmental conditions vary. Due to this, genes needed to be tested multiple times in different environmental conditions to determine if they were functional or not, resulting more numerous screenings. The efficiency and standardization of these protocols enable data to be compared confidently between institutions.[2] This increased the speed that accurate gene function predictions could be uploaded into genome and pathway databases such as EcoCyc.[4]
Modern updates
Due to advancements in Gene Sequencing and constant study it was discovered that there’s actually 4,296 ORFs regions in 2008.[11] 37 incorrect mutant strands were discovered due to gene duplications. Additionally, 299 ORFs were found as likely candidates for essential genes during the study. The Keio Collection has also allowed researchers to develop new methods of screening by using the model E.coli as a foundation for new studies on other organisms.[7] Sequential technology advancements have also improved the accuracy of identifying essential and non essential genes in E.coli. [12]
Limitations
The Keio Collection was published February 21, 2006.[2] These mutants have been sub-cultured to perform a variety of different biological research. Spontaneous mutations should strongly be considered with regards to accuracy for these mutants. Frame-shift and nonsense mutations could lead to drastic biological changes in these mutants, which can be wrongly attributed to the knock-out gene.[13] Another limitation is that gene deletions may cause polar effects, affecting the expression of downstream genes in operons. Environmental factors may also affect the designation of a gene, essentially, resulting in variability depending on growth conditions. [14]
References
- ^ a b Datsenko, K. A., & Wanner, B. L. (2000). One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proceedings of the National Academy of Sciences, 97(12), 6640–6645. doi:10.1073/pnas. 120163297.PMID 10829079.
- ^ a b c d Baba, Tomoya; Ara, Takeshi; Hasegawa, Miki; Takai, Yuki; Okumura, Yoshiko; Baba, Miki; Datsenko, Kirill A; Tomita, Masaru; Wanner, Barry L; Mori, Hirotada (21 February 2006). “Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection”. Molecular Systems Biology. 2 (1) 2006.0008. doi:10.1038/msb4100050. ISSN 1744-4292. PMC 1681482. PMID 16738554.
- ^ Aedo, Sandra J.; Ma, Helena R.; Brynildsen, Mark P. (2019), “Checks and balances with use of the Keio collection for phenotype testing”, Methods in Molecular Biology, Humana Press Inc., pp. 125–138, PMID 30788789, retrieved 2026-04-11
- ^ a b Karp, Peter D.; Paley, Suzanne; Caspi, Ron; Kothari, Anamika; Krummenacker, Markus; Midford, Peter E.; Moore, Lisa R.; Subhraveti, Pallavi; Gama-Castro, Socorro; Tierrafria, Víctor H.; Lara, Paloma; Muñiz-Rascado, Luis; Bonavides-Martinez, César; Santos-Zavaleta, Alberto; Mackie, Amanda (2025-04-30). “The EcoCyc database (2025)”. EcoSal Plus. 13 (1): eesp–0019–2024. doi:10.1128/ecosalplus.esp-0019-2024. PMC 12707148. PMID 40304522.
- ^ a b Pontrelli, Sammy; Chiu, Tsan-Yu; Lan, Ethan I.; Chen, Frederic Y.-H.; Chang, Peiching; Liao, James C. (22 April 2018). “Escherichia coli as a host for metabolic engineering”. Metabolic Engineering. 50: 16–46. Bibcode:2018MbEng..50…16P. doi:10.1016/j.ymben.2018.04.008. PMID 29689382.
- ^ a b French, Shawn; Côté, Jean-Philippe; Stokes, Jonathan M.; Truant, Ray; Brown, Eric D. (2017-05-03). Dunman, Paul (ed.). “Bacteria Getting into Shape: Genetic Determinants of E. coli Morphology”. mBio. 8 (2) e01977-16. doi:10.1128/mBio.01977-16. ISSN 2161-2129. PMC 5340871. PMID 28270582.
- ^ a b Heo, Jina; Cho, Kichul; Kim, Urim; Cho, Dae-Hyun; Ko, Sora; Tran, Quynh-Giao; Lee, Yong Jae; Ryu, Choong-Min; Kim, Hee-Sik (2020-06-30). “Genome-wide high-throughput screening of interactive bacterial metabolite in the algal population using Escherichia coli K-12 Keio collection”. Scientific Reports. 10 (1). doi:10.1038/s41598-020-67322-w. ISSN 2045-2322. PMC 7327039. PMID 32606320.
- ^ Tamae, Cindy; Liu, Anne; Kim, Katherine; Sitz, Daniel; Hong, Jeeyoon; Becket, Elinne; Bui, Ann; Solaimani, Parrisa; Tran, Katherine P.; Yang, Hanjing; Miller, Jeffrey H. (September 2008). “Determination of Antibiotic Hypersensitivity among 4,000 Single-Gene-Knockout Mutants of Escherichia coli”. Journal of Bacteriology. 190 (17): 5981–5988. Bibcode:2008JBact.190.5981T. doi:10.1128/jb.01982-07. PMC 2519525. PMID 18621901.
- ^ Dague, Darryl (2009-01-01). “Analysis of Potential Essential Genes Using the Escherichia Coli K-12 Keio Mutant Collection”. Theses & ETDs.
- ^ Goodman, Andrew L.; McNulty, Nathan P.; Zhao, Yue; Leip, Douglas; Mitra, Robi D.; Lozupone, Catherine A.; Knight, Rob; Gordon, Jeffrey I. (2009). “Identifying Genetic Determinants Needed to Establish a Human Gut Symbiont in Its Habitat”. Cell Host & Microbe. 6 (3): 279–289. doi:10.1016/j.chom.2009.08.003. PMC 2895552. PMID 19748469.
- ^ Yamamoto, Natsuko; Nakahigashi, Kenji; Nakamichi, Tomoko; Yoshino, Mihoko; Takai, Yuki; Touda, Yae; Furubayashi, Akemi; Kinjyo, Satoko; Dose, Hitomi; Hasegawa, Miki; Datsenko, Kirill A.; Nakayashiki, Toru; Tomita, Masaru; Wanner, Barry L.; Mori, Hirotada (2009). “Update on the Keio collection of Escherichia coli single-gene deletion mutants”. Molecular Systems Biology. 5 335. doi:10.1038/msb.2009.92. ISSN 1744-4292. PMC 2824493. PMID 20029369.
- ^ Goodman, Andrew L.; McNulty, Nathan P.; Zhao, Yue; Leip, Douglas; Mitra, Robi D.; Lozupone, Catherine A.; Knight, Rob; Gordon, Jeffrey I. (2009). “Identifying Genetic Determinants Needed to Establish a Human Gut Symbiont in Its Habitat”. Cell Host & Microbe. 6 (3): 279–289. doi:10.1016/j.chom.2009.08.003. PMC 2895552.
- ^ Sen, Oishi; Liu, Xianghui; Kjelleberg, Staffan; Rice, Scott A.; Seviour, Thomas (2025-03-31). “Potential confounding mutations in Keio knockout strains: implications for research accuracy”. Microbiology Spectrum. 13 (5): e02036–24. doi:10.1128/spectrum.02036-24. PMC 12054083. PMID 40162750.
- ^ Cardinale, Stefano; Arkin, Adam Paul (2012). “Contextualizing context for synthetic biology – identifying causes of failure of synthetic biological systems”. Biotechnology Journal. 7 (7): 856–866. doi:10.1002/biot.201200085. ISSN 1860-6768. PMC 3440575.