Illumina Launches the Billion Cell Atlas for Enhanced AI-Driven Drug Discovery
Illumina Launches the Billion Cell Atlas for Enhanced AI-Driven Drug Discovery
On January 13, 2026, Illumina, Inc. (NASDAQ: ILMN) announced the introduction of the Billion Cell Atlas, marking the largest genome-wide genetic perturbation dataset ever created. This ambitious project aims to significantly enhance drug discovery processes, utilizing advanced artificial intelligence across the pharmaceutical landscape.
The Billion Cell Atlas is the initial phase of an expansive initiative to construct a 5 billion cell atlas over the next three years. This monumental project is anticipated to yield the most detailed mapping of human disease biology thus far.
Under a strategic partnership with industry giants like AstraZeneca, Merck, and Eli Lilly, the Atlas is being curated from a specific selection of cell lines. It will serve as a vital resource for validating drug targets, fostering the development of large-scale AI models, and delving into research areas previously considered unattainable.
Jacob Thaysen, CEO of Illumina, stated, "The cell atlas is a pivotal innovation that will allow us to substantially scale AI for drug discovery. We are creating an unmatched resource for training the forthcoming generation of AI models for precision medicine and drug target identification. This, in turn, will illuminate the biological pathways behind some of the deadliest diseases worldwide."
Industry Impact and Partnerships
Merck, one of the founding participants, plans to adopt the Atlas for accelerating its precision medicine initiatives throughout its drug discovery pipelines. The data generated from the Atlas will support Merck's proprietary AI and machine learning (ML) models, driving the development of virtual cell models aimed at enhancing disease indication predictions. Iya Khalil, Vice President and Head of Data, A.I. Genome Sciences at Merck, highlighted that by utilizing comprehensive genomic datasets, scientists can construct AI models that reflect genuine biological variation, thus translating insights into impactful targets and precision biomarkers.
The Atlas will meticulously document how 1 billion individual cells react to genetic modifications through CRISPR technology, encompassing over 200 disease-relevant cell lines. These cell lines have been carefully selected due to their significance in understanding diseases that have previously been challenging to decode, such as various cancers, immune disorders, and specific metabolic and neurological diseases.
Transforming Drug Discovery