← Back to News

AI Just Graduated From Writing Essays to Writing DNA — Meet Evo 2, the Foundation Model for All Life

Evo 2 foundation model infographic showing 1-million nucleotide context window, zero-shot pathogenicity prediction, and generative biology capabilities

While the world is focused on LLMs writing code and emails, a landmark paper just published in Nature (March 2026) proves AI is now capable of writing the Code of Life itself.

The "GPT Moment" for Biology

For decades, genomics has been a "Read-Only" field. We spent billions reading DNA, but we struggled to understand the "syntax." Evo 2 changes this by treating the genome not as a string of letters, but as a complex programming language with a 1-million-token context window.

Published by the Arc Institute in collaboration with Salk Institute and Stanford, Evo 2 is a 7-billion parameter model trained on the entire genomic diversity of the planet — 9.3 trillion nucleotide tokens spanning bacteria, archaea, and eukaryotes.

The Technical Breakthroughs

  • 1-Million Nucleotide Context Window: Previous models could only "see" ~1,000 base pairs at a time — like understanding a massive codebase by reading three lines. Evo 2 sees how a mutation in a non-coding region on one side of a chromosome affects a gene 500,000 base pairs away.
  • Zero-Shot Pathogenicity Prediction: It can predict if a mutation is dangerous (like BRCA1 variants) without ever seeing that specific mutation before — simply by understanding the "grammar" of the surrounding genome. Accuracy: 90%+.
  • Generative Biology: Evo 2 doesn't just predict; it designs. Researchers used it to generate synthetic chromosomes and functional protein structures that have never existed in nature.
  • StripedHyena 2 Architecture: Bypasses the quadratic scaling limits of traditional Transformers — the only way to reach a 1-million token window without melting a server farm.

The Performance Metrics

Discovery Rate+40% accelerated biological discovery
Pathogenicity Prediction90%+ accuracy on zero-shot variant classification
Drug Design2x faster protein and enzyme design pipeline
LicenseOpen Source (Apache 2.0) — fully available to the research community

The Economic Impact: From "In-Vitro" to "In-Silico"

This is where the financial disruption hits. Traditional companies like Illumina (ILMN) and Thermo Fisher (TMO) rely on physical lab kits and sequencing machines to test for genetic variants.

If an AI model can predict the effect of a mutation with 90%+ accuracy from a digital file, the need for expensive, repetitive "wet lab" testing drops dramatically. We are seeing capital shift away from "Hardware-Heavy" sequencing toward AI-Biotech companies like Recursion (RXRX), Schrodinger (SDGR), and Absci (ABSI).

The "Compiler for Life"

Biology is no longer just a study of nature — it is now a programmable field. If DNA is the software, Evo 2 is the first true IDE (Integrated Development Environment) for life. We have the compiler. Now we just need to write the code.

And the most remarkable part? It's fully open source. Unlike proprietary pharma models locked behind corporate walls, Evo 2 is available to every researcher, every university, and every startup on the planet.

We are moving from "Reading" life to "Programming" it.

Sources: Nature: Genome modelling and design across all domains of life with Evo 2, Arc Institute

Source: Nature ↗