Each failure compounds the next. Stitched together, they produce predictions that look credible and aren't.
16% of top-20 oncology targets flip when names are revealed.
Cuccarese (Recursion VP), 2026 · public disclosure
Up to 87% of medical pre-training content persists through fine-tuning.
Memorization in LLMs in Medicine, 2025.
Phenotype-driving cells: <1% of a sample. Bulk -omics averages them out.
Van de Sande et al., Nature Reviews Drug Discovery, 2023.
Reading thousands of papers at depth breaks on cost, time, and orchestration debt.
Each component is replicable. The intersection is what holds.
Every claim points to a paper figure, in-silico run, or experiment ID. The model never paraphrases from memory — it reads what it cites.
Reasoning agents chase specific questions through the graph — surfacing the low-magnitude signals at cell and sub-population scale where disease-driving biology sits.
A fact extracted in cancer is reusable in autoimmune, neuro, ageing. Graph grows ~88 arm-level facts per discovery cycle.
McKinsey · 2024 ~32% of pharma have scaled gen AI; ~5% report consistent differentiation.
Verifiable output at scale is what unblocks the 95%.
Claim-level freedom-to-operate.
Auditable claim validation.
What testable hypotheses fall out of this dataset?
What in here is novel and defendable?
What does the raw cohort data reveal at single-cell resolution?
If I modify a target — add, inhibit, or combine — what happens downstream?
In active developmentFive iterative steps from data to therapy. We push verifiable AI into every step of the way.
Three live proof points — already in customers' hands.
Working at the intersection of biology, AI, and architecture.

