Scaling High-Throughput Experimentation Unlocks Robust Reaction-Outcome Prediction
Michał Sadowski · Lukasz Sztukiewicz · Maria Wyrzykowska · Tadija Radusinović · Piotr Byrski · Paweł Włodarczyk-Pruszyński · Bartosz Matysiak · Jan Kulczycki · Filip Ulatowski · Ruard van Workum · Pawel Dabrowski-Tumanski · Paulina Wach · Filip Chmielewski · Jan Rzymkowski · Mateusz Bruno-Kamiński · Jan Busz · Artur Chołuj · Mateja Duda · Tomasz Dybowski · Marco Farinone · Tomasz Jeliński · Alicja Karczewska · Paweł Kowalczyk · Marek Pietrzak · Łukasz Szczupak · Aleksander Szkółka · Grzegorz Wojciechowski · Stanislaw Jastrzebski
Abstract
Organic chemistry underpins small-molecule drug discovery, yet—unlike structural biology—it lacks large, unbiased datasets for training broadly generalizable models. We report the largest microliter-scale high-throughput experimentation (HTE) campaign to date: $200{,}000$ reactions spanning three workhorse classes (Amide Coupling, Suzuki Coupling, Buchwald–Hartwig Coupling) involving $30{,}000$ products—over $4\times$ larger than the largest publicly disclosed dataset to date. This scale and diversity enable reaction-outcome predictors that generalize to unseen substrates. We introduce UniReact, a molecule-attention Transformer built on pretrained molecular encoders. Across the three reaction classes, our models achieve PR-AUC $2$--$3\times$ over random and ROC-AUC in the $70$--$86\%$ range. We further establish scaling laws for reaction-outcome prediction spanning three orders of magnitude of HTE data, and for one class up to $100{,}000$ reactions—\emph{to our knowledge}, the broadest HTE scaling study to date. In a human study on Suzuki coupling prioritization, our models outperform PhD-level chemists (precision $87.1\%$ at $50\%$ recall vs.~$60.8\%$). Finally, we show the first, to our best knowledge, demonstration of zero-shot transfer to an external HTE dataset. Taken together, these results support scaled HTE as a viable path to broadly applicable predictors of chemical reactivity that surpass human intuition and ultimately help discover novel chemistry.
Chat is not available.
Successful Page Load