Benchmarking diffusion models for predicting perturbed cellular responses
Abstract
Predicting cellular responses to perturbations is important for understanding biological processes. Although several benchmarks exist, the performance of diffusion models in perturbation prediction has not been systematically studied. Here, we compare the VAE-based model scGen against five diffusion models to assess their suitability for perturbation prediction. Our benchmark covers the prediction of known perturbations, unseen conditions, and stress-test evaluations. The results show that scGen outperforms diffusion models in most settings, while diffusion models demonstrate robustness to noisy data. We also find that encoder design strongly influences model stability and that evaluation metrics can lead to different conclusions. Diffusion models capture the responses of differentially expressed genes, but perform less well on non-DE genes. Overall, our study provides insights into the strengths and limitations of diffusion models, informing their future application in perturbation prediction.