Timezone: »

 
Protein structure generation via folding diffusion
Kevin Wu · Kevin Yang · Rianne van den Berg · James Zou · Alex X Lu · Ava Soleimany

The ability to computationally generate novel yet physically foldable protein structures could lead to new biological discoveries and new treatments targeting yet incurable diseases. Despite recent advances in protein structure prediction, directly generating diverse, novel protein structures from neural networks remains difficult. In this work, we present a new diffusion-based generative model that designs protein backbone structures via a procedure that mirrors the native folding process. We describe protein backbone structure as a series of consecutive angles capturing the relative orientation of the constituent amino acid residues, and generate new structures by denoising from a random, unfolded state towards a stable folded structure. Not only does this mirror how proteins biologically twist into energetically favorable conformations, the inherent shift and rotational invariance of this representation crucially alleviates the need for complex equivariant networks. We train a denoising diffusion probabilistic model with a simple transformer backbone and demonstrate that our resulting model unconditionally generates highly realistic protein structures with complexity and structural patterns akin to those of naturally-occurring proteins. As a useful resource, we release the first open-source codebase and trained models for protein structure diffusion.

Author Information

Kevin Wu (Stanford University)
Kevin Yang (Microsoft)
Rianne van den Berg (Microsoft Research)
James Zou (Stanford University)
Alex X Lu (Microsoft Research)

I’m a Senior Researcher at Microsoft Research New England, in the BioML group. I’m interested in how machine learning can help us discover new insights from biological data, by finding patterns that are too subtle or large-scale to identify unassisted. I primarily focus on biological images, and my research often designs self-supervised learning methods, as I believe these methods are unbiased by prior knowledge.

Ava Soleimany (Microsoft Research)

More from the Same Authors