Diffusion Models for Protein Structure Design: From Backbone Generation to Atomic-Resolution Enzyme Design

dc.contributor.advisorBaker, David
dc.contributor.authorAhern, Woody
dc.date.accessioned2025-08-01T22:19:31Z
dc.date.available2025-08-01T22:19:31Z
dc.date.issued2025-08-01
dc.date.submitted2025
dc.descriptionThesis (Ph.D.)--University of Washington, 2025
dc.description.abstractThe field of protein structure modeling has been revolutionized by the introduction of deep learning methods, particularly AlphaFold2, which has achieved near-experimental accuracy in predicting protein structures from amino acid sequences. This dissertation explores the application of diffusion models to create general solutions to protein design tasks. We introduce RFdiffusion, a model that generates protein structures as a series of backbone frames, which achieves state of the art performance on unconditional generation, motif scaffolding, and protein-protein binder design. We then leverage a broadened molecular vocabulary to predict general biomolecular structures including nucleic acids, small molecules, post-translational modifications, metals, and ions with RoseTTAFoldAA. Using the RoseTTAFoldAA architecture we finetune a diffusion model capable of generating proteins which bind small molecules. Finally, we present RFdiffusion2, a flow-matching model trained from random weight initializations capable of unindexed atomic motif scaffolding, enabling the design of enzymes with complex active sites. In all cases we validate the design capabilities of the models \textit{in vitro}. Our work demonstrates the potential of diffusion models to advance the field of protein design and opens new avenues for enzyme engineering.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherAhern_washington_0250E_28351.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53498
dc.language.isoen_US
dc.rightsCC BY
dc.subjectArtificial intelligence
dc.subjectComputational chemistry
dc.subjectBiochemistry
dc.subject.otherComputer science and engineering
dc.titleDiffusion Models for Protein Structure Design: From Backbone Generation to Atomic-Resolution Enzyme Design
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ahern_washington_0250E_28351.pdf
Size:
68.68 MB
Format:
Adobe Portable Document Format