Exploring the Trinity of Protein Science: Structure, Stability, and Function Through the Lens of Machine Learning

dc.contributor.advisorBeck, David A. C.
dc.contributor.authorAlanzi, Humood
dc.date.accessioned2024-09-09T23:05:08Z
dc.date.issued2024-09-09
dc.date.submitted2024
dc.descriptionThesis (Master's)--University of Washington, 2024
dc.description.abstractMachine learning and deep learning are revolutionizing protein science by enabling the prediction of complex, emergent biophysical properties. This thesis presents two novel computational models that leverage these technologies to predict protein thermostability and function, illustrating how they can serve as powerful hypothesis generators within iterative "design, build, test, and learn" cycles. Chapter 2 details NOMELT, a generative model trained as a neural machine translator between mesophilic and thermophilic protein domains, which uses a vast new dataset of homologous protein pairs to enhance the stability of generated thermophilic sequences. Chapter 3 introduces the PairProphet pipeline, which integrates diverse sequence and structural data to predict functional similarities between protein pairs with high accuracy, highlighting the importance of sequence-based features and the potential limitations of current structural analysis techniques. The thesis suggests that integrating ecological information and pangenomic analyses could further enhance the predictive power of these models, pointing to these approaches as promising areas for future research. This work contributes to a deeper understanding of protein behaviors under diverse environmental conditions and suggests pathways to more effectively design proteins for therapeutic and industrial applications.
dc.embargo.lift2025-09-09T23:05:08Z
dc.embargo.termsRestrict to UW for 1 year -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherAlanzi_washington_0250O_27198.pdf
dc.identifier.urihttps://hdl.handle.net/1773/51823
dc.language.isoen_US
dc.rightsCC BY
dc.subjectMachine Learning
dc.subjectProtein Design
dc.subjectProtein Function
dc.subjectThermostability
dc.subjectChemical engineering
dc.subjectBioinformatics
dc.subjectBioengineering
dc.subject.otherChemical engineering
dc.titleExploring the Trinity of Protein Science: Structure, Stability, and Function Through the Lens of Machine Learning
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Alanzi_washington_0250O_27198.pdf
Size:
7.47 MB
Format:
Adobe Portable Document Format