Machine Learning-Based Determination of Protein Secondary Structures

dc.contributor.advisorOverney, Rene M.
dc.contributor.authorLee, Huan-Jui
dc.date.accessioned2021-08-26T18:07:39Z
dc.date.issued2021-08-26
dc.date.submitted2021
dc.descriptionThesis (Master's)--University of Washington, 2021
dc.description.abstractThe main purpose of this thesis is to construct a machine learning model that yields protein secondary structures from sequences and circular dichroism (CD) spectra, and test the contribution of each part. This effort is motivated by the desire to reduce the costs and time involved in state-of-the-art approaches, which involve elaborate instrumentation, such as nuclear magnetic resonance (NMR) and X-ray powder diffraction (XRD). Conformational analysis based on current experimental methods require preparations and analytical processes that are often hampered by sample impurities and aging, and, limitations originating from crystal cultures. A well-developed machine learning algorithm, based on existing conformational data provides an easier and also faster way to predict unknown conformations of proteins. In the research here, we make use of CD spectra and improvement of machine learning model. The algorithm used in this thesis is based on Long Short-Term Memory (LSTM) and Convolutional Neural Network (CNN), we analyzed the performance of single model and stacked model. The result indicates that stacked model and CD spectra can help us to improve the accuracy of prediction.
dc.embargo.lift2023-08-16T18:07:39Z
dc.embargo.termsRestrict to UW for 2 years -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherLee_washington_0250O_23200.pdf
dc.identifier.urihttp://hdl.handle.net/1773/47377
dc.language.isoen_US
dc.rightsnone
dc.subjectcircular dichroism
dc.subjectmachine learning
dc.subjectprotein
dc.subjectsecondary structure
dc.subjectChemical engineering
dc.subjectBioinformatics
dc.subject.otherChemical engineering
dc.titleMachine Learning-Based Determination of Protein Secondary Structures
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Lee_washington_0250O_23200.pdf
Size:
513.49 KB
Format:
Adobe Portable Document Format