Machine Learning of Amino Acid Composition Models for Protein Redesign

dc.contributor.advisorBaneyx, François
dc.contributor.advisorBeck, David A.C
dc.contributor.authorXiao, Sijia
dc.date.accessioned2019-08-14T22:30:31Z
dc.date.available2019-08-14T22:30:31Z
dc.date.issued2019-08-14
dc.date.submitted2019
dc.descriptionThesis (Master's)--University of Washington, 2019
dc.description.abstractProteins from thermophiles can preserve their basic structures and original functions at high temperature. However, most of the mesophilic proteins are vulnerable to such extreme condition due to their different amino acid composition. Improve the thermostability of thermophilic proteins will reduce the cost of from storage and production in industrial process. Nowadays, machine learning becomes a powerful method for data-intensive computation. This work provides a well-tuned network, called Thermalizer, which applies Recurrent Neural Network (RNN) to encode the mesophilic proteins to thermostable proteins which is predicted to be able to perform expected function in higher temperature. The toolkit also provides workflow from gene and amino acid sequence preprocessing to encoder and decoder construction, model training and evaluation, and translation window. The project is accessible on GitHub: https://github.com/BeckResearchLab/thermalizer.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherXiao_washington_0250O_20291.pdf
dc.identifier.urihttp://hdl.handle.net/1773/44090
dc.language.isoen_US
dc.rightsnone
dc.subjectChemical engineering
dc.subject.otherChemical engineering
dc.titleMachine Learning of Amino Acid Composition Models for Protein Redesign
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Xiao_washington_0250O_20291.pdf
Size:
1.59 MB
Format:
Adobe Portable Document Format