Development of a machine learning pipeline to analyze biological multiple particle tracking datasets

dc.contributor.advisorNance, Elizabeth
dc.contributor.authorSCHIMEK, NELS
dc.date.accessioned2022-09-23T20:43:52Z
dc.date.available2022-09-23T20:43:52Z
dc.date.issued2022-09-23
dc.date.submitted2022
dc.descriptionThesis (Master's)--University of Washington, 2022
dc.description.abstractMultiple Particle Tracking (MPT) has been demonstrated as an important tool for understanding changes to biological environments. MPT studies are capable of generating gigabytes of data across hundreds to thousands of trajectories, making MPT datasets an interesting candidate for machine learning applications. To begin understanding the scope of biological questions that can be answered by coupling MPT datasets with machine learning techniques, an end-to-end data science pipeline is developed building off of recent work in the Nance Lab and applied to three unique datasets. To begin, Principal Components Analysis is applied in order to visualize the spread and distribution of the high dimensional MPT data. Next, a boosted decision tree model, XGBoost, is applied to determine the predictable capability of each dataset, and SHAP values are used to understand model predictions and find the statistical feature driving accurate predictions. Finally, XGBoost models are trained on trajectories from specific diffusion modes to determine any increase in accuracy. Overall, the pipeline presented demonstrates the capability to provide information across multiple biological questions.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherSCHIMEK_washington_0250O_24336.pdf
dc.identifier.urihttp://hdl.handle.net/1773/49289
dc.language.isoen_US
dc.rightsCC BY-NC-ND
dc.subjectextracellular matrix
dc.subjectmachine learning
dc.subjectmicroscopy
dc.subjectmultiple particle tracking
dc.subjectChemistry
dc.subjectChemical engineering
dc.subject.otherChemistry
dc.titleDevelopment of a machine learning pipeline to analyze biological multiple particle tracking datasets
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SCHIMEK_washington_0250O_24336.pdf
Size:
696.66 KB
Format:
Adobe Portable Document Format

Collections