Development of a machine learning pipeline to analyze biological multiple particle tracking datasets
| dc.contributor.advisor | Nance, Elizabeth | |
| dc.contributor.author | SCHIMEK, NELS | |
| dc.date.accessioned | 2022-09-23T20:43:52Z | |
| dc.date.available | 2022-09-23T20:43:52Z | |
| dc.date.issued | 2022-09-23 | |
| dc.date.submitted | 2022 | |
| dc.description | Thesis (Master's)--University of Washington, 2022 | |
| dc.description.abstract | Multiple Particle Tracking (MPT) has been demonstrated as an important tool for understanding changes to biological environments. MPT studies are capable of generating gigabytes of data across hundreds to thousands of trajectories, making MPT datasets an interesting candidate for machine learning applications. To begin understanding the scope of biological questions that can be answered by coupling MPT datasets with machine learning techniques, an end-to-end data science pipeline is developed building off of recent work in the Nance Lab and applied to three unique datasets. To begin, Principal Components Analysis is applied in order to visualize the spread and distribution of the high dimensional MPT data. Next, a boosted decision tree model, XGBoost, is applied to determine the predictable capability of each dataset, and SHAP values are used to understand model predictions and find the statistical feature driving accurate predictions. Finally, XGBoost models are trained on trajectories from specific diffusion modes to determine any increase in accuracy. Overall, the pipeline presented demonstrates the capability to provide information across multiple biological questions. | |
| dc.embargo.terms | Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | SCHIMEK_washington_0250O_24336.pdf | |
| dc.identifier.uri | http://hdl.handle.net/1773/49289 | |
| dc.language.iso | en_US | |
| dc.rights | CC BY-NC-ND | |
| dc.subject | extracellular matrix | |
| dc.subject | machine learning | |
| dc.subject | microscopy | |
| dc.subject | multiple particle tracking | |
| dc.subject | Chemistry | |
| dc.subject | Chemical engineering | |
| dc.subject.other | Chemistry | |
| dc.title | Development of a machine learning pipeline to analyze biological multiple particle tracking datasets | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- SCHIMEK_washington_0250O_24336.pdf
- Size:
- 696.66 KB
- Format:
- Adobe Portable Document Format
