Sparse Partial Least Squares Methods and Extensions for Modeling Heart-Healthy Diets

dc.contributor.advisorMcClelland, Robyn
dc.contributor.authorGasca, Natalie
dc.date.accessioned2021-10-29T16:18:49Z
dc.date.available2021-10-29T16:18:49Z
dc.date.issued2021-10-29
dc.date.submitted2021
dc.descriptionThesis (Ph.D.)--University of Washington, 2021
dc.description.abstractWhen investigating the link between diet and cardiovascular disease (CVD), nutritional epidemiologists often use unsupervised methods to construct dietary patterns using hundreds of foods. We posit that diet summaries can be better tailored to CVD by incorporating outcome data and sparsity. Partial least squares (PLS) is an appealing supervised method because its patterns are correlated with a continuous response while also capturing covariate variability. However, its statistical and modeling assumptions are not well characterized for non-continuous outcomes. In this dissertation, we clarify the implications of incorporating PLS into linear and Cox models, with the aim of constructing parsimonious patterns to facilitate hypothesis generation. First, we identify an advantageous sparse PLS procedure (SPLS) that targets variable selection for continuous data. We propose using SPLS after fitting the Cox or approximate Cox model to analyze a right-censored survival outcome. To enable proper adjustment for covariates that do not require dimension reduction, we demonstrate that various scientific premises and goals require different types of adjustment when using PLS. These contributions are verified by simulation studies, analytic results, and applications to CVD-related endpoints. Our findings allow for more informed method selection and deeper insight connecting least squares and PLS regression coefficients.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherGasca_washington_0250E_23367.pdf
dc.identifier.urihttp://hdl.handle.net/1773/47948
dc.language.isoen_US
dc.rightsCC BY
dc.subjectData integration
dc.subjectDimension reduction
dc.subjectPartial least squares
dc.subjectSparsity
dc.subjectSurvival
dc.subjectVariable selection
dc.subjectStatistics
dc.subject.otherBiostatistics
dc.titleSparse Partial Least Squares Methods and Extensions for Modeling Heart-Healthy Diets
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Gasca_washington_0250E_23367.pdf
Size:
3.23 MB
Format:
Adobe Portable Document Format

Collections