Using non-linear, machine learning methodology to assess the potential metabolomic-based biomarkers of total fat and percentage fat intake using a controlled feeding study

dc.contributor.advisorNeuhouser, Marian
dc.contributor.authorNondin, Caroline Lea
dc.date.accessioned2024-02-12T23:42:02Z
dc.date.available2024-02-12T23:42:02Z
dc.date.issued2024-02-12
dc.date.submitted2023
dc.descriptionThesis (Master's)--University of Washington, 2023
dc.description.abstractBackground: Understanding and identifying objective dietary biomarkers is a crucial component of nutrition research today. By investigating the relationship between biomarker profiles and dietary intake using machine learning methodologies, there could be a way to more objectively assess study participant nutrient profiles and better understand the relationship between nutrient intake and disease. Our aim in this thesis is to assess the utility of non-linear tree-based models in predicting daily intake of total fat and the percent of energy from fat from serum and 24-h urine high dimensional metabolites. Methods: Our analysis used the dataset from a 2-week controlled feeding study mimicking the participants’ habitual diets among 153 post-menopausal women from the Nutrition and Physical Activity Assessment Study Feeding study, conducted in the Women’s Health Initiative (WHI). Fasting serum metabolite profiles, urine metabolites, as well as demographic and Food Frequency Questionnaire (FFQ) data, were used to predict total fat and percent energy from fat using four cross-validated tree-based machine learning models. A LASSO model for regression was used as a way to compare the models to a linear model. Results: The highest cross-validated multiple correlation coefficients (CV-R2) for total fat intake and percent energy intake were 10.2% and 10.4%, respectively. None of the models had a CV-R2 of over 36%. There were no significant differences found between the performance of linear and non-linear models in predicting fat intake. Conclusion: Both linear and non-linear models were shown to be unable to predict total fat and dietary fat intake using serum and urinary metabolites accurately and reliably. Variable importance suggests that tree-based, machine-learning models have the potential to help understand non-linear interactions between biomarkers and dietary intake.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherNondin_washington_0250O_26457.pdf
dc.identifier.urihttp://hdl.handle.net/1773/51228
dc.language.isoen_US
dc.rightsnone
dc.subjectBiomarkers
dc.subjectFat
dc.subjectMachine Learning
dc.subjectNutrition
dc.subject.otherNutritional sciences
dc.titleUsing non-linear, machine learning methodology to assess the potential metabolomic-based biomarkers of total fat and percentage fat intake using a controlled feeding study
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Nondin_washington_0250O_26457.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format