Statistical Issues in Microbiome Data Analysis: Batch Effects and Multi-Omics Analysis

dc.contributor.advisorWu, Michael
dc.contributor.authorFu, Weijia
dc.date.accessioned2019-08-14T22:29:55Z
dc.date.issued2019-08-14
dc.date.submitted2019
dc.descriptionThesis (Master's)--University of Washington, 2019
dc.description.abstractProgress in high throughput sequencing has facilitated the conduct of large scale microbiome profiling studies which have already begun to elucidate the role of microbes in many disorders and clinical outcomes. Despite the many successes, statistical analysis of data from these studies continues to pose a challenge. In the thesis, we proposed methods to study two specific challenges: batch effects and integrative analysis of microbiome and other omics data. Both issues are increasingly relevant problems. As studies get larger, batching becomes inevitable and integrative analysis is imperative for gaining clues as to the mechanisms underlying discovered associations. The thesis is composed of two projects. In the first project, we compared six existing batch correction methods for microarray data when applied to microbiome data. Two real microbiome data sets were used to evaluate the performance using data visualization and several evaluation metrics. Our results suggest that an empirical bayes approach (ComBat), when applied appropriately, can outperform other methods. In the second project, we proposed a robust microbiome regression-based kernel association test (MiRKAT-R) to screen a large number of genomic markers for association with microbiome profiles. This approach utilizes a recently developed robust kernel machine test. We further propose to incorporate an omnibus test that simultaneously considers different models so as to allow for different relationships between the individual markers and microbiome composition. Systematic simulations and applications to real data show that the MiRKAT-R improves both type I error control and power.
dc.embargo.lift2020-08-13T22:29:55Z
dc.embargo.termsRestrict to UW for 1 year -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherFu_washington_0250O_20443.pdf
dc.identifier.urihttp://hdl.handle.net/1773/44069
dc.language.isoen_US
dc.rightsnone
dc.subjectAssociation Test
dc.subjectBatch Effects
dc.subjectMicrobiome
dc.subjectMulti-Omics Analysis
dc.subjectBiostatistics
dc.subject.otherBiostatistics
dc.titleStatistical Issues in Microbiome Data Analysis: Batch Effects and Multi-Omics Analysis
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Fu_washington_0250O_20443.pdf
Size:
1.02 MB
Format:
Adobe Portable Document Format

Collections