Statistical Methods to Estimate Evolutionary and Technical Parameters Using Whole Genome Sequence Data

dc.contributor.advisorBrowning, Sharon R
dc.contributor.authorMasaki, Nobuaki
dc.date.accessioned2025-10-02T16:05:36Z
dc.date.available2025-10-02T16:05:36Z
dc.date.issued2025-10-02
dc.date.submitted2025
dc.descriptionThesis (Ph.D.)--University of Washington, 2025
dc.description.abstractWhole genome sequence data are widely used in humans and other species to reveal evolutionary patterns and recent demographic history. In this dissertation, we introduce three new statistical methods that can be used to estimate technical parameters such as genotype error rates, as well as parameters related to genome evolution and recent demographic history, using whole genome sequence data from humans and SARS-CoV-2. In our first method, we propose a model that calculates the likelihood of observed parent-offspring trio genotypes, adjusting for both genotype errors and uncalled deletions. We fit our model to SNVs in 77 White British trios identified in the UK Biobank whole genome sequence data, obtaining estimates for the genotype error and uncalled deletion rates in this dataset. In our second method, we formulate a model to estimate the mean length of gene conversion tracts. Our model uses a separate per-site allele conversion rate for each observed tract. We fit this model to gene conversion tracts detected from the UK Biobank whole autosome sequence data and infer the mean length of gene conversion tracts in humans. Finally, in our third method, we propose a hidden Markov model that accounts for mutations and genotype errors to detect recombinant SARS-CoV-2 sequences.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherMasaki_washington_0250E_28852.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53924
dc.language.isoen_US
dc.rightsCC BY-NC-ND
dc.subjectGenetics
dc.subjectStatistics
dc.subjectEpidemiology
dc.subject.otherBiostatistics
dc.titleStatistical Methods to Estimate Evolutionary and Technical Parameters Using Whole Genome Sequence Data
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Masaki_washington_0250E_28852.pdf
Size:
5.21 MB
Format:
Adobe Portable Document Format

Collections