Molecular tagging to overcome limitations of massively parallel sequencing

dc.contributor.advisorShendure, Jayen_US
dc.contributor.authorHiatt, Josephen_US
dc.date.accessioned2013-04-17T17:57:44Z
dc.date.available2013-10-15T11:06:13Z
dc.date.issued2013-04-17
dc.date.submitted2012en_US
dc.descriptionThesis (Ph.D.)--University of Washington, 2012en_US
dc.description.abstractMassively parallel technologies have dramatically reduced the cost and increased the throughput of DNA sequencing, transforming the study of patterns of genetic variation in human and model organisms, the genetic basis of disease, and the organization, regulation, and function of genomes. However, the cost and throughput gains of the most widely used massively parallel sequencing platforms are offset by substantial drawbacks with respect to read length and base-calling accuracy, limiting their utility for many exciting and potentially important research and clinical applications. These include de novo assembly of genomes and metagenomes, diversity profiling of metagenomic communities and viral populations, synthetic functional assays to interrogate long genetic elements, and sensitive and highly multiplexed cancer-related gene sequencing. This dissertation describes a paradigm, "molecular tagging," which I developed to overcome the read length and error rate limitations of massively parallel sequencing, in the context of its use for three distinct applications. I first describe the development of a molecular tagging-based method called "Subassembly" and its application to de novo genome and metagenome assembly. Complexity-bottlenecked libraries of DNA fragments at least ~350 nucleotides in length were amplified, re-fragmented, and subjected to massively parallel sequencing such that we could identify groups of reads derived from the same longer progenitor fragment. Groups of reads were then locally assembled to generate highly accurate haploid consensus sequences that effectively corresponded to single input molecules, and these long consensus sequences were used to improve de novo genome assembly for single bacterial and metagenomic samples. Next, I describe the development and application of a massively parallel assay to dissect transcriptional enhancer elements with single nucleotide resolution. We applied this method to determine the functional consequences of all possible single nucleotide changes in three mammalian enhancers; this assay involved an optimized version of the Subassembly experimental protocol and analytical strategy. Finally, I describe the characterization and application of a multiplex assay for ultra-sensitive targeted sequencing of cancer-related genes in clinical tumor samples. I integrated the molecular tagging paradigm with the molecular inversion probe multiplex capture strategy to develop a method that is simultaneously simple, rapid, cost-effective, and ultra-sensitive for sub-clonal variation in genetically heterogeneous tissue samples. Molecular tagging is therefore a broadly applicable strategy to overcome key limitations of massively parallel sequencing that is expanding the utility of this already transformative group of technologies to a number of important research and clinical applications.en_US
dc.embargo.termsDelay release for 6 months -- then make Open Accessen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.otherHiatt_washington_0250E_11165.pdfen_US
dc.identifier.urihttp://hdl.handle.net/1773/22450
dc.language.isoen_USen_US
dc.rightsCopyright is held by the individual authors.en_US
dc.subject.otherGeneticsen_US
dc.subject.othergeneticsen_US
dc.titleMolecular tagging to overcome limitations of massively parallel sequencingen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hiatt_washington_0250E_11165.pdf
Size:
6.61 MB
Format:
Adobe Portable Document Format

Collections