Data-Aware Complexity Analysis and Program Optimization

dc.contributor.advisorSuciu, Dan
dc.contributor.advisorBalazinska, Magda
dc.contributor.authorDeeds, Kyle
dc.date.accessioned2025-10-02T16:07:27Z
dc.date.available2025-10-02T16:07:27Z
dc.date.issued2025-10-02
dc.date.submitted2025
dc.descriptionThesis (Ph.D.)--University of Washington, 2025
dc.description.abstractThis dissertation explores the problem of analyzing and optimizing data-dependent programs from a theoretical and practical perspective. The performance of these programs depends in a complex manner on the distribution of the input data, and they arise in many contexts, e.g. databases, sparse tensor programming, and graph analytics. By definition, these programs cannot be optimized by considering the code alone, so an optimizer for them must consider information about the data distribution. This dissertation presents two new theoretical approaches for analyzing data-dependent programs by bounding the size of their intermediate results: the degree sequence bound and partition constraints. It then describes two practical systems for producing these bounds: SafeBound and COLOR. Lastly, we present a state-of-the-art optimizer for sparse tensor programs, Galley, that demonstrates the value of data-aware optimization.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherDeeds_washington_0250E_28848.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53971
dc.language.isoen_US
dc.rightsCC BY
dc.subjectComplexity Analysis
dc.subjectDatabase Systems
dc.subjectDatabase Theory
dc.subjectJoin Algorithms
dc.subjectProgram Optimization
dc.subjectSparse Tensor Programming
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleData-Aware Complexity Analysis and Program Optimization
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Deeds_washington_0250E_28848.pdf
Size:
3.63 MB
Format:
Adobe Portable Document Format