Effective Model Deployment and Data Curation for Foundation Model Development

dc.contributor.advisorKrishna, Ranjay
dc.contributor.advisorRatner, Alexander J.
dc.contributor.authorHsieh, Cheng-Yu
dc.date.accessioned2025-10-02T16:07:22Z
dc.date.available2025-10-02T16:07:22Z
dc.date.issued2025-10-02
dc.date.submitted2025
dc.descriptionThesis (Ph.D.)--University of Washington, 2025
dc.description.abstractWhile scaling—both in terms of model size and dataset volume—has driven many of the recent breakthroughs in AI, this increasingly larger-scale development trajectory faces emergent challenges not seen in traditional small-scale supervised learning setting. On model side, the exponential growth in parameter counts has rendered these highly capable but massive models prohibitively expensive to deploy or adapt for many practical applications. On the data side, although training on massive datasets improves performance on standard benchmarks, scaling alone does not guarantee the emergence of desirable model behaviors beyond traditional metrics. This thesis develops techniques to address core challenges along both the model and data axes of modern AI development. Specifically, it proposes strategies for the efficient deployment and adaptation of Transformer-based large language models, and introduces principled methods for curating reliable and effective data to evaluate and improve modern vision-language models beyond standard accuracy metrics. Collectively, these contributions aim to make large-scale AI systems more effective and accessible across diverse real-world scenarios.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherHsieh_washington_0250E_28690.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53968
dc.language.isoen_US
dc.rightsCC BY
dc.subjectArtificial intelligence
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleEffective Model Deployment and Data Curation for Foundation Model Development
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hsieh_washington_0250E_28690.pdf
Size:
24.96 MB
Format:
Adobe Portable Document Format