Algorithimic data efficient learning in the era of large model.

dc.contributor.advisorJamieson, Kevin KJ
dc.contributor.advisorDu, Simon SD
dc.contributor.authorChen, Yifang
dc.date.accessioned2025-10-02T16:07:16Z
dc.date.available2025-10-02T16:07:16Z
dc.date.issued2025-10-02
dc.date.submitted2025
dc.descriptionThesis (Ph.D.)--University of Washington, 2025
dc.description.abstractIn the race towards Artificial General Intelligence, data is the fuel that powers our most advanced models. Vision-Language Models like LLaVA and CLIP are trained on billions of image-text pairs, while Large Language Models (LLMs) like GPT and Claude may process trillions of text samples. Despite the abundance of data, ensuring its quality and effective curation remains more of an art than a science. This process must manage real-world data that is multimodal, noisy, and lacks a guaranteed relationship to target tasks. Furthermore, the process is compounded by the complex training dynamics of neural networks, where the value of each data point depends heavily on the evolving state of model training. Without principled guidance, these challenges often create systematic blind spots, and their impact remains unclear due to a lack of theoretical understanding. My research aims to develop \textbf{theoretical foundations for data curation} through designing \textbf{theory-inspired algorithms} under realistic assumptions and establishing systematic empirical evaluation frameworks to understand the limitations of existing methods including: 1/ target-aware data curation in pretraining 2/label-efficient finetuning 3/ inference-efficient data synthesis and 4/ Interactive learning theories.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherChen_washington_0250E_28664.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53957
dc.language.isoen_US
dc.rightsnone
dc.subjectActive Learning
dc.subjectData selection
dc.subjectData-centric AI
dc.subjectInteractive Learning
dc.subjectLarge Models
dc.subjectReinforcement Learning
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleAlgorithimic data efficient learning in the era of large model.
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Chen_washington_0250E_28664.pdf
Size:
8.21 MB
Format:
Adobe Portable Document Format