Toward Efficient Machine Learning Systems with Sampling and Compression.

Lin, Chien-Yu

Toward Efficient Machine Learning Systems with Sampling and Compression.

dc.contributor.advisor	Ceze, Luis
dc.contributor.author	Lin, Chien-Yu
dc.date.accessioned	2025-08-01T22:19:27Z
dc.date.available	2025-08-01T22:19:27Z
dc.date.issued	2025-08-01
dc.date.submitted	2025
dc.description	Thesis (Ph.D.)--University of Washington, 2025
dc.description.abstract	The rapid growth of machine learning - in terms of model size, dataset volume, and task complexity - has created significant computational and memory efficiency challenges. This thesis addresses these challenges by developing sampling and compression techniques across a variety of machine learning applications. Specifically, we introduce sampling strategies for efficient graph neural networks (CacheSample), accelerated 3D image rendering (FastSR-NeRF), and optimized retrieval-augmented generation systems (TeleRAG). We further propose novel compression methods targeting convolutional neural networks (SPIN), large language models (Atom), and their key-value caches (Palu). Collectively, these techniques substantially reduce computational and memory requirements while preserving model accuracy, facilitating the scalability and accessibility of machine learning systems. Finally, I present my vision for future efficiency innovations to ensure continued scalability and robustness as machine learning models continue to grow in complexity.
dc.embargo.terms	Open Access
dc.format.mimetype	application/pdf
dc.identifier.other	Lin_washington_0250E_28591.pdf
dc.identifier.uri	https://hdl.handle.net/1773/53496
dc.language.iso	en_US
dc.rights	none
dc.subject	efficient machine learning
dc.subject	machine learning system
dc.subject	Computer science
dc.subject.other	Computer science and engineering
dc.title	Toward Efficient Machine Learning Systems with Sampling and Compression.
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Lin_washington_0250E_28591.pdf
Size:: 14.72 MB
Format:: Adobe Portable Document Format

Download

Collections

Computer science and engineering