Toward Efficient Machine Learning Systems with Sampling and Compression.
| dc.contributor.advisor | Ceze, Luis | |
| dc.contributor.author | Lin, Chien-Yu | |
| dc.date.accessioned | 2025-08-01T22:19:27Z | |
| dc.date.available | 2025-08-01T22:19:27Z | |
| dc.date.issued | 2025-08-01 | |
| dc.date.submitted | 2025 | |
| dc.description | Thesis (Ph.D.)--University of Washington, 2025 | |
| dc.description.abstract | The rapid growth of machine learning - in terms of model size, dataset volume, and task complexity - has created significant computational and memory efficiency challenges. This thesis addresses these challenges by developing sampling and compression techniques across a variety of machine learning applications. Specifically, we introduce sampling strategies for efficient graph neural networks (CacheSample), accelerated 3D image rendering (FastSR-NeRF), and optimized retrieval-augmented generation systems (TeleRAG). We further propose novel compression methods targeting convolutional neural networks (SPIN), large language models (Atom), and their key-value caches (Palu). Collectively, these techniques substantially reduce computational and memory requirements while preserving model accuracy, facilitating the scalability and accessibility of machine learning systems. Finally, I present my vision for future efficiency innovations to ensure continued scalability and robustness as machine learning models continue to grow in complexity. | |
| dc.embargo.terms | Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Lin_washington_0250E_28591.pdf | |
| dc.identifier.uri | https://hdl.handle.net/1773/53496 | |
| dc.language.iso | en_US | |
| dc.rights | none | |
| dc.subject | efficient machine learning | |
| dc.subject | machine learning system | |
| dc.subject | Computer science | |
| dc.subject.other | Computer science and engineering | |
| dc.title | Toward Efficient Machine Learning Systems with Sampling and Compression. | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Lin_washington_0250E_28591.pdf
- Size:
- 14.72 MB
- Format:
- Adobe Portable Document Format
