Towards Adaptive Intelligence

dc.contributor.advisorFarhadi, Ali
dc.contributor.advisorKakade, Sham
dc.contributor.authorKusupati, Aditya
dc.date.accessioned2024-09-09T23:06:29Z
dc.date.available2024-09-09T23:06:29Z
dc.date.issued2024-09-09
dc.date.submitted2024
dc.descriptionThesis (Ph.D.)--University of Washington, 2024
dc.description.abstractLiving beings, including humans, are highly adaptive, especially in terms of context and compute (resources). While intelligent machine learning systems are ubiquitous today, their current rigid design hinders adaptation as they struggle with ever-changing data, use cases, and deployment settings, requiring dedicated efforts to function properly. In this thesis, I present my work towards enabling adaptive machine learning solutions for flexible and seamless deployment across widely changing scenarios. First, I present Matryoshka information packing for adaptive data representations to handle growing data size and task-specific usage seamlessly. Then, I build a web-scale search system, AdANNS, leveraging matryoshka representations to enable adaptive search across data. Next, I extend these principles to the neural networks, crafting MatFormer models. This next-generation Transformer architecture adapts its computational footprint based on input and device with minimal overhead during deployment. Along the way, I worked on the first end-to-end learnable sparsity solution to solve the problem of optimal compute allocation across layers of neural networks. Further, to address the inherent rigidity in the design of web-scale intelligent systems, I worked on differentiable search solutions, fundamentally rethinking how large-scale AI pipelines harness data for continuous improvement. Finally, I conclude with the impact these works had in real-world deployments and present future works directed towards adaptive contextual and continual intelligence across disciplines.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherKusupati_washington_0250E_26796.pdf
dc.identifier.urihttps://hdl.handle.net/1773/51880
dc.language.isoen_US
dc.rightsCC BY
dc.subjectAdaptive Compute
dc.subjectDeep Learning
dc.subjectMachine Learning
dc.subjectSearch
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleTowards Adaptive Intelligence
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kusupati_washington_0250E_26796.pdf
Size:
10.03 MB
Format:
Adobe Portable Document Format