Beyond Scaling: Frontiers of Retrieval-Augmented LMs

dc.contributor.advisorHajishirzi, Hannaneh
dc.contributor.authorAsai, Akari
dc.date.accessioned2025-08-01T22:19:40Z
dc.date.available2025-08-01T22:19:40Z
dc.date.issued2025-08-01
dc.date.submitted2025
dc.descriptionThesis (Ph.D.)--University of Washington, 2025
dc.description.abstractLanguage Models (LMs) have made significant progress by scaling training data and model sizes. However, they still face key limitations, including hallucinations and outdated knowledge, which undermine their reliability especially in expert domains like scientific research and software development. In this thesis, I argue that overcoming these challenges requires moving beyond monolithic LMs toward Augmented LMs: systems that are designed, trained, and deployed alongside complementary modules to improve reliability and efficiency. Specifically, my work has pioneered the field of Retrieval-Augmented LMs, which precisely locate relevant knowledge from large-scale text data and incorporate them at inference time. I begin by analyzing the limitations of current LMs and demonstrate how retrieval augmentation provides a more reliable, adaptable, and efficient path forward. I then introduce our work on establishing new foundations for Retrieval-Augmented LMs, moving beyond simple post-hoc combinations of off-the-shelf models to tackle challenges driven by broader adoption. Finally, I highlight the real-world impact of Retrieval-Augmented LMs through applications in domains such as scientific literature synthesis. Our fully open \osmodel system are now used by over 30,000 researchers. I conclude by outlining our vision for the future of Augmented LMs, including better handling of heterogeneous modalities, flexible integration with diverse components, and rigorous interdisciplinary evaluation.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherAsai_washington_0250E_28476.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53506
dc.language.isoen_US
dc.rightsCC BY
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleBeyond Scaling: Frontiers of Retrieval-Augmented LMs
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Asai_washington_0250E_28476.pdf
Size:
7.66 MB
Format:
Adobe Portable Document Format