Reading to Learn

dc.contributor.advisorZettlemoyer, Luke
dc.contributor.authorZhong, Victor Yuan
dc.date.accessioned2023-08-14T17:03:41Z
dc.date.available2023-08-14T17:03:41Z
dc.date.issued2023-08-14
dc.date.submitted2023
dc.descriptionThesis (Ph.D.)--University of Washington, 2023
dc.description.abstractTraditional machine learning systems are trained on vast quantities of annotated data or experience. These systems often do not generalize to new, related problems that emerge after training, such as conversing about new topics or interacting with new environments. This thesis introduces Reading to Learn, a new class of algorithms that improve generalization by learning to read language specifications, without requiring any actual experience or labeled examples. This includes, for example, reading FAQ documents to learn to answer questions about new topics and reading manuals to learn to play new games. This thesis discusses new algorithms and data for Reading to Learn applied to a broad range of tasks, including policy learning in grounded environments and data synthesis for code generation, while also highlighting open challenges for this line of work. Ultimately, the goal of Reading to Learn is to democratize AI by making it accessible for low-resource problems where the practitioner cannot obtain annotated data at scale, but can instead write language specifications that models read to generalize.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherZhong_washington_0250E_25751.pdf
dc.identifier.urihttp://hdl.handle.net/1773/50305
dc.language.isoen_US
dc.rightsCC BY
dc.subjectMachine learning
dc.subjectNatural language processing
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleReading to Learn
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zhong_washington_0250E_25751.pdf
Size:
40.08 MB
Format:
Adobe Portable Document Format