Building Blocks for Data-Driven Theories of Language Understanding
| dc.contributor.advisor | Zettlemoyer, Luke | |
| dc.contributor.author | Michael, Julian | |
| dc.date.accessioned | 2023-08-14T17:03:33Z | |
| dc.date.available | 2023-08-14T17:03:33Z | |
| dc.date.issued | 2023-08-14 | |
| dc.date.submitted | 2023 | |
| dc.description | Thesis (Ph.D.)--University of Washington, 2023 | |
| dc.description.abstract | I propose a paradigm for scientific progress in natural language processing, centered around the development of data-driven theories of language understanding. The central idea is to collect data in tightly scoped, carefully defined ways which allow for exhaustive annotation of a behavioral phenomenon of interest. With such data, we can use machine learning to construct explanatory theories of these phenomena which can be used as building blocks for intelligible AI systems. After laying some conceptual groundwork for the idea, I describe a series of investigations into the development of data and theory for representations of shallow semantic structure in natural language — in particular, using Question-Answer driven Semantic Role Labeling (QA-SRL), a simple schema for annotating verbal predicate-argument structure using highly constrained question-answer pairs. While this just scratches the surface of the complex language behaviors of interest in AI, I outline principles for data collection and theoretical modeling which can inform future scientific progress. | |
| dc.embargo.terms | Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Michael_washington_0250E_25907.pdf | |
| dc.identifier.uri | http://hdl.handle.net/1773/50300 | |
| dc.language.iso | en_US | |
| dc.rights | CC BY | |
| dc.subject | Crowdsourcing | |
| dc.subject | Pragmatism | |
| dc.subject | QA-SRL | |
| dc.subject | Semantic Roles | |
| dc.subject | Artificial intelligence | |
| dc.subject | Linguistics | |
| dc.subject.other | Computer science and engineering | |
| dc.title | Building Blocks for Data-Driven Theories of Language Understanding | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Michael_washington_0250E_25907.pdf
- Size:
- 2.03 MB
- Format:
- Adobe Portable Document Format
