Span-based Neural Structured Prediction

dc.contributor.advisorZettlemoyer, Luke S
dc.contributor.authorLee, Kenton
dc.date.accessioned2018-01-20T00:59:58Z
dc.date.issued2018-01-20
dc.date.submitted2017
dc.descriptionThesis (Ph.D.)--University of Washington, 2017
dc.description.abstractA long-standing goal in artificial intelligence is for machines to understand natural language. With ever-growing amounts of data in the world, it is crucial to automate many aspects of language understanding so that users can make sense of this data in the face of information overload. The main challenge stems from the fact that the surface form of language, either as speech or text, is unstructured. Without programmatic access to the semantics of natural language, it is challenging to build general, robust systems that are usable in practice. Towards achieving this goal, we propose a series of neural structured-prediction algorithms for natural language processing. In particular, we address a challenge common to all such algorithms: the space of possible output structures can be extremely large, and inference in this space can be intractable. Despite the seeming incompatibility of neural representations with dynamic programs from traditional structured prediction algorithms, we can leverage these rich representations to learn more accurate models while using simpler or lazier inference. We focus on algorithms that model the most basic substructure of language: spans of text. We present state-of-the-art models for tasks that require modeling the internal structure of spans, such as syntactic parsing, and modeling structure between spans, such as question answering and coreference resolution. The proposed techniques are applicable to many problems, and we expect that they will further push the limits of neural structured prediction for natural language processing.
dc.embargo.lift2019-01-20T00:59:58Z
dc.embargo.termsRestrict to UW for 1 year -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherLee_washington_0250E_18155.pdf
dc.identifier.urihttp://hdl.handle.net/1773/40871
dc.language.isoen_US
dc.rightsnone
dc.subject
dc.subjectArtificial intelligence
dc.subject.otherComputer science and engineering
dc.titleSpan-based Neural Structured Prediction
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Lee_washington_0250E_18155.pdf
Size:
685.4 KB
Format:
Adobe Portable Document Format