Learning To Understand Entities In Text

Choi, Eunsol

Learning To Understand Entities In Text

dc.contributor.advisor	Zettlemoyer, Luke S.
dc.contributor.advisor	Choi, Yejin
dc.contributor.author	Choi, Eunsol
dc.date.accessioned	2019-10-15T22:57:02Z
dc.date.available	2019-10-15T22:57:02Z
dc.date.issued	2019-10-15
dc.date.submitted	2019
dc.description	Thesis (Ph.D.)--University of Washington, 2019
dc.description.abstract	Real world entities such as people, organizations and countries play a critical role in text. Reading text offers rich information about these entities, both explicit, such as historical facts and scientific findings, and implicit, such as social relationships and personal opinions. Automatically extracting rich entity information promises exciting opportunities, such as question answering systems which can reason across facts mentioned in different documents, and analytic models which can help us understand constantly evolving social relations. This dissertation studies how machines can read natural language text, gather rich entity information, and map this information to a structured format. We present three independent studies, each focusing on different aspects of entity centric text understanding. First, we introduce a semantic parser populating entity attributes embedded in noun phrases to a large scale knowledgebase. Our method addresses the challenges arise from incompleteness in schema (i.e., existing ontologies cannot represent the meaning of many English phrases) and in KB (i.e., most KB misses many facts). Second, we study rich entity categories that can be inferred from the sentence that the entity occurs in. Our new formulation with virtually unrestricted types allows us to expand the standard KB-based training methodology with typing information from Wikipedia definitions and naturally-occurring head-word supervision. Lastly, we introduce a document-level model to infer dynamic entity-entity relationships. Unlike prior work which mostly focused on factual relationships, our work considers sentiment relationship between a pair of entities and presents a model which considers the document and social context jointly. For each of these studies, we address the limited annotated data challenge via crowdsourcing and/or harvesting large-scale naturally occurring weak supervision. Each study presents a new model and learning framework exploiting new sources of supervision to organize entity information. Together, this thesis expands the scope of information that can be learned about entities from text and points towards future work for entity centric document understanding.
dc.embargo.terms	Open Access
dc.format.mimetype	application/pdf
dc.identifier.other	Choi_washington_0250E_20669.pdf
dc.identifier.uri	http://hdl.handle.net/1773/44767
dc.language.iso	en_US
dc.rights	none
dc.subject
dc.subject	Computer science
dc.subject.other	Computer science and engineering
dc.title	Learning To Understand Entities In Text
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Choi_washington_0250E_20669.pdf
Size:: 1.83 MB
Format:: Adobe Portable Document Format

Download

Collections

Computer science and engineering