Graph-based Algorithms for Lexical Semantics and its Applications

Wu, Wei

Graph-based Algorithms for Lexical Semantics and its Applications

dc.contributor.advisor	Ostendorf, Mari	en_US
dc.contributor.author	Wu, Wei	en_US
dc.date.accessioned	2012-09-13T17:25:03Z
dc.date.available	2013-03-13T11:04:54Z
dc.date.issued	2012-09-13
dc.date.submitted	2012	en_US
dc.description	Thesis (Ph.D.)--University of Washington, 2012	en_US
dc.description.abstract	Lexical semantics studies the meaning of words, which is a useful tool for computer-based automatic natural language processing (NLP). This thesis explores graph-based algorithms to learn and apply distributional lexical semantics in NLP applications. One theory of lexical semanticists holds that semantic relations among words can be extracted from their textual context in natural languages. Based on this theory, we propose using graphs to represent natural language text according to the contextual relations of words in higher-level language units (e.g. sentences, definitions or documents). In these graphs, words and/or higher-level language units are represented with nodes, and edges are added between them according to their textual context to indicate their observed relatedness in a dictionary or a collection of documents. We explore two types of graph representations: the word-word graph which is used for modeling the semantic relations among words, and the instance-word bipartite graph which uses words as a medium to study the relatedness among higher-level language units. In this way, we can embed the semantic relations among words and optionally higher-level language units into the graph structure. We design algorithms to propagate semantic information through the graphs in order to recover the unobserved relatedness among words or higher-level language units, which is used in designing unsupervised, semi-supervised or active learning algorithms to reduce human supervision in NLP applications for harvesting or analyzing text data resources. Specifically, we design graph-based algorithms either for quantitatively assessing lexical semantic similarity, or for developing representativeness and diversity criteria for selecting a characteristic subset of terms, which is useful for problems such as keyword summarization and query design for active learning. In particular, we focus on designing graph-based algorithms to apply lexical semantics in three NLP applications, including Wiktionary lexical semantic similarity extraction, Twitter user interest extraction, and active learning for semantic orientation classification.	en_US
dc.embargo.terms	Delay release for 6 months -- then make Open Access	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.other	Wu_washington_0250E_10632.pdf	en_US
dc.identifier.uri	http://hdl.handle.net/1773/20590
dc.language.iso	en_US	en_US
dc.rights	Copyright is held by the individual authors.	en_US
dc.subject	Graph-based Algorithms; Lexical Semantics; NLP; Semantic Similarity; Sentiment Classification; Twitter	en_US
dc.subject.other	Electrical engineering	en_US
dc.subject.other	Computer science	en_US
dc.subject.other	Electrical engineering	en_US
dc.title	Graph-based Algorithms for Lexical Semantics and its Applications	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Wu_washington_0250E_10632.pdf
Size:: 576.93 KB
Format:: Adobe Portable Document Format

Download

Collections

Electrical engineering