Using External Knowledge to Improve Brown Clustering

dc.contributor.advisorLevow, Gina-Anne
dc.contributor.authorMiljanic, Veljko
dc.date.accessioned2021-03-19T22:56:07Z
dc.date.available2021-03-19T22:56:07Z
dc.date.issued2021-03-19
dc.date.submitted2020
dc.descriptionThesis (Master's)--University of Washington, 2020
dc.description.abstractIn recent years, semi-supervised learning methods that rely on using low-dimensional word representation gained interest in NLP, due to their ability take advantage of vastly available unlabeled data and reduce dependence on large, labeled datasets. In this thesis we propose two methods which allow integration of domain specific knowledge to one of the most popular methods for low dimensional word representations – Brown clustering. First, we propose changing the order in which words are clustered so that words more relevant for the task are given priority. Then, we also propose modifying the clustering objective so that it ensures relevance of induced clusters for downstream supervised task. Experiments show that both methods improve performance of NER system using cluster features.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherMiljanic_washington_0250O_22373.pdf
dc.identifier.urihttp://hdl.handle.net/1773/46828
dc.language.isoen_US
dc.relation.haspartapproval-form.pdf; pdf; Approval Form.
dc.rightsCC BY
dc.subjectbrown clustering
dc.subjectconstrained clustering
dc.subjectner
dc.subjectsemi-supervised learning
dc.subjectword representations
dc.subjectLinguistics
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subject.otherLinguistics
dc.titleUsing External Knowledge to Improve Brown Clustering
dc.typeThesis

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Miljanic_washington_0250O_22373.pdf
Size:
1.23 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
approval-form.pdf
Size:
276.14 KB
Format:
Adobe Portable Document Format

Collections