Learning Board Game Rules from an Instruction Manual

dc.contributor.advisorLevow, Gina-Anneen_US
dc.contributor.authorMills, Chaden_US
dc.date.accessioned2013-07-25T17:51:16Z
dc.date.available2013-07-25T17:51:16Z
dc.date.issued2013-07-25
dc.date.submitted2013en_US
dc.descriptionThesis (Master's)--University of Washington, 2013en_US
dc.description.abstractBoard game rulebooks offer a convenient scenario for extracting a systematic logical structure from a passage of text since the mechanisms by which board game pieces interact must be fully specified in the rulebook and outside world knowledge is irrelevant to gameplay. A representation was proposed for representing a game's rules with a tree structure of logically-connected rules, and this problem was shown to be one of a generalized class of problems in mapping text to a hierarchical, logical structure. Then a keyword-based entity- and relation-extraction system was proposed for mapping rulebook text into the corresponding logical representation, which achieved an f-measure of 11% with a high recall but very low precision, due in part to many statements in the rulebook offering strategic advice or elaboration and causing spurious rules to be proposed based on keyword matches. The keyword-based approach was compared to a machine learning approach, and the former dominated with nearly twenty times better precision at the same level of recall. This was due to the large number of rule classes to extract and the relatively small data set given this is a new problem area and all data had to be manually annotated. This provided insufficient training data the machine learning approach, which performs better on large data sets with small numbers of extraction classes. The keyword-based approach was subsequently improved with a set of domain-specific filters using context information to remove likely false positives. This improved precision over the baseline system by 296% at an expense of an 11% drop in recall, an f-measure improvement from 16% to 49% on the rule extraction subtask which is the system's bottleneck. The overall system's f-measure improved from 11% to 30%, providing a substantial improvement though leaving plenty of opportunities for future work.en_US
dc.embargo.termsNo embargoen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.otherMills_washington_0250O_11704.pdfen_US
dc.identifier.urihttp://hdl.handle.net/1773/23470
dc.language.isoen_USen_US
dc.rightsCopyright is held by the individual authors.en_US
dc.subjectboard game rules; computational linguistics; entity extraction; logical system extraction; natural language processing; relation extractionen_US
dc.subject.otherLinguisticsen_US
dc.subject.otherComputer scienceen_US
dc.subject.otherlinguisticsen_US
dc.titleLearning Board Game Rules from an Instruction Manualen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mills_washington_0250O_11704.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format

Collections