Speech to Text to Semantics: A Sequence-to-Sequence System for Spoken Language Understanding

dc.contributor.advisorLevow, Gina-Anne
dc.contributor.authorDodson, John Ryan
dc.date.accessioned2020-08-14T03:32:09Z
dc.date.issued2020-08-14
dc.date.submitted2020
dc.descriptionThesis (Master's)--University of Washington, 2020
dc.description.abstractSpoken language understanding entails both the automatic transcription of a speech utterance and the identification of one or more semantic concepts being conveyed by the utterance. Traditionally these systems are domain specific and target industries like travel, entertainment, and home automation. As such, many approaches to spoken language understanding solve the task of filling predefined semantic slots, and cannot generalize to identify arbitrary semantic roles. This thesis addresses the broader question of how to extract predicate-argument frames from a transcribed speech utterance. I describe a sequence-to-sequence system for spoken language understanding through shallow semantic parsing. Built using a modification of the OpenSeq2Seq toolkit, the system is able to perform speech recognition and semantic parsing in a single end-to-end flow. The proposed system is extensible and easy to use, allowing for fast iteration on system parameters and model architectures. The system is evaluated through two experiments. The first experiment performs a speech to text to semantics transformation and uses n-best language model rescoring to generate the best transcription sequence. The second experiment executes the same transformation process, but generates transcriptions through shallow language model fusion. Both experiments evaluate several combinations of speech recognition models and semantic parsers.
dc.embargo.lift2021-08-14T03:32:09Z
dc.embargo.termsRestrict to UW for 1 year -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherDodson_washington_0250O_21292.pdf
dc.identifier.urihttp://hdl.handle.net/1773/46082
dc.language.isoen_US
dc.rightsnone
dc.subject
dc.subjectComputer science
dc.subjectLinguistics
dc.subject.otherLinguistics
dc.titleSpeech to Text to Semantics: A Sequence-to-Sequence System for Spoken Language Understanding
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dodson_washington_0250O_21292.pdf
Size:
751.46 KB
Format:
Adobe Portable Document Format

Collections