An Independent Assessment of Phonetic Distinctive Feature Sets used to Model Pronunciation Variation

dc.contributor.advisorLevow, Gina-Anneen_US
dc.contributor.authorRolston, Leanne Elizabethen_US
dc.date.accessioned2014-04-30T16:19:27Z
dc.date.available2014-04-30T16:19:27Z
dc.date.issued2014-04-30
dc.date.submitted2014en_US
dc.descriptionThesis (Master's)--University of Washington, 2014en_US
dc.description.abstractIt has been consistently shown that Automatic Speech Recognition (ASR) performance on casual, spontaneous speech is much worse than on carefully planned or read speech by as much as double the word error rate, and that variation in pronunciation is the main reason for this degradation of performance. Thus far, any attempts to mitigate this have fallen well below expectations. Phonetic Distinctive Features show promise from a theoretical standpoint, but have thus far not been fully incorporated into an end-to-end ASR system. Work incorporating distinctive features into ASR is widespread and varied, and each project uses a unique set of features based on the authors' linguistic intuitions, so the results of these experiments cannot be fully and fairly compared. In this work, I attempt to determine which style of distinctive feature set is best suited to model pronunciation variation in ASR based on measures of surface phone prediction accuracy and efficiency of the decision tree model. Using a non-exhaustive, representative set of phonetic distinctive feature sets, decision trees were trained, one per canonical base form phone, under two experimental conditions: words in isolation, and words in sequence. These models were tested against a comparable held-out test set, and an additional data set of canonical pronunciations used to simulate formal speech. It was found that a multi-valued articulatory-based feature set provided a far more compact model that yielded comparable accuracy results, while in a comparison of binary feature sets, the model with feature redundancy provided a far more robust model, with slightly higher accuracy and, where it predicted an incorrect phone, it was closer to the actual gold standard phone than the other feature sets' predictions.en_US
dc.embargo.termsNo embargoen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.otherRolston_washington_0250O_12824.pdfen_US
dc.identifier.urihttp://hdl.handle.net/1773/25371
dc.language.isoen_USen_US
dc.rightsCopyright is held by the individual authors.en_US
dc.subjectASR; Distinctive Features; Pronunciation Modelingen_US
dc.subject.otherLinguisticsen_US
dc.subject.otherlinguisticsen_US
dc.titleAn Independent Assessment of Phonetic Distinctive Feature Sets used to Model Pronunciation Variationen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rolston_washington_0250O_12824.pdf
Size:
454.35 KB
Format:
Adobe Portable Document Format

Collections