Histogram Matching to Reduce Acoustic Mismatch in Automatic Speech Recognition

dc.contributor.advisorAtlas, Les
dc.contributor.authorFey, Cuinn Rios
dc.date.accessioned2021-03-19T22:54:08Z
dc.date.available2021-03-19T22:54:08Z
dc.date.issued2021-03-19
dc.date.submitted2020
dc.descriptionThesis (Master's)--University of Washington, 2020
dc.description.abstractWith motivation from histogram matching in image processing used to redistribute pixel probabilities in each color channel of an image, a new approach with an old technique is used for reducing acoustic mismatch between audio signals. Mel-frequency-dependent histogram matching with a silence threshold used in the log Mel-spectrogram domain is implemented before the decoding step in an automatic speech recognition system. The technique is shown to be effective within a system built to recognize low-resource, noisy, compressed, and distorted air traffic control communications. The algorithm has been shown to be robust to high acoustic variance and capable of reducing acoustic mismatch between training, validation, and test data. Additionally, it can decrease the word error rate with a statistically significant chance of confidence improvement. After tuning the algorithm’s silence threshold on the validation dataset, we were able to lower the word error rate when decoding on the test dataset from 50.4% to 46.8% with a 99.9% chance of confidence improvement.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherFey_washington_0250O_22351.pdf
dc.identifier.urihttp://hdl.handle.net/1773/46780
dc.language.isoen_US
dc.rightsCC BY
dc.subjectacoustic mismatch
dc.subjectautomatic speech recognition
dc.subjectdeep learning
dc.subjecthistogram matching
dc.subjectmachine learning
dc.subjectsignal processing
dc.subjectElectrical engineering
dc.subjectAcoustics
dc.subjectComputer science
dc.subject.otherElectrical engineering
dc.titleHistogram Matching to Reduce Acoustic Mismatch in Automatic Speech Recognition
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Fey_washington_0250O_22351.pdf
Size:
2.95 MB
Format:
Adobe Portable Document Format