Histogram Matching to Reduce Acoustic Mismatch in Automatic Speech Recognition
| dc.contributor.advisor | Atlas, Les | |
| dc.contributor.author | Fey, Cuinn Rios | |
| dc.date.accessioned | 2021-03-19T22:54:08Z | |
| dc.date.available | 2021-03-19T22:54:08Z | |
| dc.date.issued | 2021-03-19 | |
| dc.date.submitted | 2020 | |
| dc.description | Thesis (Master's)--University of Washington, 2020 | |
| dc.description.abstract | With motivation from histogram matching in image processing used to redistribute pixel probabilities in each color channel of an image, a new approach with an old technique is used for reducing acoustic mismatch between audio signals. Mel-frequency-dependent histogram matching with a silence threshold used in the log Mel-spectrogram domain is implemented before the decoding step in an automatic speech recognition system. The technique is shown to be effective within a system built to recognize low-resource, noisy, compressed, and distorted air traffic control communications. The algorithm has been shown to be robust to high acoustic variance and capable of reducing acoustic mismatch between training, validation, and test data. Additionally, it can decrease the word error rate with a statistically significant chance of confidence improvement. After tuning the algorithm’s silence threshold on the validation dataset, we were able to lower the word error rate when decoding on the test dataset from 50.4% to 46.8% with a 99.9% chance of confidence improvement. | |
| dc.embargo.terms | Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Fey_washington_0250O_22351.pdf | |
| dc.identifier.uri | http://hdl.handle.net/1773/46780 | |
| dc.language.iso | en_US | |
| dc.rights | CC BY | |
| dc.subject | acoustic mismatch | |
| dc.subject | automatic speech recognition | |
| dc.subject | deep learning | |
| dc.subject | histogram matching | |
| dc.subject | machine learning | |
| dc.subject | signal processing | |
| dc.subject | Electrical engineering | |
| dc.subject | Acoustics | |
| dc.subject | Computer science | |
| dc.subject.other | Electrical engineering | |
| dc.title | Histogram Matching to Reduce Acoustic Mismatch in Automatic Speech Recognition | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Fey_washington_0250O_22351.pdf
- Size:
- 2.95 MB
- Format:
- Adobe Portable Document Format
