The system will be down for regular maintenance from 8:00-10:00am PDT on April 3rd, 2024.
Linguistics: Recent submissions
Now showing items 1-20 of 143
-
On the diverse language experiences of humans and machines
Human experience is characterized by remarkable linguistic and sociocultural diversity. At the same time, much of this diversity is neglected in the language-related fields of linguistics, cognitive science, and natural ... -
A Grammar of Lushootseed: Phonetics, Phonology, Morphology
In this dissertation, I document the phonetics, phonology, and morphology of Lushootseed from a corpus of recordings recovered from the archives of the University of Washington’s Burke Museum. There are several goals to ... -
Language Dataset Documentation Design: Learning from Deaf and Indigenous Communities
This dissertation investigates how engaging with stakeholder groups, namely natural language processing (NLP) practitioners and language communities, can contribute to the development of documentation toolkits that are ... -
Developing a Framework for Metrics and Evaluation of the Impact of Acoustic-Prosodic Features in Synthesized Speech on Listener Perception in Dyadic Interactions
Acoustic-prosodic properties of conversational speech have been shown in prior research to impact the perceptions that listeners have towards the speakers in dyadic interactions. While correlations between the two have ... -
Automatically Inferring Grammar Specifications for Valence-changing Verbal Morphology from Interlinear Glossed Text
This work builds upon the AGGREGATION project and extends it by adding an inference module targeting valence-changing verbal morphology. This module automatically collects information that answers the questionnaire of the ... -
Transfer Learning Using L2 Speech to Improve Automatic Speech Recognition of Dysarthric Speech
Dysarthria is a class of speech disorders associated with impairments to a person’s motor system. Dysarthric speech is diverse but is broadly characterized by reduced prosodic, phonation, and articulatory precision (Rowe ... -
The Weighted Möbius Score: A Unified Framework for Feature Attribution
Feature attribution aims to explain the reasoning behind a black-box model's prediction by identifying the impact of each feature on the prediction. Recent work has extended feature attribution to interactions between ... -
A Comparative Analysis of Transcription Errors from Major Commercial Automatic Speech Recognition Systems on Speakers of Four Ethnic Backgrounds in the Pacific Northwest
Major commercial ASR systems have demonstrated higher transcription error rates for non-white American English speakers, particularly for African American speakers, and there is evidence that sociophonetic features are ... -
An Investigation Into Supervision for Seq2Seq Techniques for Natural Language to Code Translation
This thesis examines the role of supervised data using small-scale datasets for the natural language to code task. The primary angles of inquiry are from analyzing the balance between unsupervised learning and supervised ... -
Sociolinguistic and Phonetic Perception of Second Language Mandarin Chinese
Perception of second language (L2) speakers and their speech is known to be influenced both by phonetic and by sociolinguistic factors. The existing body of scholarly research on L2 speech perception, however, is overwhelmingly ... -
Simplifying Multimodal Emotion Recognition with Single Eye Movement Modality
Multimodal emotion recognition has long been a popular topic in affective computing since it significantly enhances the performance compared with that of a single modality. Among all, the combination of electroencephalography ... -
Automatically Inferring Grammar Specifications for Adnominal Possession from Interlinear Glossed Text
This thesis presents an update to the AGGREGATION grammar inference project: namely, the ability to automatically infer information about adnominal possession for a given lan- guage. Specifically, I contribute code that ... -
Ethnic History and Language Typology in Western China: The Cases of Xining, Daohua and Bai
The following dissertation examines the language history of areas historically lying along the China-Tibet frontier, namely Amdo, Kham and the Dali region of northwest Yunnan. It draws from a wide and diverse literature ... -
Modals in Natural Language Optimize the Simplicity/Informativeness Trade-Off
The meanings expressed by the world’s languages have been argued to support efficient communication. Evidence for this hypothesis has drawn on cross-linguistic analyses of vocabulary in semantic domains of both content ... -
"Obama never said that": Evaluating fact-checks for topical consistency and quality
This thesis examines topical consistency between claims and fact-checks in the Birdwatch dataset published by Twitter. The dataset has tweets (the claims), notes (context-adding annotations written by Birdwatch users), and ... -
Resourceful at Any Size: A Predictive Methodology Using Linguistic Corpus Metrics for Multi-Source Training in Neural Dependency Parsing
Multilingual modeling comes up in natural language processing at any scale. High-resource language corpora train high-performing models, and can be combined with other language corpora of all sizes to make better models ... -
Comparing Methods for Automatic Identification of Mislabeled Data
This thesis compares three methods for identifying mislabeled examples in datasets: Dataset Cartography (Swayamdipta et al. [2020]), Cleanlab, (Northcutt et al. [2021b]), and Ensem- bling (Brodley and Friedl [1999], Reiss ... -
Considerations for the social impact of natural language processing
Natural language processing (NLP) technologies have transformed how people access information and communicate with one another. It has thus become critical to take stock of the social impact of natural language processing ... -
Latent Compositional Representations for English Function Word Comprehension
This paper investigates whether biasing natural language models toward tree-compositional structure and systematic token representation can improve performance on tasks that require the use of function words. The method ... -
The Spatiality of Perceptual Dialectology
A criticism that has been leveled against modern sociolinguistic research is that “space [has been] carefully controlled out of” studies and that "spatial variation [... is] not examined" (Britain 2010b, p. 3). This ...