Browsing Linguistics by Title
Now showing items 24-43 of 143
-
Challenges in Automated Debiasing for Toxic Language Detection
Biased associations have been a challenge in the development of classifiers for detecting toxic language, hindering both fairness and accuracy. As potential solutions, we investigate recently introduced debiasing methods ... -
Classifying COVID-19 News on Sina Weibo
This thesis addresses the classification of Sina Weibo news related to the COVID-19 pandemic by using sentiment analysis. The design is a comparison study, involving four different systems. The systems were chosen after ... -
Clausal case marking in Korean
(1998)This dissertation investigates morphological features of Korean complementizers -ko and -nun. I propose that -ko and -nun are the realizations of accusative and genitive, respectively. The configurations in which the ... -
Comparative Analysis of DeepBank and the Penn Treebank
(2014-02-24)I examined the differences between the DeepBank and Penn Treebank and the effect of hand-created and grammar-derived annotations on dependency representations. The dependencies comparison involved transforming the DeepBank ... -
Comparing Methods for Automatic Identification of Mislabeled Data
This thesis compares three methods for identifying mislabeled examples in datasets: Dataset Cartography (Swayamdipta et al. [2020]), Cleanlab, (Northcutt et al. [2021b]), and Ensem- bling (Brodley and Friedl [1999], Reiss ... -
Considerations for the social impact of natural language processing
Natural language processing (NLP) technologies have transformed how people access information and communicate with one another. It has thus become critical to take stock of the social impact of natural language processing ... -
Contextual Scripture Recommendation for Writers
Recommendation of book passages, quotes, or citations based on a given text can aid writing, research, literary analysis, and the incorporation of legal references (e.g. laws, previous cases). Each of these applications ... -
Counterfactuals in Context: Felicity conditions for counterfactual conditionals containing proper names
(2013-11-14)This thesis provides felicity conditions for counterfactual conditionals containing proper names in which essential changes to an individual are counterfactually posited using contrastive focus in either the antecedent or ... -
Cross-Linguistic Acoustic Characteristics of Phonation: A Machine Learning Approach
Phonation, the process of producing a quasi-periodic sound wave through vocal fold vibration, plays different roles in different languages. Phonation types, or voice qualities, are produced by adjusting the length, thickness, ... -
Defining, Extracting, and Applying Events in NLP Tasks for Clinical Corpora
This dissertation explores defining, extracting, and applying clinical events in three studies of applied clinical natural language processing (NLP)---pneumonia report classification, acquired lung injury (ALI) report ... -
Dependency Parsing for Tweets
This thesis concentrates on the problem of dependency parsing for Twitter texts. Twitter texts, also called tweets, are a typical kind of web domain language with many informal and specific linguistic phenomena (Eisenstein, ... -
Detecting Adverse Events in Clinical Trial Free Text
(2013-11-14)<bold>Introduction:</bold> In pharmacotherapy cancer clinical trials patients receive frequent outpatient evaluation and monthly inpatient evaluation, as required by the protocol or institutional guidelines. Detection of ... -
Detection of Agreement and Disagreement: An investigation of linguistic coordination and conversational features
The focus of this thesis is detection of agreement and disagreement in multiparty conversations using existing transcripts from the ICSI corpus. We use an unsupervised lexicon-based method to create our baseline and then ... -
Developing a Framework for Metrics and Evaluation of the Impact of Acoustic-Prosodic Features in Synthesized Speech on Listener Perception in Dyadic Interactions
Acoustic-prosodic properties of conversational speech have been shown in prior research to impact the perceptions that listeners have towards the speakers in dyadic interactions. While correlations between the two have ... -
Dialogical Signals of Stance Taking in Spontaneous Conversation
This is one of the first computational studies to investigate dialogical aspects of stance taking in spontaneous, spoken dialogue with a focus on lexical similarities. In any dialogic inter- action, each speaker influences ... -
East Uvean A condensed grammar
East Uvean (EUV), also called Faka’Uvea or le wallisien, is a Polynesian language of the Austronesian family, spoken on Wallis Island (‘Uvea) in the French collectivity of Wallis and Futuna, as well as by populations in ... -
Endangered languages, technology and learning: A Yakama/Yakima Sahaptin case study
Efforts to support Indigenous and endangered language education continue to utilize technology in a variety of ways. As the vitality of many languages around the world continues to be threatened, it is important to reassess ... -
Enriching Scientific Paper Embeddings with Citation Context
Amid profusion of scientific literature, methods to organize and search available papers are quite valuable. Embedded representations of papers have potential to be used as input to a variety of tasks related to research ... -
Ethnic History and Language Typology in Western China: The Cases of Xining, Daohua and Bai
The following dissertation examines the language history of areas historically lying along the China-Tibet frontier, namely Amdo, Kham and the Dali region of northwest Yunnan. It draws from a wide and diverse literature ... -
Evaluating Transformer's Ability to Learn Mildly Context-Sensitive Languages
Transformer models perform well on NLP tasks, but recent theoretical studies suggest their ability in modeling certain regular and context-free languages are limited. This creates a disparity given their success in modeling ...