Towards Efficient, Customizable, and Communal Natural Language Processing

Kasai, Jungo

Towards Efficient, Customizable, and Communal Natural Language Processing

dc.contributor.advisor	Smith, Noah A
dc.contributor.author	Kasai, Jungo
dc.date.accessioned	2023-09-27T17:19:14Z
dc.date.available	2023-09-27T17:19:14Z
dc.date.issued	2023-09-27
dc.date.submitted	2023
dc.description	Thesis (Ph.D.)--University of Washington, 2023
dc.description.abstract	I advocate for efficient, customizable, and communal approaches to natural language pro- cessing (NLP) and artificial intelligence (AI), where people with diverse skill levels and research backgrounds can: build, use, analyze, and evaluate models; collaborate to solve research prob- lems; and accelerate advances in NLP and AI. AI and NLP have made remarkable progress from recent, large-scale training on massive datasets. These technologies are being developed and used by many cross-disciplinary researchers and practitioners. People with scant computer science training—including physicians, translators, and historians—now rely on AI models for work problems that can be solved by using massive amounts of data. This thesis discusses my key contributions to ways to make AI and NLP more accessible to researchers, practitioners, and users. How can we encourage model builders and practitioners to work as a community to broaden the appeal and utility of NLP and AI models across disciplines? How can we make it easier for them to formulate and answer complex real-world questions using these technologies and ensure these models are robustly evaluated? I first introduce and empirically demonstrate efficient architectures and learning paradigms for state-of-the-art NLP models. More efficient methods will lower the cost of developing and using these models, making them deployable to less-well-funded fields or institutions. I then present an algorithm for flexible and customizable language generation in the areas of collaborative inference between diverse models. This inference method avoids the computationally (and thus financially and environmentally) expensive training process of large models. Lastly, I pro- pose methodologies and interfaces to make model evaluations more transparent, consistent, and reliable. I present a collaborative platform that bridges the modeling and evaluation research communities to enable robust evaluation of AI models.
dc.embargo.terms	Open Access
dc.format.mimetype	application/pdf
dc.identifier.other	Kasai_washington_0250E_26022.pdf
dc.identifier.uri	http://hdl.handle.net/1773/50758
dc.language.iso	en_US
dc.rights	CC BY
dc.subject	Artificial Intelligence
dc.subject	Machine Learning
dc.subject	Natural Language Processing
dc.subject	Computer science
dc.subject.other	Computer science and engineering
dc.title	Towards Efficient, Customizable, and Communal Natural Language Processing
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Kasai_washington_0250E_26022.pdf
Size:: 4.18 MB
Format:: Adobe Portable Document Format

Download

Collections

Computer science and engineering