mSCAN - a Multilingual Dataset for Compositional Generalization Evaluation

dc.contributor.advisorSteinert-Threlkeld, Shane
dc.contributor.authorReymond, Amélie Thu Tâm
dc.date.accessioned2025-08-01T22:26:06Z
dc.date.issued2025-08-01
dc.date.submitted2025
dc.descriptionThesis (Master's)--University of Washington, 2025
dc.description.abstractLanguage models achieve remarkable results on a variety of tasks, yet still struggle on compositional generalization benchmarks. The majority of these benchmarks evaluate performance in English only, leaving open the question of whether these results generalize to other languages. As an initial step to answering this question, we introduce mSCAN, a multilingual adaptation of the SCAN dataset covering Mandarin Chinese, French, Hindi and Russian. It was produced by a rule-based translation, developed in cooperation with native speakers. We then showcase this dataset on some in-context learning experiments on multiple open-source multilingual models.
dc.embargo.lift2027-07-22T22:26:06Z
dc.embargo.termsRestrict to UW for 2 years -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherReymond_washington_0250O_28510.pdf
dc.identifier.urihttps://hdl.handle.net/1773/53674
dc.language.isoen_US
dc.rightsnone
dc.subjectCompositional generalization
dc.subjectCross Linguistic Evaluation
dc.subjectLarge Language Models Evaluation
dc.subjectLinguistics
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subject.otherLinguistics
dc.titlemSCAN - a Multilingual Dataset for Compositional Generalization Evaluation
dc.typeThesis

Files

Collections