Language Models can Generalize from Indirect Evidence: Evidence from Filtered Corpus Training (FICT)

dc.contributor.advisorSteinert-Threlkeld, Shane
dc.contributor.authorPatil, Abhinav
dc.date.accessioned2024-09-09T23:12:02Z
dc.date.available2024-09-09T23:12:02Z
dc.date.issued2024-09-09
dc.date.submitted2024
dc.descriptionThesis (Master's)--University of Washington, 2024
dc.description.abstractThis thesis introduces Filtered Corpus Training, a method that trains language models (LMs) on corpora with certain linguistic constructions filtered out from the training data, and uses it to measure the ability of LMs to perform linguistic generalization on the basis of indirect evidence. Applying the method to both LSTM and Transformer LMs, of roughly comparable size, we develop corpora filtered of direct evidence for a wide range of linguistic phenomena. Our results show that while transformers are better qua LMs (as measured by perplexity), both models perform equally and surprisingly well on linguistic generalization measures, suggesting that they are capable of generalizing from indirect evidence. This adds to a growing body of evidence on the limitations of perplexity as an evaluation metric, while also showing that direct attestation may be not strictly be necessary for learners to develop the appropriate linguistic generalizations.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherPatil_washington_0250O_27109.pdf
dc.identifier.urihttps://hdl.handle.net/1773/52074
dc.language.isoen_US
dc.rightsCC BY-SA
dc.subjectFiltered Corpus Training
dc.subjectInductive Biases
dc.subjectLanguage Model Evaluation
dc.subjectLinguistic Generalization
dc.subjectPoverty of the Stimulus
dc.subjectTargeted Syntactic Evaluations
dc.subjectLinguistics
dc.subjectArtificial intelligence
dc.subjectComputer science
dc.subject.otherLinguistics
dc.titleLanguage Models can Generalize from Indirect Evidence: Evidence from Filtered Corpus Training (FICT)
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Patil_washington_0250O_27109.pdf
Size:
2.12 MB
Format:
Adobe Portable Document Format

Collections