AGGREGATION

Loading...
Thumbnail Image

Date

Authors

Bender, Emily M.
Howell, Kristen
Xia, Fei
Zamaraeva, Olga
Goodman, Michael Wayne
Crowgey, Joshua
Packard, Woodley
Lockwood, Michael Wayne
Lepp, Haley
Ramaswamy, Swetha

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

The AGGREGATION Project aims to bring the benefits of grammar engineering to language documentation without requiring field linguists to become grammar engineers. We achieve this by automatically creating precision grammars on the basis of analyses and annotations already produced by field linguists together with a typologically-grounded cross-linguistic grammar resource (the LinGO Grammar Matrix) and natural language processing techniques developed for high-resource languages. Precision grammars are machine-readable encodings of mutually-consistent linguistic hypotheses, in our case, concerning morphotactics, morphosyntax and the syntax-semantics interface. They can be used to automatically process text, assigning structures to input strings and strings to input semantic representations. Text processed in this way can then be searched for sentences or word forms with structures of interest or items that are not covered by the grammar (i.e. fall outside current hypotheses).

Description

This archive is associated with the AGGREGATION project, which seeks to automatically generate HPSG grammars on the basis of Interlinnear Glossed Text data. For a detailed description of this project see Chapter 3 of Inferring Grammars from Interlinear Glossed Text: Extracting Typological and Lexical Properties for the Automatic Generation of HPSG Grammars, PhD thesis by Kristen Howell 2020. This archive includes the following: The AGGREGATION/BASIL syntactic inference repository from https://git.ling.washington.edu/agg/aggregation The MOM morphological inference repository from https://git.ling.washington.edu/agg/mom The Xigt framework for eXtensible Interlinear Glossed Text release 1.1 from https://github.com/xigt/xigt The Grammar Matrix Customization system http://matrix.ling.washington.edu/index.html Code, dependencies and sample data for running the AGGREGATION pipeline end to end.

Citation

DOI