Modelling Language Acquisition through Syntactico-Semantic Pattern Finding

Jonas Doumen, Katrien Beuls, Paul Van Eecke

Onderzoeksoutput: Conference paper

31 Downloads (Pure)

Samenvatting

Usage-based theories of language acquisition have extensively documented the processes by which children acquire language through communicative interaction. Notably, Tomasello (2003) distinguishes two main cognitive capacities that underlie human language acquisition: intention reading and pattern finding. Intention reading is the process by which children try to continuously reconstruct the intended meaning of their interlocutors. Pattern finding refers to the process that allows them to distil linguistic schemata from multiple communicative interactions. Even though the fields of cognitive science and psycholinguistics have studied these processes in depth, no faithful computational operationalisations of these mechanisms through which children learn language exist to date. The research on which we report in this paper aims to fill part of this void by introducing a computational operationalisation of syntactico-semantic pattern finding. Concretely, we present a methodology for learning grammars based on similarities and differences in the form and meaning of linguistic observations alone. Our methodology is able to learn compositional lexical and item-based constructions of variable extent and degree of abstraction, along with a network of emergent syntactic categories. We evaluate our methodology on the CLEVR benchmark dataset and show that the methodology allows for fast, incremental and effective learning. The constructions and categorial network that result from the learning process are fully transparent and bidirectional, facilitating both language comprehension and production. Theoretically, our model provides computational evidence for the learnability of usage-based constructionist theories of language acquisition. Practically, the techniques that we present facilitate the learning of computationally tractable, usage-based construction grammars, which are applicable for natural language understanding and production tasks.
Originele taal-2English
TitelFindings of the Association for Computational Linguistics: EACL 2023
UitgeverijAssociation for Computational Linguistics
Pagina's1347–1357
Aantal pagina's11
ISBN van elektronische versie9781959429470
StatusPublished - 2023
EvenementThe 17th Conference of the European Chapter of the Association for Computational Linguistics - Dubrovnik, Croatia
Duur: 2 mei 20234 mei 2023
Congresnummer: 17
https://2023.eacl.org

Publicatie series

NaamEACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023

Conference

ConferenceThe 17th Conference of the European Chapter of the Association for Computational Linguistics
Verkorte titelEACL
Land/RegioCroatia
StadDubrovnik
Periode2/05/234/05/23
Internet adres

Bibliografische nota

Funding Information:
The research reported on in this paper received funding from the imec’s Smart Education research programme, with support from the Flemish government, the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 951846, and the Research Foundation Flanders (FWO) through a postdoctoral grant awarded to Paul Van Eecke (75929).

Publisher Copyright:
© 2023 Association for Computational Linguistics.

Copyright:
Copyright 2023 Elsevier B.V., All rights reserved.

Vingerafdruk

Duik in de onderzoeksthema's van 'Modelling Language Acquisition through Syntactico-Semantic Pattern Finding'. Samen vormen ze een unieke vingerafdruk.

Citeer dit