Connectionist models of sentence production

This page will have code related to the implementation of the Dual-path models of sentence production. To learn more about connectionist models and the Dual-path model, there is a tutorial here.

Chara Tsoukala has implemented the Dual-Path model in python. Code is available here.

Chang (2002) introduced the Dual-path architecture in order to get variable-based generalization of words in a connectionist model. Chang, Dell, & Bock (2006) applied that model to structural priming phenomena and syntax acquisition phenomena. Chang (2009) has extended the model to incremental production phenomena like heavy NP shift and accessibility effects in English and Japanese. Hartmut Fitz has extended the model to multiple embedded clauses and addressed a wide range of phenomena with his Recursive Dual-path model.

Dual-path model during language acquisition (Quicktime movie)

Dual-path model during production of sentence "the cat is hit by the girl" (Quicktime movie)

Processing code for drawing Dual-path and Prod-SRN models

Summaries of work with the Dual-path model

Chang, F. (2015) The role of learning in theories of English and Japanese processing. In M. Nakayama (Ed), Handbook of Japanese Psycholinguistics. (pp. 353-386) Berlin:De Gruyter draft pdf publisher

Chang, F. and Fitz, H. (2014) Computational models of sentence production: A Dual-path approach. In V. Ferreira, M. Goldrick, and M. Miozzo (Eds), The Oxford Handbook of Language Production, New York: Oxford University Press pdf online

Dell, G.S. & Chang, F. (2014). The P-Chain: Relating sentence production and its disorders to comprehension and acquisition. Philosophical Transactions of the Royal Society B, 369: 20120394, 1471-2970 abstract pdf audio of talk

Application of the Dual-path model to various issues (click on authors to go to separate page with model code)

Janciauskas, M. & Chang, F. (2017) Input and age-dependent variation in second language learning: A connectionist account

Fitz, H. & Chang, F. (2017) Meaningful questions: The acquisition of auxiliary inversion in a connectionist model of sentence production. Cognition

Chang. F., Baumann, M., Pappert, S., and Fitz, H. (2015). Do lemmas speak German?: A verb position effect in German structural priming, Cognitive Science, 1-18 doi: 10.1111/cogs.12184 pdf

Twomey, K., Chang, F., & Ambridge, B. (2013). A distributional learning account of the acquisition of the locative alternation: Corpus analysis and modeling. Proceedings of the Cognitive Science Society, Berlin pdf

Fitz, H., Chang, F., and Christiansen, M. H. (2011). A connectionist account of the acquisition and processing of relative clauses. In E. Kidd (Ed.), The Acquisition of Relative Clauses (Vol 8), Amsterdam: John Benjamins, 39–60 pdf

Fitz, H. (2009). Neural Syntax. Amsterdam. ILLC publication series.

Hartmut Fitz has completed his dissertation on the Recursive Dual-path model. It addresses recursion in neural networks, poverty of the stimulus, systematicity in neural networks, messages with multiple propositions, learning of polar interrogatives, and the accessibility hierarchy for relative clause constructions.

Chang, F. (2009). Learning to order words: A connectionist model of heavy NP shift and accessibility effects in Japanese and English. Journal of Memory and Language, 61, 374-397.

Languages differ from one another and must therefore be learned. Processing biases in

word order can also differ across languages. For example, heavy noun phrases tend to be shifted

to late sentence positions in English, but to early positions in Japanese. Although these language

differences suggest a role for learning, most accounts of these biases have focused on processing

factors. This paper presents a learning-based account of these word order biases in the form of a

connectionist model of syntax acquisition that can learn the distinct grammatical properties of

English and Japanese while, at the same time, accounting for the cross-linguistic variability in

processing biases in sentence production. This account demonstrates that the incremental nature

of sentence processing can have an important effect on the representations that are learned in

different languages.

Chang, F., Dell, G. S., & Bock, K. (2006). Becoming syntactic. Psychological Review, 113, 2, 234-272

Psycholinguistic research has shown that the influence of abstract syntactic knowledge on performance is shaped by particular sentences that have been experienced. To explore this idea, the authors applied a connectionist model of sentence production to the development and use of abstract syntax. The model makes use of (a) error-based learning to acquire and adapt sequencing mechanisms and (b) meaning–form mappings to derive syntactic representations. The model is able to account for most of what is known about structural priming in adult speakers, as well as key findings in preferential looking and elicited production studies of language acquisition. The model suggests how abstract knowledge and concrete experience are balanced in the development and use of syntax.

Chang, F. (2002) Symbolically speaking: A connectionist model of sentence production. Cognitive Science, 26(5), 609-651.

The ability to combine words into novel sentences has been used to argue that humans have symbolic language production abilities. Critiques of connectionist models of language often center on the inability of these models to generalize symbolically (Fodor & Pylyshyn, 1988; Marcus, 1998). To address these issues, a connectionist model of sentence production was developed. The model had variables (role-concept bindings) that were inspired by spatial representations ( Landau & Jackendoff, 1993). In order to take advantage of these variables, a novel dual-pathway architecture with event semantics is proposed and shown to be better at symbolic generalization than several variants. This architecture has one pathway for mapping message content to words and a separate pathway that enforces sequencing constraints. Analysis of the model’s hidden units demonstrated that the model learned different types of information in each pathway, and that the model’s compositional behavior arose from the combination of these two pathways. The model’s ability to balance symbolic and statistical behavior in syntax acquisition and to model aphasic double dissociations provided independent support for the dual-pathway architecture.

Here is another figure for the model