When does a word end and another one begin?
Language acquisition is one of the most complex tasks that humans face. The difficulty of this task can be appreciated if we consider, as adults, how puzzling it is to hear a conversation in a totally unknown foreign language. Since speech is a continuous flow of sounds, swe have no idea how many words have been spoken. Spoken words are rarely produced in isolation, and contrary to written words, they are not separated by silences but are more commonly integrated within sentences. How can young children succeed at extracting words from the running speech? How do they tackle this challenging task and build their lexicon?
From early on, infants have been shown to use different strategies to successfully identify word boundaries. Their experiences are crucial, since segmentation cues are complex and language dependent. With Rushen Shi, I have investigated how French-learning infants and adults resolve lexical ambiguities when word boundaries are unclear due to a potential liaison.
In French, we find a phonological process called liaison that leads to the intrusion of a set of consonants before vowel-initial words. In liaison, the underlying consonant of a word surfaces as the syllabic onset of the following vowel-initial word, as in, petit /t/éléphant, un /n/éléphant (“small elephant”, “an elephant”). The non-alignment of the word boundary and the syllable boundary yields a potential challenge for the learner. Is it a néléphant, a téléphant, a zéléphant, or ...an éléphant?
As of today, I have published 5 journal papers and 3 conference proceedings on this topic. Follow this link to access my thesis (written in French).
Part of my work has investigated how learners deal with conflicting cues to word boundaries when encountering novel words in liaison contexts. I found that, before the age of two, infants are guided by a syllabic-alignment bias, interpreting the liaison consonant as the onset of a novel word (e.g. gros /z/onches "big onches"-> infants misinterpret 'zonche' as the word; Babineau & Shi, 2014; 2011). Soon after, sensitivity to the distributional patterns of liaison develops; enabling infants to parse the correct form from variable liaison contexts (e.g., parse the novel word onche after hearing it in various contexts such as un /n/onche, petit /t/onche, ses /z/onches). When exposed to regular minimal pair words, 24-month-olds do not parse a vowel-initial form, e.g. when the variable consonants surfacing as onsets are not linked to liaison (e.g. guonche, vonche, chonche) or when the syntactic context does not trigger these particular liaison consonants (e.g. un zonche, gros nonche, premier tonche; Babineau, Emond, & Shi, 2023). This shows that through extensive exposure to liaison, and with the help of a growing lexicon, infants become aware of the fact that a small group of liaison consonants surface in front of vowel-initial words (e.g. zarbre/narbre for “tree”, zours/nours for “bear”). These consonants gain a special status. Yet, children have a careful approach: they take into account the larger context in which these special consonants appear when they interpret the phonological form of novel words.
When do children master liaison? This is a hard question, as there are a lot of interindividual variations. Despite of this, we have found that the process of building a representation for a new word is more advanced around 30 months of age, when toddlers can clearly take into account its syntactic context (Babineau & Shi, 2013; 2016). Around the age of three, toddlers have more precise expectations regarding the surfacing of certain liaison consonants (e.g. the singular determiner un "a" has an underlying /n/). Nonetheless, revisiting the phonological representation of words takes time. In Babineau, Legrand & Shi (2021), we found that 30-month-old French-learning children’s lexicons contain multiple variants for familiar words subject to liaison, especially frequent variants (e.g., the word elephant can be accessed through different forms such as éléphant, téléphant, zéléphant). Liaison-onset variants, which were most likely encoded at an early age, remain in their lexicon for some time.
What about adults? Can they tell apart homophonous sequences linked to liaison (e.g., un /n/air "a melody" vs un nerf "a nerve") or interpret novel words correctly? In Babineau, Shi and Achim (2017; see also Babineau & Shi, 2013), we found that segmentation of liaison ambiguous phrases remains challenging for adult native speakers of French, who are influenced by both the syntactic context (determiner vs adjective) as well as the frequency of the onset consonant as a liaison.
Babineau, M., Emond, E., & Shi, R. (2023). Uncovering the limits of sub-syllabic statistical word segmentation: The case of French liaison. Infancy, 28(2), 301-321. https://onlinelibrary.wiley.com/doi/10.1111/infa.12510
Babineau, M., Legrand, C., & Shi, R. (2021). Variable forms in French-learning toddlers’ lexical representations. Developmental Psychology, 57(4), 457-470. https://doi.org/10.1037/dev0001157
Babineau, M., Shi, R., & Achim, A. (2017). Contextual factors in lexical processing: the case of French liaison. Language, Cognition and Neuroscience, 32, 457-470. DOI: 10.1080/23273798.2016.1239022
Babineau, M., & Shi, R. (2016). Development of liaison representation and its top-down influence on word processing in infants. Language Learning and Development, 12, 482-498. DOI: 10.1080/15475441.2016.1170604
Babineau, M., & Shi, R. (2014). Distributional cues and the onset bias in early word segmentation. Developmental Psychology, 50(12), 2666-2674. DOI: 10.1037/a0038105
Babineau, M., & Shi, R. (2013). Ambiguity related to French liaisons: The role of bottom-up and top-down processes. In Proceedings of the 165th Meeting of the Acoustical Society of America. Vol. 19, 060123, 1-8. DOI: 10.1121/1.4805872 (pdf)
Babineau, M., & Shi, R. (2013). Acoustical cues versus top-down bias in infants’ parsing. In Proceedings of the 165th Meeting of the Acoustical Society of America. Vol. 19, 060073, 1-7. DOI: 10.1121/1.4805623 (pdf)
Babineau, M., & Shi, R. (2011). Processing of French liaisons in toddlers. In BUCLD 35: Proceedings of the 35th annual Boston University conference on language development. Boston, MA: Cascadilla Press, Vol. 1, 25-37. (pdf)