Wim Pouw, Mark Dingemanse & Asli Özyürek

Gesture network analysis reveals structural properties of change in evolving languages in the lab

We introduce gesture network analysis (Pouw & Dixon, 2019) as a quantitative, reproducible, automatable method for understanding how communicative bodily expressions undergo structural changes during simulated language evolution.

Iterated learning paradigms have been used to understand the constraints of evolving communication systems (e.g., Motamedi, Schouwstra, Smith, Culbertson, & Kirby, 2019; Raviv, Meyer, & Lev-Ari, 2019; Sato, Schouwstra, Flaherty, & Kirby, 2019). Such experimental paradigms aim to simulate evolving language dynamics that naturally occur in larger populations and on longer time scales, e.g., the evolution of sign language over generations of users (e.g., de Vos and Nyst, 2018; Senghas, Kita, & Özyürek, 2004; Sandler, Aronoff, Meir & Padden, 2011).

The promise of iterated learning paradigms is exemplified by experiment 1 of Motamedi et al. (2019), wherein pairs of participants learn to associate a set of concepts (e.g., hand-cuffs, prison etc.) with silent gestures. The first generation of participants learns these silent gestures from seed participants asked to spontaneously produce silent gestures for a set of concepts. After learning the gesture, the pair belonging to the first generation is asked to interact in a director-matcher task to solidify their gestures to be communicated to a next generation of participants. This procedure is repeated up to the 5th generation for 5 unique chains of participants. Systematicity, measured in terms of Shannon entropy of hand-coded gesture content, decreased over generations, indicating simpler gesture content being used as the communication system evolved.

Recently Pouw & Dixon (2019) have showcased the potential of combining time series analysis with network analysis so as to gain understanding how kinematics of different gestures interrelate in structural ways on the level of a gesture ensemble (a larger set of gestures). The method uses dynamic timewarping (e.g., Sato et al., 2019) to measure the kinematic similarity of gesture trajectories, and based on this, builds distance matrices in which each gesture is compared to each other gesture. It then uses network visualization and topology and complexity measures to probe structural properties of gesture networks.

We used gesture network analysis to reanalyze the data of Motamedi et al. )2019; experiment 1). We extracted kinematic positional data for both hands and the head with Openpose (Cao, Simon, Wei, & Sheikh, 2017) and performed dependent multivariate Dynamic Time Warping (Giorgino, 2009) on each gesture pair, retrieving a distance measure of how well the trajectory of each gesture matches the trajectory of another gesture. Then we constructed distance matrices for networks of gestures within a generation and probed whether the structure of these networks changed over the generations. Through hierarchical clustering analysis we find that gesture networks become less cluttered (i.e., agglomerative clustering coefficient increases, p = .012), suggesting that gestures become less similar to their nearest neighbors over the generations. Additionally, we find that Shannon entropy of the network connections go down, suggesting that the gestures kinematic relations become more similar to each other (p = .008), i.e., increases in systematicity.

Our analyses show how gestural kinematics evolve in structural ways in language evolution. As a fully automated, reproducible method, gesture network analysis can be scaled to very large datasets, supplementing hand-coded procedures. More importantly however, gesture network analysis provides new ways to show how structure may emerge within the interrelationships of communicative tokens. We will discuss further analyses in our presentation, as well as shortcomings and promising avenues of application. Shortcomings of the current analysis are that it will judge a gesture to be different from another gesture if the order of otherwise identical segments are reversed in another gesture. A human rater might judge such gesture phrases to be similar in form. Additionally, this analysis is only sensitive to changes in gesture form (kinematic trajectories), not content.

Note Figure.Example networks are shown for generation 1 versus 5 (for chain 5) wherein each node in a network represents a gesture for a token (e.g., haircut) from a participant with the token superimposed in blue text, and edgedistance representing dissimilarity between gestures. Several data trends are shown. For each chain 1-5 the changes in the clustering and average entropy are shown. These gesture networks show a decrease in entropy, which follows patterns that were obtained for the hand-coded gesture semantic analysis (Motamedi et al., 2019). The decrease of entropy over generations is strongly related to degree of clustering in gesture networks. Clutter decreases when distance between gesture forms increases. These decorrelations of gesture’s form may suggest a drive for optimal discriminability (Dingemanse, Blasi, Lupyan, Christiansen, Monaghan, 2015).

ReferencesDingemanse, M., Blasi, D. E., Lupyan, G., Christiansen, M. H., & Monaghan, P. (2015). Arbitrariness, iconicity and systematicity in language. Trends in Cognitive Sciences, 19(10), 603–615. doi: 10.1016/j.tics.2015.07.013Cao, Z., Simon, T., Wei, S.-E., & Sheikh, Y. (2017). Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. 7291–7299. Retrieved fromhttp://openaccess.thecvf.com/content_cvpr_2017/html/Cao_Realtime_Multi- Person_2D_CVPR_2017_paper.htmlde Vos, C. & Nyst, V. (2018). The Time Depth and Typology of Rural Sign Languages. Sign Language Studies, 18(4), 477–487.Giorgino, T. (2009). Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package. Journal of Statistical Software, 31(7).https://doi.org/10.18637/jss.v031.i07 Motamedi, Y., Schouwstra, M., Smith, K., Culbertson, J., & Kirby, S. (2019). Evolving artificial sign languages in the lab: From improvised gesture to systematic sign. Cognition, 192, 103964. https://doi.org/10.1016/j.cognition.2019.05.001Pouw, W., & Dixon, J. A. (2019). Gesture networks: Introducing dynamic time warping and network analyses for the kinematic study of gesture ensembles. Discourse Processes. https://doi.org/10.1080/0163853X.2019.1678967Raviv, L., Meyer, A., & Lev-Ari, S. (2019). Larger communities create more systematic languages. Proceedings of the Royal Society B: Biological Sciences, 286(1907), 20191262. https://doi.org/10.1098/rspb.2019.1262Sato, A., Schouwstra, M., Flaherty, M., & Kirby, S. (2019). Do all aspects of learning benefit from iconicity? Evidence from motion capture. Language and Cognition, 1–20. https://doi.org/10.1017/langcog.2019.37Sandler, W., Aronoff, M., Meir, I. & Padden, C. (2011). The gradual emergence of phonological form in a new language. Natural Language & Linguistic Theory 29, 503–543.Senghas, A., Kita, S., & Özyürek, A. (2004). Children creating core properties of language: Evidence from an emerging sign language in Nicaragua. Science, 305(5691), 1779–1782. https://doi.org/10.1126/science.1100199