Criticality in the human voice
Speech is a distinctive complex feature of human capabilities. In order to understand the physics underlying speech production, we empirically analyse the statistics of large human speech datasets ranging several languages. We first show that during speech, the energy is unevenly released and power-law distributed, reporting a universal robust Gutenberg–Richter-like law in speech. We further show that such ‘earthquakes in speech’ show temporal correlations, as the interevent statistics are again power-law distributed. As this feature takes place in the intraphoneme range, we conjecture that the process responsible for this complex phenomenon is not cognitive, but it resides in the physiological (mechanical) mechanisms of speech production. Moreover, we show that these waiting time distributions are scale invariant under a renormalization group transformation, suggesting that the process of speech generation is indeed operating close to a critical point. These results are put in contrast with current paradigms in speech processing, which point towards low dimensional deterministic chaos as the origin of nonlinear traits in speech fluctuations. As these latter fluctuations are indeed the aspects that humanize synthetic speech, these findings may have an impact in future speech synthesis technologies. Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.
Key papers
Scaling and universality in the human voice
Jordi Luque, Bartolo Luque and Lucas Lacasa
Journal of the Royal Society Interface 12, 20141344 (2015)
Origin of Linguistic laws
Linguistic laws constitute one of the quantitative cornerstones of modern cognitive sciences and have been routinely investigated in written corpora, or in the equivalent transcription of oral corpora. This means that inferences of statistical patterns of language in acoustics are biased by the arbitrary, language-dependent segmentation of the signal, and virtually precludes the possibility of making comparative studies between human voice and other animal communication systems. Here we bridge this gap by proposing a method that allows to measure such patterns in acoustic signals of arbitrary origin, without needs to have access to the language corpus underneath. The method has been applied to sixteen different human languages, recovering successfully some well-known laws of human communication at timescales even below the phoneme and finding yet another link between complexity and criticality in a biological system. These methods further pave the way for new comparative studies in animal communication or the analysis of signals of unknown code.
On the other hand, physical manifestations of linguistic units include sources of variability due to factors of speech production which are by definition excluded from counts of linguistic symbols. We also examine whether linguistic laws hold with respect to the physical manifestations of linguistic units in spoken English. The data we analyse come from a phonetically transcribed database of acoustic recordings of spontaneous speech known as the Buckeye Speech corpus. First, we verify with unprecedented accuracy that acoustically transcribed durations of linguistic units at several scales comply with a lognormal distribution, and we quantitatively justify this ‘lognormality law’ using a stochastic generative model. Second, we explore the four classical linguistic laws (Zipf’s Law, Herdan’s Law, Brevity Law and Menzerath–Altmann’s Law (MAL)) in oral communication, both in physical units and in symbolic units measured in the speech transcriptions, and find that the validity of these laws is typically stronger when using physical units than in their symbolic counterpart. Additional results include (i) coining a Herdan’s Law in physical units, (ii) a precise mathematical formulation of Brevity Law, which we show to be connected to optimal compression principles in information theory and allows to formulate and validate yet another law which we call the size-rank law or (iii) a mathematical derivation of MAL which also highlights an additional regime where the law is inverted. Altogether, these results support the hypothesis that statistical laws in language have a physical origin.
Key papers
Ivan Gonzalez Torre, Bartolo Luque, Lucas Lacasa, Jordi Luque and Antoni Hernandez-Fernandez
NPG Scientific Reports 7, 43862 (2017)
Featured in Research News Portal, Technical University of Madrid
Ivan G. Torre, Bartolo Luque, Lucas Lacasa, Christopher T. Kello, Toni Hernandez-Fernandez
Royal Society Open Science 6, 8 (2019)
Antoni Hernandez-Fernandez, Ivan G. Torre, Juan Maria Garrido, Lucas Lacasa
Entropy 21, 12 (2019)
Collective dynamics of orthographic norm adoption
Social conventions, such as shaking hands or dressing formally, allow us to coordinate smoothly and, once established, appear to be natural. But what happens when a new convention replaces an old one? This question has remained largely unanswered so far, due to the lack of suitable data. Here, we investigate the process of norm change by looking at 2,541 linguistic norm shifts occurring over the last two centuries in English and Spanish. We identify different patterns of norm adoption depending on whether the change is spontaneous or driven by a centralized institution, and we propose a simple model that reproduces all of the empirical observations. These results shed light on the cultural evolution of linguistic norms and human collective behavior.
Key papers
The dynamics of norm change in the cultural evolution of language
Roberta Amato, Lucas Lacasa, Albert Diaz-Guilera, Andrea Baronchelli
PNAS 115, 33 (2018)
Featured in City University Press QMUL Press Release Le Scienze (italian version of Scientific American) Madri+D