Publications

Ludusan, B. and Wagner, P. (2023). The effect of conversation type on entrainment: Evidence from laughter, in Proceedings of the Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 168-174 , Prague, Czechia.

Ludusan, B. (2023). The usefulness of phonetically-motivated features for automatic laughter detection, in Proceedings of the Disfluency in Spontanous Speech (DiSS) Workshop, pp. 33-37, Bielefeld, Germany.

Ludusan, B., Schröer, M., Rossi, M., and Wagner, P. (2023). The co-use of laughter and head gestures across speech styles, in Proceedings of the Annual Conference of the International Speech Communication Association – INTERSPEECH, pp. 3592-3596, Dublin, Ireland.

de Seyssel, M., Lavechin, M., Titeux, H., Thomas, A., Virlet, G., Santos Revilla, A., Wisniewski, G., Ludusan, B., and Dupoux, E. (2023). ProsAudit, a prosodic benchmark for self-supervised speech models, in Proceedings of the Annual Conference of the International Speech Communication Association – INTERSPEECH, pp. 2963-2967, Dublin, Ireland.

Ludusan, B., Heldner, M. and Włodarczak, M. (2023). Exploring the role of formant frequencies in the classification of phonation type, in Proceedings of the International Congress of Phonetic Sciences, pp. 1726-1730, Prague, Czechia.

Rossi, M., Schröer, M., Ludusan, B., and Zellers, M. (2023). A multimodal account of listener feedback in face-to-face interactions,  in Proceedings of the International Congress of Phonetic Sciences, pp. 4120-4124, Prague, Czechia.

Włodarczak, M., Ludusan, B., Sundberg, J., and Heldner, M. (2022). Classification of voice quality using neck-surface acceleration: Comparison with glottal flow and radiated sound, Journal of Voice.

Ludusan, B., Schröer, M., and Wagner, P. (2022). Investigating phonetic convergence of laughter in conversation, in Proceedings of the Annual Conference of the International Speech Communication Association – INTERSPEECH, pp. 1332-1336, Incheon, South Korea.

Ludusan, B. and Schuppler, B. (2022). To laugh or not to laugh? The use of laughter to mark discourse structure, in Proceedings of the Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 76-82, Edinburgh, United Kingdom.

Ludusan, B. and Schuppler, B. (2022). An analysis of prosodic boundaries across speaking styles in two varieties of German, Speech Communication, 141, 93-106.

Ludusan, B. and Wagner, P. (2022). ha-HA-hha? Intensity and voice quality characteristics of laughter, in Proceedings of the International Conference on Speech Prosody, pp. 560-564, Lisbon, Portugal.

de Seyssel, M., Wisniewski, G., Dupoux, E., and Ludusan, B. (2022). Investigating the usefulness of i-vectors for automatic language characterization, in Proceedings of the International Conference on Speech Prosody, pp. 460-464, Lisbon, Portugal.

Ludusan, B., Cristia, A., Mazuka, R. and Dupoux, E. (2022) How much does prosody help word segmentation? A simulation study on infant-directed speech, Cognition, 219, 104961. (accepted manuscript) (supplementary materials)

Ludusan, B. and Wagner, P. (2022). Laughter entrainment in dyadic interactions: Temporal distribution and form, Speech Communication, 136, 42-52.

Ludusan, B., Mori, M., Minagawa, Y. and Dupoux, E. (2021) The effect of different information sources on prosodic boundary perception, JASA Express Letters, 1(11), 115203.

Ludusan, B., Wagner, P. and Włodarczak, M. (2021) Cue interaction in the perception of prosodic prominence: the role of voice quality, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 1006-1010, Brno, Czechia.

Ludusan, B., Mazuka, R. and Dupoux, E. (2021) Does infant-directed speech help phonetic learning? A machine learning investigation, Cognitive Science, 45(5), e12946.

Ludusan, B. and Wagner, P. (2021). Knock-knock! Who's there? The laughter-enhanced virtual real-estate agent, in Proceedings of the Conference on Electronic Speech Signal Processing, pp. 281-288, Berlin, Germany.

Ludusan, B. and Wagner, P. (2020). An evaluation of manual and semi-automatic laughter annotation, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 621-625, Shanghai, China.

Ludusan, B., Wesemann, M. and Wagner, P. (2020). A distributional analysis of laughter across turns and utterances, in Proceedings of the Laughter and Other Non-Verbal Vocalisations Workshop, pp. 28-31, Bielefeld, Germany.

Schuppler, B. and Ludusan, B. (2020). An analysis of prosodic boundary detection in German and Austrian German read speech, in Proceedings of the International Conference on Speech Prosody, pp. 990-994, Tokyo, Japan.

Ludusan, B. and Wagner, P. (2020). Speech, laughter and everything in between: A modulation spectrum-based analysis, in Proceedings of the International Conference on Speech Prosody, pp. 995-999, Tokyo, Japan.

Ludusan, B. and Wagner, P. (2019). Laughter dynamics in dyadic conversations, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 524-528, Graz, Austria.

Ludusan, B., Jorschick, A. and Mazuka, R. (2019). Nasal consonant discrimination in infant- and adult-directed speech, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 3584-3588, Graz, Austria.

Ludusan, B. and Wagner, P. (2019). No laughing matter: An investigation into the acoustic cues marking the use of laughter, in Proceedings of the International Congress of Phonetic Sciences, pp. 2179-2182, Melbourne, Australia.

Wagner, P., Bryhadyr, N., Schröer, M. and Ludusan, B. (2019). Does information-structural acoustic prosody change under different visibility conditions?, in Proceedings of the International Congress of Phonetic Sciences, pp. 1575-1579, Melbourne, Australia.

Guevara-Rukoz, A., Cristia, A., Ludusan, B., Thiollière, R., Martin, A., Mazuka, R. and Dupoux, E. (2018). Are words easier to learn from infant- than adult-directed speech? A quantitative corpus-based investigation, Cognitive Science, 42(5), 1586-1617.

Ludusan, B., Mazuka, R., Bernard, M., Cristia, A. and Dupoux, E. (2017). The role of prosody and speech register in word segmentation: A computational modelling perspective, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 178-183, Vancouver, Canada.

Ludusan, B., Cristia, A., Martin, A., Mazuka, R. and Dupoux, E. (2016). Learnability of prosodic boundaries: Is infant-directed speech easier?, The Journal of the Acoustical Society of America, 140(2), 1239-1250. 

Ludusan, B. and Dupoux, E. (2016). The role of prosodic boundaries in word discovery: Evidence from a computational model, The Journal of the Acoustical Society of America, 140(1), EL1-EL6.

Ludusan, B. and Dupoux, E. (2016). Automatic syllable segmentation using broad phonetic class information, in Proceedings of the Workshop on Spoken Language Technologies for Under-Resourced Languages, pp. 101-106, Yogyakarta, Indonesia.

Ludusan, B., Origlia, A. and Dupoux, E. (2015). Rhythm-based syllabic stress learning without labelled data, in Proceedings of the International Conference on Statistical Language and Speech Processing, pp. 185-196, Budapest, Hungary. 

Ludusan, B., Caranica, A., Cucu, H., Buzo, A., Burileanu, C. and Dupoux, E. (2015). Exploring multi-language resources for unsupervised spoken term discovery, in Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, pp. 1-6, Bucharest, Romania. 

Ludusan, B., Seidl, A., Dupoux, E. and Cristia, A. (2015). Motif discovery in infant- and adult-directed speech, in Proceedings of the Workshop on Cognitive Aspects of Computational Language Learning, pp. 93-102, Lisbon, Portugal.

Ludusan, B. and Dupoux, E. (2015). A multilingual study on intensity as a cue for marking prosodic boundaries, in Proceedings of the International Congress of Phonetic Sciences, paper 982, Glasgow, UK.

Wagner, P., Origlia, A., Avesani, C., Christodoulides, G., Cutugno, F., D’Imperio, M., Escudero Mancebo, D., Gili Fivela, B., Lacheret, A., Ludusan, B., Moniz, H., Ní Chasaide, A., Niebuhr, O., Rousier-Vercruyssen, L., Simon, A-C., Šimko, J., Tesser, F. and Vainio, M. (2015). Different parts of the same elephant: A roadmap to disentangle and connect different perspectives on prosodic prominence, in Proceedings of the International Congress of Phonetic Sciences, paper 202, Glasgow, UK.

Ludusan, B., Synnaeve, G. and Dupoux, E. (2015). Prosodic boundary information helps unsupervised word segmentation, in Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 953-963, Denver, USA. 

Ludusan, B., Versteegh, M., Jansen, A., Gravier, G., Cao, X-N., Johnson, M. and Dupoux, E. (2014). Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems, in Proceedings of the International Conference on Language Resources and Evaluation, pp. 560-567, Reykjavik, Iceland.

Ludusan, B., Gravier, G. and Dupoux E. (2014). Incorporating Prosodic Boundaries in Unsupervised Term Discovery, in Proceedings of the International Conference on Speech Prosody, pp. 939-943, Dublin, Ireland. 

Ludusan, B., Ziegler, S. and Gravier, G. (2014). Is Syllable Stress Information Robust for ASR in Adverse Conditions?, in Proceedings of the International Conference on Speech Prosody, pp. 207-211, Dublin, Ireland. 

Ludusan, B. and Dupoux E. (2014). Towards low-resource prosodic boundary detection, in Proceedings of the Workshop on Spoken Language Technologies for Under-Resourced Languages, pp. 231-237, Saint Petersburg, Russia.

Ludusan, B. (2013). UNINA System for the EVALITA 2011 Forced Alignment Task. In Evaluation of Natural Language and Speech Tools for Italian, pp. 330-337. Springer Berlin Heidelberg, 2013.

Ziegler, S., Ludusan, B. and Gravier, G. (2012). Towards a New Speech Detection Approach for Landmark-Driven Speech Recognition, in Proceedings of IEEE Workshop on Spoken Language Technology, pp. 342-347, Miami Beach, USA. 

Ludusan, B., Ziegler, S. and Gravier, G. (2012). Integrating Stress Information in Large Vocabulary Continuous Speech Recognition, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 2642-2645, Portland, USA.

Ziegler, S., Ludusan, B. and Gravier, G. (2012). Using Broad Phonetic Classes to Guide Search in Automatic Speech Recognition, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 1023-1026, Portland, USA.

Cutugno, F., Leone, E., Ludusan, B. and Origlia, A. (2012). Investigating Syllabic Prominence with Conditional Random Fields and Latent-Dynamic Conditional Random Fields, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 2402-2405, Portland, USA.

Leone, E., Origlia, A. and Ludusan, B. (2012). Conditional random fields come strumento di indagine per la rilevazione automatica di prominenze sillabiche, in Proceedings of the National Conference of the Associazione Italiana di Scienze della Voce, pp. 107-113, Rome, Italy.

Ludusan, B., Origlia, A. and Cutugno, F. (2011). On the use of the rhythmogram for automatic syllabic prominence detection, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 2413-2416, Florence, Italy.

Origlia, A., Abete, G., Cutugno, F., Alfano, I., Savy, R., Ludusan, B. (2011). A divide et impera algorithm for optimal pitch stylization, in Proceedings of the Annual Conference of the International Speech Communication Association - INTERSPEECH, pp. 1993-1996, Florence, Italy.

Cangemi, F., Cutugno, F., Ludusan, B., Seppi, D. and van Compernolle, D. (2011). Automatic speech segmentation for Italian: tools, models, evaluation and applications, in Proceedings of the National Conference of the Associazione Italiana di Scienze della Voce, Lecce, Italy.

Origlia, A., Galatà, V. and Ludusan, B. (2010). Automatic classification of emotions via global and local prosodic features on a multilingual emotional database, in Proceedings of the International Conference on Speech Prosody, paper 213, Chicago, USA.

Ludusan, B., Origlia, A. and Cutugno, F. (2010). Syllable classification using static matrices and prosodic features, in Proceedings of the International Conference on Speech Prosody, paper 830, Chicago, USA.

Abete, G., Cutugno, F., Ludusan, B. and Origlia, A. (2010). Pitch behavior detection for automatic prominence recognition, in Proceedings of the International Conference on Speech Prosody, paper 2001, Chicago, USA.

Piccolino-Boniforti, M.A., Ludusan, B., Hawkins, S. and Norris, D. (2010). Same phonemic sequence, different acoustic pattern and grammatical status. A model, in Proceedings of the National Conference of the Associazione Italiana di Scienze della Voce, pp. 279-292, Naples, Italy.

Cutugno, F., Ludusan, B., Origlia, A. and Soldo, S. (2009). Connected Digits Recognition Using a Syllable-Based ASR System, in Poster and Workshop Proceedings of the Conference of the Italian Association for Artificial Intelligence, ISBN 978-88-903581-1-1, Reggio Emilia, Italy.

Ludusan, B. and Soldo, S. (2009). Sonority-based syllable segmentation, in Proceedings of the National Conference of the Associazione Italiana di Scienze della Voce, pp. 699-706, Zurich, Switzerland.

Soldo, S. and Ludusan, B. (2009). Statico vs dinamico, un possibile ruolo della sillaba nel riconoscimento automatico del parlato, in Proceedings of the National Conference of the Associazione Italiana di Scienze della Voce, pp. 707-714, Zurich, Switzerland.