Publications
2024
â– Â The First VoicePrivacy Attacker Challenge Evaluation PlanÂ
Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, Junichi Yamagishi
â– Â Anonymizing Speaker Voices: Easy to Imitate, Difficult to Recognize?Â
Jennifer Williams, Karla Pizzi, Natalia Tomashenko, Sneha Das
ICASSP 2024
[IEEE Xplore]
â– Â The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice AnonymisationÂ
Michele Panariello*, Â Natalia Tomashenko*, Xin Wang*, Xiaoxiao Miao, Pierre Champion, Hubert Nourtel, Massimiliano Todisco, Nicholas Evans, Emmanuel Vincent, and Junichi Yamagishi (*-equal contribution)
IEEE Transactions on Audio, Speech and Language Processing
[arxiv][IEEE Xplore ]
â– Â The VoicePrivacy 2024 Challenge Evaluation PlanÂ
Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco
2023
â– Â Federated learning for ASR based on wav2vec 2.0Â
Tuan Nguyen, Salima Mdhaffar, Natalia Tomashenko, Jean-François Bonastre, Yannick Estève
ICASSP 2023
[arxiv][IEEE Xplore]
â– Â Speaker anonymization using orthogonal Householder neural networkÂ
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko
IEEE Transactions on Audio, Speech and Language Processing
[arxiv][IEEE Xplore ]
â– Â Evaluating the effects of task design on unfamiliar Francophone listener and automatic speaker identification performance Â
Benjamin O'Brien, Christine Meunier, Natalia Tomashenko,  Alain Ghio, Jean-François Bonastre
Multimedia Tools and Applications Journal
â– Â LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech Â
Titouan Parcollet, Ha Nguyen, Solene Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jerome Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier
Computer Speech and Language (journal, published in 2024)
[Computer Speech & Language] [arxiv]Â
â– Â Metrics for anonymization of unstructured datasetsÂ
Lydia Belkadi, Martine De Cock, Natasha Fernandes, Katherine Lee, Christina Lohr, Olga Ohrimenko, Andreas Nautsch, Laurens Sion, Natalia Tomashenko, Marc Tommasi, Peggy Valcke, Emmanuel Vincent
Report from Dagstuhl Seminar 21052: Privacy in Speech and Language Technology
[report]
2022
â– Â Privacy attacks for automatic speech recognition acoustic models in a federated learning frameworkÂ
Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève, Jean-François Bonastre
ICASSP 2022
[arxiv][poster][IEEE Xplore]
â– Â Retrieving Speaker Information from Personalized Acoustic Models for Speech RecognitionÂ
Salima Mdhaffar, Jean-François Bonastre, Marc Tommasi, Natalia Tomashenko, Yannick Estève
ICASSP 2022
[arxiv]
â– The VoicePrivacy 2022 Challenge evaluation plan
Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Hubert Nourtel, Pierre Champion, Massimiliano Todisco, Emmanuel Vincent, Nicholas Evans, Junichi Yamagishi, Jean-François Bonastre
[arxiv]
â– Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained ModelsÂ
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko
[arxiv]
â– Â Analyzing Language-Independent Speaker Anonymization Framework under Unseen ConditionsÂ
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko
Interspeech 2022
[arxiv]
â– Â A Study of Gender Impact in Self-supervised Models for Speech-to-Text SystemsÂ
Marcely Zanon Boito, Laurent Besacier, Natalia Tomashenko, Yannick Estève
Interspeech 2022
[arxiv]
â– Â Multi-lingual Speech to Speech Translation for Under-Resourced LanguagesÂ
Anthony Larcher, Yannick Estève, Mickael Rouvier, Natalia Tomashenko, Jarod Duret, et al.
ESPERANTO/JSALT workshop report
[hal]
■ Sur la vérification du locuteur à partir de traces d’exécution de modèles acoustiques personnalisés
Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève, Jean-François Bonastre
Journées d'Études sur la Parole - JEP2022
[hal]
■Extraction d'informations liées au locuteur depuis un modèle acoustique personnalisé
Salima Mdhaffar, Jean-François Bonastre, Marc Tommasi, Natalia Tomashenko, Yannick Estève
Journées d'Études sur la Parole - JEP2022
■ Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français
Solène Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab and Laurent Besacier
Journées d'Études sur la Parole - JEP2022
■ LeBenchmark, un référentiel d'évaluation pour le français oral
Hang Le, Sina Alisamir, Marco Dinarelli, Fabien Ringeval, Solène Evain, Ha Nguyen, Marcely Zanon Boito, Salima Mdhaffar, Ziyi Tong, Natalia Tomashenko, Titouan Parcollet, Allauzen Alexandre, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Didier Schwab and Laurent Besacier
Journées d'Études sur la Parole - JEP2022
2021
â– Â The VoicePrivacy 2020 Challenge: Results and findings
Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier NoĂ©, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O'Brien, AnaĂŻs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche.Â
Computer Speech and Language (journal, published in 2022)
[Computer Speech & Language ][arxiv]
â– Â Supplementary material to the paper: The VoicePrivacy 2020 Challenge: Results and findings
Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier NoĂ©, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O'Brien, AnaĂŻs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche.Â
Report
[pdf]
â– Â Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interfaceÂ
Benjamin O'Brien, Natalia Tomashenko, Anaïs Chanclu, Jean-François Bonastre
Interspeech 2021
â– Â Towards a unified assessment framework of speech pseudonymisationÂ
Paul-Gauthier Noé, Andreas Nautsch, Nicholas Evans, Jose Patino, Jean-François Bonastre, Natalia Tomashenko, Driss Matrouf
Computer Speech & LanguageÂ
[CSL]
â– Â Benchmarking and challenges in security and privacy for voice biometricsÂ
Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noe, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi
ISCA Symposium on Security and Privacy in Speech CommunicationÂ
â– Â LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechÂ
Solene Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Esteve, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier
Interspeech 2021
â– Â Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmarkÂ
Solene Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Esteve, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier
NeurIPS Thirty-fifth Annual Conference on Neural Information Processing Systems 2021: Datasets and Benchmarks Track
[paper]Â [poster]
â– Â Speaker anonymisation using the McAdams coefficient
Jose Patino, Natalia Tomashenko, Massimiliano Todisco, Andreas Nautsch, Nicholas Evans.Â
Interspeech 2021
[ISCA archive] [arxiv]
â– Â Privacy and utility of x-vector based speaker anonymization
Brij M. L. Srivastava, Mohamed Maouche, Md Sahidullah, Emmanuel Vincent, AurĂ©lien Bellet, Marc Tommasi, Natalia Tomashenko, Xin Wang, Junichi Yamagishi.Â
IEEE/ACM Transactions on Audio, Speech, and Language Processing
■ ON-TRAC’ systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks
Le, H., Barbier, F., Nguyen, H., Tomashenko, N., Mdhaffar, S., Gahbiche, S., Fethi, B., Lecouteux, B., Schwab, D. and Estève, Y.Â
International Conference on Spoken Language Translation (IWSLT)
[paper]Â
2020
â– Â Introducing the VoicePrivacy initiative
Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier NoĂ©, Massimiliano Todisco. Â
Interspeech 2020
[arxiv] [slides] [ISCA archive] [video] [code]
â– Â Design choices for x-vector based speaker anonymization
Brij Mohan Lal Srivastava, Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi.
Interspeech 2020
[arxiv] [slides] [ISCA archive] [video] [code]
â– Â The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment
Andreas Nautsch, Jose Patino, Natalia Tomashenko, Junichi Yamagishi, Paul-Gauthier Noe, Jean-Francois Bonastre, Massimiliano Todisco, Nicholas Evans.
Interspeech 2020
[arxiv] [slides] [ISCA archive] [video] [code]
â– Â Speech Pseudonymisation Assessment Using Voice Similarity Matrices
Paul-Gauthier Noé, Jean-François Bonastre, Driss Matrouf, Natalia Tomashenko, Andreas Nautsch, Nicholas Evans.
Interspeech 2020
[arxiv] [slides] [ISCA archive] [video] [code]
â– Â Investigating Self-supervised Pre-training for End-to-end Speech Translation
Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Yannick Estève, Laurent Besacier.
Interspeech 2020
[arxiv] [slides] [ISCA archive] [video]
â– Â Dialogue history integration into end-to-end signal-to-concept spoken language understanding systems
Natalia Tomashenko, Christian Raymond, Antoine Caubrière, Renato De Mori, Yannick Estève.
ICASSP 2020
[arxiv] [slides] [IEEE Xplore] [video]
â– Â Error Analysis Applied to End-to-End Spoken Language Understanding
Antoine Caubrière, Sahar Ghannay, Natalia Tomashenko, Renato De Mori, Antoine Laurent, Emmanuel Morin, Yannick Estève.Â
ICASSP 2020
[pdf] [slides] [IEEE Xplore] [video]
â– Â ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
Maha Elbayad, Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Antoine Caubrière, Benjamin Lecouteux, Yannick Estève, Laurent Besacier.Â
IWSLT 2020 (ACL)
â– Â Exploring Gaussian mixture model framework for speaker adaptation of deep neural network acoustic models
Natalia Tomashenko, Yuri Khokhlov, Yannick Estève.Â
preprint
[arxiv]
â– Â The VoicePrivacy 2020 Challenge Evaluation Plan
Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jean-François Bonastre, Paul-Gauthier NoĂ©, Massimiliano Todisco, Jose Patino.Â
[pdf]
2019
â– Â Investigating adaptation and transfer learning for end-to-end spoken language understanding from speech
Natalia Tomashenko, Antoine Caubrière, Yannick Estève.
Interspeech 2019
[arxiv] [ISCA archive] [poster] [video]
â– Â Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière, Natalia Tomashenko, Antoine Laurent, Emmanuel Morin, Nathalie Camelin, Yannick Estève.
Interspeech 2019
[arxiv] [ISCA archive] [poster] [video]
â– Â Recent advances in end-to-end spoken language understanding
Natalia Tomashenko, Antoine Caubrière, Yannick Estève, Antoine Laurent, Emmanuel Morin.
International Conference on Statistical Language and Speech Processing (SLSP) 2019
[arxiv] [Springer link] [slides]Â
â– Â ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task
Ha Nguyen, Natalia Tomashenko, Marcely Zanon Boito, Antoine Caubrière, Fethi Bougares, Mickael Rouvier, Laurent Besacier, Yannick Estève.
IWSLT 2019
[arxiv] [slides]Â
■ Curriculum d'apprentissage: reconnaissance d'entités nommées pour l'extraction de concepts sémantiques
Antoine Caubrière, Natalia Tomashenko, Yannick Estève, Antoine Laurent, Emmanuel Morin.
TALN 2019
[pdf]Â
2018
â– Â Speaker Adaptive Training and Mixup Regularization for Neural Network Acoustic Models in Automatic Speech Recognition
Natalia Tomashenko, Yuri Khokhlov, Yannick Estève.
Interspeech 2018
[ISCA archive] [poster] [code]
â– Â An Investigation of Mixup Training Strategies for Acoustic Models in ASR
Ivan Medennikov, Yuri Y Khokhlov, Aleksei Romanenko, Dmitry Popov, Natalia Tomashenko, Ivan Sorokin, Alexander Zatvornitskiy.
Interspeech 2018
[ISCA archive] [poster] [code]
â– Â Evaluation of feature-space speaker adaptation for end-to-end acoustic models
Natalia Tomashenko, Yannick Estève.
International Conference on Language Resources and Evaluation (LREC) 2018
[pdf] [poster]Â
â– Â TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation
François Hernandez, Vincent Nguyen, Sahar Ghannay, Natalia Tomashenko, Yannick Estève.
International Conference on Speech and Computer (SPECOM) 2018
[arxiv] [Springer link]Â
■ Impact des techniques d’adaptation au locuteur dans l’espace des paramètres pour des modèles acoustiques purement neuronaux
Natalia Tomashenko, Yannick Estève.
IXXXIIe Journées d’Études sur la Parole (JEP) 2018
[pdf] [poster]Â
2017
â– Â Speaker adaptation of deep neural network acoustic models using Gaussian mixture model framework in automatic speech recognition systems
Natalia Tomashenko.
PhD thesis
[pdf] [slides]Â
â– Â Fast and accurate OOV decoder on high-level features
Yuri Khokhlov, Natalia Tomashenko, Ivan Medennikov, Alexei Romanenko
Interspeech 2017
[pdf] [poster]Â
â– Â The STC Keyword Search System for OpenKWS 2016 Evaluation
Yuri Y Khokhlov, Ivan Medennikov, Aleksei Romanenko, Valentin Mendelev, Maxim Korenevsky, Alexey Prudnikov, Natalia Tomashenko, Alexander Zatvornitsky
Interspeech 2017
[pdf]Â
â– Â Acoustic modeling in the STC keyword search system for OpenKWS 2016 evaluation
Ivan Medennikov, Aleksei Romanenko, Alexey Prudnikov, Valentin Mendelev, Yuri Khokhlov, Maxim Korenevsky, Natalia Tomashenko, Alexander Zatvornitskiy
International Conference on Speech and Computer (SPECOM) 2017
[pdf] [Springer link]
2016
â– Â LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic challenge
Natalia Tomashenko, Kévin Vythelingum, Anthony Rousseau, Yannick Estève
2016 IEEE Spoken Language Technology Workshop (SLT)
[pdf] [IEEE Xplore]
â– Â On the Use of Gaussian Mixture Model Framework to Improve Speaker Adaptation of Deep Neural Network Acoustic Models
Natalia Tomashenko, Yuri Khokhlov, Yannick Estève
Interspeech 2016
[ISCA archive]Â [poster]
â– Â A new perspective on combining GMM and DNN frameworks for speaker adaptation
Natalia Tomashenko, Yuri Khokhlov, Yannick Estève
International Conference on Statistical Language and Speech Processing (SLSP) 2016
[Springer link] [slides]Â
â– Â Exploring GMM-derived features for unsupervised adaptation of deep neural network acoustic models
Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher, Yannick Estève
International Conference on Speech and Computer (SPECOM) 2016
[pdf] [Springer link]Â
■ Exploration de paramètres acoustiques dérivés de GMMs pour l'adaptation non supervisée de modèles acoustiques à base de réseaux de neurones profonds
Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher, Yannick Estève
Journées d’Études sur la Parole (JEP'16)
[pdf]
2015
â– Â GMM-derived features for effective unsupervised adaptation of deep neural network acoustic models
Natalia Tomashenko, Yuri Khokhlov
Interspeech 2015
â– Â A bilingual Kazakh-Russian system for automatic speech recognition and synthesis
Olga Khomitsevich, Valentin Mendelev, Natalia Tomashenko, Sergey Rybin, Ivan Medennikov, Saule Kudubayeva
International Conference on Speech and Computer (SPECOM) 2015
[pdf][Springer link]
â– Â Speaker verification using spectral and durational segmental characteristics
Elena Bulgakova, Aleksei Sholohov, Natalia Tomashenko, Yuri Matveev
International Conference on Speech and Computer (SPECOM) 2015
[pdf][Springer link]
2014
â– Â Speaker adaptation of context dependent deep neural networks based on MAP-adaptation and GMM-derived feature processing
Natalia Tomashenko, Yuri Khokhlov
Interspeech 2014
â– Â Speaking rate estimation based on deep neural networks
Natalia Tomashenko, Yuri Khokhlov
International Conference on Speech and Computer (SPECOM) 2014
â– Â Automated closed captioning for Russian live broadcasting
Kirill Levin, Irina Ponomareva, Anna Bulusheva, G Chernykh, Ivan Medennikov, Nickolay Merkin, Alexey Prudnikov, Natalia Tomashenko
Interspeech 2014
â– Â State level control for acoustic model training
German Chernykh, Maxim Korenevsky, Kirill Levin, Irina Ponomareva, Natalia Tomashenko
International Conference on Speech and Computer (SPECOM) 2014
[pdf][Springer link]