Research

Publications

2025:

NAVER LABS Europe Submission to the Instruction-following Track. [PDF]

B Lee, MZ Boito,L Besascier, I Calapodescu. IWSLT 2025. (BEST SHORT SUBMISSION)

From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM. [PDF]

K Ambilduke, B Peters, S Sannigrahi, A Keshwani, TK Lam, B Martins, MZ Boito, AFT Martins. Findings of EMNLP 2025.

2024:

mHuBERT-147: A Compact Multilingual HuBERT Model. [PDF]

MZ Boito, V Iyer, N Lagos, L Besacier, I Calapodescu. INTERSPEECH 2024.

Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts. [PDF]

TP Ferraz, MZ Boito, C Brun, V Nikoulina. ICASSP 2024.

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech. [PDF]

T Parcollet, H Nguyen, S Evain, MZ Boito, APupier,S Mdhaffar,H Le, S Alisamir, N Tomashenko, M Dinarelli, S Zhang, A Allauzen, M Coavoux, Y Esteve, M Rouvier, J Goulian, B Lecouteux, F Portet, S Rossato, F Ringeval, D Schwab, L Besacier. Elsevier Computer Speech and Language Journal.

2023:

NAVER LABS Europe’s Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track. [PDF]

E Gow-Smith, A Berard,MZ Boito, I Calapodescu. IWSLT 2023. (WINNING SUBMISSION TAQ-FRA; WINNER SUBMISSION QUE-ES)

2022:

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems. [PDF]

MZ Boito, L Besacier, N Tomashenko, Y Estève. INTERSPEECH 2022.

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. [PDF]

MZ Boito, B Yusuf, L Ondel, A Villavicencio, L Besacier. SIGUL 2022.

Speech Resources in the Tamasheq Language. [PDF]

MZ Boito, F Bougares, F Barbier, S Gahbiche, L Barrault, M Rouvier, Y Estève. LREC 2022.

FINDINGS OF THE IWSLT 2022 EVALUATION CAMPAIGN. [PDF]

A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, A Currey, G Dinu, K Duh, M Elbayad, Y Estève, M Federico, C Federmann, S Gahbiche, H Gong, R Grundkiewicz, B Haddow, B Hsu, D Javorský, V Kloudová, SM Lakew, X Ma, P Mathur, P McNamee, K Murray, M N ̆adejde, S Nakamura, M Negri, J Niehues, X Niu, J Ortega, J Pino, E Salesky, J Shi, S Stuker, K Sudoh, M Turchi, Y Virkar, A Waibel, C Wang, S Watanabe. IWSLT 2022.

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks. [PDF]

MZ Boito, J Ortega, H Riguidel, A Laurent, L Barrault, F Bougares, F Chaabani, H Nguyen, F Barbier, S Gahbiche, Y Estève. IWSLT 2022. (WINNER SUBMISSION TAQ-FRA)

Promises and Limitations of Self-supervised Learning for Automatic Speech Processing. [PDF]

L Maison, MZ Boito, Y Estève. CAID 2022: Conference on Artificial Intelligence for Defense .

LeBenchmark, un référentiel d’évaluation pour le français oral. [PDF] (French only)

H Le, S Alisamir, M Dinarelli, F Ringeval, S Evain, H Nguyen, MZ Boito, S Mdhaffar,, Z Tong, N Tomashenko, T Parcollet, A Allauzen, Y Estève, B Lecouteux, F Portet, S Rossato,, D Schwab, L Besacier. JEP 2022 .

Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français. [PDF] (French only)

S Evain, H Nguyen, H Le, MZ Boito, S Mdhaffar, S Alisamir, Z Tong, N Tomashenko, M Dinarelli, T Parcollet, A Allauzen, Y Estève, B Lecouteux, F Portet, S Rossato, F Ringeval, D Schwab, L Besacier. JEP 2022.

2021:

Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark. [PDF]

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech. [PDF]

Investigating Alignment Interpretability for low-resource NMT. [PDF]

MZ Boito, A Villavicencio, L Besacier. Machine Translation Journal: Special Issue on Machine Translation for Low-resource Languages. SPRINGER NETHERLANDS.

2020:

Investigating Language Impact in Bilingual Approaches for Computational Language Documentation. [PDF]

MZ Boito, A Villavicencio, L Besacier. SLTU-CCURL WORKSHOP: LREC 2020.

MaSS: A large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible. [PDF]

MZ Boito, W Havard, M Garnerin, E Le Ferrand, L Besacier. LREC 2020.

2019:

ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task. [PDF]

H Nguyen, N Tomashenko, MZ Boito, A Caubriere, F Bougares, M Rouvier, L Besacier, Y Esteve. IWSLT 2019.

How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages. [PDF]

MZ Boito, A Villavicencio, L Besacier. LIFT WORKSHOP 2019.

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings. [PDF]

MZ Boito, A Villavicencio, L Besacier. INTERSPEECH 2019.

2018:

Unsupervised Word Segmentation From Speech With Attention. [PDF]

P Godard, MZ Boito, L Ondel, A Berard, F Yvon, A Villavicencio, L Besacier. INTERSPEECH 2018.

A small Griko-Italian speech translation corpus. [PDF]

MZ Boito, A Anastasopoulos, M Lekakou, A Villavicencio, L Besacier. SLTU WORKSHOP: INTERSPEECH 2018.

A very low resource language speech corpus for computational language documentation experiments. [PDF]

P Godard, G Adda, M Adda-Decker, J Benjumea, L Besacier, J Cooper-Leavitt, GN Kouarata, L Lamel, H Maynard, M Muller, A Rialland, S Stuker, F Yvon, MZ Boito. LREC 2018.

2017:

Unwritten Languages Demand Attention Too! Word Discovery with Encoder-Decoder Models. [PDF]

MZ Boito, A Berard, A Villavicencio, L Besacier. ASRU WORKSHOP: IEEE 2017.

Unsupervised Word Discovery Using Encoder-Decoder Models. [PDF]

MZ Boito, L Besacier, A Villavicencio. WiNLP WORKSHOP: ACL 2017.

2014:

Size does not matter. Frequency does. A study of features for measuring lexical complexity. [PDF]

R Wilkens, A Dalla Vecchia, MZ Boito, M Padro, A Villavicencio. IBERAMIA 2014.

Uma análise do perfil de entropia das estruturas sintáticas do português. [PDF]

MZ Boito, L Hagemann, R Wilkens, A Villavicencio. ToRPorEsp WORKSHOP: PROPOR 2014.

PhD Thesis (2021):

Models and Resources for Attention-based Unsupervised Word Segmentation. [PDF]

Master Thesis (2017):

Unsupervised Word Discovery Using Attentional Encoder-Decoder Models. [PDF]

Extended abstract in Portuguese (first 20 pages), full thesis in English.

Work in Conferences

Reviewing:
- PC: LREC 2020, ACL 2020, SLTU-CCURL 2020, EMNLP 2020, EACL 2021, EMNLP 2021, ACL 2022, LREC 2022, SIGUL 2022, NAACL 2022, EACL 2022, GITT 2023, ILLC-NLP 2024, INTERSPEECH 2024, ICASSP 2024, EMT 2025 Thesis award, INTERSPEECH 2025, ARR May 2025
- External Reviewer: SBAC-PAD 2018, HRI 2026
Communications/Social Media/Website:
- - Website Chair: CoNLL 2019
  - Social Media and Communications Chair: PROPOR 2018
  - Internal Communications Chair: ACL2022
  - Publicity and Social Media Chair: EACL2026
Organization:
- - Local Organization Comittee - LTT 2018, TALN2022, RÉCITAL 2022
  - Task organizer: IWSLT 2022 (low-resource track).
  - Co-chair RÉCITAL 2022, SASB 2023, SASB 2024

Supervisions

6 months master projects:

Thomas Palmeira Ferraz - Multilingual DistilWhisper [PDF]

6 months PhD projects:

Edward Gow-Smith - Multimodal architecture for speech translation (IWSLT 23) [PDF]
Vivek Iyer - Benchmarking SSL models: mHuBERT-147 project [PDF]
Biswesh Mohapatra
Hemant Yadav

Google Sites

Report abuse