Software

Framework for assessing self-supervised learning (SSL) representations for speech processing

https://github.com/LeBenchmark

LeBenchmark (http://lebenchmark.com) is a reproducible benchmark for evaluating speech SSL models for different speech  tasks in French:

Models: https://huggingface.co/LeBenchmark

Papers:


Privacy preserving speech processing software

Implementation of two different baseline voice anonymization systems:

Metrics to assess anonymization integrated in the setup:

Papers:

2020:

2022:


Automatic speech recognition software

Mixup

Implementation of the mixup (between class learning) technique for ASR training and its extension for sequence-trained neural networks on lattice-free MMI (in Kaldi).

Papers:

Data

Paper: TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation

Contents:

Two corpus distributions:

Related software links: