Speaker and Language Recognition
Niko Brummer
Fully Bayesian stuff
- Here is a very simple, but practical, fully Bayesian, generative, multiclass pattern classifier: derivation, MATLAB mplementation.
- Some (in-progress) notes analysing various aspects of the problem of forensic likelihood-ratio calibration, with the aim of working towards more Bayesian solutions:
- Integrating out model parameters in generative and discriminative classifiers.
- What is the ‘relevant population’ in Bayesian forensic inference?
- Tutorial for Bayesian forensic likelihood ratio.
- Fully Bayesian Score Calibration assuming Gaussian Distributions.
- See also 'Bayesian PLDA below.
BOSARIS Toolkit
This is the successor to the FoCal Toolkit. The BOSARIS Toolkit provides MATLAB code for calibrating, fusing and evaluating scores from (automatic) binary classifiers. It was developed
to provide solutions for automatic speaker recognition, but we envision that much of the code will have wider applicability for other biometric and/or forensics problems, where the calibration of likelihood-ratios is of interest.
- The BOSARIS Toolkit User Guide: Theory, Algorithms and Code for Binary Classifier Score Processing.
- The code and user manual are here.
Selected Papers
- Niko Brummer, "Application-Independent Evaluation of Speaker Detection", Odyssey 2004.
- Niko Brummer and Johan du Preez, "Application Independent Evaluation of Speaker Detection", Computer Speech and Language, 2006.
- Niko Brummer and David van Leeuwen, "On calibration of language recognition scores", Odyssey 2006.
- David van Leeuwen and Niko Brummer, "Channel-dependent GMM and Multi-class Logistic Regression", Odyssey 2006.
- Niko Brummer, Lukas Burget, et al. ''Fusion of Heterogenous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006'', IEEE TASLP, vol.15, no.7, Sept. 2007.
- Niko Brummer et al., "Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics'', Interspeech 2009. [Paper: pdf][Presentation:pdf]
- Niko Brummer and Edward de Villiers, "The Speaker Partitioning Problem", Odyssey 2010. PDF.
- Jesus Villalba and Niko Brummer, "Towards Fully Bayesian Speaker Recognition: Integrating Out the Between-Speaker Covariance", accepted for Interspeech 2011. PDF.
Ph.D.
- Niko Brummer, Measuring, refining and calibrating speaker and language information extracted from speech, Ph.D. dissertation, University of Stellenbosch, October 2010.
- Ph.D. Oral defense presentation: defence.pdf.
- (Also online: here or here.)
Book chapter
- David van Leeuwen and Niko Brümmer, An Introduction to Application-Independent Evaluation of Speaker Recognition Systems, in Speaker Classification I: Fundamentals, Features, and Methods, Christian Müller (Ed.),
Springer 2007.
Talk
- Here are some slides of a talk at ATVS UAM about Calibration of Binary (Speaker Recognition) and Multiclass (Language Recognition) Statictical Pattern Recognizers.
Notes
- Some notes written during the BOSARIS workshop:
- Calculus of likelihood ratios: PDF.
- Bayesian PLDA: My original notes, and Jesus's impelementation and our Interspeech 2011 paper.
- Some things which are good to know when computing first and second order partial derivatives for large-scale numerical optimization: PDF.
- THE EM ALGORITHM AND MINIMUM DIVERGENCE:
- General theory: PDF.
- Applied to JFA-style GMM modeling: EM4JFA.PDF.
- The PAV Algorithm optimizes binary proper scoring rules.
- Incomplete technical report, describing a precursor to the two-covariance and PLDA speaker recognition models: Farewell SVM: Bayes Factor Speaker Detection in Supervector Space, 2006.
- Calculus of likelihood ratios: PDF.
- Bayesian PLDA: My original notes, and Jesus's impelementation and our Interspeech 2011 paper.
- General theory: PDF.
- Applied to JFA-style GMM modeling: EM4JFA.PDF.
Software
- FoCal Toolkit: MATLAB code for Evaluation, Fusion and Calibration of Statistical Pattern Recognizers. Includes tools for logistic regression, Cllr and APE-curves.
- Some new tools for ROCCH-DET curves are here: http://focaltoolkit.googlepages.com/rocch.
- Also see: http://sites.google.com/site/bosaristoolkit.