2024:
V.Mingote, A.Ortega, A.Miguel, E.Lleida, “Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges" In peer-review.
2023:
V.Mingote, P. Gimeno, L. Vicente, S. Khurana, A. Laurent, J. Duret, “Direct Text to Speech Translation System using Acoustic Units" Published in IEEE Signal Processing Letters, vol. 30, pp. 1262-1266, 2023.
V.Mingote, A.Miguel, A.Ortega, E.Lleida, “Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems” Published in Digital Signal Processing, vol.133, pp.103859, 2023.
S.Khurana, N.Dawalatabad, A.Laurent, L.Vicente, P.Gimeno, V.Mingote, J.R.Glass, “ Improved Cross-Lingual Transfer Learning For Automatic Speech Translation” Published in Arxiv, 2023.
2022:
V.Mingote, A.Miguel, D.Ribas, A.Ortega, E.Lleida, “aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems” Published in IEEE/ACM Transactions on Audio, Speech and Language, vol.30, pp. 772-784, 2022.
V.Mingote, I. Viñals, P.Gimeno, A.Miguel, A.Ortega, E.Lleida, “Multimodal Diarization Systems by Training Enrollment Models as Identity Representations” Published in Applied Sciences, Volume 12, Issue 2, pp. 1141, 2022.
2021:
P.Gimeno, V.Mingote, A.Miguel, A.Ortega, E.Lleida, “Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data” Published in IEEE Signal Processing Letters, vol.28, pp. 1135-1139, 2021.
2020:
V.Mingote, A.Miguel, A.Ortega, E.Lleida, “Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification” Published in Computer Speech & Language vol. 63, pp. 101078, 2020.
2019:
V.Mingote, A.Miguel, A.Ortega, E.Lleida, “Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification” Published in Applied Sciences, Volume 9, Issue 16, pp. 3295, 2019.