A complete list of publications is available from Google Scholar.
Z. Jin, Y. Tu, C. X. Gan, M.-W. Mak, and K. A. Lee, “Adversarially adaptive temperatures for decoupled knowledge distillation with applications to speaker verification,” Neurocomputing, vol. 624, pp. 1–9, Apr. 2025, Art. no. 129481.
R. Wang, L. Chen, K. A. Lee, and Z.-H. Ling, “Asynchronous voice anonymization by learning from speaker-adversarial speech,” IEEE Signal Process. Lett., vol. 32, pp. 1905–1909, Apr. 2025.
M. Jing, V. Sethu, B. Ahmed, and K. A. Lee, “Quantifying prediction uncertainties in automatic speaker verification systems,” Comput. Speech Lang., vol. 94, pp. 1–21, Apr. 2025, Art. no. 101806.
X. Wang, H. Delgado, H. Tak, J. Jung, H. Shim, M. Todisco, I. Kukanov, X. Liu, M. Sahidullah, T. Kinnunen, N. Evans, K. A. Lee, et al., “ASVspoof 5: Design, collection and validation of resources for spoofing, deepfake, and adversarial attack detection using crowdsourced speech,” Comput. Speech Lang., vol. 95, pp. 1–27, May 2025, Art. no. 101825.
L. Zhang, B. Niu, K. A. Lee, and L. Wang, “Make full use of your data: On copy-based augmentation in speech anti-spoofing,” Neurocomputing, vol. 649, pp. 1–10, Jun. 2025, Art. no. 130799.
L. Chen, C. Guo, R. Wang, K. A. Lee, and Z.-H. Ling, “Any-to-any speaker attribute perturbation for asynchronous voice anonymization,” IEEE Trans. Inf. Forensics Security, vol. 20, pp. 7736–7747, Jul. 2025.
Y. Tu, M.-W. Mak, K. A. Lee, and W. Lin, “ConFusionformer: Locality-enhanced Conformer through multi-resolution attention fusion for speaker verification,” Neurocomputing, vol. 644, Sep. 2025, Art. no. 130429.
C. X. Gan, Y. Tu, Z. Jin, M.-W. Mak, and K. A. Lee, “Grouped knowledge distillation with adaptive logit softening for speaker recognition,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2025, pp. 1–5.
H. T. Luong, H. Li, L. Zhang, K. A. Lee, and E. S. Chng, “LlamaPartialSpoof: An LLM-driven fake speech dataset simulating disinformation generation,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2025, pp. 1–5.
H. Zeinali, K. A. Lee, J. Alam, and L. Burget, “Text-dependent speaker verification challenge 2024: Exploring shared and user-defined passphrases,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2025, pp. 1–5.
J. Li, K. A. Lee, and M.-W. Mak, “MoMuSE: Momentum multi-modal target speaker extraction for real-time scenarios with impaired visual cues,” in Proc. IEEE Int. Conf. Multimedia Expo (ICME), Jul. 2025, pp. 1–5.
J. Li, M.-W. Mak, J. Rohdin, K. A. Lee, and H. Hermansky, “Bayesian learning for domain-invariant speaker verification and anti-spoofing,” in Proc. Interspeech, 2025, pp. 1123–1127.
R. Zuo, K. A. Lee, Z. Huang, and M.-W. Mak, “The sub-3sec problem: From text-independent to text-dependent corpus,” in Proc. Interspeech, 2025, pp. 4003–4007.
C.-X. Gan, Z. Li, Z. Jin, Z. Huang, M.-W. Mak, and K. A. Lee, “IDIR: Identifying and distilling informative relations for speaker verification,” in Proc. Interspeech, 2025, pp. 5758–5762.
J. Meng, H. B. Sailor, Q. Wang, T. Liu, K. A. Lee, and X. Wang, “Exploring audio-visual fusion methods in foundation model-based deception detection,” in Proc. APSIPA ASC, 2025, pp. 1964–1968.
R. Wang, L. Chen, K. A. Lee, Z. Zha, and Z. Ling, “Investigation of perception inconsistency in speaker embedding for asynchronous voice anonymization,” in Proc. APSIPA ASC, 2025, pp. 2074–2079.
S. Tang, Z. Liu, L. Chen, K. A. Lee, T. Toda, and Z. Ling, “A preliminary study on sectional voice anonymization and detection,” in Proc. APSIPA ASC, 2025, pp. 2229–2234.
H.-T. Luong, I. Rimon, H. Permuter, K. A. Lee, and E. S. Chng, “Robust localization of partially fake speech: Metrics and out-of-domain evaluation,” in Proc. APSIPA ASC, 2025, pp. 2205–2210.
S. Qin, K. A. Lee, M.-W. Mak, P. Lisena, and M. Todisco, “Variational regularization for end-to-end speech deepfake detection,” in Proc. APSIPA ASC, 2025, pp. 2241–2246.
L. Chen, K. A. Lee, Z.-H. Ling, X. Wang, R. K. Das, T. Toda, and H. Li, “Speaker privacy and security in the big data era: Protection and defense against deepfake,” in Proc. APSIPA ASC, 2025, pp. 2570–2575.
T. Tayir, L. Li, B. Li, J. Liu, and K. A. Lee, “Encoder-decoder calibration for multimodal machine translation,” IEEE Trans. Artif. Intell., vol. 5, no. 8, pp. 3965–3973, Jan. 2024.
X. Liu, M. Sahidullah, K. A. Lee, and T. Kinnunen, “Generalizing speaker verification for spoof awareness in the embedding space,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 32, pp. 1261–1273, Jan. 2024.
Q. Wang and K. A. Lee, “Cosine scoring with uncertainty for neural speaker embedding,” IEEE Signal Process. Lett., vol. 31, pp. 845–849, Mar. 2024.
T. Liu, K. A. Lee, Q. Wang, and H. Li, “Golden Gemini is all you need: Finding the sweet spots for speaker verification,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 32, pp. 2324–2337, Apr. 2024.
T. H. Kinnunen, K. A. Lee, H. Tak, N. Evans, and A. Nautsch, “t-EER: Parameter-free tandem evaluation of countermeasures and biometric comparators,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 46, no. 5, pp. 2622–2637, May 2024.
Q. Wang, H. B. Sailor, K. A. Lee, K. Ma, K. H. Goh, and W. F. Boh, “Using Twitter dataset for social listening in Singapore,” IEEE Access, vol. 12, pp. 100015–100025, Jul. 2024.
S. Wang, Z. Chen, K. A. Lee, Y. Qian, and H. Li, “Overview of speaker modeling and its applications: From the lens of deep speaker representation learning,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 32, pp. 4971–4998, Nov. 2024.
L. Chen, W. Gu, K. A. Lee, W. Guo, and Z.-H. Ling, “Pseudo-speaker distribution learning in voice anonymization,” IEEE/ACM Trans. Audio Speech Lang. Process., pp. 272–285, Dec. 2024.
D. T. Truong, R. Tao, J. Q. Yip, K. A. Lee, and E. S. Chng, “Emphasized non-target speaker knowledge in knowledge distillation for automatic speaker verification,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2024, pp. 10336–10340.
L. Zhang, K. A. Lee, L. Zhang, L. Wang, and B. Niu, “CPAUG: Refining copy-paste augmentation for speech anti-spoofing,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2024, pp. 10996–11000.
L. Chen, K. A. Lee, W. Guo, and Z.-H. Ling, “Modeling pseudo-speaker uncertainty in voice anonymization,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2024, pp. 11601–11605.
Y. Ma, K. A. Lee, V. Hautamäki, M. Ge, and H. Li, “Gradient weighting for speaker verification in extremely low signal-to-noise ratio,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2024, pp. 11311–11315.
S. Chen, L. Chen, J. Zhang, K. A. Lee, Z. Ling, and L. Dai, “Adversarial speech for voice privacy protection from personalized speech generation,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2024, pp. 11411–11415.
D. T. Truong, R. Tao, T. Nguyen, H. T. Luong, K. A. Lee, and E. S. Chng, “Temporal-channel modeling in multi-head self-attention for synthetic speech detection,” in Proc. Interspeech, Sept. 2024, pp. 537–541.
X. Wang, T. Kinnunen, K. A. Lee, P. G. Noé, and J. Yamagishi, “Revisiting and improving scoring fusion for spoofing-aware speaker verification using compositional data analysis,” in Proc. Interspeech, Sept. 2024, pp. 1110–1114.
Z. Huang, M.-W. Mak, and K. A. Lee, “MM-NodeFormer: Node transformer multimodal fusion for emotion recognition in conversation,” in Proc. Interspeech, Sept. 2024, pp. 4069–4073.
R. Wang, L. Chen, K. A. Lee, and Z.-H. Ling, “Asynchronous voice anonymization using adversarial perturbation on speaker embedding,” in Proc. Interspeech, Sept. 2024, pp. 4443–4447.
C. Guo, L. Chen, K. A. Lee, Z.-H. Ling, and W. Guo, “Investigation into the impact of speaker adversarial perturbation on speech recognition,” in Proc. NCMMSC, Dec. 2024, pp. 191–199.
J. Li, K. Zhang, S. Wang, H. Li, M.-W. Mak, and K. A. Lee, “On the effectiveness of enrollment speech augmentation for target speaker extraction,” in Proc. IEEE Spoken Lang. Technol. Workshop (SLT), Dec. 2024, pp. 325–332.
H. T. Luong, D. T. Truong, K. A. Lee, and E. S. Chng, “Room impulse responses help attackers to evade deep fake detection,” in Proc. IEEE Spoken Lang. Technol. Workshop (SLT), Dec. 2024, pp. 623–629.
C. Guo, L. Chen, Z. Li, K. A. Lee, Z.-H. Ling, and W. Guo, “On the generation and removal of speaker adversarial perturbation for voice-privacy protection,” in Proc. IEEE Spoken Lang. Technol. Workshop (SLT), Dec. 2024, pp. 1179–1184.
T. Liu, I. Kukanov, Z. Pan, Q. Wang, H. B. Sailor, and K. A. Lee, “Towards quantifying and reducing language mismatch effects in cross-lingual speech anti-spoofing,” in Proc. IEEE Spoken Lang. Technol. Workshop (SLT), Dec. 2024, pp. 1185–1192.
X. Wang, J. Meng, K. A. Lee, B. Li, and J. Liu, “Two-stage semi-supervised speaker recognition with gated label learning,” in Proc. Int. Joint Conf. Artif. Intell. (IJCAI), Aug. 2024, pp. 6495–6503.
H. Zhang, L. Wang, K. A. Lee, M. Liu, J. Dang, and H. Meng, “Meta-generalization for domain-invariant speaker verification,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 31, pp. 1024–1036, Feb. 2023.
R. Tao, K. A. Lee, R. K. Das, V. Hautamäki, and H. Li, “Self-supervised training of speaker encoder with multi-modal diverse positive pairs,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 31, pp. 1706–1719, Apr. 2023.
X. Liu, X. Wang, M. Sahidullah, J. Patino, H. Delgado, T. Kinnunen, M. Todisco, J. Yamagishi, N. Evans, A. Nautsch, and K. A. Lee, “ASVspoof 2021: Towards spoofed and deepfake speech detection in the wild,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 31, pp. 2507–2522, Jun. 2023.
Q. Wang, K. Okabe, K. A. Lee, and T. Koshinaka, “Generalized domain adaptation framework for parametric back-end in speaker recognition,” IEEE Trans. Inf. Forensics Security, vol. 18, pp. 3936–3947, Jun. 2023.
J. Y. Lee, K. A. Lee, and W. S. Gan, “A dual latent variable personalized dialogue agent,” SN Comput. Sci., vol. 4, no. 2, Art. no. 159, Mar. 2023.
M. Liu, K. A. Lee, L. Wang, H. Zhang, C. Zeng, and J. Dang, “Cross-modal audio-visual co-learning for text-independent speaker verification,” in Proc. IEEE ICASSP, 2023.
Q. Wang, K. A. Lee, and T. Liu, “Incorporating uncertainty from speaker embedding estimation to speaker verification,” iin Proc. IEEE ICASSP, 2023.
X. Liu, M. Liu, L. Wang, K. A. Lee, H. Zhang, and J. Dang, “Leveraging positional-related local-global dependency for synthetic speech detection,” in Proc. IEEE ICASSP, 2023.
Y. Sun, H. Zhang, L. Wang, K. A. Lee, M. Liu, and J. Dang, “Noise-disentanglement metric learning for robust speaker verification,” in Proc. IEEE ICASSP, 2023.
R. Tao, K. A. Lee, Z. Shi, and H. Li, “Speaker recognition with two-step multi-modal deep cleansing,” iin Proc. IEEE ICASSP, 2023.
A. Sholokhov, N. Kuzmin, K. A. Lee, and E. S. Chng, “Probabilistic back-ends for online speaker recognition and clustering,” in Proc. IEEE ICASSP, 2023.
H. Chen, H. Zhang, L. Wang, K. A. Lee, M. Liu, and J. Dang, “Self-supervised audio-visual speaker representation with co-meta learning,” in Proc. IEEE ICASSP, 2023.
X. Liu, M. Sahidullah, K. A. Lee, and T. Kinnunen, “Speaker-aware anti-spoofing,” in Proc. Interspeech, 2023, pp. 2498–2502.
Y. Liang, M. Shi, F. Yu, Y. Li, S. Zhang, Z. Du, Q. Chen, L. Xie, Y. Qian, J. Wu, Z. Chen, K. A. Lee, Z. Yan, and H. Bu, “The second multi-channel multi-party meeting transcription challenge (M2MeT 2.0): A benchmark for speaker-attributed ASR,” in Proc. IEEE ASRU, 2023.
T. Liu, K. A. Lee, Q. Wang, and H. Li, “Disentangling voice and content with self-supervision for speaker recognition,” in Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, Dec. 2023, pp. 50221–50236.
J. Y. Lee, K. A. Lee, and W. S. Gan, “DLVGen: A dual latent variable approach to personalized dialogue generation,” in Proc. International Conference on Agents and Artificial Intelligence (ICAART), 2022, vol. 2, pp. 193-202.
T. Liu, R. K. Das, K. A. Lee, and H. Li, “Neural acoustic-phonetic approach for speaker verification with phonetic attention mask,” IEEE Signal Processing Letters, vol. 29, pp. 782-786, 2022.
H. Zhu, K. A. Lee, and H. Li, “Discriminative speaker embedding with serialized multi-layer multi-head attention,” Speech Communication, vol. 144, pp. 89-100, Oct. 2022.
J. Y. Lee, K. A. Lee, and W. S. Gan, “Improving contextual coherence in variational personalized and empathetic dialogue agents,” in Proc. IEEE ICASSP, 2022, pp. 7052-7056.
R. Tao, K. A. Lee, R. K. Das, V. Hautamaki, and H. Li, “Self-supervised speaker recognition with loss-gated learning,” in Proc. IEEE ICASSP, 2022, pp. 6142-6146.
H. Zhang, L. Wang, K. A. Lee, M. Liu, J. Dang, and H. Chen, “Learning domain-invariant transformation for speaker verification,” in Proc. IEEE ICASSP, 2022, pp. 7177-7181.
T. Liu, R. K. Das, K. A. Lee, H. Li, “MFA: TDNN with multi-scale frequency-channel attention for text-independent speaker verification with short utterances,” in Proc. IEEE ICASSP, 2022, pp. 7517-7521.
F. Yu, S. Zhang, P. Guo, Y. Fu, Z. Du, S. Zheng, W. Huang, L. Xie, Z.-H. Tan, D. Wang, Y. Qian, K. A. Lee, Z. Yan, B. Ma, X. Xu, and H. Bu, “Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge,” in Proc. IEEE ICASSP, 2022, pp. 9156-9160.
H. Shim, H. Tak, X. Liu, H. Heo, J. Jung, J. Chung, S. Chung, H. Yu, B. Lee, M. Todisco, H. Delgado, K. A. Lee, M. Sahidullah, T. Kinnunen, and N. Evans, “Baseline systems for the first spoofing-aware speaker verification challenge: score and embedding fusion,” in Proc. Odyssey Workshop, 2022, pp. 330 – 337.
Q. Wang, K. A. Lee, and T. Liu, “Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?” in Proc. Interspeech, 2022, pp. 600 – 604.
M. Liu, L. Wang, J. Dang, K. A. Lee, and S. Nakagawa, “Replay attack detection using variable-frequency resolution phase and magnitude features,” Computer Speech & Language, vol. 66, 101161, Mar. 2021.
A. Nautsch, X. Wang, N. Evans, T. H. Kinnunen, V. Vestman, M. Todisco, H. Delgado, M. Sahidullah, J. Yamagishi, and K. A. Lee, "ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech," in IEEE Transactions on Biometrics, Behavior, and Identity Science, vol. 3, no. 2, pp. 252-265, Apr. 2021.
K. A. Lee, V. Vestman, and T. Kinnunen, "ASVtorch toolkit: Speaker verification with deep neural networks," SoftwareX, vol. 14, 2021, 100697, ISSN 2352-7110.
K. A. Lee, Q. Wang, and T. Koshinaka, “Xi-vector embedding for speaker recognition,” IEEE Signal Processing Letters, vol. 28, pp. 1385-1389, June 2021.
M. Liu, L. Wang, K. A. Lee, X. Chen, and J. Dang, “Replay-attack detection using features with adaptive spectro-temporal resolution,” in Proc. ICASSP, 2021, pp. 6374-6378.
H. Zhang, L. Wang, K. A. Lee, M. Liu, J. Dang, and H. Chen, “Meta-learning for cross-channel speaker verification,” in Proc. ICASSP, 2021, pp. 5839-5843.
L. Li, K. Hu, Y. Zheng, J. Liu, and K. A. Lee, “COOPNet: Multi-Modal Cooperative Gender Prediction in Social Media User Profiling,” in Proc. ICASSP, 2021, pp. 4310-4314.
H. Zhu, K. A. Lee, and H. Li, “Serialized multi-layer multi-head attention for neural speaker embedding,” in Proc. INTERSPEECH, 2021, pp. 106-110.
Y. Wu, L. Wang, K. A. Lee, M. Liu, and J. Dang, “Joint feature enhancement and speaker recognition with multi-objective task-oriented network,” in Proc. INTERSPEECH, 2021, pp. 1089-1093.
L. Zhang, Q. Wang, K. A. Lee, L. Xie, and H. Li, “Multi-level transfer learning from near-field to far-field speaker verification,” in Proc. INTERSPEECH, pp. 1094-1098, 2021.
J. Y. Lee, K. A. Lee, and W. S. Gan, “Generating personalized dialogue via multi-task meta-learning,” SemDial, 2021.
Q. Wang, K. A. Lee, T. Koshinaka, K. Okabe, and H. Yamamoto, "Task-aware Warping Factors in Mask-based Speech Enhancement," in Proc. European Signal Processing Conference (EUSIPCO), 2021, pp. 476-480.
M. Liu, L. Wang, K. A. Lee, H. Zhang, C. Zeng, and J. Dang, “DeepLip: A benchmark for deep learning-based audio-visual lip biometrics,” in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2021, pp. 122-129.
Y. Ma, K. A. Lee, V. Hautamäki and H. Li, “PL-EESR: Perceptual loss based end-to-end robust speaker representation extraction,” in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2021, pp. 106-113.
K. A. Lee, O. Sadjadi, H. Li, and D. Reynolds, “Two decades into Speaker Recognition Evaluation - are we there yet?” Computer Speech & Language, vol. 61, 101058, 2020.
A. Sholokhov, T. Kinnunen, V. Vestman, and K. A. Lee, "Voice biometrics security: Extrapolating false alarm rate via hierarchical Bayesian modeling of speaker verification scores,” Computer Speech & Language, vol. 60, 101024, 2020.
K. A. Lee, H. Yamamoto, K. Okabe, Q. Wang, L. Guo, T. Koshinaka, J. Zhang, and K. Shinoda, “NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition,” Computer Speech & Language, vol. 61, 101033, May 2020.
X. Wang, J. Yamagishi, M. Todisco, H. Delgado, A. Nautsch, N. Evans, M. Sahidullah, V. Vestman, T. Kinnunen, K. A. Lee, L. Juvela et al, “ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech,” Computer Speech & Language, vol. 64, 101114, 2020.
I. Kukanov, T. N. Trong, V. Hautamäki, S. M. Siniscalchi, V. M. Salerno, and K. A. Lee, “Maximal Figure-of-Merit Framework to Detect Multi-Label Phonetic Features for Spoken Language Recognition,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 682-695, 2020.
T. Kinnunen, H. Delgado, N. Evans, K. A. Lee, V. Vestman, A. Nautsch, M. Todisco, X. Wang, M. Sahidullah, J. Yamagishi, and D. A. Reynolds, “Tandem assessment of spoofing countermeasures and automatic speaker verification: Fundamentals,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 2195-2210, 2020.
Q. Wang, K. Okabe, K. A. Lee, and T. Koshinaka, “A generalized framework for domain adaptation of PLDA in speaker recognition,” in Proc. IEEE ICASSP, 2020.
H. Zeinali, K. A. Lee, J. Alam, and L. Burget, “SdSV Challenge 2020: large-scale evaluation of short-duration speaker verification,” in Proc. INTERSPEECH, 2020, pp. 731-735.
H. Zhang, L. Wang, Y. Zhang, M. Liu, K. A. Lee, and J. Wei, “Adversarial separation network for speaker recognition,” Proc. INTERSPEECH, 2020, pp. 951-955.
D. Zhou, L. Wang, K. A. Lee, Y. Wu, M. Liu, J. Dang, and J. Wei, “Dynamic margin softmax loss for speaker verification,” Proc. INTERSPEECH, 2020, pp. 3800-3804.
K. A. Lee, H. Yamamoto, K. Okabe, Q. Wang, L. Guo, T. Koshinaka, J. Zhang, and K. Shinoda, “NEC-TT speaker verification system for SRE’19 CTS Challenge,” in Proc. INTERSPEECH, 2020, pp. 2227-2231.
K. Akimoto, S. P. Liew, S. Mishima, R. Mizushima, and K. A. Lee, "POCO: A voice spoofing and liveness detection corpus based on pop noise," Proc. INTERSPEECH 2020, pp. 1081-1085.
L. Chen, K. A. Lee, L. He, F. Soong, “On early-stop clustering for speaker diarization,” in Proc. Odyssey 2020: The Speaker and Language Recognition Workshop, 2020, pp. 110-116.
Q. Wang, K. A. Lee, and T. Koshinaka, “Using multi-resolution feature maps with convolutional neural networks for anti-spoofing in ASV,” in Proc. Odyssey 2020: The Speaker and Language Recognition Workshop, 2020, pp. 138-142.
P. Garcia Perera, J. Villalba, H. Bredin, J. Du, D. Castan, A. Cristia, L. Bullock, L. Guo, K. Okabe, P.S. Nidadavolu, S. Kataria, S. Chen, L. Galmant, M. Lavechin, L. Sun, M. Gill, B. Ben-Yair, S. Abdoli, X. Wang, W. Bouaziz, H. Titeux, E. Dupoux, K.A. Lee, and N. Dehak, "Speaker detection in the wild: Lessons learned from JSALT 2019," in Proc. Odyssey 2020 The Speaker and Language Recognition Workshop, pp. 415-422.