Hagai Aronowitz: Publication List
2024
A. Turetzky, N. Shabtay, S .Shechtman, H. Aronowitz, D. Haws, R. Hoory, A. Dekel, "Continuous Speech Synthesis using per-token Latent Diffusion", arXiv preprint arXiv:2410.16048.
2023
E. Morais, M. Damasceno, H. Aronowitz, A. Satt, R. Hoory, "Modeling Turn-Taking in Human-To-Human Spoken Dialogue Datasets Using Self-Supervised Features", in Proc. ICASSP, 2023.
2022
Z. Kons, H. Aronowitz, E. Morais, M. Damasceno, H.K. Kuo and S. Thomas, "Extending RNN-T-based speech recognition systems with emotion and language classification", in Proc. Interspeech, 2022.
H. Aronowitz, I. Gat, E. Morais, W. Zhu and R. Hoory, "TOWARDS A COMMON SPEECH ANALYSIS ENGINE ", in ICASSP, 2022.
I. Gat, W. Zhu, E. Morais, R. Hoory and H. Aronowitz, , "SPEAKER NORMALIZATION FOR SELF-SUPERVISED SPEECH EMOTION RECOGNITION", in ICASSP, 2022.
E. Morais, W. Zhu, I. Gat, M. Damasceno, R. Hoory and H. Aronowitz "Speech emotion recognition using Self-Supervised features ", in ICASSP, 2022.
2020
H. Aronowitz, W. Zhu, M. Suzuki, G. Kurata and R. Hoory, "New Advances in Speaker Diarization", to appear in Interspeech, 2020.
S. Rozenberg, H. Aronowitz, R. Hoory, "Siamese x-vector reconstruction for domain adapted speaker recognition", to appear in Interspeech, 2020.
H. Aronowitz, W. Zhu, "Context modeling for online speaker change detection", in Proc. ICASSP, 2020.
2018
A. Aides, D. Dov, H. Aronowitz, "Robust Audiovisual Liveness Detection for Biometric Authentication Using Deep Joint Embedding and Dynamic Time Warping", in Proc. ICASSP, 2018.
2017
K.A. Lee et al., "The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation", to appear in Interspeech, 2017.
H. Aronowitz, "A Deep Dive into Biometrics and Multimedia Analysis". Keynote address, the 14th Bar-Ilan Symposium on the Foundations of Artificial Intelligence (BISFAI), 2017.
H. Aronowitz, "Text dependent speaker recognition". Invited talk, Afeka Conference on Speech Processing, 2017.
H. Aronowitz, "Inter Dataset Variability Modeling for Speaker Recognition", in Proc. ICASSP, 2017.
H. Aronowitz, "Speaker Recognition using Common Passphrases in RedDots", in Proc. ICASSP, 2017.
2016
A. Aides, H. Aronowitz, "Text-Dependent Audiovisual Synchrony Detection for Spoofing Detection in Mobile Person Recognition", in Proc. Interspeech, 2016. [Talk].
Y. Solewicz, H. Aronowitz, T. Becker, "Reducing Noise Bias in the i-Vector Space for Speaker Recognition", in Proc. Speaker Odyssey, 2016.
H. Aronowitz, "Speaker Recognition using Matched Filters", in Proc. ICASSP, 2016.
O.Plchot, L. Burget, H. Aronowitz, P. Majetka, "Audio Enhancing with DNN Autoencoders for Speaker Recognition", in Proc. ICASSP, 2016.
2015
H. Aronowitz, "Exploiting Supervector Structure for Speaker Recognition Trained on a Small Development Set", in Interspeech, 2015.
H. Aronowitz, "Score Stabilization for Speaker Recognition Trained on a Small Development Set", in Interspeech, 2015.
K.A. Lee, A. Larcher, G.Wang, P. Kenny, N. Brummer, D. van Leeuwen, H. Aronowitz, M. Kockmann, C. Vaquero, B. Ma, H. Li, T. Stafylakis, J. Alam, A. Swart, J. Perez, "The RedDots Data Collection for Speaker Recognition", in Interspeech, 2015.
2014
H. Aronowitz, M. Li, O. Toledo-Ronen, S. Harary, A. Geva, S. Ben-David, A. Rendel, R. Hoory, N. Ratha, S. Pankanti, D. Nahamoo, "Multi-Modal Biometrics for Mobile Authentication", in Proc. IJCB, 2014.
H. Aronowitz, "Tutorial: Recent Advances in Speaker Diarization", in Interspeech, 2014.
H. Aronowitz, A. Rendel, "Domain Adaptation for Text Dependent Speaker Recognition", in Proc. Interspeech, 2014.
H. Aronowitz, "Compensating Inter-Dataset Variability in PLDA Hyper-Parameters for Robust Speaker Recognition", in Proc. Speaker Odyssey, 2014. [presentation]
H. Aronowitz,"Inter Dataset Variability Compensation for Speaker Recognition",in Proc. ICASSP, 2014.
2013
O. Barkan, J. Weill, L. Wolf and H. Aronowitz, "Fast high dimensional vector multiplication based face recognition", in ICCV, 2013.
H. Aronowitz,O. Barkan, "On Leveraging Conversational Data for Building a Text Dependent Speaker Verification System", in Proc. Interpseech, 2013.
Z. Kons, H. Aronowitz, "Voice Transformation-based Spoofing of Text-Dependent Speaker Verification Systems",in Proc. Interspeech, 2013.
O. Barkan, H. Aronowitz, "Diffusion Maps for PLDA-based Speaker Verification", in Proc. ICASSP, 2013.
O. Toledo-Ronen, H. Aronowitz, "Confidence for Speaker Diarization using PCA Spectral Ratio", in Proc. Interspeech, 2012.
H. Aronowitz, Y. Solewicz, O. Toledo-Ronen, "Online Two Speaker Diarization", in Proc. Speaker Odyssey, 2012.
H. Aronowitz, "Text Dependent Speaker Verification Using a Small Development Set", in Proc. Speaker Odyssey, 2012.
H. Aronowitz, O. Barkan, "Efficient approximated i-vector extraction", in Proc. ICASSP, 2012.
2011
H. Aronowitz, H. Hoory, J. Pelecanos, D. Nahamoo, "New Developments in Voice Biometrics for User Authentication", in Proc. Interspeech, 2011. [pps]
O. Toledo-Ronen, H. Aronowitz, "Towards Goat Detection in Text-Dependent Speaker Verification", in Proc. Interspeech, 2011.
H. Aronowitz, "Speaker Diarization using A Priori Acoustic Information", in Proc. Interspeech, 2011. [pps]
Y. Solewicz, H. Aronowitz, "Implicit Segmentation in Two-Wire Speaker Recognition", in Proc. Interspeech, 2011. [pps]
H. Aronowitz, O. Barkan,"New Developments in Joint Factor Analysis for Speaker Verification", in Proc. Interspeech, 2011. [pps]
A. Sorin, H. Aronowitz, J. Mamou, O. Toledo-Ronen, R. Hoory, M. Kuritzky, Y. Erez, B. Ramabhadran and A. Sethy, "Speech processing and retrieval in a personal memory aid system for the elderly", in Proc. ICASSP, 2011.
H. Aronowitz, "The integral of a product of three Gaussians", March, 2011.
2010
H. Aronowitz, "Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization", in Odyssey, 2010. [presentation].
H. Aronowitz, V. Aronowitz, "Efficient score normalization for speaker recognition", in Proc. ICASSP, 2010. [presentation].
2009
Y.A. Solewicz, H. Aronowitz, "Two-Wire Nuisance Attribute Projection", in Proc. Interspeech 2009.
2008
S. Chu, H.K. Kuo, L. Mangu, Y. Liu, S. Qin, Q. Shi, S.L. Zhang, H. Aronowitz, "Recent advances in the IBM GALE Mandarin transcription system”, in Proc. ICASSP, 2008.
H. Aronowitz, "Online Vocabulary Adaptation Using Contextual Information and Information Retrieval," in Proc. Interspeech, 2008.
H. Aronowitz and Y.A. Solewicz, "Speaker Recognition in Two Wire Test Sessions," in Proc. Interspeech, 2008. [presentation].
H. Aronowitz, "Speaker Recognition in Two-Wire Test Sessions - Extended Version" - Technical Report - on construction.
2007
H. Aronowitz, “Segmental modeling for speech segmentation”, in Proc. ICASSP 2007.
H. Aronowitz and D. Burshtein, “Efficient Speaker Recognition Using Approximated Cross Entropy (ACE)”, in IEEE Trans. on Audio, Speech & Language Processing, September 2007.
H. Aronowitz, “Speaker Recognition using Kernel-PCA and Intersession Variability Modeling”, in Proc. Interspeech, 2007.
H. Aronowitz, “Trainable Speaker Diarization”, in Proc. Interspeech, 2007.
2006
E. Noor and H. Aronowitz, "Efficient language Identification using Anchor Models and Support Vector Machines", in Proc. Odyssey, 2006.
Y. Qin, Q. Shi, Y.Y. Liu, H. Aronowitz, S. M. Chu, H-K. Kuo, and G. Zweig, “Advances in Mandarin Broadcast Speech Transcription at IBM under the DARPA GALE Program”, in Proc. ISCSLP, 2006.
2005
Aronowitz H., Burshtein D., Amir A., "Speaker indexing in audio archives using Gaussian mixture scoring simulation ", in “Machine learning for multimodal interaction: first international workshop, MLMI'04, revised selected papers”, pp. 243-250, 2005.
Aronowitz H., Burshtein D., Amir A., "A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification", in Proc. ICASSP 2005.
Aronowitz H. and Irony D., “Modeling intra-speaker variability for improved speaker recognition”, in SLSF’, 2005.
Goldberger J. and Aronowitz H., "A distance measure between GMMs based on the unscented transform and its application to speaker recognition" , in Proc. Interspeech 2005.
Aronowitz H., Irony D., Burshtein D., "Modeling Intra-Speaker Variability for Speaker Recognition", in Proc. Interspeech 2005.
Aronowitz H. and Burshtein D., "Efficient Speaker Identification and Retrieval", in Proc. Interspeech 2005.
2004
Aronowitz H., Burshtein D., Amir A., "Speaker indexing in audio archives using test utterance Gaussian mixture modeling", in Proc. ICSLP, pp. 609-612, 2004.
Aronowitz H., Burshtein D., Amir A., "Text independent speaker recognition using speaker dependent word spotting ", in Proc. ICSLP, pp. 1789-1792, 2004.
2008