9:00 - 9:10
9:10 - 10:30
[HSCMA] DISCRIMINATING REAL AND SYNTHETIC SUPER-RESOLVED AUDIO SAMPLES USING EMBEDDING-BASED CLASSIFIERS
Mikhail Silaev, Konstantinos Drossos, Tuomas Virtanen
[HSCMA] Eigenbeam-Feature-Based Multi-Order Encoder for Geometry-Agnostic Speech Enhancement
Dongzhe Zhang, Alessandro Ilic Mezza, Federico Miotello, Jianfeng Chen, Mou Wang, Fabio Antonacci, Alberto Bernardini
[HSCMA] Target-speaker voice activity detection with chunk-level speaker queries
Naohiro Tawara, Shota Horiguchi
[HSCMA] An Analysis on the Influence of Array Element Directivity on the Performance of Differential Beamformers
Federico Miotello, Davide Albertini, Alberto Bernardini
Coffee Break 10:30 - 11:00
11:00 - 12:00
Creating super-hearing capabilities with real-time AI
Shyam Gollakota
12:00 - 12:30
[MCoRec] CHiME-9 Task 1 - MCoRec: Multi-Modal Context-aware Recognition
MCoRec Task Organizers
[ECHI] CHiME-9 Task 2 - ECHI: Enhancing Conversations to address Hearing Impairment
ECHI Task Organizers
Lunch Break 12:30 - 2:30
2:30 - 3:10
[MCoRec] The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge
Ya Jiang, Ruoyu Wang, Jingxuan Zhang, Jun Du, Yi Han, Zihao Quan, Hang Chen, Yeran Yang, Kongzhi Zheng, Zhuo Chen, Yanhui Tu, Shutong Niu, Changfeng Xi, Mengzhi Wang, Zhongbin Wu, Jieru Chen, Henghui Zhi, Weiyi Shi, Shuhang Wu, Genshun Wan, Jia Pan, Jianqing Gao
[MCoRec] BUT system description for CHiME-9 MCoRec Challenge
Dominik Klement, Alexander Polok, Nguyen Hai Phong, Prachi Singh, Lukáš Burget
[ECHI] A Low-Latency Multi-Stage MIMO System with Cross-beam Interaction for CHiME-9 Task 2 (ECHI)
Yanhui Tu, Rui He, Lele Xu, Xiao Wang, Changyin Sun, Yi Fang
[ECHI] Perceptually Motivated Low-Latency Target Speaker Extraction for CHiME-9 ECHI
Prachi Sharma, Ferdinand Campe, Dorothea Kolossa
3:10 - 4:10
[HSCMA] On the Role of Spatial Features in Foundation-Model-Based Speaker Diarization
Marc Deegen, Tobias Gburrek, Tobias Cord-Landwehr, Thilo von Neuman, Jiangyu Han, Lukáš Burget, Reinhold Haeb-Umbach
[HSCMA] Optimization of High Directivity Beamforming and WPE Method for Improved Speech Dereverberation
Kang Chen, Hanchen Pei, Gongping Huang
[MCoRec] CHiME-9 Task 1 - MCoRec Overview
MCoRec Task Organizers
[MCoRec] The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge
Ya Jiang, Ruoyu Wang, Jingxuan Zhang, Jun Du, Yi Han, Zihao Quan, Hang Chen, Yeran Yang, Kongzhi Zheng, Zhuo Chen, Yanhui Tu, Shutong Niu, Changfeng Xi, Mengzhi Wang, Zhongbin Wu, Jieru Chen, Henghui Zhi, Weiyi Shi, Shuhang Wu, Genshun Wan, Jia Pan, Jianqing Gao
[MCoRec] The SUSTech AILab System Description for CHiME-9 MCoRec Challenge
Tongtao Ling, Pengjie Shen, Zhong-Qiu Wang
[MCoRec] Conversation Clustering by Mutual Gaze Estimation And AV-ASR by Dual Model Output Fusion
Zhengyang Li, Aziz Hakiri, Zehang Wu, Thomas Graave, Ernst Seidel, Yihui Fu, Björn Möller, Patrick Blumenberg, Tim Fingscheidt
[ECHI] CHiME-9 Task 2 - ECHI Overview
ECHI Task Organizers
[ECHI] A Three-Stage System for CHiME-9 ECHI: Self-Interference Suppression, Target Speaker Extraction, and Post-Processing
Fei Zhao, Changjiang Zhao, Zhenlong Guo, Wenzheng Zhang, Yongjie Yan, Xueliang Zhang
[ECHI] Reinforcement Learning for Multi-Channel Speech Enhancement
Afrooz Haghbin, Rodney Vaughan
[ECHI] A Low-Latency Multi-Stage MIMO System with Cross-beam Interaction for CHiME-9 Task 2 (ECHI)
Yanhui Tu, Rui He, Lele Xu, Xiao Wang, Changyin Sun, Yi Fang
[Non-archival] Differentiable Optimization of Linear Differential Microphone Arrays: A Joint Geometry and Filter Design Framework
Siminfar Samakoush Galougah, Ramani Duraiswami
[Non-archival] Prompting Whisper for Joint Speech Transcription and Diarization
Mariia Zamyrova, Henk van den Heuvel
Coffee Break 4:10 - 4:40
4:40 - 5:40
[HSCMA] Blind Direction-Dependent Acoustic Parameter Estimation using Smart Glasses
Philipp Götz, Sebastià V. Amengual, Paul Calamia, Ishwarya Ananthabhotla, Andrew Francl, Carl Schissler, Emanuël A. P. Habets
[HSCMA] Geneses: Unified Generative Speech Enhancement and Separation
Kohei Asai, Wataru Nakata, Yuki Saito, Hiroshi Saruwatari
[MCoRec] BUT system description for CHiME-9 MCoRec Challenge
Dominik Klement, Alexander Polok, Nguyen Hai Phong, Prachi Singh, Lukáš Burget
[MCoRec] Science Tokyo CHiME-9 MCoRec System Description
Roland Hartanto, Daichi Nitsu, Nhu Minh Phuong Dinh, Koichi Shinoda
[MCoRec] The NJU-AALAB Systems for the CHiME-9 MCoRec Challenge
Zeyan Song, Yushi Wang, Jing Lu
[MCoRec] The AUVIS system for the CHiME-9 Multi-Modal Context-aware Recognition (MCoRec) Challenge
Sean Ackermann, Michael Becker, Erol Celik, Magdalena Eggers, Johannes Anton Kaiser, Hendrik Leddin, Adrian Nowak, Alexander Reimann, Dagmar Schönenberg, Johanna Schulze, Alison Vanzetta, Alexandra Wasilkow, Stefan Goetze
[ECHI] A Unified Latency-Flexible Framework with a Causal Mamba Core for Multichannel Speech Enhancement
Rong Chao, Zeen Jhou, You-Jin Li, Sung-Feng Huang, Moreno La Quatra, Sabato Marco Siniscalchi, Wen-Huang Cheng, Szu-Wei Fu, Yu Tsao
[ECHI] Training Low-Latency Target Speech Extraction for Wearable Devices Using Phase- and Amplitude-Aligned Data in the CHiME-9 ECHI Task
Dengxiang Hu, Tomohiro Nakatani, Naoyuki Kamo, Marc Delcroix, Tsubasa Ochiai, Shoji Makino
[ECHI] A Multichannel Band-split Recurrent Architecture for Real-time Speaker-conditioned Speech Enhancement
Zeen Jhou, Rong Chao, You-Jin Li, Yu Tsao
[ECHI] Perceptually Motivated Low-Latency Target Speaker Extraction for CHiME-9 ECHI
Prachi Sharma, Ferdinand Campe, Dorothea Kolossa
[Non-archival] Generic Speech Enhancement with Self-Supervised Representation Space Loss
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Ryo Masumura
[Non-archival] End-to-End Multi-Task Learning for Adjustable Joint Noise Reduction and Hearing Loss Compensation
Philippe Gonzalez, Vera Margrethe Frederiksen, Torsten Dau, Tobias May
5:40 - 6:00