AAAI 2026 Workshop 📍 Singapore 📅26 Janurary 2026
Topics of Interest
The topics of interest include, but are not limited to:
• Robust audio and multimodal scene analysis under real-world constraints (asynchronous microphones, reverberant environments, video-audio desynchronization)
• Data augmentation and synthetic generation for spatial audio and video-audio learning
• Multimodal large-scale models (LLMs, VLMs) for audio-language(-video) retrieval and understanding
• Perceptual representation learning for 3D sound event localization, detection, and audio-visual grounding
• Foundation models and adaptation for audio, speech, and video-audio tasks
• Speech, audio, and video-conditioned generation (avatars, dubbing, cross-modal synthesis)
• Audio-Visual scene understanding and reasoning
• Audio and speech quality assessment: evaluation, metrics, and perception-driven benchmarks
• Multimodal safeguard and robustness in audio-video-language modeling
• Efficient evaluation frameworks for reasoning, generation, and multimodal integration (LLM + audio + video)
• Benchmarking dataset creation and sharing across audio, speech, and video modalities
• Applications of microphone array processing and virtual microphone techniques in multimodal systems
• Real World Applications: entertainment & media (karaoke, video avatars), manufacturing (process monitoring), sustainability (forest restoration, biodiversity monitoring), education (classroom video-audio analysis), healthcare (elderly care monitoring), security (robotic patrolling, surveillance with video-audio fusion)
Submission Requirements
Please prepare your submission using the AAAI template available at https://aaai.org/conference/aaai/aaai-26/main-technical-track-call/. Full papers should not exceed 8 pages (excluding references), while short papers are limited to 4 pages (excluding references). Accepted papers may be submitted under either the archival track or the non-archival track. Non-archival track papers will not be included in the proceedings and may consist of previously published work or preliminary studies. Archival track submissions must present original work that has not been published or submitted elsewhere. The Microsoft CMT service was used for managing the peer-reviewing process for this conference. This service was provided for free by Microsoft and they bore all expenses, including costs for Azure cloud services as well as for software development and support.
Submission Portal
Workshop Format
The workshop is a one-day event for balancing depth and participant engagement, including invited talks, paper presentations and discussions.
Invited Speakers
Wenwu Wang, Full Professor, University of Surrey, UK.
Tsubasa Takahashi, Principal Researcher, Tutoring Inc, USA.
Björn Schuller, Full Professor, Technical University of Munich, Germany.
Hung-yi Lee, Full Professor, National Taiwan University.
Important Dates
24 October 2025: Paper Submission
5 November 2025: Paper Notification
16 November 2025: Early Bird Registration
26 January 2026: Workshop Program
Workshop Committee
Nancy F. Chen, A*STAR Institute for Infocomm Research (A*STAR I²R), Singapore, nfychen@a-star.edu.sg
Nobutaka Ono, Tokyo Metropolitan University, Japan, onono@tmu.ac.jp
Xiaoxue Gao, A*STAR Institute for Infocomm Research (A*STAR I²R), Singapore, Gao_Xiaoxue@a-star.edu.sg
Keisuke Imoto, Kyoto University, Japan, keisuke.imoto@ieee.org
Tatsuya Komatsu, LY Corporation, Japan, komatsu.tatsuya@lycorp.co.jp