AAAI 2026 Workshop 📍 Singapore 📅26 Janurary 2026
Topics of Interest
The topics of interest include, but are not limited to:
• Robust audio and multimodal scene analysis under real-world constraints (asynchronous microphones, reverberant environments, video-audio desynchronization)
• Data augmentation and synthetic generation for spatial audio and video-audio learning
• Multimodal large-scale models (LLMs, VLMs) for audio-language(-video) retrieval and understanding
• Perceptual representation learning for 3D sound event localization, detection, and audio-visual grounding
• Foundation models and adaptation for audio, speech, and video-audio tasks
• Speech, audio, and video-conditioned generation (avatars, dubbing, cross-modal synthesis)
• Audio-Visual scene understanding and reasoning
• Audio and speech quality assessment: evaluation, metrics, and perception-driven benchmarks
• Multimodal safeguard and robustness in audio-video-language modeling
• Efficient evaluation frameworks for reasoning, generation, and multimodal integration (LLM + audio + video)
• Benchmarking dataset creation and sharing across audio, speech, and video modalities
• Applications of microphone array processing and virtual microphone techniques in multimodal systems
• Real World Applications: entertainment & media (karaoke, video avatars), manufacturing (process monitoring), sustainability (forest restoration, biodiversity monitoring), education (classroom video-audio analysis), healthcare (elderly care monitoring), security (robotic patrolling, surveillance with video-audio fusion)
Submission Requirements
Please follow the template required by AAAI at https://aaai.org/conference/aaai/aaai-26/main-technical-track-call/ for full paper (8 pages except references) and short paper (4 pages except references). The review process will be double-blind. The Microsoft CMT service was used for managing the peer-reviewing process for this conference. This service was provided for free by Microsoft and they bore all expenses, including costs for Azure cloud services as well as for software development and support.
Submission Portal
Attendance
We expect to attract around 50 attendees and 20 submissions.
Workshop Format
The workshop is a one-day event for balancing depth and participant engagement, including invited talks, paper presentations and discussions.
Important Dates
24 October 2025: Paper Submission
5 November 2025: Paper Notification
16 November 2025: Early Bird Registration
26 January 2026: Workshop Program
Workshop Committee
Nancy F. Chen, A*STAR Institute for Infocomm Research (A*STAR I²R), Singapore, nfychen@i2r.a-star.edu.sg
Nobutaka Ono, Tokyo Metropolitan University, Japan, onono@tmu.ac.jp
Xiaoxue Gao, A*STAR Institute for Infocomm Research (A*STAR I²R), Singapore, Gao_Xiaoxue@i2r.a-star.edu.sg
Keisuke Imoto, Kyoto University, Japan, keisuke.imoto@ieee.org
Tatsuya Komatsu, LY Corporation, Japan, komatsu.tatsuya@lycorp.co.jp