Publication

Journal Papers

M. Yasuda, N. Harada, Y. Ohishi, S. Saito, A. Nakayama, and N. Ono, "Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis," ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 22, no. 2, pp. 1-18, 2026. arXiv: https://arxiv.org/abs/2404.08264
M. Yasuda, N. Harada, S. Saito, and N. Ono, "Spatial Annotation-free Sound Event Localization and Detection via Spatial Instance Classification," IEEE Access, vol. 13, pp. 171613-171625, 2025. ieeexplore.ieee.org/document/11162519
K. Nagatomo, M. Yasuda, K. Yatabe, S. Saito, and Y. Oikawa, "On-line sound event localization and detection for real-time recognition of surrounding environment," Applied Acoustics, vol. 199, 108961, 2022. www.sciencedirect.com/science/article/pii/S0003682X22003358

Peer-Reviewed International Conference Proceedings

M. Yasuda, S. Saito, N. Sato, and N. Harada, "Spatial Annotation-free Training for Sound Event Localization and Detection," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025), 2025.
N. Sato, M. Yasuda, S. Saito, and N. Harada, "Sound Source Distance Estimation Utilizing Physics-informed Prior for Sound Event Localization and Detection," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025), 2025.
B. T. Nguyen, M. Yasuda, D. Takeuchi, D. Niizumi, Y. Ohishi, and N. Harada, "Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes," in Proc. of European Signal Processing Conference (EUSIPCO 2025), 2025. https://arxiv.org/abs/2503.22088
D. Niizumi, D. Takeuchi, M. Yasuda, B. T. Nguyen, Y. Ohishi, and N. Harada, "Towards Pre-training an Effective Respiratory Audio Foundation Model," in Proc. of Interspeech 2025, 2025. https://arxiv.org/abs/2505.15307
D. Takeuchi, B. T. Nguyen, M. Yasuda, Y. Ohishi, D. Niizumi, and N. Harada, "CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer," in Proc. of Interspeech 2025, 2025. https://arxiv.org/abs/2506.00800
D. Niizumi, D. Takeuchi, M. Yasuda, B. T. Nguyen, Y. Ohishi, and N. Harada, "Assessing the Utility of Audio Foundation Models for Heart and Respiratory Sound Analysis," in Proc. of EMBC 2025, 2025. https://arxiv.org/abs/2504.18004
M. Yasuda, B. T. Nguyen, N. Harada, R. Serizel, M. Mishra, M. Delcroix, S. Araki, D. Takeuchi, D. Niizumi, Y. Ohishi, T. Nakatani, T. Kawamura, and N. Ono, "Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes," in Proc. of the 10th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2025), pp. 170-174, 2025. https://arxiv.org/abs/2506.10676
M. Yasuda, Y. Ohishi, S. Saito, A. Nakayama, and N. Harada, "6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 2024. https://arxiv.org/abs/2403.01670
D. Niizumi, D. Takeuchi, Y. Ohishi, N. Harada, M. Yasuda, S. Tsubaki, and K. Imoto, "M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation," in Proc. of Interspeech 2024, 2024. https://arxiv.org/abs/2406.02032
N. Harada, D. Niizumi, Y. Ohishi, D. Takeuchi, and M. Yasuda, "First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline," in Proc. of European Signal Processing Conference (EUSIPCO 2023), 2023. https://arxiv.org/abs/2303.00455
M. Yasuda, Y. Ohishi, and S. Saito, "Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), 2022. https://arxiv.org/abs/2202.09121
M. Yasuda, Y. Ohishi, S. Saito, and N. Harada, "Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor Fusion," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), 2022. https://arxiv.org/abs/2202.09124
K. Nagatomo, M. Yasuda, K. Yatabe, S. Saito, and Y. Oikawa, "Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), 2022. https://arxiv.org/abs/2202.08458
T. Tanaka, K. Yatabe, M. Yasuda, and Y. Oikawa, "APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), 2022. https://arxiv.org/abs/2202.08028
N. Harada, D. Niizumi, D. Takeuchi, Y. Ohishi, M. Yasuda, and S. Saito, "ToyADMOS2: Another Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection under Domain Shift Conditions," in Proc. of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2021), 2021.
M. Yasuda, Y. Ohishi, Y. Koizumi, and N. Harada, "Crossmodal Sound Retrieval based on Specific Target Co-occurrence Denoted with Weak Labels," in Proc. of Interspeech 2020, 2020.
Y. Koizumi, R. Masumura, K. Nishida, M. Yasuda, and S. Saito, "A Transformer-based Audio Captioning Model with Keyword Estimation," in Proc. of Interspeech 2020, 2020. https://arxiv.org/abs/2007.00222
Y. Koizumi, Y. Kawaguchi, K. Imoto, T. Nakamura, Y. Nikaido, R. Tanabe, H. Purohit, K. Suefusa, T. Endo, M. Yasuda, and N. Harada, "Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring," in Proc. of the 5th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020), 2020. https://arxiv.org/abs/2006.05822
Y. Koizumi, M. Yasuda, S. Murata, S. Saito, H. Uematsu, and N. Harada, "SPIDERnet: Attention Network for One-shot Anomaly Detection in Sounds," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), 2020.
M. Yasuda, Y. Koizumi, S. Saito, H. Uematsu, and K. Imoto, "Sound Event Localization based on Sound Intensity Vector Refined by DNN-based Denoising and Source Separation," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), 2020. https://arxiv.org/abs/2002.05994
K. Imoto, N. Tonami, Y. Koizumi, M. Yasuda, R. Yamanishi, and Y. Yamashita, "Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), 2020. https://arxiv.org/abs/2002.05848
L. Mazzon, Y. Koizumi, M. Yasuda, and N. Harada, "First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation," in Proc. of the 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019), 2019. https://arxiv.org/abs/1910.04388

First Author Domestic Workshop Proceedings (in Japanese)

安田昌弘，小泉悠馬，斎藤翔一郎，植松尚，井本桂右, "深層学習に基づく時間周波数マスクと音響強度ベクトルを利用した音響イベント定位," 日本音響学会2020年春季研究発表会, 2020.
安田昌弘，大石康智，小泉悠馬，原田登, "弱ラベルで示される特定の共起関係に基づいたクロスモーダル音検索," 日本音響学会2020年秋季研究発表会, 2020.
安田昌弘，大石康智，齊藤翔一郎，小泉悠馬, "分散マイク・分散カメラの空間位置情報を活用したマルチモーダルシーン分類," 日本音響学会2021年春季研究発表会, 2021.
安田昌弘，大石康智，齊藤翔一郎，原田登, "分散カメラ・分散マイクを利用したイベント検出のためのSelf-Attentionに基づくマルチセンサ統合," 日本音響学会2021年秋季研究発表会, 2021.
安田昌弘，大石康智，齊藤翔一郎, "反響音情報を利用した音響イベント定位の未知環境適応," 日本音響学会2022年春季研究発表会, 2022.
安田昌弘，原田登，大石康智，中山彰，齊藤翔一郎，小野順貴, "分散センサに基づくイベント分析のためのMasked self-distillation modeling," 日本音響学会2023年秋季研究発表会, 2023.
安田昌弘，齊藤翔一郎，佐藤菜緒，中山彰，原田登, "音源方向の教師付け無しに学習可能な音響イベント定位," 日本音響学会2024年秋季研究発表会, 2024.
安田昌弘，Nguyen Binh Thien，原田登，竹内大起，仁泉大輔，大石康智, "空間音響信号を対象とした音響イベントの検出と分離," 日本音響学会2025年秋季研究発表会, 2025.

Page updated

Google Sites

Report abuse