Integrating Generative and Contrastive Approaches for Human Action Recognition
Pablo Cervantes, Yusuke Sekikawa, Ikuro Sato, Koichi Shinoda
IEEE Access, vol. 13, pp. 100095 - 100104, 2025, https://doi.org/10.1109/ACCESS.2025.3575707
ContextualCoder: Adaptive In-context Prompting for Programmatic Visual Question Answering
Ruoyue Shen, Nakamasa Inoue, Dayan Guan, Rizhao Cai, Alex C. Kot, Koichi Shinoda
IEEE Transactions on Multimedia, Feb. 17 2025, https://doi.org/10.1109/TMM.2025.3543043
SepVAC: Multitask Learning of Speaker Separation, Speaker Localization, Microphone Array Localization, and Room Acoustic Parameter Estimation in Various Acoustic Conditions
Roland Hartanto, Sakriani Sakti, Koichi Shinoda
Proc. Interspeech 2025, Aug. 17-21, 2025, Rotterdam, Netherlands, pp. 2480-2484
Diffusion-based Generative Regularization for Supervised Discriminative Learning
Takuya Asakura, Nakamasa Inoue, Koichi Shinoda
Proceeding of IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Feb. 28-Mar. 4, 2025, pp. 8915-8926
マルチチャンネルモデルを用いた知識蒸留による単一チャンネル音声分離手法
二通大地, ローランド ハルタント, 篠田浩一
ASJ Autumn Meeting, Sep 10-12, 2025
単一チャンネル音声分離のためのマルチチャンネルモデルを用いた知識蒸留手法
二通大地, ローランド ハルタント, 篠田浩一
電子情報通信学会技術研究報告 SP, vol. 125, no. 74, pp. 10-15, 2025年6月
音韻レベルの話者情報を用いた音声認識における話者適応
伊藤光一, 篠田浩一
ASJ Spring Meeting, Mar 17-19, 2025
Multitask Training of Multi-channel Speaker Separation and Room Acoustic Parameter Estimation
Roland Hartanto, Sakriani Sakti, Koichi Shinoda
ASJ Spring Meeting, Mar 17-19, 2025