Publications
Journal
Y. Masuyama, K. Yamaoka, T. Kawamura, and N. Ono, "Efficient joint optimization of sampling rate offsets using entire multichannel signal," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 32, pp. 1816-1828, 2024.
Y.-J. Lu, X. Chang, C. Li, W. Zhang, S. Cornell, Z. Ni, Y. Masuyama, B. Yan, R. Scheibler, Z.-Q. Wang, Y. Tsao, Y. Qian, and S. Watanabe, "Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing," J. Open Source Softw., 2023.
Y. Masuyama, K. Yamaoka, Y. Kinoshita, T. Nakashima, and N. Ono, "Causal and relaxed-distortionless response beamforming for online target source extraction," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 32, pp. 310-324, 2024. [Project page]
Y. Masuyama, K. Yatabe, K. Nagatomo and Y. Oikawa, "Online phase reconstruction via DNN-based phase differences estimation," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 31, pp. 163-176, 2023.
K. Kobayashi, Y. Masuyama, K. Yatabe and Y. Oikawa, "Phase-recovery algorithm for harmonic/percussive source separation based on observed phase information and analytic computation," Acoust. Sci. & Tech., vol.42, np.5, pp.261--269, 2021.
Y. Bando, Y. Masuyama, Y. Sasaki and M. Onishi, "Robust auditory functions based on probabilistic integration of MUSIC and CGMM," IEEE Access, vol.9, pp.38718--38730, 2021.
Y. Masuyama, K. Yatabe, Y. Koizumi, Y. Oikawa and N. Harada, "Deep Griffin-Lim iteration: Trainable iterative phase reconstruction using neural network," IEEE J. Sel. Top. Signal Process., vol.15, no.1, pp.37--50, 2021. (IEEE SPS Tokyo Joint Chapter Student Journal Paper Award) [Project page]
Y. Masuyama, T. Kusano, K. Yatabe and Y. Oikawa, "Modal decomposition of musical instrument sounds via optimization-based non-linear filtering," Acoust. Sci. & Tech., vol.40, no.3, pp.186--197, 2019.
Letters
Y. Bando, K. Sekiguchi, Y. Masuyama, A. A. Nuguraha, M. Fontaine and K. Yoshii, "Neural full-rank spatial covariance analysis for blind source separation," IEEE Signal Process. Lett., vol.28, pp.1670--1674, Aug. 2021. [Project page]
Y. Masuyama, K. Yatabe, K. Nagatomo and Y. Oikawa, "Joint amplitude and phase refinement for monaural source separation," IEEE Signal Process. Lett., vol.27, pp.1939--1943, Oct. 2020. [MATLAB CODE]
Y. Masuyama, K. Yatabe and Y. Oikawa, "Griffin-Lim like phase recovery via alternating direction method of multipliers," IEEE Signal Process. Lett., vol.26, no.1, pp.184--188, Jan. 2019. [Project page] [MATLAB CODE]
Tutorial Paper
K. Yatabe, Y. Masuyama, T. Kusano and Y. Oikawa, "Representation of complex spectrogram via phase conversion," Acoust. Sci. & Tech., vol.40, no.3, pp.170--177, May 2019. [MATLAB CODE]
矢田部浩平, 升山義紀, 草野翼, 及川靖広, "位相変換による複素スペクトログラムの表現," 日本音響学会誌, vol.75, no.3, pp.147--155, Mar. 2019.
International Conference
Y. Masuyama, G. Wichern, F. Germain, Z. Pan, S. Khurana, C. Hori, J. Le Roux, "NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization," ICASSP 2024 (Accepted)
Z. Pan, G. Wichern, Y. Masuyama, F. Germain, S. Khurana, C. Hori, J. Le Roux, "Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction," ASRU 2023. (Accepted)
K. Yamada, Y. Masuyama, K. Yamaoka, and N. Ono, "Fundamental Frequency Estimation Based on Finite-Order Harmonic Constraint Differential Equation," Asia-Pacific Signal Inf. Process. Assoc. Annual Summit Conf. (APSIPA ASC), 2023. (Accepted)
Y. Masuyama*, X. Chang*, W. Zhang, S. Cornell, Z.-Q. Wang, N. Ono, Y. Qian, and S. Watanabe, "Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation," WASPAA 2023. (Accepted)
Y. Masuyama, N. Ueno, and N. Ono, "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase," WASPAA 2023. (Accepted)
Y. Bando, Y. Masuyama, A. A. Nuguraha, and K. Yoshii, "Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation," Eur. Signal Process. Conf. (EUSIPCO), 2023. (Accepted)
S. Cornell, Z.-Q. Wang, Y. Masuyama, S. Watanabe, M. Pariente, N. Ono, and S. Squartini, "Multi-Channel Speaker Extraction with Adversarial Training: The Wavlab Submission to The Clarity ICASSP 2023 Grand Challenge," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), June 2023.
Y. Masuyama, X. Chang, S. Cornell, S. Watanabe, and N. Ono, "End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation," Spok. Lang. Tech. Workshop (SLT), Jan. 2023. (Best Student Paper Award)
S. Cornell, Z.-Q. Wang, Y. Masuyama, S. Watanabe, M. Pariente, and N. Ono, "Multi-Channel Target speaker Extraction with Refinement: The WavLab Submission to The Second Clarity Enhancement Challenge," Clarity Challenge, Dec. 2022.
K. Yamada, Y. Masuyama, Y. Wakabayashi, and N. Ono, "Simultaneous frequency estimation for three or more sinusoids based on sinusoidal constraint differential equation," Asia-Pacific Signal Inf. Process. Assoc. Annual Summit Conf. (APSIPA ASC), Nov. 2022.
Y. Masuyama, K. Yamaoka, and N. Ono, "Joint optimization of sampling rate offsets based on entire signal relationship among distributed microphones," Interspeech, Aug. 2022.
Y.-J. Lu, X. Chang, C. Li, W. Zhang, S. Cornell, Z. Ni, Y. Masuyama, B. Yan, R. Scheibler, Z.-Q. Wang, Y. Tsao, Y. Qian, and S. Watanabe, "ESPnet-SE++: Speech enhancement for robust speech recognition, translation, and understanding," Interspeech, Aug. 2022.
Y. Masuyama, K. Yamaoka, Y. Kinoshita, and N. Ono, "Causal distortionless response beamforming by alternating direction method of multipliers," Asia-Pacific Signal Inf. Process. Assoc. Annual Summit Conf. (APSIPA ASC), Dec. 2021.
Y. Masuyama, T. Tanaka, K. Yatabe, T. Kusano and Y. Oikawa, "Simultaneous declipping and beamforming via alternating direction method of multipliers," Eur. Signal Process. Conf. (EUSIPCO), Aug. 2021.
M. Togami, Y. Masuyama, T. Komatsu, K. Yoshii and T. Kawahara, "Computer-resource-aware deep speech separation with a run-time-specified number of BLSTM layers," Asia-Pacific Signal Inf. Process. Assoc. Annual Summit Conf. (APSIPA ASC), Dec. 2020.
Y. Masuyama, Y. Bando, K. Yatabe, Y. Sasaki, M. Onishi and Y. Oikawa, "Self-supervised neural audio-visual sound source localization via probabilistic spatial modeling ," IEEE/RSJ Int. Conf. Intell. Robot Syst. (IROS), Oct. 2020. (IEEE RAS Japan Chapter Young Award)
Y. Masuyama, M. Togami and T. Komatsu, "Consistency-aware multi-channel speech enhancement using deep neural networks," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2020.
Y. Masuyama, K. Yatabe, Y. Koizumi, Y. Oikawa and N. Harada, "Phase reconstruction based on recurrent phase unwrapping with deep neural networks," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2020. [Project page]
Y. Koizumi, K. Yatabe, M. Delcroix, Y. Masuyama and D. Takeuchi, "Speech enhancement using self-adaptation and multi-head self-attention," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2020.
M. Togami, Y. Masuyama, T. Komatsu and Y. Nakagome "Unsupervised training for deep speech source separation with Kullback-Leibler divergence based probabilistic loss function," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2020.
Y. Masuyama, M. Togami and T. Komatsu, "Multichannel loss function for supervised speech source separation by mask-based beamforming," Interspeech, Sep. 2019.
T. Kusano, Y. Masuyama, K. Yatabe and Y. Oikawa, "Designing nearly tight window for improving time-frequency masking," Int. Congr. Acoust. (ICA), Sep. 2019.
Y. Masuyama, K. Yatabe, Y. Koizumi, Y. Oikawa and N. Harada, "Deep Griffin-Lim iteration," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2019. (IEEE SPS Tokyo Joint Chapter Student Conference Paper Award)
Y. Masuyama, K. Yatabe and Y. Oikawa, "Low-rankness of complex-valued spectrogram and its application to phase-aware audio processing," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2019.
Y. Masuyama, K. Yatabe and Y. Oikawa, "Phase-aware harmonic/percussive source separation via convex optimization," IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), May 2019.
Y. Masuyama, K. Yatabe and Y. Oikawa, "Model-based phase recovery of spectrograms via optimization on Riemannian manifolds,'' Int. Workshop Acoust. Signal Enhanc. (IWAENC), Sep. 2018.
K. Yatabe, Y. Masuyama and Y. Oikawa, "Rectified linear unit can assist Griffin-Lim phase recovery,'' Int. Workshop Acoust. Signal Enhanc. (IWAENC), Sep. 2018.
Y. Masuyama, T. Kusano, K. Yatabe and Y. Oikawa, "Modal decomposition of musical instrument sound via alternating direction method of multipliers,'' IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), Apr. 2018.
Domestic Conferences (in Japanese)
升山義紀, "ICASSP2024における音源分離・音声強調の動向," 電子情報通信学会信号処理研究会, May 2024.
升山義紀, 山岡洸瑛, 木下裕磨, 小野順貴, "因果的MPDRビームフォーマのオンライン化およびタップ長の影響評価," 日本音響学会講演論文集, Sep. 2022.
升山義紀, 山岡洸瑛, 小野順貴, "尤度計算に用いる周波数帯域の逐次増大による初期値に頑健なブラインド同期," 電子情報通信学会応用音響研究会, Aug. 2022.
升山義紀, 山岡洸瑛, 小野順貴, "補助関数法による複数の非同期録音信号のブラインド同期," 日本音響学会講演論文集, Mar. 2022.
山田健太, 升山義紀, 若林佑幸, 小野順貴, "微分方程式に基づく複数の正弦波の周波数同時推定," 日本音響学会講演論文集, Mar. 2022.
坂東宜昭, 升山義紀, 佐々木洋子, 大西正輝, "雑踏環境における音源地図の生成," 第58回人工知能学会 AI チャレンジ研究会, Nov. 2021. (人工知能学会研究会優秀賞)
升山義紀, 山岡洸瑛, 木下裕磨, 小野順貴, "因果的MPDRビームフォーマの近接分離最適化による設計," 日本音響学会講演論文集, Sep. 2021.
升山義紀, 坂東宜昭, 佐々木洋子, 大西正輝, 矢田部浩平, 及川靖広, "視聴覚統合に基づく音源定位と音区間検出の自己教師あり学習," 情報処理学会 第83回全国大会, Mar. 2021. (学生奨励賞)
升山義紀, 矢田部浩平, 長友健人, 及川靖広, "モノラル音源分離のための一般化KLダイバージェンスに基づいた位相復元," 日本音響学会講演論文集, Mar. 2021.
坂東宜昭, 工藤一輝, 升山義紀, 佐々木洋子, 大西正輝 "MUSIC法と混合複素ガウスモデルに基づくロボット聴覚," 日本ロボット学会 第38回学術講演会, Oct. 2020.
升山義紀, 坂東宜昭, 佐々木洋子, 大西正輝, 矢田部浩平, 及川靖広, "音源数と音源位置を同時推定する視聴覚統合 DNN の自己教師あり学習," 日本音響学会講演論文集, Sep. 2020.
升山義紀, 矢田部浩平, 小泉悠馬, 原田登, 及川靖広, "複素DNN を用いた深層Griffin-Lim位相復元," 日本音響学会講演論文集, Sep. 2020.
長友健人, 升山義紀, 矢田部浩平, 及川靖広, 竹内大起, 小泉悠馬, "複数解像度のスペクトログラムを用いた DNN音声強調,'' 日本音響学会講演論文集, Sep. 2020.
升山義紀, 坂東宜昭, 大西正輝, 矢田部浩平, 及川靖広, "全方位画像と多チャネル音響信号を用いた自己教師あり深層音源定位," 日本音響学会講演論文集, Mar. 2020.
長友健人, 升山義紀, 竹内大起, 矢田部浩平, 及川靖広, "複数解像度のスペクトログラムを用いたDNN歌声分離,'' 日本音響学会講演論文集, Mar. 2020.
升山義紀, 矢田部浩平, 小泉悠馬, 原田登, 及川靖広, "位相の微分値に基づいた DNN 位相復元," 日本音響学会講演論文集 , Sep. 2019. [原稿訂正版]
升山 義紀, 矢田部 浩平, 及川 靖広 , "複素スペクトログラムのスパース・低ランクモデリング ," 日本音響学会講演論文集 , Sep. 2019.
矢田部 浩平, 升山 義紀, 草野 翼, 及川 靖広, "MATLAB 瞬時周波数Toolbox ," 日本音響学会講演論文集 , Sep. 2019.
長友 健人, 升山 義紀, 竹内 大起, 矢田部 浩平, 及川 靖広, "位相を考慮した調波音・打楽器音分離 ," 日本音響学会講演論文集 , Sep. 2019.
升山義紀, 矢田部浩平, 小泉悠馬, 原田登, 及川靖広, "DeGLI: 深層Griffin-Lim位相復元," 日本音響学会講演論文集, Mar. 2019.
升山義紀, 矢田部浩平, 及川靖広, "瞬時周波数に基づいた複素スペクトログラムの低ランクモデリング," 日本音響学会講演論文集, Mar. 2019.
草野翼, 升山義紀, 矢田部浩平, 及川靖広, "時間周波数マスキング性能を向上させる窓関数," 日本音響学会講演論文集, Mar. 2019.
升山義紀, 矢田部浩平, 及川靖広, "ADMMを用いたGriffin-Lim型位相復元,'' 日本音響学会講演論文集, Sep. 2018.
升山義紀, 矢田部浩平, 及川靖広, "正弦波モデルと多様体上の最適化による位相復元,'' 日本音響学会講演論文集, Sep. 2018.
草野翼, 升山義紀, 矢田部浩平, 及川靖広, "音響信号処理に対する逆短時間Fourier変換の合成窓関数の影響, '' 日本音響学会講演論文集, Sep. 2018.
矢田部浩平, 升山義紀, 及川靖広, "ReLUはGriffin-Limアルゴリズムの一助となるか,'' 日本音響学会講演論文集, Sep. 2018.
升山義紀, 草野翼, 矢田部浩平, 及川靖広, 大石耕史, 宮城雄介, 高橋健, ``交互方向乗数法を用いたモード分解による楽器音の解析, " 日本音響学会講演論文集, Mar. 2018.
升山義紀, 草野翼, 矢田部浩平, 及川靖広, 宮城雄介, 大石耕史, ``データ忠実性を制約とした最適化による楽器音のモード分解, " 日本音響学会音楽音響研究会資料, MA2017-36, Oct. 2017
升山義紀, 草野翼, 矢田部浩平, 及川靖広, 宮城雄介, 大石耕史, ``制約付き最適化を用 いた楽器音のモード分解 , " 日本音響学会講演論文集, Sep. 2017.(学生優秀発表賞受賞)