I'm a 2nd-year PhD student at Okumura-Funakoshi Lab in Institute of Science Tokyo, Japan.
I'm also a founding ML engineer at Kotoba Technologies, building AI-powered simultaneous translation system, and speech foundation model.
Google Scholar: Aru Maekawa
GitHub: arumaekawa
LinkedIn: Aru Maekawa
Email: maekawa@lr.pi.titech.ac.jp / amaekawa@kotoba.tech
X: @arumaekawa
Dataset Distillation, Dataset Condensation: [NAACL2024 Findings], [ACL2023]
RST Discourse Parsing: [EACL2024], [NLP2023]
Continual Learning, Lifelong Learning: [EACL2023]
Simultaneous Speech Translation: [DOTSU]
Automatic Speech Recognition (ASR)
Text-to-Speech Synthesis (TTS)
January 2024 - present
Kotoba Technologies, Inc.
Founding member & Research engineer – Building Simultaneous Translation, Speech Foundation Models
May 2022 - September 2024
Tokyo Institute of Technology
Research assistant – Joint Research with NTT Communication Science Laboratories, Topic: RST Discourse Parsing
April 2024 - present
Institute of Science Tokyo (formerly Tokyo Institute of Technology)
Ph.D. in Engineering – Department of Information and Communications Engineering, School of Engineering
April 2022 - March 2024
Tokyo Institute of Technology
Master of Engineering – Department of Information and Communications Engineering, School of Engineering
April 2019 - March 2022 (1-year grade-skipping)
Tokyo Institute of Technology
Bachelor of Engineering – Department of Information and Communications Engineering, School of Engineering
Journal of Natural Language Processing, Vol.32, No.1, pp.1340-7619, 2025 [paper]
DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
Aru Maekawa, Satoshi Kosugi, Kotaro Funakoshi, and Manabu Okumura
Journal of Natural Language Processing, Vol.32, No.1, pp.283-299, 2025 [paper]
Dataset Distillation with Attention Labels for Fine-tuning BERT
Aru Maekawa, Naoki Kobayashi, Kotaro Funakoshi, and Manabu Okumura
ACL 2025 Demo., July 2025 [paper]
Live Football Commentary System Providing Background Information
Yuichiro Mori, Chikara Tanaka, Aru Maekawa, Satoshi Kosugi, Tatsuya Ishigaki, Kotaro Funakoshi, Hiroya Takamura, Manabu Okumura
NAACL 2024 Findings, Jun 2024 [paper] [code]
DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
Aru Maekawa, Satoshi Kosugi, Kotaro Funakoshi, and Manabu Okumura
EACL 2024, March 2024 [paper] [code]
Can we obtain significant success in RST discourse parsing by using Large Language Models?
Aru Maekawa, Tsutomu Hirao, Hidetaka Kamigaito, and Manabu Okumura
ACL 2023, July 2023 [paper] [code]
Dataset Distillation with Attention Labels for Fine-tuning BERT
Aru Maekawa, Naoki Kobayashi, Kotaro Funakoshi, and Manabu Okumura
EACL 2023, May 2023 [paper] [code]
Generative Replay Inspired by Hippocampal Memory Indexing for Continual Language
Aru Maekawa, Hidetaka Kamigaito, Kotaro Funakoshi, and Manabu Okumura
NLP 2025, March 2025 [paper]
大規模言語モデルを用いたシフト還元型句構造解析
中根 稜介, 前川 在, 上垣外 英剛, 平尾 努, 奥村 学
NLP 2024, March 2024 [paper] (Young Researcher Award)
テキスト生成モデルを利用したデータセット蒸留
前川 在, 小杉 哲, 船越 孝太郎, 奥村 学
NLP 2024, March 2024 [paper]
大規模言語モデルによるシフト還元修辞構造解析の模倣
前川 在, 平尾 努, 上垣外 英剛, 奥村 学
NLP 2024, March 2024 [paper] (Committee Special Award)
サッカー実況中継を付加的情報の提供という側面から見る
森 雄一郎, 前川 在, 小杉 哲, 船越 孝太郎, 高村大也, 奥村 学
NLP 2024, March 2024 [paper]
R2T: 言語モデルの確率操作による学習なし中間文生成
城戸 晴輝, 前川 在, 小杉 哲, 船越 孝太郎, 奥村 学
NLP 2023, March 2023 [paper] (Committee Special Award)
事前学習済みTransformerのための注意教師付きFew-shotデータの蒸留
前川 在, 小林 尚輝, 船越 孝太郎, 奥村 学
NLP 2023, March 2023 [paper]
逆翻訳を利用したデータ拡張による文間の修辞構造解析の改善
前川 在, 小林 尚輝, 平尾努, 上垣外 英剛, 奥村 学
NLP 2022, March 2022 [paper]
海馬の記憶インデックスに着想を得たリプレイによる言語処理タスクの継続学習
前川 在, 上垣外 英剛, 船越 孝太郎, 奥村 学
April 2024 – September 2024 [link]
Research Fellowship for Young Scientists (DC1) Japan Society for Promotion of Science (JSPS)
200,000 yen / month + 800,000 yen / year
March 2024 [link]
Outstanding Student Award (Master) Department of Information and Communications Engineering, School of Engineering, Tokyo Institute of Technology
March 2024 [link]
Young Researcher Award 27th Annual Meeting of the Language Processing Society of Japan (NLP2024)
テキスト生成モデルを利用したデータセット蒸留
March 2024 [link]
Committee Special Award 27th Annual Meeting of the Language Processing Society of Japan (NLP2024)
サッカー実況中継を付加的情報の提供という側面から見る (共著)
March 2023 [link]
Committee Special Award 26th Annual Meeting of the Language Processing Society of Japan (NLP2023)
事前学習済みTransformerのための注意教師付きFew-shotデータの蒸留
December 2022 [link]
Award for Research Plan Presentation (Master) Department of Information and Communications Engineering, School of Engineering, Tokyo Institute of Technology
March 2022 [link]
Outstanding Student Award (Bachelor) Department of Information and Communications Engineering, School of Engineering, Tokyo Institute of Technology