Masanari Oi

Graduate Student of Computer Science at Institute of Science Tokyo.

Member of Inoue Laboratory (Natural Language Processing / Multi-Modal AI)
Member of TokyoTech-LLM (Developing Large Language Models)
AI developer at 3keigo.com (Natural Language Generation)
Research Engineer at CoeFont (Speech Synthesis)

News

Feb 2026: A paper is accepted at LREC2026 @ Mallorca 🇪🇸
Jan 2026: A paper is accepted at ICLR2026 @ Rio de Janeiro 🇧🇷
Nov 2025: A paper is accepted at AAAI2026 @ Singapore 🇸🇬
Sep 2025: We released swallow-evaluation-instruct, a comprehensive framework for evaluating the Japanese and English capabilities of post-trained LLMs.
July 2025: A paper is accepted at MELT Workshop (held in conjunction with COLM 2025) @ Montréal 🇨🇦
July 2025: A paper is accepted at COLM2025 @ Montréal 🇨🇦
Jan 2025: A paper is accepted at ICLR2025 @Singapore 🇸🇬
Nov 2024: We released swallow-evaluation, a comprehensive framework for evaluating the Japanese and English capabilities of pre-trained LLMs.
July 2024: Two papers are accepted at COLM2024 @ Pennsylvania 🇺🇸
July 2024: We released Llama3 Swallow, a series of large language models that enhance Japanese capability of Llama 3 8B and 70B.
May 2024: A paper is accepted at ACL2024 findings @Bangkok 🇹🇭
Feb 2024: Our preprint: "Likelihood-based Mitigation of Evaluation Bias in Large Language Models" is now on arXiv.
Dec 2023: We released Swallow, a large language model.

Research Interest

Natural Language Processing
- Spatial Reasoning of multi-modal AI
- Post-training of LLMs: test-time scaling, RL
- Meta-Evaluation
Speech Synthesis

Work Experience

Aug 2024 - Present:

Research Assitant

The National Institute of Advanced Industrial Science and Technology (AIST)

Apr 2023 - Present:

AI Engineer

3keigo.com

Oct 2021 - Present:

Research Engineer

CoeFont

Publication

NOTE: Some of my earlier publications appear under the name Masanari Ohi, which was a previous romanization of my name. Both refer to myself, Masanari Oi.

International Conference (peer-reviewed)

Masanari Oi, Masahiro Kaneko, Naoki Okazaki, Nakamasa Inoue. Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model. LREC2026. [paper]
Kazuki Fujii, Yukito Tajima, Sakae Mizuki, Masaki Kawamura, Hinari Shimada, Taihei Shiotani, Koshiro Saito, Masanari Oi, Taishi Nakamura, Takumi Okamoto, Shigeki Ishida, Kakeru Hattori, Youmi Ma, Hiroya Takamura, Rio Yokota, Jun Sakuma, Naoaki Okazaki. Rewriting Pre-Training Data Boosts LLM Performance in Math and Code. ICLR2026. [paper]
Nakamasa Inoue, Kanoko Goto, Masanari Oi, Martyna Gruszka, Mahiro Ukai, Takumi Hirose, Yusuke Sekikawa. DISCODE: Distribution-Aware Score Decoder for Robust Automatic Evaluation of Image Captioning. AAAI2026. [paper]
Youmi Ma, Sakae Mizuki, Kazuki Fujii, Taishi Nakamura, Masanari Ohi, Hinari Shimada, Taihei Shiotani, Koshiro Saito, Koki Maeda, Kakeru Hattori, Takumi Okamoto, Shigeki Ishida, Rio Yokota, Hiroya Takamura, Naoaki Okazaki. Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models. COLM2025.
[paper]
Yuto Nishimura, Takumi Hirose, Masanari Ohi, Hideki Nakayama, Nakamasa Inoue. HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis. ICLR2025.
[paper]
Nakamasa Inoue, Shinta Otake, Takumi Hirose, Masanari Ohi, Rei Kawakami. ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks. IEEE/ACM TASLP.
[paper]
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Shota Hirai, Sakae Mizuki, Rio Yokota, Naoaki Okazaki. Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities. COLM2024.
[paper]
Naoaki Okazaki, Kakeru Hattori, Shota Hirai, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki. Building a Large Japanese Web Corpus for Large Language Models. COLM2024.
[paper]
Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki. Likelihood-based Mitigation of Evaluation Bias in Large Language Models. ACL2024 (short, Findings).
[paper][code]

Journal (peer-reviewed)

Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki. 大規模言語モデルにおける評価バイアスの尤度に基づく緩和. (Likelihood-based Mitigation of Evaluation Bias in Large Language Models.) 自然言語処理, 32(2):480–496, Jun 2025.
[paper]

Workshop

Koshiro Saito, Sakae Mizuki, Masanari Ohi, Taishi Nakamura, Taihei Shiotani, Koki Maeda, Youmi Ma, Kakeru Hattori, Kazuki Fujii, Takumi Okamoto, Shigeki Ishida, Hiroya Takamura, Rio Yokota, Naoaki Okazaki. Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs. Multilingual and Equitable Language Technologies (MELT), held in conjunction with COLM 2025.
[paper]

Preprint (non-peer-reviewed)

Masanari Oi, Koki Maeda, Ryuto Koike, Daisuke Oba, Nakamasa Inoue, Naoaki Okazaki. From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models. arXiv preprint, 2026. [paper] [project page]
Masanari Oi, Mahiro Ukai, Masahiro Kaneko, Naoaki Okazaki, Nakamasa Inoue. Autoregressive Direct Preference Optimization. arXiv preprint, 2026. [paper] [project page]

Domestic Conference and Symposium (non-peer-reviewed)

Taihei Shiotani, Masahiro Kaneko, Ayana Niwa, Yuki Maruyama, Daisuke Oba, Masanari Oi, Naoaki Okazaki. JUBAKU: 日本文化における偏見評価のための敵対的ベンチマーク.(Adversarial Benchmark for Evaluating Stereotypes in Japanese Culture.) The 39th Annual Conference of the Japanese Society for Artificial Intelligence, 2025, Osaka, Japan. (in Japanese). Annual Conference Award.
Masanari Oi, Masahiro Kaneko, Naoki Okazaki, Nakamasa Inoue. 複数タスク・複数項目に跨ったマルチモーダル自動評価手法. (Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model.) The 31st Annual Meeting of the Association of Natural Language Processing (NLP2025), Nagasaki, Japan. (in Japanese). Committee Special Award.
Youmi Ma, Sakae Mizuki, Kazuki Fujii, Taishi Nakamura, Masanari Ohi, Hinari Shimada, Taihei Shiotani, Koshiro Saito, Koki Maeda, Kakeru Hattori, Takumi Okamoto, Shigeki Ishida, Rio Yokota, Hiroya Takamura, Naoaki Okazaki. 模倣学習による大規模言語モデルの指示チューニング. The 31st Annual Meeting of the Association of Natural Language Processing (NLP2025), Nagasaki, Japan. (in Japanese).
Kakeru Hattori, Sakae Mizuki, Kazuki Fujii, Taishi Nakamura, Taihei Shiotani, Kokoro Ueki, Takuro Niitsuma, Akira Kawabata, Hideaki Tamori, Youmi Ma, Koki Maeda, Masanari Ohi, Koshiro Saito, Takumi Okamoto, Shigeki Ishida, Rio Yokota, Hiroya Takamura, Naoaki Okazaki. 新聞記事からつくる時事と社会に強い日本語LLM. The 31st Annual Meeting of the Association of Natural Language Processing (NLP2025), Nagasaki, Japan. (in Japanese).

Kakeru Hattori, Naoaki Okazaki, Sakae Mizuki, Kazuki Fujii, Taishi Nakamura, Masanari Ohi, Taihei Shiotani, Koshiro Saito, Youmi Ma, Koki Maeda, Takumi Okamoto, Shigeki Ishida, Rio Yokota, Hiroya Takamura. Swallowコーパスv2: 教育的な日本語ウェブコーパスの構築. The 31st Annual Meeting of the Association of Natural Language Processing (NLP2025), Nagasaki, Japan. (in Japanese).
Koshiro Saito, Sakae Mizuki, Masanari Ohi, Takashi Nakamura, Taihei Shiotani, Koki Maeda, Ma Youmi, Kakeru Hattori, Kazuki Fujii, Takumi Okamoto, Shigeki Ishida, Hiroya Takamura, Rio Yokota, Naoaki Okazaki . LLMに日本語テキストを学習させる意義. (Advantages of Training LLMs on Japanese Text.) The 261st NL Research Presentation. Best Research Award.
Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki. 大規模言語モデルにおける評価バイアスの尤度に基づく緩和. (Likelihood-based Mitigation of Evaluation Bias in Large Language Models.) The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Hyogo, Japan. (in Japanese). Young Researcher Award.
Naoaki Okazaki, Kakeru Hattori, Shota Hirai, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki. Swallowコーパス: 日本語大規模ウェブコーパス. (Building a Large Japanese Web Corpus for Large Language Models.) The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Hyogo, Japan. (in Japanese). Best Paper Award.
Sakae Mizuki, Hiroki Iida, Kazuki Fujii Taishi Nakamura, Mengsay Loem, Masanari Ohi, Kakeru Hattori, Shota Hirai, Rio Yokota, Naoaki Okazaki. 大規模言語モデルの日本語能力の効率的な強化: 継続事前学習における語彙拡張と対訳コーパスの活用. The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Hyogo, Japan. (in Japanese)
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Shota Hirai, Sakae Mizuki, Rio Yokota, Naoaki Okazaki. 継続事前学習による日本語に強い大規模言語モデルの構築. The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Hyogo, Japan. (in Japanese). Best Paper Award.

Resources / Tools

Swallow-evaluation-instruct: a comprehensive framework for evaluating the Japanese and English capabilities of post-trained LLMs.

Swallow-evaluation: a comprehensive framework for evaluating the Japanese and English capabilities of pre-trained LLMs.

Awards

Annual Conference Award (Top 35 / 1178 = 3.0%)
The 39th Annual Conference of the Japanese Society for Artificial Intelligence, 2025.
Committee Special Award (Top 63 / 726 = 8.7%)

The 31st Annual Meeting of the Association of Natural Language Processing (NLP2025), Mar. 2025.

Language Resource Award (Top 1 / 42 = 2.4%)

The 31st Annual Meeting of the Association of Natural Language Processing (NLP2025), Mar. 2025.

Best Research Award (Top 1 / 15 = 6.7%)

The 261st NL Research Presentation, Sep. 2024.

Young Researcher Award (Top 18 / 427 = 4.2%)

The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Mar. 2024.

Best Paper Award (Top 13 / 599 = 2.1%)

The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Mar. 2024.

Best Paper Award (Top 13 / 599 = 2.1%)

The 30th Annual Meeting of the Association of Natural Language Processing (NLP2024), Mar. 2024.

Service

Reviewer

IEEE/ACM TASLP.

Education

Apr 2024 - Present: Master of Engineering (Computer Science) Institute of Science Tokyo (Formerly known as Tokyo Institute of Technology)

Apr 2020 - Mar 2024: Bachelor of Engineering (Computer Science) Tokyo Institute of Technology

Others

Aug 2023 - Sep 2023: Short-term overseas programs in Sweden 🇸🇪
Aug 2021 - Present: Scholarship by Takenaka Scholarship Foundation

Page updated

Google Sites

Report abuse