Publications

*:Co-first authors; †:Co-corresponding authors

2026

[P2] SpeedAug: Policy Acceleration via Tempo-Enriched Policy and RL Fine-Tuning

Taewook Nam, Junmo Cho, Youngsoo Jang, Sung Ju Hwang

Preprint

[W9] LLM-PriorCB: Textual Contextual Bandits with LLM-Induced Priors

Geon-Hyeong Kim, Yu Jin Kim, June Yong Yang, Woohyung Lim, Youngsoo Jang†, Moontae Lee†

ICML Workshop on Decision-Making from Offline Datasets to Online Adaptation: Black-Box Optimization to Reinforcement Learning, 2026

[W8] CAMEL: Learning Community-Aligned Metrics and Weights for LLM Evaluation

Ji Yong Cho, Bumsoo Kang, June Yong Yang, Youngsoo Jang, Chang Liu, Moontae Lee

ACL Workshop on Natural Language Generation, Evaluation, and Metrics, 2026

[C19] A Regret Minimization Framework on Preference Learning in Large Language Models

Suhwan Kim*, Taehyun Cho*, Geon-Hyeong Kim, Yu Jin Kim, Youngsoo Jang†, Moontae Lee†, Jungwoo Lee†

ICML 2026 (Spotlight Paper, Top 2.2%)

[C18] Efficiently Learning To Reason or Not to Reason: Root-token Policy Optimization for Adaptive Thinking

Taehyeon Kim, Hyunsoo Lee, Youngsoo Jang†, Moontae Lee†

ACL 2026 (Oral Presentation, Top 4.0%)

[C17] SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

Geon-Hyeong Kim, Yu Jin Kim, Byoungjip Kim, Honglak Lee, Kyunghoon Bae, Youngsoo Jang†, Moontae Lee†

ICLR 2026 (Oral Presentation, Top 1.1%)

[C16] IRPO: Implicit Policy Regularized Preference Optimization

Youngsoo Jang, Yu Jin Kim, Geon-Hyeong Kim, Honglak Lee, Moontae Lee

EACL Findings 2026 (Oral Presentation, Top 8.5%)

2025

[C15] Online Pre-Training for Offline-to-Online Reinforcement Learning

Yongjae Shin, Jeonghye Kim, Whiyoung Jung, Sunghoon Hong, Deunsol Yoon, Youngsoo Jang, Geon-Hyeong Kim, Jongseong Chae, Youngchul Sung, Kanghoon Lee, Woohyung Lim

ICML 2025

2024

[P1] Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

Kyungjae Lee*, Dasol Hwang*, Sunghyun Park*, Youngsoo Jang, Moontae Lee

Preprint

[C14, W7] Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking

Byoungjip Kim, Youngsoo Jang, Lajanugen Logeswaran, Geon-Hyeong Kim, Yu Jin Kim, Honglak Lee, Moontae Lee

EMNLP Findings 2024

NeurIPS Foundation Models for Decision Making Workshop, 2023

[C13] Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments

Sangwoo Shin*, SeungHyun Kim*, Youngsoo Jang, Moontae Lee, Honguk Woo

ACL Findings 2024

[C12, W6] Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration

Youngsoo Jang, Geon-Hyeong Kim, Byoungjip Kim, Yu Jin Kim, Honglak Lee, Moontae Lee

ICML 2024

ICLR Workshop on Generative Models for Decision Making, 2024

[W5] Show, Think, and Tell: Thought-Augmented Fine-Tuning of Large Language Models for Video Captioning

Byoungjip Kim, Dasol Hwang, Sungjun Cho, Youngsoo Jang, Honglak Lee, Moontae Lee

CVPR Workshop on Multi-Modal Foundation Models, 2024

2023

[C11] SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations

Youngsoo Jang, Geon-Hyeong Kim, Jongmin Lee, Sungryull Sohn, Byoungjip Kim, Honglak Lee, Moontae Lee

NeurIPS 2023

[C10] Information-Theoretic State Space Model for Multi-View Reinforcement Learning

HyeongJoo Hwang, Seokin Seo, Youngsoo Jang, Sungyoon Kim, Geon-Hyeong Kim, Seunghoon Hong, Kee-Eung Kim

ICML 2023 (Oral Presentation, Top 2.3%)

2022

[C9] LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation

Geon-Hyeong Kim*, Jongmin Lee*, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim

NeurIPS 2022

[C8] GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems

Youngsoo Jang, Jongmin Lee, Kee-Eung Kim

ICLR 2022

2021

[C7] Monte-Carlo Planning and Learning with Language Action Value Estimates

Youngsoo Jang, Seokin Seo, Jongmin Lee, Kee-Eung Kim

ICLR 2021

2020

[C6] Variational Inference for Sequential Data with Future Likelihood Estimates

Geon-Hyeong Kim, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim

ICML 2020

[C5, W4] End-to-End Neural Pipeline for Goal-Oriented Dialogue System using GPT-2

Donghoon Ham*, Jeong-Gwan Lee*, Youngsoo Jang, Kee-Eung Kim

ACL 2020

AAAI DSTC8 Workshop, 2020 (Oral Presentation)

1st place on 8th Dialog System Technology Challenge (DSTC8) Multi-domain Task Completion Track, 2019

[C4, W3] Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues

Youngsoo Jang, Jongmin Lee, Kee-Eung Kim

AAAI 2020

NeurIPS Conversational AI Workshop, 2019 (Oral Presentation)

2019

[C3] Trust Region Sequential Variational Inference

Geon-Hyeong Kim, Youngsoo Jang, Jongmin Lee, Wonseok Jeon, Hongseok Yang, Kee-Eung Kim

ACML 2019

[C2] PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Systems with Probabilistic Rules

Youngsoo Jang*, Jongmin Lee*, Jaeyoung Park*, Kyeng-Hun Lee, Pierre Lison, KeeEung Kim

EMNLP 2019

2018

[J1] Cross-language Neural Dialog State Tracker for Large Ontologies using Hierarchical Attention

Youngsoo Jang, Jiyeon Ham, Byung-Jun Lee, Kee-Eung Kim

IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2018

2017

[C1, W2] Constrained Bayesian Reinforcement Learning via Approximate Linear Programming

Jongmin Lee, Youngsoo Jang, Pascal Poupart, Kee-Eung Kim

IJCAI 2017

ECML-PKDD Workshop on Scaling-Up Reinforcement Learning (SURL), 2017

2016

[W1] Neural Dialog State Tracker for Large Ontologies by Attention Mechanism

Youngsoo Jang*, Jiyeon Ham*, Byung-Jun Lee, Youngjae Chang, Kee-Eung Kim

IEEE Workshop on Spoken Language Technology (SLT), 2016

3rd place on 5th Dialog State Tracking Challenge (DSTC), 2016

Page updated

Google Sites

Report abuse