*:Co-first authors; †:Co-corresponding authors
2026
[C17] SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety
Geon-Hyeong Kim, Yu Jin Kim, Byoungjip Kim, Honglak Lee, Kyunghoon Bae, Youngsoo Jang†, Moontae Lee†
ICLR 2026 (Oral Presentation)
[C16] IRPO: Implicit Policy Regularized Preference Optimization
Youngsoo Jang, Yu Jin Kim, Geon-Hyeong Kim, Honglak Lee, Moontae Lee
EACL Findings 2026 (Oral Presentation)
2025
[C15] Online Pre-Training for Offline-to-Online Reinforcement Learning
Yongjae Shin, Jeonghye Kim, Whiyoung Jung, Sunghoon Hong, Deunsol Yoon, Youngsoo Jang, Geon-Hyeong Kim, Jongseong Chae, Youngchul Sung, Kanghoon Lee, Woohyung Lim
ICML 2025
2024
[P1] Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Kyungjae Lee*, Dasol Hwang*, Sunghyun Park*, Youngsoo Jang, Moontae Lee
Preprint
[C14, W7] Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking
Byoungjip Kim, Youngsoo Jang, Lajanugen Logeswaran, Geon-Hyeong Kim, Yu Jin Kim, Honglak Lee, Moontae Lee
EMNLP Findings 2024
NeurIPS Foundation Models for Decision Making Workshop, 2023
[C13] Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments
Sangwoo Shin*, SeungHyun Kim*, Youngsoo Jang, Moontae Lee, Honguk Woo
ACL Findings 2024
[C12, W6] Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration
Youngsoo Jang, Geon-Hyeong Kim, Byoungjip Kim, Yu Jin Kim, Honglak Lee, Moontae Lee
ICML 2024
ICLR Workshop on Generative Models for Decision Making, 2024
[W5] Show, Think, and Tell: Thought-Augmented Fine-Tuning of Large Language Models for Video Captioning
Byoungjip Kim, Dasol Hwang, Sungjun Cho, Youngsoo Jang, Honglak Lee, Moontae Lee
CVPR Workshop on Multi-Modal Foundation Models, 2024
2023
[C11] SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations
Youngsoo Jang, Geon-Hyeong Kim, Jongmin Lee, Sungryull Sohn, Byoungjip Kim, Honglak Lee, Moontae Lee
NeurIPS 2023
[C10] Information-Theoretic State Space Model for Multi-View Reinforcement Learning
HyeongJoo Hwang, Seokin Seo, Youngsoo Jang, Sungyoon Kim, Geon-Hyeong Kim, Seunghoon Hong, Kee-Eung Kim
ICML 2023 (Oral Presentation)
2022
[C9] LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation
Geon-Hyeong Kim*, Jongmin Lee*, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim
NeurIPS 2022
[C8] GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
Youngsoo Jang, Jongmin Lee, Kee-Eung Kim
ICLR 2022
2021
[C7] Monte-Carlo Planning and Learning with Language Action Value Estimates
Youngsoo Jang, Seokin Seo, Jongmin Lee, Kee-Eung Kim
ICLR 2021
2020
[C6] Variational Inference for Sequential Data with Future Likelihood Estimates
Geon-Hyeong Kim, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim
ICML 2020
[C5, W4] End-to-End Neural Pipeline for Goal-Oriented Dialogue System using GPT-2
Donghoon Ham*, Jeong-Gwan Lee*, Youngsoo Jang, Kee-Eung Kim
ACL 2020
AAAI DSTC8 Workshop, 2020 (Oral Presentation)
1st place on 8th Dialog System Technology Challenge (DSTC8) Multi-domain Task Completion Track, 2019
[C4, W3] Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues
Youngsoo Jang, Jongmin Lee, Kee-Eung Kim
AAAI 2020
NeurIPS Conversational AI Workshop, 2019 (Oral Presentation)
2019
[C3] Trust Region Sequential Variational Inference
Geon-Hyeong Kim, Youngsoo Jang, Jongmin Lee, Wonseok Jeon, Hongseok Yang, Kee-Eung Kim
ACML 2019
[C2] PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Systems with Probabilistic Rules
Youngsoo Jang*, Jongmin Lee*, Jaeyoung Park*, Kyeng-Hun Lee, Pierre Lison, KeeEung Kim
EMNLP 2019
2018
[J1] Cross-language Neural Dialog State Tracker for Large Ontologies using Hierarchical Attention
Youngsoo Jang, Jiyeon Ham, Byung-Jun Lee, Kee-Eung Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2018
2017
[C1, W2] Constrained Bayesian Reinforcement Learning via Approximate Linear Programming
Jongmin Lee, Youngsoo Jang, Pascal Poupart, Kee-Eung Kim
IJCAI 2017
ECML-PKDD Workshop on Scaling-Up Reinforcement Learning (SURL), 2017
2016
[W1] Neural Dialog State Tracker for Large Ontologies by Attention Mechanism
Youngsoo Jang*, Jiyeon Ham*, Byung-Jun Lee, Youngjae Chang, Kee-Eung Kim
IEEE Workshop on Spoken Language Technology (SLT), 2016
3rd place on 5th Dialog State Tracking Challenge (DSTC), 2016