I am an assistant professor at the Department of AI at Yonsei University. Previously, I was a postdoc with Yiming Yang at CMU. I earned my Ph.D. at KAIST under the supervision of Prof. Jinwoo Shin. During Ph.D., I also worked closely with Prof. Dongyeop Kang at the University of Minnesota. I am a recipient of Qualcomm Innovation Fellowship Korea (2021, 2022) for three of my papers. I also receive Silver prize from Samsung Humantech Paper Awards.
Contact: jaehyungk@yonsei.ac.kr (or wogudehowl@gmail.com )
Google Scholar, CV (update on 25.01)
Apr 2025: Jaehyung will serve as an Area Chair for NeurIPS 2025!
Jan 2025: Fermi is accepted to NAACL 2025!
Jan 2025: SPA is accepted to ICLR 2025 as Oral presentation (207/11672=1.77%)!
Sep 2024: Two papers (OCTree and MAC) are accepted to NeurIPS 2024!
Sep 2024: CoBB🪽 is accepted to EMNLP 2024! See you in Miami 🌴!
Sep 2024: I'm joining the Department of Artificial Intelligence at Yonsei University as an Assistant Professor!
My research goal is to enhance machine learning (ML) and NLP frameworks to be more accurate and reliable in real-world scenarios, by designing proper algorithms. Recently, I’ve been mostly interested in large language models (LLMs), especially in their alignment, adaptation (e.g., personalization), and development. While my recent focus has mainly been on NLP and LLMs, I’m also interested in improving ML frameworks in other domains.
(C: Conference, W: Workshop, P: Preprint, *: Equal contribution)
Jaehyung Kim, and Yiming Yang
Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL) 2025 (Long, Main)
Dongyoung Kim, Kimin Lee, Jinwoo Shin, and Jaehyung Kim
International Conference on Learning Representations (ICLR) 2025 (Oral Presentation, 207/11672=1.77%)
Hamin Koo, and Jaehyung Kim
Arxiv Preprint 2025
Hyunseok Lee*, Seunghyuk Oh*, Jaehyung Kim, Jinwoo Shin, and Jihoon Tack
International Conference on Learning Representations (ICLR) 2025, Reasoning and Planning for LLMs Workshop
Jaehyun Nam*, Kyuyoung Kim*, Seunghyuk Oh, Jihoon Tack, Jaehyung Kim, and Jinwoo Shin
Jihoon Tack, Jaehyung Kim, Eric Mitchell, Jinwoo Shin, Yee Whye Teh, and Jonathan Richard Schwarz
Jaehyung Kim, Dongyoung Kim, and Yiming Yang
Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024 (Long, Main)
[C13, W3] Tabular Transfer Learning via Prompting LLMs [pdf][code]
Jaehyun Nam, Woomin Song, Seong Hyeon Park, Jihoon Tack, Sukmin Yun, Jaehyung Kim, Kyu Hwan Oh, and Jinwoo Shin
Conference on Language Modeling (COLM) 2024
ICML Workshop on Efficient Systems for Foundation Models (ES-FoMo) 2023
[C12] Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs [pdf][code]
Woomin Song*, Seunghyuk Oh*, Sangwoo Mo, Jaehyung Kim, Sukmin Yun, Jung-Woo Ha, and Jinwoo Shin
International Conference on Learning Representations (ICLR) 2024
[C11] SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs [pdf][code][slide][poster]
Jaehyung Kim, Jaehyun Nam, Sangwoo Mo, Jongjin Park, Sang-Woo Lee, Minjoon Seo, Jung-Woo Ha, and Jinwoo Shin
International Conference on Learning Representations (ICLR) 2024
Yunhui Jang, Jaehyung Kim, and Sungsoo Ahn
Neural Information Processing Systems (NeurIPS) 2024, AIDrugX Workshop
Hyosoon Jang, Yunhui Jang, Jaehyung Kim, and Sungsoo Ahn
Arxiv Preprint 2024
Ritik Sachin Parkar*, Jaehyung Kim*, Jong Inn Park, and Dongyeop Kang
Arxiv Preprint 2024
[P2] Under the Surface: Tracking the Artifactuality of LLM-Generated Data [pdf][code][project]
Debarati Das*, Karin De Langis*, Anna Martin*, Jaehyung Kim*, Minhwa Lee*, Zae Myung Kim*, Shirley Hayati, Risako Owan, Bin Hu, Ritik Parkar, Ryan Koo, Jonginn Park, Aahan Tyagi, Libby Ferland, Sanjali Roy, Vincent Liu, and Dongyeop Kang
Arxiv Preprint 2024
[W1] Meta-Crafting: Improved Detection of Out-of-distributed Texts via Crafting Metadata Space
Ryan Koo, Yekyung Kim, Dongyeop Kang, and Jaehyung Kim
AAAI Conference on Artificial Intelligence (AAAI) Student Abstract and Poster Program 2024
[C10] RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training [pdf][code][slide][poster]
Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang, Pascale Fung, Qifan Wang, Fuli Feng, Lifu Huang, and Madian Khabsa
Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023 (Long, Findings)
[C9] A Universal Framework for Dataset Characterization with Multidimensional Meta-information [pdf][code][slide][poster]
Jaehyung Kim, Yekyung Kim, Karin Johanna Denton de Langis, Jinwoo Shin, and Dongyeop Kang
Annual Meeting of the Association for Computational Linguistics (ACL) 2023 (Long, Main)
[C8] Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning [pdf][code][slide][poster]
Jaehyung Kim, Jinwoo Shin, and Dongyeop Kang
Ruyuan Wan, Jaehyung Kim, and Dongyeop Kang
AAAI Conference on Artificial Intelligence (AAAI) 2023 (Oral Presentation)
Sukmin Yun, Jaehyung Kim, Dongyoon Han, Hwanjun Song, Jung-Woo Ha, and Jinwoo Shin
Winners, Qualcomm Innovation Fellowship Korea 2022
Sukmin Yun, Hankook Lee, Jaehyung Kim, and Jinwoo Shin
Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral Presentation)
Jaehyung Kim, Dongyeop Kang, Sungsoo Ahn, and Jinwoo Shin
International Conference on Learning Representations (ICLR) 2022
Silver Prize, Samsung Humantech Paper Awards 2021
Winners, Qualcomm Innovation Fellowship Korea 2022
Junhyun Nam, Jaehyung Kim, Jaeho Lee, and Jinwoo Shin
International Conference on Learning Representations (ICLR) 2022
Jaehyung Kim, Youngbum Hur, Sejun Park, Eunho Yang, Sung Ju Hwang, and Jinwoo Shin
Jaehyung Kim*, Jongheon Jeong*, and Jinwoo Shin
Conference on Computer Vision and Pattern Recognition (CVPR) 2020
Kimin Lee, Jaehyung Kim, Song Chong, and Jinwoo Shin
ArXiv Preprint 2017
Collaborator, Naver AI, Jeongja, South Korea - with Sang-Woo Lee, Minjoon Seo, and Jung-Woo Ha (Nov. 2022 - May. 2023)
Research Intern, Meta AI, Seattle, WA - with Madian Khabsa (May. 2022 - Aug. 2022)
Visiting Student, University of Minnesota, Minneapolis, MN - with Dongyeop Kang (Feb. 2022 - May. 2022)
Visiting Student, Samsung Advanced Institute of Technology, Suwon, South Korea - with Eunho Yang and Sung Ju Hwang (Jan. 2020 - Mar. 2020)
Senior Program Committee
Area Chair, Neural Information Processing Systems (NeurIPS): 2025
Conference Reviewer
AAAI Conference on Artificial Intelligence (AAAI): 2021, 2022, 2023
Association for Computational Linguistics (ACL) Rolling Review: 2022-2025
Conference on Computer Vision and Pattern Recognition (CVPR): 2023
Conference on Empirical Methods in Natural Language Processing (EMNLP): 2022, 2023
International Conference on Learning Representations (ICLR): 2022-2025
International Conference on Machine Learning (ICML): 2021-2025
Neural Information Processing Systems (NeurIPS): 2021-2024
Journal Reviewer
Transactions on Machine Learning Research (TMLR)
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Beyond the Era of Transformer-based AI Foundation Models
Samsung Future Technology Research (Oct. 2024)
Designing New Effective Inference Algorithms for Large Language Models
Amazon Artificial General Intelligence (AGI) (Mar. 2024)
Ulsan National Institute of Science and Technology (UNIST) (May. 2024)
Deep Learning with Imbalanced Datasets
Samsung Electronics Data & Information Technology Center (Oct. 2021)
Multi-aspect Analysis on Data Informativeness
Summer 2021 Presentation Minnesota NLP Group (Aug. 2021)
Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised Learning
NeurIPS 2020 Social ML in Korea (Dec. 2020)