Jaeyoung Do
Assistant Professor
Department of Electrical and Computer Engineering
Seoul National University
Email: jaeyoung.do@snu.ac.kr
Short Bio: Jaeyoung Do currently serves as an assistant professor in the Department of Electrical and Computer Engineering at Seoul National University (SNU). Prior to joining SNU, he worked as a Senior Applied Scientist at Amazon Alexa AI from 2021 to 2023, and as a Senior Researcher at Microsoft Research (MSR) from 2016 to 2021. Before that, he held positions as a Senior Scientist at Microsoft Jim Gray Systems Lab (GSL) from 2014 to 2016, and as a Research Engineer at Microsoft Database Lab from 2012 to 2014. He received his Ph.D. in Computer Science from University of Wisconsin-Madison in 2012 under the supervision of Professor Jigensh M. Patel. He obtained his Master's degree from the same university in 2009 and earned his Bachelor's degree from Korea Advanced Institute of Science and Technology (KAIST).
I am currently looking for motivated undergraduate and graduate students interested in AI-powered big data management systems, generative AI based on Large Language Models (LLMs) for healthcare, large-scale deep learning training and inference, natural language/vision processing through multi-modal AI, algorithm-system co-design for ML/AI applications, and high-performance large-scale AI data analysis and processing using next-generation memory and cutting-edge hardware technologies.
If you are interested in working with me, please feel free to email me! 😀
Publications
[C]: Conference, [J]: Journal, [P]: Patent
2024
[C] AscleAI: A LLM-based clinical note managmenet system for enhancing clinician productivity
Jiyeon Han, Jimin Park, Jinyoung Huh, Uran Oh, Daehee Kim, Jaeyoung Do
Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI), 2024
2023
[C] Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems
Sarthak Ahuja, Mohammad Kachuee, Fatemeh Sheikholeslami, Weiqing Liu, Jaeyoung Do
Proceedings of the 61st Annual Meeting Of The Association For Computational Linguistics (ACL), 2023[C] Large-scale lifelong learning of in-context instructions and how to tackle it
Jisoo Mok, Jaeyoung Do, Sungjin Lee, Tara Taghavi, Seunghak Yu, Sungroh Yoon
Proceedings of the 61st Annual Meeting Of The Association For Computational Linguistics (ACL), 2023[C] Weakly supervised referring image segmentation with intra-chunk and inter-chunk consistency
Jungbeom Lee, Sungjin Lee, Jinseok Nam, Seunghak Yu, Jaeyoung Do, Tara Taghavi
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023[C] Grounding counterfactual explanation of image classifiers to textual concept space
Siwon Kim, Jinoh Oh, Sungjin Lee, Seunghak Yu, Jaeyoung Do, Tara Taghavi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023[j] Extending and programming the NVMe I/O determinism interface for Flash Arrays
Huaicheng Li, Martin L Putra, Ronald Shi, Fadhil I Kurnia, Xing Lin, Jaeyoung Do, Achmad Imam Kistijantoro, Gregory R Ganger, Haryadi S Gunawi
ACM Transactions on Storage (ToS), 2023[j] Accelerating Large-Scale Graph-Based Nearest Neighbor Search on a Computational Storage Platform
Ji-Hoon Kim, Yeo-Reum Park, Jaeyoung Do, Soo-Young Ji, Joo-Young Kim
IEEE Transactions on Computers (ToC), 2023[P] Storage device and memory system
Soo Young Ji, Joo Young Kim, Ji Hoon Kim, Jae Young Do, Yeo Reum Park
US Patent (No: 17696586), 2023
2022
[C] Debiasing neighbor aggregation for graph neural network in recommender systems
Minseok Kim, Jinoh Oh, Jaeyoung Do, Sungjin Lee
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM), 2022[C] A Dual-Mode Similarity Search Accelerator based on Embedding Compression for Online Cross-Modal Image-Text Retrieval
Yeo-Reum Park, Ji-Hoon Kim, Jaeyoung Do, Joo-Young Kim
IEEE 30th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2022
2021
[C] Computational storage: Where are we today?
Antonio Barbalace, Jaeyoung Do
11th Conference on Innovative Data Systems Research (CIDR), 2021[C] Programming an SSD controller to support batched writes for variable-size pages
Jaeyoung Do, Chen Luo, David Lomet
IEEE 37th International Conference on Data Engineering (ICDE), 2021[C] Accelerating large-scale nearest neighbor search with computational storage device
Ji-Hoon Kim, Yeo-Reum Park, Jaeyoung Do, Soo-Young Ji, Joo-Young Kim
IEEE 29th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2021[J] Better database cost/performance via batched I/O on programmable SSD
Jaeyoung Do, Ivan Luiz Picoli, David Lomet, Philippe Bonnet
The VLDB Journal (VLDB J.), 2021
2020
[C] Lessons learned from the early performance evaluation of intel optane dc persistent memory in dbms
Yinjun Wu, Kwanghyun Park, Rathijit Sen, Brian Kroth, Jaeyoung Do
Proceedings of the 16th International Workshop on Data Management on New Hardware (DaMoN), 2020[C] ALEX: an updatable adaptive learned index
Jialin Ding, Umar Farooq Minhas, Jia Yu, Chi Wang, Jaeyoung Do, Yinan Li, Hantian Zhang, Badrish Chandramouli, Johannes Gehrke, Donald Kossmann, David Lomet, Tim Kraska Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), 2020[J] Cost-effective, energy-efficient, and scalable storage computing for large-scale AI applications
Jaeyoung Do, Victor C Ferreira, Hossein Bobarshad, Mahdi Torabzadehkashi, Siavash Rezaei, Ali Heydarigorji, Diego Souza, Brunno F Goldstein, Leandro Santiago, Min Soo Kim, Priscila MV Lima, Felipe MG França, Vladimir Alves
ACM Transactions on Storage (TOS), 2020[P] Application cache replication to secondary application(s)
Nikhil Teletia, Jae Young Do, Kwanghyun Park, Jignesh M Patel
US Patent (No: 9684596), 2020
2019
[C] Improving CPU I/O performance via SSD controller FTL support for batched writes
Jaeyoung Do, David Lomet, Ivan Luiz Picoli
Proceedings of the 15th International Workshop on Data Management on New Hardware (DaMoN), 2019[J] Programmable solid-state storage in future cloud datacenters
Jaeyoung Do, Sudipta Sengupta, Steven Swanson
Communications of the ACM (CACM), 2019[P] Replicating a primary application cache within a secondary application cache
Nikhil Teletia, Jae Young Do, Kwanghyun Park, Jignesh M Patel
US Patent (No: 10204048), 2019[P] Application-driven storage systems for a computing system
Sudipta Sengupta, Jae Young Do
US Patent (No: 10289568), 2019
2018
[P] Automatic recovery of application cache warmth
Nikhil Teletia, Jae Young Do, Kwanghyun Park, Jignesh M Patel
US Patent (No: 10114765), 2018
2016
[C] Aggressive buffer pool warm-up after restart in SQL Server
Kwanghyun Park, Jaeyoung Do, Nikhil Teletia, Jignesh M Patel
IEEE 32nd International Conference on Data Engineering Workshops (ICDEW), 2016
2014
[J] Query Processing on Smart SSDs.
Kwanghyun Park, Yang-Suk Kee, Jignesh M Patel, Jaeyoung Do, Chanik Park, David J Dewitt
IEEE Data Engeering Bulletin (Data Eng. Bull.), 2014
2013
[C] Query processing on smart ssds: Opportunities and challenges
Jaeyoung Do, Yang-Suk Kee, Jignesh M Patel, Chanik Park, Kwanghyun Park, David J DeWitt
Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), 2013[C] Fast peak-to-peak behavior with SSD buffer pool
Jaeyoung Do, Donghui Zhang, Jignesh M Patel, David J DeWitt
IEEE 29th International Conference on Data Engineering (ICDE), 2013
2011
[C] Turbocharging DBMS buffer pool using SSDs
Jaeyoung Do, Donghui Zhang, Jignesh M Patel, David J DeWitt, Jeffrey F Naughton, Alan Halverson
Proceedings of the ACM SIGMOD International Conference on Management of data (SIGMOD), 2011
2009
[C] Join processing for flash SSDs: remembering past lessons
Jaeyoung Do, Jignesh M Patel
Proceedings of the Fifth International Workshop on Data Management on New Hardware (DaMoN), 2009[J] Fast statistical alignment
Robert K Bradley, Adam Roberts, Michael Smoot, Sudeep Juvekar, Jaeyoung Do, Colin Dewey, Ian Holmes, Lior Pachter
PLoS computational biology (PLoS Comput. Biol.), 2009
Honors and Awards
UW-Madison Computer Science Scholarship, 2010
Best Paper Award at DaMoN, 2009
Samsung Scholarship for Graduate Study, 2007-2012
KAIST Undergraduate Research Fellowship, 2006
Ministry of Information and Communication Scholarship, 2004
Korea Science and Engineering Foundation Scholarship, 2003-2007
Teaching
Spring 2023: Large Language Models - Applications and Use Cases
Student Advising
Yooju Shin (Ph.D. Student, KAIST) - Intern at Amazon Alexa AI, 2023
Dehong Xu (Ph.D. Student, University of California, Los Angeles) - Intern at Amazon Alexa AI, 2023
Siwon Lee (Ph.D. Student, Seoul National University) - Intern at Amazon Alexa AI, 2022
Jisoo Mok (Ph.D. Student, Seoul National University) - Intern at Amazon Alexa AI, 2022
Jungbeom Lee (Ph.D. Student, Seoul National University) - Intern at Amazon Alexa AI, 2022
Ting Wei Wu (Ph.D. Student, Georgia Tech) - Intern at Amazon Alexa AI, 2022
Minseok Kim (Ph.D. Student, KAIST) - Intern at Amazon AI, 2021
Minsoo Kim (Ph.D. Student, University of California - Irvine) - Intern at Microsoft Research, 2020
Huaicheng Li (Ph.D. Student, University of Chicago) - Intern at Microsoft Research, 2019
Chen Luo (Ph.D. Student, University of California - Irvine) - Intern at Microsoft Research, 2019
Ivan Picoli (Ph.D. Student, IT University of Copenhagen) - Intern at Microsoft Research, 2018
Kwanghyun Park (Ph.D. Student, UW-Madison) - Research Assistant at Microsoft GSL, 2015