Ph.D. Student, ViLab (Visual Intelligence Lab), Department of Artificial Intelligence, College of Computing, Yonsei University Research Advisor: Prof. Yongjung Uh
Co-founder of CineLingo
Running VeritasiumKorean (250k subscribers), 3Blue1Brown한국어 (100k subscribers)
Email: kwonmingi@yonsei.ac.kr (Personal, Research) mingikwon@giverny.ai (Bussiness)
CV / Google Scholar / Semantic Scholar / GitHub / LinkedIn / YouTube1 / YouTube2
As a Ph.D. student
I am currently a Ph.D. student at the Dept. of Artificial Intelligence, Yonsei University. My research interests are generative AI especially image/video generation and editing. I have developed diffusion-based image editing methods. Recently, my focus on research has been extending video generation and editing including talking head problems.
Previously, I completed my internship at Adobe Research (2023 May-Dec), advised by SeoungWug Oh, JoonYoung Lee, Yang Zhou, Difan Liu, Feng Liu, and other great team members.
As a YouTuber
I am currently running YouTube channels called Veritasium Korean and 3Blue1Brown Korean, which focus on math and science stories. Since 2021, I have been officially collaborating with the original Veritasium channel. Currently, the channel has nearly 250k and 100k subscribers.
As a Co-founder
I am currently a co-founder of CineLingo, a startup that just launched in 2024. Based on my years of experience as a translation YouTuber, I established a startup that provides AI services for translating videos. Since 2022, I have been researching and working on this as a side project with Jaeseok Jeong, who has greatly supported me.
❤️ [2025] "JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching" is arxived.
🧡 [2025] "TTS-CtrlNet: Time varying emotion aligned text-to-speech generation with controlnet" will be arxived soon.
🧡 [2025] "Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers" will be arxived soon.
🧡 [2025] "Balanced Conic Rectified Flow" will be arxived soon.
❤️ [2025] "TCFG: Tangential Damping Classifier-free Guidance" was accepted in CVPR2025!
❤️ [2024] "Harnessing Text-to-Image Models for Video Generation" was accepted in ECCV2024!
🧡 [2024] "Motion Customization Video Diffusion Model" was accepted in ECCV2024!
❤️ [2024] "Attribute Based Interpretable Evaluation Metrics for Generative Models" was accepted in ICML2024!
🧡 [2024] "Plug-and-Play Diffusion Distillation" was accepted in CVPR2024!
❤️ [2024] "Training-free Style Transfer Emerges from h-space in Diffusion models" was accepted in WACV2024!
❤️ [2023] "Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry" was accepted in Neurips2023!
❤️ [2023] "Unsupervised Discovery of Semantic Latent Directions in Diffusion Models" was arxived.
❤️ [2023] "Diffusion Models already have a Semantic Latent Space" was accepted in ICLR2023 as a notable-top-25%!
🧡 [2022] "FurryGAN: High Quality Foreground-aware Image Synthesis" was accepted in ECCV2022!
❤️ : First author or co-first author 🧡 : My favorite co-working papers!
Mar 2021 - Present, Graduate Student Researcher, ViLab (Visual Intelligence Lab), Research Advisor: Prof. Yongjung Uh
June 2025 - September 2025, Atmanity research internship, working with Yi-Hsuan Tsai, Yumin Suh, and other great team members.
May 2023 - Dec 2023, Adobe summer internship (full‐time internship extended), advised by SeoungWug Oh, JoonYoung Lee, Yang Zhou, Difan Liu, Feng Liu, and other great team members.
Won the Intra-University Merit Paper Award
Gold prize in Samsung Humantech Paper Award, 2022 (1st prize in the category ‐ Computer Science, $8000 (Authors) + $4000 (Corresponding Author)
Won the Best Graduation Project Award across my entire undergraduate university - Received the University President's Award
Received the Dean's Award from the College of Engineering at Korea University
Gold Medal at the National Research Council of Thailand (Patent Number: 4‐2019‐084383‐5)
P12986-US: MOTION CUSTOMIZATION FOR DIGITAL VIDEOS
8828-361-NP1_P12631-US: VIDEO GENERATION USING FRAME-WISE TOKEN EMBEDDINGS
Reviewer in CVPR, ICLR, Neurips, ECCV, AAAI