Se Jin Park
I'm a final-year Ph.D. student at KAIST, Integrated Vision and Language Lab (IVLLab), advised by Professor Yong Man Ro. My research focuses on advancing multimodal human-AI interactions—integrating audio, vision, and text—based on Large Language Models. Specifically, I have worked on multimodal integration, unbounded generation, generation fidelity, and full-duplex behaviors for realistic and engaging human-AI dialogues.
Email: jinny960812@kaist.ac.kr
CV | Google Scholar | LinkedIn