My name is Wan-Cyuan Fan (Chris Fan), and I am a Ph.D. student working with Prof. Leonid Sigal on the topics of multimodal learning at the University of British Columbia.

I previously worked as a Research Assistant in the Vision and Learning Lab (VLL) at National Taiwan University (NTU), guided by Prof. Yu-Chiang Frank Wang during my Master's and Research Assistant journey. Additionally, I collaborated with Yen-Chun Chen, DongDong Chen, Yu Cheng, and Lu Yuan as a student intern at Microsoft Research for six months. I received my Bachelor of Science degree in Electrical Engineering from NTU in 2020, during which I served as a student research intern for a year at the Institute of Information Science, Academia Sinica, under the guidance of Prof. Tyng-Luh Liu.

My research focuses on two interconnected areas: (1) the fundamental study of multimodal large language models (MLLMs), including customization, synthetic data generation for training, benchmarking, and training strategies; and (2) the application of MLLMs in agentic frameworks, specifically designing LLM-based agents to more effectively solve general vision-and-language tasks and exploring collaborative dynamics within multi-agent systems.