My name is Wan-Cyuan (Chris) Fan, and I am a Ph.D. student working with Prof. Leonid Sigal on the topics of cross-modal learning, especially on vision-language synthesis and editing, at the University of British Columbia.

I previously worked as a Research Assistant in the Vision and Learning Lab (VLL) at National Taiwan University (NTU), guided by Prof. Yu-Chiang Frank Wang during my Master's and Research Assistant journey. Additionally, I collaborated with Yen-Chun Chen, DongDong Chen, Yu Cheng, and Lu Yuan as a student intern at Microsoft Research for six months. I received my Bachelor of Science degree in Electrical Engineering from NTU in 2020, during which I served as a student research intern for a year at the Institute of Information Science, Academia Sinica, under the guidance of Prof. Tyng-Luh Liu.

My research interests include Deep Learning (DL) and Computer Vision (CV), with a focus on multi-modal learning, x-to-image synthesis, text-guided image manipulation and editing, video generation, and object detection/segmentation.

EmailLinkGitHubLinkLinkLink
IIS, Academia SinicaStudent Research InternSep. 19 - Sep. 20 
National Taiwan UniversityB.S. in EEJan. 20 
Azure Computer Vision Research at MicrosoftResearch InternMarch 22 - Sep. 22 
National Taiwan UniversityM.S. in ECEJan. 22 
Vector Institute for AI, CanadaPhD studentSep. 23 - present
University of British ColumbiaPhD student in CSSep. 23 - present

News                                                                                                    Red: academic activity; Green: internship activity


Publications     (please wait a moment for loading gif.)

M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking

Raghav Goyal*, Wan-Cyuan Fan*, Mennatullah Siam, Leonid Sigal

Under submission

[Paper] [Project Page]


IOU-Aware Multi-Expert Cascade Network for Long-tailed Object Detection

Wan-Cyuan Fan*, Cheng-Yao Hong*, Yen-Chi Hsu, Tyng-Luh Liu

ICASSP, 2023

[Paper]


Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis

Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, Yu-Chiang Frank Wang

AAAI, 2023

[Paper] [Project page] [Code]


Target-free Text-guided Image Manipulation

Wan-Cyuan Fan, Cheng-Fu Yang, Chiao-An Yang, Yu-Chiang Frank Wang

AAAI, 2023

[Paper (ArXiv pre-print)] [Code (coming soon)]


Paraphrasing Is All You Need for Novel Object Captioning

Cheng-Fu Yang, Yao-Hung Hubert Tsai, Wan-Cyuan Fan, Yu-Chiang Frank Wang, Louis-Philippe Morency, Ruslan Salakhutdinov

Thirty-Sixth Annual Conference on Neural Information Processing Systems, 2022

[Paper]


Scene Graph Expansion for Semantics-Guided Image Outpainting

Chiao-An Yang, Cheng-Yo Tan, Wan-Cyuan Fan, Cheng-Fu Yang, Meng-Lin Wu, Yu-Chiang Frank Wang

IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2022

[Paper] [Code (coming soon)]


Cross-Modal Mutual Learning for Audio-Visual Speech Recognition and Manipulation

Chih-Chun Yang, Wan-Cyuan Fan, Cheng-Fu Yang, Yu-Chiang Frank Wang

Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), 2022

[Paper] [Code]


LayoutTransformer: Scene Layout Generation with Conceptual and Spatial Diversity

Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang

IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2021

[Paper] [Code]


Multi-expert heads of Long-tailed Instance Segmentation

Wan-Cyuan Fan, Cheng-Yao Hong, Yen-Chi Hsu, Tynh-Luh Liu

Tech. Report of IEEE European Conference on Computer Vision Workshop (ECCVW), 2020

[Paper]


Pre-prints and Miscellaneous 

ACE: Adaptive Confusion Energy for Natural World Data Distribution

Yen-Chi Hsu, Cheng-Yao Hong, Wan-Cyuan Fan, Ming-Sui Lee, Davi Geiger, Tyng-Luh Liu

under submission (Journal), 2021

[Paper (Arxiv pre-print)]


Auto-drawer: Generating and Modifying Images Continually byVisual-Relational Knowledge Graph

Wan-Cyuan Fan

Submitted to Second Workshop on Computer Vision for Fashion, Art and Design, ICCVw 2019

[Paper]


Awards

UBC 4YF Fellowship

The University of British Columbia, Canada, 2023

only 2-3 students each year in the CS department 

NSERC PGSD/CGSD Award

Natural Sciences and Engineering Research Council of Canada, 2023

Honorable Master Thesis Award

Chinese Image Processing and Pattern Recognition Society, Taiwan, 2022

only 11 recipients in Taiwan

IPPR web 

Research Scholarship

Novatek Foundation, Taiwan, 2021

only 3 recipients in NTU EECS (500+ students)

Novatek 

7th place

Large Vocabulary Instance Segmentation Challenge, ECCV workshop, 2020

Top 1 among non-industrial teams

CVF Paper     LVIS leaderboard (team: Argus)      Eval.AI

Professional Activities


Reviewer

AAAI 2021, CVPR 2021, ICCV 2021, CVPR 2022, AAAI 2022, NeurIPS 2022