Chia-Wen Kuo (郭佳文)
Research Focus: Vision and Language; Knowledge Augmentation; Semi-Supervised Learning; Self-Supervised Learning; Zero-Shot Learning
Contact: albert.cwkuo[AT]gatech.edu
Chia-Wen Kuo (郭佳文) has successfully completed his Ph.D. at Georgia Tech and is now embarking on a new journey as a research scientist at ByteDance. He joined the RobotIcs Perception and Learning (RIPL) lab in 2017, led by Dr. Zsolt Kira. During his Ph.D., Chia-Wen focused on deep learning in the field of vision-and-language (VL) research. His Ph.D. dissertation was centered on developing efficient and effective ways to integrate external knowledge into VL models and tasks.
Prior to joining Georgia Tech, he worked with Dr. Yu-Chiang Frank Wang on 3D shape generation and pose estimation. Before that, he earned his Master's and Bachelor's degrees in Electrical Engineering at National Taiwan University.
News
(02/26/2024) Research scientist at ByteDance
(12/15/2023) 👨🎓 Graduate from Georgia Tech
(11/29/2023) 🎓 Pass Ph.D. Defense
(07/10/2023) Code release for HAAV [GitHub]
(06/18-22/2023) Attend CVPR-23 in Vancouver
(04/18/2023) 🎓 Pass Ph.D. Proposal Exam
(02/27/2023) 📜 HAAV paper accepted in CVPR-23
Publications
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
[WACV 2023] Chia-Wen Kuo, Chih-Yao Ma, Judy Hoffman, Zsolt Kira
We propose SEA to improve the visual representation through a series of self-supervised tasks for vision-and-language navigation.
Unbiased Teacher for Semi-Supervised Object Detection
[ICLR 2021] Yen-Cheng Liu, Chih-Yao Ma, Zijian He, Chia-Wen Kuo, Kan Chen, Peizhao Zhang, Bichen Wu, Zsolt Kira, Peter Vajda
We propose Unbiased Teacher using focal loss and teacher-student training to reduce the bias in pseudo-labeling for semi-supervised object detection.
Data-Efficient Graph Embedding Learning for PCB Component Detection
[WACV 2019] Chia-Wen Kuo, Jacob D. Ashmore, David Huggins, Zsolt Kira
We improve few-shot PCB component detection by leveraging the similarity relationship between detected components and component prototypes via graph neural network.
Research Experiences
Aug '22 - Aug '23
Microsoft Research
Role: Research intern
Advisor: Chunyuan Li; Collaborator: Jianwei Yang
Project: Knowledge augmentation for large-scale pre-trained vision-and-language models.
May '21 - Aug '21
Rekognition team @ Amazon AWS AI
Role: Applied scientist intern
Advisor: Yuting Zhang
Project: Leveraging cross-modal pre-trained models for vision-and-language tasks.
May '20 - Aug '20
Computer Vision team @ Facebook AI & Applied Research
Role: Research intern
Advisor: Zeki Yalniz
Project: Learned data augmentation for self-supervised learning.
Jan '17 – Aug '17
Research Center for Information Technology Innovation @ Academia Sinica
Role: Research assistant
Advisor: Yu-Chiang Frank Wang
Project: 3D object reconstruction and pose estimation via generative models.
Education
Aug '17 - Dec '23
Ph.D. @ Georgia Institute of Technology (Atlanta, USA)
Major: Robotics, working on computer vision and deep learning
Advisor: Dr. Zsolt Kira
Research interests: vision and language; knowledge augmentation; learning with less supervision.
Sep '13 - Jun '15
M.S. @ National Taiwan University (Taipei, Taiwan)
Major: Electrical Engineering
Advisor: Prof. Ren C. Luo
Thesis title: Robot Integrated 3D Object Recognition and Fetching System for Factory Automation
Sep '09 - Jun '13
B.S. @ National Taiwan University (Taipei, Taiwan)
Major: Electrical Engineering