Welcome

Xiaoliang Dai

About Me

I am a research scientist working at Facebook since Sept. 2019. I received the B.S. degree from Peking University in 2014, and the Ph. D. degree from Princeton University in July 2019.

Education

2014 - 2019, PhD, Princeton University, NJ, U.S.
- Advisor: Niraj K. Jha Thesis title: Synthesis of Efficient Neural Networks
2010 - 2014, Bachelor of Science, Peking University, Beijing, China

Work Experience

09/2019 - current, Research Scientist, Mobile Computer Vision, Facebook, CA, U.S.
05/2018 - 01/2019, Research Intern, Mobile Computer Vision, Facebook, CA, U.S.
05/2016 - 08/2016, DSP Software Engineer Intern, Tensilica, San Jose, CA, U.S.

Research Interests

Generative AI
Efficient deep neural network
Data efficient learning

Selected Publications

J. Tian, X. Dai, C. Ma, Z. He, Y. Liu, Z. Kira, "Trainable Projected Gradient Method for Robust Fine-tuning", CVPR 2023.
J. Hou, X. Dai, Z. He, A. Dai, M. Nießner, "Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors", CVPR 2023.
H. You, Y. Xiong, X. Dai, B. Wu, P. Zhang, H. Fan, P Vajda, Y. Lin, "Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference", CVPR 2023.
F. Liang, B. Wu, X. Dai, K. Li, Y. Zhao, H. Zhang, P. Zhang, P. Vajda, and D. Marculescu, "Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP", CVPR 2023.
D. Bolya, C.-Y. Fu, X. Dai, P. Zhang, C. Feichtenhofer, J. Hoffman, "Token Merging: Your ViT But Faster", ICLR 2023.
T. Zhang, D. Cheng, Y. He, Z. Chen, X. Dai, L. Xiong, F. Yan, H. Li, Y. Chen, W. Wen, "NASRec: Weight Sharing Neural Architecture Search for Recommender Systems", WWW 2023.
Y. Liu, C. Ma, X. Dai, J. Tian, P. Vajda, Z. He, Z. Kira, “Open-Set Semi-Supervised Object Detection”, ECCV, 2022.
D. Bolya, C.-Y. Fu, X. Dai, P. Zhang, J. Hoffman, "Hydra attention: Efficient attention with many heads", ECCV CADL Workshop (Best Paper Award), 2022.
Y. Li, X. Dai, C. Ma, Y. Liu, K. Chen, B. Wu, Z. He, K. Kitani, P. Vajda"Cross-Domain Adaptive Teacher for Object Detection", CVPR, 2022.
X. Dai, A. Wan, P. Zhang, B. Wu, Z. He, Z. Wei, K. Chen, Y. Tian, M. Yu, P. Vajda, and J. Gonzalez, “FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function,” CVPR, 2021.
Z. Yan, X. Dai, P. Zhang, Y. Tian, B. Wu, M. Feiszli, “FP-NAS: Fast Probabilistic Neural Architecture Search”, CVPR, 2021.
B. Wu, C. Xu, X. Dai, A. Wan, P. Zhang, Z. Yan, M. Tomizuka, K. Keutzer, P. Vajda, “Visual Transformers: Where Do Transformers Really Belong in Vision Models?” ICCV, 2021.
A. Wan, X. Dai, P. Zhang, Z. He, Y. Tian, S. Xie, B. Wu, M. Yu, T. Xu, K. Chen, P. Vajda, and J. Gonzatez, “FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions,” CVPR, 2020.
X. Dai, P. Zhang, B. Wu, H. Yin, F. Sun, Y. Wang, M. Dukhan, Y. Hu, Y. Wu, Y. Jia, P. Vajda, M. Uyttendaele, and N. K. Jha, “ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation,” CVPR, 2019.
B. Wu, X. Dai, P. Zhang, Y. Wang, Y., F. Sun, Y. Wu, Y. Tian, P. Vajda, Y. Jia, and K. Keutzer, “FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search,” CVPR, 2019.
X. Dai, H. Yin, and N. K. Jha, “NeST: A Neural Network Synthesis Tool Based on a Grow-and-prune Paradigm,” IEEE Trans. on Computers, 2019.
X. Dai, H. Yin, and N. K. Jha, “Grow and Prune Compact, Fast, and Accurate LSTMs,” IEEE Trans. on Computers, 2019.
X. Dai, H. Yin, and N. K. Jha, “Incremental Learning Using a Grow-and-Prune Paradigm with Efficient Neural Networks,” IEEE Trans. on Emerging Topics in Computing, 2021.
A. Mosenia, X. Dai, P. Mittal and N. K. Jha, “PinMe: Tracking a Smartphone User around the World,” IEEE Trans. on Multi-scale Computing Syst., Aug. 2017.

Cross-Domain Adaptive Teacher for Object Detection

Yu-Jhe Li, Xiaoliang Dai, Chih-Yao Ma, Yen-Cheng Liu, Kan Chen, Bichen Wu, Zijian He, Kris Kitani, Peter Vajda

IEEE Conf. CVPR, 2022

paper code

FBNetV3: Joint Architecture-Recipe Search using Predictor Pretraining

Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E. Gonzalez

Proc. IEEE Conf. CVPR, 2021

paper

FP-NAS: Fast Probabilistic Neural Architecture Search

Zhicheng Yan, Xiaoliang Dai, Peizhao Zhang, Yuandong Tian, Bichen Wu, Matt Feiszli

Proc. IEEE Conf. CVPR, 2021

paper

Visual Transformers: Token-based Image Representation and Processing for Computer Vision

Bichen Wu , Chenfeng Xu , Xiaoliang Dai , Alvin Wan , Peizhao Zhang , Masayoshi Tomizuka , Kurt Keutzer , Peter Vajda

Proc. IEEE Conf. ICCV, 2021

paper

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions

Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E. Gonzalez

Proc. IEEE Conf. CVPR, 2020

paper / code

ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation

Xiaoliang Dai, Peizhao Zhang, Bichen Wu, Hongxu Yin, Fei Sun, Yanghan Wang, Marat Dukhan, Yunqing Hu, Yiming Wu, Yangqing Jia, Peter Vajda, Matt Uyttendaele, and Niraj K. Jha,

Proc. IEEE Conf. CVPR, 2019

paper / code