Publications

Fbnetv5: Neural architecture search for multiple tasks in one run: B Wu, C Li, H Zhang, X Dai, P Zhang, M Yu, J Wang, Y Lin, P Vajda, 2021

[Full Text]

Image2point: 3d point-cloud understanding with pretrained 2d convnets: Chenfeng Xu, Shijia Yang, Bohan Zhai, Bichen Wu, Xiangyu Yue, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka, 2021

[Full Text]

Unbiased teacher for semi-supervised object detection: YC Liu, CY Ma, Z He, CW Kuo, K Chen, P Zhang, B Wu, Z Kira, P Vajda, 2021

[Full Text]

FBNetV3: Joint architecture-recipe search using predictor pretraining: Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E Gonzalez, CVPR 2021

[Full Text]

Squeezesegv3: Spatially-adaptive convolution for efficient point-cloud segmentation, C Xu, B Wu, Z Wang, W Zhan, P Vajda, K Keutzer, M Tomizuka, ECCV 2020

[Full Text]

One shot 3d photography: Johannes Kopf, Kevin Matzen, Suhib Alsisan, Ocean Quigley, Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, Peizhao Zhang, Zijian He, Peter Vajda, Ayush Saraf, Michael Cohen, ACM Transactions on Graphics (TOG), 2020

[Full Text]

Visual transformers: Token-based image representation and processing for computer vision: Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez, Kurt Keutzer, Peter Vajda, 2020

[Full Text]

Fbnetv2: Differentiable neural architecture search for spatial and channel dimensions: Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E Gonzalez, CVPR, 2020

[Full Text]

Fbnet: Hardware-aware efficient convnet design via differentiable neural architecture search; Bichen Wu, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, Kurt Keutzer, CVPR oral 2019

[Full Text]

Chamnet: Towards efficient network design through platform-aware model adaptation; Xiaoliang Dai, Peizhao Zhang, Bichen Wu, Hongxu Yin, Fei Sun, Yanghan Wang, Marat Dukhan, Yunqing Hu, Yiming Wu, Yangqing Jia, Peter Vajda, Matt Uyttendaele, Niraj K Jha, CVPR 2019

[Full Text]

Efficient Segmentation: Learning Downsampling Near Semantic Boundaries Authors; Dmitrii Marin, Zijian He, Peter Vajda, Priyam Chatterjee, Sam Tsai, Fei Yang, Yuri Boykov, ICCV 2019

[Full Text]

Machine Learning at Facebook: Understanding Inference at the Edge; Carole-Jean Wu, David Brooks, Kevin Chen, Douglas Chen, Sy Choudhury, Marat Dukhan, Kim Hazelwood, Eldad Isaac, Yangqing Jia, Bill Jia, Tommer Leyvand, Hao Lu, Yang Lu, Lin Qiao, Brandon Reagen, Joe Spisak, Fei Sun, Andrew Tulloch, Peter Vajda, Xiaodong Wang, Yanghan Wang, Bram Wasti, Yiming Wu, Ran Xian, Sungjoo Yoo, Peizhao Zhang

[Full Text]

Precision Highway for Ultra Low-Precision Quantization, E Park, D Kim, S Yoo, P Vajda

[Full Text]

Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search; B Wu, Y Wang, P Zhang, Y Tian, P Vajda, K Keutzer

[Full Text]

Value-aware Quantization for Training and Inference of Neural Networks", ECCV, 2018.

[Full Text]

Song Han, Huizi Mao, Enhao Gong, Shijian Tang, William J. Dally, Jeff Pool, John Tran, Bryan Catanzaro, Sharan Narang, Erich Elsen, Peter Vajda, Manohar Paluri; "DSD: dense-sparse-dense training for deep neural networks", ICLR, 2017.

[Full Text]

A. Araujo, D. Chen, P. Vajda, and B. Girod; "Real-time query-by-image video search system", ACM Multimedia (MM), November 2014.

[Full Text] [Video] [Demo]

M. Yu, P. Vajda, D. Chen, M. Daneshi, S. Tsai, A. Araujo, H. Chen, and B. Girod; "EigenNews: A personalized news video delivery platform",ACM Multimedia (MM), October 2013.

[Full Text] [Demo]

D. Chen, P. Vajda, S. Tsai, M. Daneshi, M. Yu, H. Chen, A. Araujo, B. Girod "Analysis of Visual Similarity in News Videos with Robust and Memory-Efficient Image Retrieval", IEEE Workshop on Media Fragment Creation and Remixing (MMIX), July 2013.

[Full Text] [Slides]

I. Ivanov, P. Vajda, J. S. Lee and T. Ebrahimi. In Tags We Trust: Trust modeling in social tagging of multimedia content, in IEEE Signal Processing Magazine, Special Issue on Signal and Information Processing for Social Learning and Networking, vol. 29, num. 2, p. 98-107, 2012.

Detailed record - Full Text - View at publisher

P. Vajda, T. Ebrahimi (Dir.). Object Duplicate Detection. EPFL, Lausanne, 2011.

Detailed record - View at publisher

L. Goldmann, T. Adamek, P. Vajda, M. Karaman and R. Mörzinger et al. Towards Fully Automatic Image Segmentation Evaluation. Advanced Concepts for Intelligent Vision Systems (ACIVS), Juan-les-Pins, Lecture Notes in Computer Science , 2008.

Detailed record - Full Text - View at publisher