Deep Object Detection

R-CNN to Faster R-CNN

  • R. Girshick, et al. "Rich feature hierarchies for accurate object detection and semantic segmentation." Proc. CVPR 2014. [pdf] [code]
  • K. He, et al. "Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition." TPAMI 2015. [pdf] [code]
  • R. Girshick. "Fast R-CNN." Proc. ICCV 2015. [pdf] [code]
  • S. Ren, et al. "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks." Proc. NIPS 2015. [pdf] [code]
  • A. Shrivastava, et al. "Training Region-based Object Detectors with Online Hard Example Mining." Proc. CVPR 2016. [pdf] [code]
  • J. Dai, et al. "R-FCN: Object Detection via Region-based Fully Convolutional Networks." Proc. NIPS 2016. [pdf] [code]
  • J. Huang, et al. "Speed/accuracy trade-offs for modern convolutional object detectors." arXiv 2016. [pdf]
  • L. Zhang, et al. "Is Faster R-CNN Doing Well for Pedestrian Detection?" Proc. ECCV 2016. [pdf] [code] [my review]
  • K. He, et al. "Mask R-CNN." arXiv 2017. [pdf]
  • W. Liu, et al. "SSD: Single Shot MultiBox Detector." Proc. ECCV 2016. [pdf] [code]
  • J. Redmon, et al. "You Only Look Once: Unified, Real-Time Object Detection." Proc. CVPR 2016. [pdf] [code]
  • J. Redmon, et al. "YOLO9000: Better, Faster, Stronger." Proc. CVPR 2017. [pdf]
  • Z. Li, C., G. Yu, X. Zhang, Y. Deng, J. Sun, "Light-Head R-CNN: In Defense of Two-Stage Object Detector." arXiv 2017. [pdf] [my review]

Vehicle detection

  • Y. Xiang, et al. "Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection." Proc. WACV 2017. [pdf]
  • Z. Cai, et al. "A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection." Proc. ECCV 2016. [pdf] [code]
  • Y. Zhou, et al. "DAVE: A Unified Framework for Fast Vehicle Detection and Annotation." Proc. ECCV 2016. [pdf] [dataset]

Object detection in videos

  • K. Wang, et al. "Object Detection from Video Tubelets with Convolutional Neural Networks." Proc. CVPR 2016. [pdf] [code]
  • K. Wang, et al. "T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos." arXiv 2016. [pdf] [code]

Face detection

  • C. Zhu, et al. "CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection." arXiv 2016. [pdf]
  • J. Yu, et al. "UnitBox: An Advanced Object Detection Network" ACM MM 2016. [pdf]
  • S. Zhang, et al. "S3FD: Single Shot Scale-invariant Face Detector." ICCV 2017. [pdf] [my review]
  • M. Najibi, et al. "SSH: Single Stage Headless Face Detector." ICCV 2017. [pdf] [my review]