Non-Local Deep Features for Salient Object Detection

Paper

Luo Z, Mishra A, Achkar A, Eichel J, Li S-Z, Jodoin P-M, “Non-Local Deep Features for Salient Object Detection”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). [pdf]

Abstract

Saliency detection aims to highlight the most relevant objects in an image. Methods using conventional models struggle whenever salient objects are pictured on top of a cluttered background while deep neural nets suffer from excess complexity and slow evaluation speeds. In this paper, we propose a simpliﬁed convolutional neural network which combines local and global information through a multi-resolution 4×5 grid structure. Instead of enforcing spacial coherence with a CRF or super-pixels as is usually the case, we implemented a loss function inspired by the Mumford-Shah functional which penalizes errors on the boundary. We trained our model on the MSRA-B dataset, and tested it on six different saliency benchmark datasets. Results show that our method is on par with the state-of-the-art while reducing computation time by a factor of 18 to 100 times, enabling near real-time, high performance saliency detection.

Network Architecture

Figure 1. Architecture of our 5*4 grid-CNN network for saliency object detection.

Experiment Results

Figure 2. Saliency maps produced by the GS, MR, wCtr*, LEGS, BSCA, MDF, MC and DCL methods compared to our NLDF method. The NLDF maps provides clear salient regions and exhibit good uniformity as compared to the saliency maps from the other deep learning methods (LEGS, MC, MDF and DCL). Our method is also more robust to background clutter than the none-deep-learning methods (GS, MR, wCtr* and BSCA).

Table 2. Quantitative performance of our model on six benchmark datasets compared with the GS, MR, wCtr*, LEGS, BSCA, MDF, MC and DCL models. The latter four are deep learning methods and the former are not. The F and MAE metrics are deﬁned in the text.

Figure 3. Precision-recall curves for our model compared to GS, MR, wCtr*, LEGS, BSCA, MDF, MC and DCL evaluated on the MASR-B, HKU-IS, DUT-OMRON, PASCAL-S, ECSSD and SOD benchmark datasets. Our NLDF model can deliver state-of-the-art performance on all six datasets.

Code & Model

Our NLDF is implemented in TensorFlow and the codes are uploaded to github.

The pre-trained model can be download at this link.

The pre-computed saliency maps of six datasets of our NLDF method can be download at this link.

References

Y. Wei, F. Wen, W. Zhu, and J. Sun. Geodesic saliency using back-ground priors. In Proc. ECCV, 2012.
C. Yang, L. Zhang, H. Lu, X. Ruan, and M. Yang. Saliency detection via graph-based manifold ranking. In Proc. CVPR, 2013.
W. Zhu, S. Liang, Y. Wei, and J. Sun. Saliency optimization from robust background detection. In Proc. CVPR, 2014.
Y. Qin, H. Lu, Y. Xu, and H. Wang. Saliency detection via cellular automata. In Proc. CVPR, 2015.
L. Wang, H. Lu, X. Ruan, and M. Yang. Deep networks for saliency detection via local estimation and global search. In Proc. CVPR, 2015.
R. Zhao, W. Ouyang, H. Li, and X. Wang. Saliency detection by multi-context deep learning. In Proc. CVPR, 2015.
G. Li and Y. Yu. Visual saliency based on multiscale deep features. In Proc. CVPR, 2015.
G. Li and Y. Yu. Deep contrast learning for salient object detection. In Proc. CVPR, 2016.

Google Sites

Report abuse