BlazeFace

Sub-millisecond Neural Face Detection on Mobile GPUs

Abstract

We present BlazeFace, a lightweight and well-performing face detector tailored for mobile GPU inference. It runs at a speed of 200–1000+ FPS on flagship devices. This super-realtime performance enables it to be applied to any augmented reality pipeline that requires an accurate facial region of interest as an input for task-specific models, such as 2D/3D facial keypoint or geometry estimation, facial features or expression classification, and face region segmentation. Our contributions include a lightweight feature extraction network inspired by, but distinct from MobileNetV1/V2, a GPU-friendly anchor scheme modified from Single Shot MultiBox Detector (SSD), and an improved tie resolution strategy alternative to non-maximum suppression.

Model Card

Model summary following M. Mitchell et al., "Model Cards for Model Reporting", FAT* '19: Conference on Fairness, Accountability, and Transparency, January 29–31, 2019, Atlanta, GA, USA.

Model Card.pdf

Paper

Published in the proceedings of the Third Workshop on Computer Vision for AR/VR, June 17, 2019, Long Beach, CA.

Paper.pdf

Spotlight Slides

As presented at the workshop.

BlazeFace Spotlight

Poster

As presented at the workshop.

Poster.pdf