Google Research Perception - CV4AR/VR

MediaPipe

MediaPipe: A Framework for Building Perception Pipelines

Camillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris McClanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-Teh Chang, Wei Hua, Manfred Georg and Matthias Grundmann

Open sourced at https://github.com/google/mediapipe

Sign up for MediaPipe announcements.

See demos at CVPR Expo Hall (Google booth) and get free MediaPipe t-shirts!

Tuesday 18 Jun 2019 (PDT) 10:15 - 11:15

Thursday 20 Jun 2019 (PDT) 15:20 - 16:20

Example MediaPipe Projects @ Google

Abstract

Building an application that processes perceptual inputs involves more than running an ML model. Developers have to harness the capabilities of a wide range of devices; balance resource usage and quality of results; run multiple operations in parallel and with pipelining; and ensure that time-series data is properly synchronized. The MediaPipe framework addresses these challenges. A developer can use MediaPipe to easily and rapidly combine existing and new perception components into prototypes and advance them to polished cross-platform applications. The developer can configure an application built with MediaPipe to manage resources efficiently (both CPU and GPU) for low latency performance, to handle synchronization of time-series data such as audio and video frames and to measure performance and resource consumption. We show that these features enable a developer to focus on the algorithm or model development, and use MediaPipe as an environment for iteratively improving their application, with results reproducible across different devices and platforms. MediaPipe will be open-sourced at https://github.com/google/mediapipe.

Paper

Third Workshop on Computer Vision for AR/VR, Long Beach, CA, 2019