Frontiers of Monocular 3D Perception

CVPR 2021 Workshop

Frontiers of Monocular 3D Perception

Toyota Technological Institute at Chicago (TTIC) & Toyota Research Institute (TRI)

25th June 2021, Virtual

Overview (live stream: https://youtu.be/cjMK9fO1jAA)

This workshop considers challenges and opportunities at the frontier of 3D reasoning from monocular visual input, the main modality common to most mobile phones, robots, and cars. Our main focus will be on 3D perception of scenes and objects directly from images and videos. The workshop’s main aim is to identify, characterize, and investigate how to qualitatively go beyond the current limitations in monocular 3D perception. The workshop consists of invited talks, panel discussions as well as a description of the Dense Depth for Autonomous Driving (DDAD) challenge and presentations from the winning methods. The total prize of the challenge is $5000.

Update: DDAD challenge deadline extended to 18th June 2021 (11:59 PM PST).

Dense Depth for Autonomous Driving (DDAD) challenge

DDAD is a new autonomous driving benchmark from TRI (Toyota Research Institute) for long range and dense depth estimation in challenging and diverse urban conditions. DDAD includes high-resolution, long-range LiDAR sensors used to generate pointclouds and images from six calibrated cameras time-synchronized at 10 Hz. In addition to the raw data, DDAD also contains high-quality panoptic segmentation associated with a select set of validation and test frames.

The DDAD depth challenge consists of two tracks: self-supervised and semi-supervised monocular depth estimation. We will evaluate all methods against the ground truth Lidar depth, and we will also compute and report depth metric per semantic class. All evaluations will be done only on images from the front camera, and the winner will be chosen based on the abs_rel metric. In addition, a jury prize will be awarded to a 3rd competitor, based on the novelty of the method.

DDAD dataset: https://github.com/TRI-ML/DDAD
Challenge start date: 5th April 2021
Challenge end date: 18th June 2021 (11:59 PM PST)

Program Details

Live stream link: https://youtu.be/cjMK9fO1jAA

The timezone of the conference is PDT (GMT-7).

8:00am - 8:05am -- Introductory remarks
8:05am - 8:25am -- Cordelia Schmid - "Reconstructing unconstrained hand-object interactions"
8:25am - 8:45am -- Matthias Nießner
8:45am - 9:05am -- Anelia Angelova - "From an Image to 3D"
9:05am - 9:35am -- Panel: “Do we need monocular 3D perception?” - Andreas Geiger, Jiajun Wu, Daniel Cremers, Katerina Fragkiadaki
9:35am - 9:45am -- Coffee Break
9:45am - 10:05am -- Noah Snavely
10:05am - 10:25am -- Angjoo Kanazawa
10:25am - 10:45am -- Andrea Vedaldi
10:45am - 11:20am -- DDAD challenge results + winner presentations
11:20am - 11:25am -- Coffee Break
11:25am - 11:55am -- Panel: “Are we making meaningful progress on monocular 3D perception?” - Georgia Gkioxari, Noha Radwan, Kris Kitani, Vincent Sitzmann
11:55am -- 12:00pm -- Concluding remarks