NCVPRIPG23

Overview:

Digitizing humans with utmost realism is the holy-grail of Metaverse kind of immersive platform enabling a large set of tele-presence applications, namely, digital gaming, sports analytics, content creation for multimedia/animation, 3D virtual try-on, etc. This is a challenging task as the body shape geometry evolves over time, yielding a large space of complex body poses as well as shape variations. In addition to this, there are several other challenges such as self-occlusions by body parts, obstructions due to free form clothing, background clutter (in-the-wild setup), sparse set of cameras with non-overlapping fields of views (multi-view setup), sensor noise, etc. Traditionally, image-based techniques for 3D human digitization uses stereo/multi-view (including RGB and depth cameras) setup that typically require studio environments with controlled lighting and multiple synchronized and calibrated cameras. With the advent of learning based methods in 3D Computer Vision, in-the-wild human digitization has become possible. The models such as SCAPE and SMPL models the human 3D surface by parameterizing the body shape and the 3D joint locations and orientation. Model based reconstruction techniques fail to capture accurate geometrical information over the body surface (both body parts and garments) is not retained and are typically applicable only for tight clothing scenarios. In this tutorial, we plan to introduce the following fundamentals to the students/researchers working in the field of 3D computer vision, computer graphics, deep learning, etc.

Sensing and 3D representation of the human body
Parametric 3D Human model fitting using images, monocular videos, RGB-D sequence, etc.
3D digitization of clothed human body.
Current Research challenges and applications of 3D human digitzation.

Tentative Schedule:

Introduction to 3D Representation Sensing (30 minutes): Overview of popular acquisition technologies and computational representation of 3D shapes.
Parametric 3D Human Model Fitting (75 minutes): 3D Human body representation models such as SCAPE and SMPL. Algorithms for fitting fit these models for reconstructing 3D humans.
Digitizing Clothed Humans (75 minutes): Birds-eye-view introduction to various learning based solution for in-the-wild clothed human digitization.

Presenters:

Dr. Avinash Sharma is a faculty at International Institute of Information Technology Hyderabad (IIIT-H), INDIA, where he is affiliated with the Centre for Visual Information Technology (CVIT). Previously, he worked as Research Scientist at Xerox Research Center India. He graduated with his PhD in Applied Mathematics from INP Grenoble and INRIA Rhone-Alpes, supervised by Prof. Radu Horaud. His current research interests includes 3D computer vision with focus on digitizing humans, large scale 3D asset analysis as well as content generation for AR/VR applications.

Dr. Rajendra Nagar is an Assistant Professor in the department of Electrical Engineering at the Indian Institute of Technology Jodhpur, India. He received his B.Tech. degree from Indian Institute of Technology Jodhpur in 2013. He received his Ph.D. degree in Electrical Engineering from Indian Institute of Technology Gandhinagar. He was awarded a gold medal for the best academic performance in B.Tech. Electrical Engineering at IIT Jodhpur, 2013. He was awarded NCC 2019 best PhD thesis award. His research interests include 3D Computer Vision, Digital Geometry Processing, and Deep Learning.