Overview:

Digitizing humans with utmost realism is the holy-grail of Metaverse kind of immersive platform enabling a large set of tele-presence applications, namely, digital gaming, sports analytics, content creation for multimedia/animation, 3D virtual try-on, etc. This is a challenging task as the body shape geometry evolves over time, yielding a large space of complex body poses as well as shape variations. In addition to this, there are several other challenges such as self-occlusions by body parts, obstructions due to free form clothing, background clutter (in-the-wild setup), sparse set of cameras with non-overlapping fields of views (multi-view setup), sensor noise, etc. Traditionally, image-based techniques for 3D human digitization uses stereo/multi-view (including RGB and depth cameras) setup that typically require studio environments with controlled lighting and multiple synchronized and calibrated cameras. With the advent of learning based methods in 3D Computer Vision, in-the-wild human digitization has become possible. The models such as SCAPE and SMPL models the human 3D surface by parameterizing the body shape and the 3D joint locations and orientation. Model based reconstruction techniques fail to capture accurate geometrical information over the body surface (both body parts and garments) is not retained and are typically applicable only for tight clothing scenarios. In this tutorial, we plan to introduce the following fundamentals to the students/researchers working in the field of 3D computer vision, computer graphics, deep learning, etc.


Tentative Schedule:  

Presenters: