CS 395T Spring 2024

Learning for 3D Humans

Time & Location: MW 3:30-5 pm GDC 2.210 

Quick Links: Syllabus | Ed Discussion | Canvas | Self-report Participation Form | Paper Review Template | Presentation Template

Course Description

If we want to develop intelligent agents that can interact, help and collaborate with humans, we need to endow said agents with the ability to perceive and understand humans from visual observations. These observations are not independent streams of pixels - they are a depiction of the underlying 3D structure. In this course, we will study the inference of the dynamic structure of humans from visual observations. We will cover a wide range of topics related to human body modeling, inference of 3D human structure and motion from images and video, as well as the use of these tools in related applications, e.g., in graphics, robotics, mixed reality, and biometrics. Most topics will be approached in the context of the recent advances in (deep) learning and the impact they have had in the progress of the area.

The format of this course will be a mix of lectures and student presentations/ seminar-style discussions. Students will be responsible for paper readings, class presentation, class participation, and completing a final project.

Instructor: Georgios Pavlakos 

Email: pavlakos@cs.utexas.edu 

Office Hour: After class, 5:00-6:00 pm, Monday/Wednesday
GDC 4.810

Website

TA: Hanwen Jiang

Email: hwjiang@utexas.edu 

Office Hour: 10 - 11 am Monday GDC 4.718

Website

Online Platforms:

Prerequisites: The course assumes a good knowledge of computer vision and deep learning. We will not enforce any strict prerequisites, but intro courses to Computer Vision or Machine Learning would be very helpful.