Visual Learning and Recognition

Class Logistics

  • When : Monday/Wednesday/Friday 12:00 PM - 1:20PM
  • Where : GHC 4307
  • Instructor : Abhinav Gupta
  • Office Hours : By Appointment


  • 2019/01/14: Class website is online!

Class Overview

Summary: A graduate course in Computer Vision with emphasis on representation and reasoning for large amounts of data (images, videos and associated tags, text, gps-locations etc) toward the ultimate goal of Image Understanding. We will be reading an eclectic mix of classic and recent papers on topics including: Theories of Perception, Mid-level Vision (Grouping, Segmentation, Poselets), Object and Scene Recognition, 3D Scene Understanding, Action Recognition, Contextual Reasoning, Image Parsing, Joint Language and Vision Models, etc. We will be covering a wide range of supervised, semi-supervised and unsupervised approaches for each of the topics above.

Prerequisites: While there are no formal prerequisites, this course assumes familiarity with computer vision (16-720 or similar) and machine learning (10-601 or similar). If you have not taken courses covering this material, consult with the instructor.


Abhinav Gupta


EDSH 121

Rohit Girdhar


EDSH 224

Kenny Marino


EDSH 222

Chen-Hsuan Lin


EDSH 212

Samantha Powers


EDSH 100

Tao Chen


EDSH 100