Instructor: Asst Prof. Mike Shou
Description:
This course briefly recaps deep learning basics, covers robotic perception systems including audio, language, vision and then dives into the learning across different modalities e.g. vision-language grounding, vision-language navigation, etc.
No specific textbook
Assessment
20% open-book take-home assignments
20% project
60% closed-book final exam
TA
Joya Chen
Ziteng Gao
Weijia Mao
Jinheng Xie
Yuchao Gu
Topics we will cover:
Review deep learning, numpy, pytorch
Robotic auditory system
Robotic vision system
Spoken dialogue system
Range sensor & Touch sensor
Applications