EE4309: Robot Perception [course website: 2025 Fall, 2024 Fall, 2023 Fall, 2022 Fall, 2022 Spring ]
It briefly recaps deep learning basics, covers robotic perception systems including audio, language, vision and then dives into the learning across different modalities e.g. vision-language grounding, vision-language navigation, audio-visual speaker verification, large-scale multi-modal pre-training, etc.