Yuke Zhu is an Associate Professor in the Department of Computer Science at the University of Texas at Austin and the director of the Robot Perception and Learning (RPL) Lab. He is also a Director and Distinguished Research Scientist at NVIDIA Research, where He co-leads the Generalist Embodied Agent Research (GEAR) group. His goal is to build algorithms and systems for autonomous robots and embodied agents that reason about and interact with the real world. His research lies at the intersection of robotics, machine learning, and computer vision. He focuses on developing methods and principles of perception and decision-making to realize general-purpose robot autonomy in the wild.
Talk: "Building Self-Improving and Continual Learning VLAs"
Abstract - [TBD]
Ziwei Wang is currently an assistant professor in School of Electrical and Electronic Engineering, Nanyang Technological University and the director of Perception and embodied INtElligence (PINE) Lab. Before joining NTU, he was a postdoc fellow in Robotics Institute, Carnegie Mellon University, with Prof. Changliu Liu. He received the Ph.D and the B.S degrees from the Department of Automation, Tsinghua University in 2023 and the Department of Physics, Tsinghua University in 2018 respectively. His research goal is to design foundation models (FMs) for robotic manipulation. He has published over 50 scientific papers in TPAMI, IJCV, RAL, CVPR, ICCV, ECCV, NeurIPS, CoRL, IROS and ICRA. He serves as a regular reviewer member for a variety of conferences and journals.
Talk: "Accelerating Sample-Efficient Reinforcement Learning for Real-World Robotic Manipulation"
Real-world robotic reinforcement learning is fundamentally constrained by expensive and risky data collection, makingsample efficiency, robustness, and reasoning abilityessential for practical deployment. However, existing approaches often suffer from inefficient exploration, limited state coverage, and unstable policy updates when interacting with complex physical environments. To address these challenges, we develop a unified research line that improves real-world learning throughinformative sample selection, diversity-aware exploration, data augmentation, reasoning-enhanced decision making, and structured policy optimization. Concretely, our works include: (1) entropy-guided sample prioritization and diversity-driven exploration to increase effective data utilization, (2) exploratory data augmentation for manipulation robustness, (3) online reasoning mechanisms for Vision-Language-Action models, and (4) explicit manifold alignment to stabilize online policy learning. These efforts collectively advance practical robotic learning systems that learn faster, generalize better, and operate more reliably in real-world scenarios.
Chao Yu(于超 received her Ph.D. from the Department of Electronic Engineering at Tsinghua University in 2023. She is currently an Assistant Professor (Distinguished Research Fellow) at the Embodied Decision Intelligence Lab (EDI Lab) at Tsinghua Shenzhen International Graduate School (SIGS). She also serves as the chairman of the Tsinghua Shenzhen International Graduate School - AgiBot Joint Research Center for Embodied Cognition and Decision Systems (JCES) 清华-智元联合研究中⼼主任. She has been selected for the Youth Talent Support Program of the Chinese Institute of Electronics. Her research has long focused on reinforcement learning–based decision intelligence.
Talk: "RLinf: A highly flexible, scalable, large-scale reinforcement learning framework for embodied intelligence"
Abstract - [TBD]
Will is a third year PhD student at UC Berkeley, advised by Sergey Levine. His research focuses on Robots that Reason -- how embodied reasoning can be leveraged to make robotic foundation models more generalizable, steerable, and amenable to improvement. Prior to Berkeley, he received his BS and MEng at MIT, advised by Luca Carlone and Jacob Andreas. His research there focused on how to apply semantic knowledge in language models for robotic scene understanding. He was also a robotics intern at NASA's Jet Propulsion Laboratory.
Talk: "Steering Robotic Generalists with Semantic Reasoning"
Robotic generalist policies are trained on large-scale, diverse datasets, thus internalizing many useful behaviors. We posit that semantic knowledge can elicit appropriate behaviors to solve new tasks, even if directly commanding the generalist policy fails to do so. We present two mechanisms for this: First, by training Steerable Policies: vision-language-action models (VLAs) that can follow a much wider range of language commands, spanning a range of abstractions -- from subtask-level plans to pixel coordinates. In accepting this spectrum of "steering commands," the policy more readily makes use of vision-language models' skills for planning and in-context reasoning. It also enables Semantic Action Reinforcement Learning (SARL) -- wherein we learn which prompts and abstractions reliably cause the VLA to solve the task, thus treating generalist policy prompts as an action space. Alternatively, we can use semantic reasoning to pick out the "reasonable" skills from the generalist's wide action prior. Specifically, we propose Flow Reversal Steering (FRS): a method for refining rough sketches of good robot behaviors into similar precise actions by passing them through generalist flow policies in reverse. We show how, while VLM reasoners struggle with directly outputting low-level actions, FRS allows them to focus on making high-level semantic inferences, which the generalist "projects" into good actions. In turn, FRS can bootstrap RL by exploring semantically-reasonable interactions, thereby improving and accelerating learning in tasks that base policies completely fail at. With both Steerable Policies and FRS, we hope to show a new paradigm for generalist reinforcement learning, wherein policy improvement is guided with both rewards and reasoned semantic knowledge.