Speakers

Yuke Zhu

Yuke Zhu is an Associate Professor in the Department of Computer Science at the University of Texas at Austin and the director of the Robot Perception and Learning (RPL) Lab. He is also a Director and Distinguished Research Scientist at NVIDIA Research, where He co-leads the Generalist Embodied Agent Research (GEAR) group. His goal is to build algorithms and systems for autonomous robots and embodied agents that reason about and interact with the real world. His research lies at the intersection of robotics, machine learning, and computer vision. He focuses on developing methods and principles of perception and decision-making to realize general-purpose robot autonomy in the wild.

Talk: "RL for VLA Models in Three Worlds"

Abstract - [TBD]

Ziwei Wang

Ziwei Wang is currently an assistant professor in School of Electrical and Electronic Engineering, Nanyang Technological University and the director of Perception and embodied INtElligence (PINE) Lab. Before joining NTU, he was a postdoc fellow in Robotics Institute, Carnegie Mellon University, with Prof. Changliu Liu. He received the Ph.D and the B.S degrees from the Department of Automation, Tsinghua University in 2023 and the Department of Physics, Tsinghua University in 2018 respectively. His research goal is to design foundation models (FMs) for robotic manipulation. He has published over 50 scientific papers in TPAMI, IJCV, RAL, CVPR, ICCV, ECCV, NeurIPS, CoRL, IROS and ICRA. He serves as a regular reviewer member for a variety of conferences and journals.

Talk: "Data-efficient Human-in-the-loop RL for Performant Robotic Manipulation"

Human-in-the-loop reinforcement learning (HiL-RL) for robotic foundation models is fundamentally constrained by expensive and risky physical interaction, making sample efficiency, stable policy improvement, and reduced human effort essential for practical deployment. However, existing approaches often suffer from redundant data sampling, biased credit assignment over mixed-quality trajectories and intervention-intensive exploration. To address these challenges, we develop a unified research line that improves HiL-RL through informative sample scheduling, credit-aware policy optimization, and agentic intervention. Concretely, our works include: (1) entropy-guided sampling to reduce redundant real-world interaction and improve effective data utilization, (2) structured credit reassignment to correct overestimated learning targets and stabilize actor-critic updates, (3) value-aware agentic recovery mechanisms to reduce human intervention while preserving exploration efficiency. These efforts collectively advance HiL-RL systems that learn faster, require less human effort, generalize to broader manipulation states, and operate more reliably in complex physical environments.

Chao Yu

Chao Yu（于超 received her Ph.D. from the Department of Electronic Engineering at Tsinghua University in 2023. She is currently an Assistant Professor (Distinguished Research Fellow) at the Embodied Decision Intelligence Lab (EDI Lab) at Tsinghua Shenzhen International Graduate School (SIGS). She also serves as the chairman of the Tsinghua Shenzhen International Graduate School - AgiBot Joint Research Center for Embodied Cognition and Decision Systems (JCES) 清华-智元联合研究中⼼主任. She has been selected for the Youth Talent Support Program of the Chinese Institute of Electronics. Her research has long focused on reinforcement learning–based decision intelligence.

Talk: "RLinf: A highly flexible, scalable, large-scale reinforcement learning framework for embodied intelligence"

Abstract - [TBD]

William Chen

Will is a third year PhD student at UC Berkeley, advised by Sergey Levine. His research focuses on Robots that Reason -- how embodied reasoning can be leveraged to make robotic foundation models more generalizable, steerable, and amenable to improvement. Prior to Berkeley, he received his BS and MEng at MIT, advised by Luca Carlone and Jacob Andreas. His research there focused on how to apply semantic knowledge in language models for robotic scene understanding. He was also a robotics intern at NASA's Jet Propulsion Laboratory.

Talk: "Steering Robotic Generalists with Semantic Reasoning"

Robotic generalist policies are trained on large-scale, diverse datasets, thus internalizing many useful behaviors. We posit that semantic knowledge can elicit appropriate behaviors to solve new tasks, even if directly commanding the generalist policy fails to do so. We present two mechanisms for this: First, by training Steerable Policies: vision-language-action models (VLAs) that can follow a much wider range of language commands, spanning a range of abstractions -- from subtask-level plans to pixel coordinates. In accepting this spectrum of "steering commands," the policy more readily makes use of vision-language models' skills for planning and in-context reasoning. It also enables Semantic Action Reinforcement Learning (SARL) -- wherein we learn which prompts and abstractions reliably cause the VLA to solve the task, thus treating generalist policy prompts as an action space. Alternatively, we can use semantic reasoning to pick out the "reasonable" skills from the generalist's wide action prior. Specifically, we propose Flow Reversal Steering (FRS): a method for refining rough sketches of good robot behaviors into similar precise actions by passing them through generalist flow policies in reverse. We show how, while VLM reasoners struggle with directly outputting low-level actions, FRS allows them to focus on making high-level semantic inferences, which the generalist "projects" into good actions. In turn, FRS can bootstrap RL by exploring semantically-reasonable interactions, thereby improving and accelerating learning in tasks that base policies completely fail at. With both Steerable Policies and FRS, we hope to show a new paradigm for generalist reinforcement learning, wherein policy improvement is guided with both rewards and reasoned semantic knowledge.

Page updated

Google Sites

Report abuse