Let's Talk About Language! Investigating Linguistic Diversity in Embodied AI Datasets (Oral)
Selma Liliane Wanna, Agnes Luhtaru, Ryan Barron, Jonathan Salfity, Juston Moore, Cynthia Matuszek, Mitch Pryor
Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust (Oral)
Asher James Hancock, Allen Z. Ren, Anirudha Majumdar
Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments (Oral)
Haritheja Etukuru, Norihito Naka, Zijin Hu, Seungjae Lee, Chris Paxton, Soumith Chintala, Lerrel Pinto, Nur Muhammad Mahi Shafiullah
Adapting Diffusion Policies to Human Preferences via Reward-Guided Fine-Tuning (Oral)
Yuxin Chen, Devesh K. Jha, Masayoshi Tomizuka, Diego Romeres
Probing a Vision-Language-Action Model for Symbolic States and Integration into a Cognitive Architecture (Spotlight)
Hong Lu, Matthias Scheutz
Human-in-the-loop Foundation Model Failure Recovery for Robot-Assisted Bite Acquisition (Spotlight)
Krishna Palempalli, Rohan Banerjee, Sarah Dean, Tapomayukh Bhattacharjee
Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions (Spotlight)
Cunxin Fan, Xiaosong Jia, Yihang Sun, Yixiao Wang, Jianglan Wei, ZiYang Gong, Xiangyu Zhao, Masayoshi Tomizuka, Xue Yang, Junchi Yan, Mingyu Ding
KitchenVLA: Iterative Vision-Language Corrections for Robotic Execution of Human Tasks (Spotlight)
Kai Lu, Chenyang Ma, Chiori Hori, Diego Romeres
Towards Safe Robot Foundation Models Using Inductive Biases (Spotlight)
Maximilian Tölle, Theo Gruner, Daniel Palenicek, Tim Schneider, Jonas Günster, Joe Watson, Davide Tateo, Puze Liu, Jan Peters
Semantically Safe Robot Manipulation: From Semantic Scene Understanding to Motion Safeguards (Spotlight)
Lukas Brunke, Yanni Zhang, Ralf Römer, Jack Naimer, Nikola Staykov, SiQi Zhou, Angela P. Schoellig
Adaptive Energy Regularization for Autonomous Gait Transition and Energy-Efficient Quadruped Locomotion (Spotlight)
Boyuan Liang, Lingfeng Sun, Xinghao Zhu, Bike Zhang, Ziyin Xiong, Yixiao Wang, Chenran Li, Koushil Sreenath, Masayoshi Tomizuka
Versatile Legged Locomotion Adaptation through Vision-Language Grounding (Spotlight)
I Made Aswin Nahrendra, Seunghyun Lee, Dongkyu Lee, Hyun Myung
Vision Foundation Model Embedding-Based Semantic Anomaly Detection (Spotlight)
Max Peter Ronecker, Matt Foutter, Amine Elhafsi, Daniele Gammelli, Ihor Barakaiev, Marco Pavone, Daniel Watzenig
Residual Policy Gradient: A Reward View of KL-regularized Objective (Spotlight)
Pengcheng Wang, Xinghao Zhu, Yuxin Chen, Chenfeng Xu, Masayoshi Tomizuka, Chenran Li
CREStE: Scalable Mapless Navigation with Internet Scale Priors and Counterfactual Guidance (Spotlight)
Arthur Zhang, Harshit Sikchi, Amy Zhang, Joydeep Biswas
OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations (Spotlight)
Christina Kassab, Sacha Morin, Martin Büchner, Matias Mattamala, Kumaraditya Gupta, Abhinav Valada, Liam Paull, Maurice Fallon