Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model
Anonymous Authors
Anonymous University
Submitted to ICRA 2026
Anonymous Authors
Anonymous University
Submitted to ICRA 2026
Abstract
Social robot navigation increasingly relies on large language models for reasoning and path planning, enabling movement in dynamic human spaces. However, this approach can be unpredictable, difficult to control, and sometimes unsafe due to inherent LLM limitations. In this work, we propose NaviWM, a socially-aware robot Navigation World Model that represents human-robot interaction (HRI) features with respect to robot inner understanding paradigm. Meanwhile, the deductive CoT (Chain-of-Thoughts) is leveraged by NaviWM to enhance the reasoning ability of LLMs, thereby supporting socially-aware navigation with logical proofs. Prior works mainly use prompting or fine-tuning for social navigation, but their outputs often fail due to methodological limits and incomplete task control. NaviWM consists of two main components: (1) a dynamically populated world model derived from the environment, designed to model entities within the space, and (2) a deductive reasoning CoT approach that enables language models to systematically reason about the action space. We conducted extensive experiments using NaviWM to compare its performance against other baselines.
Core Contributions of NaviWM
1. We propose a novel LLM-based social navigation planner, named NaviWM to address the lack of physical knowledge on existing LLM, NaviWM leverages a world model to either capture complex environmental dynamics and HRI features as a spatial-temporal graph within a human-designed environment, or to exemplify the inner reasoning paradigm of LLM.
2. Converting comprehensive decision-making problem of social navigation tasks into a multi-step logical reasoning procedure. NaviWM introduces a natural deductive multi-step CoT (Chain-of-Thought) framework to enhance LLM reasoning ability, involving logical inference, verification, and other operations.
3. Incorporating world model and deductive CoT to enhance the integration of LLM and physical environments, addressing the multi-variable optimization problem. Our approach achieves strong performance with notable improvements over other baselines or ablation methods.
Architecture of NaviWM
The world model illustration of social robot navigation scenario: NaviWM constructs the world model from local observation to capture both agent vertex information and environmental semantic information with respect to spatial-temporal HRI features.
The architecture of NaviWM: (1). NaviWM constructs a world model to represent environmental description from local observation; (2). The deductive CoT algorithm is encoded as prompt engineering for LLM reasoning guidance; (3). The inference chains are generated step-by-step with respect to logical procedure and self-validation step; (4). Final robot action is obtained in the final step of the deductive CoT.
Simulation Scenario
We design a gym-based simulator environment illustrates a socially-aware navigation scenario in which a mobile robot (shown as a yellow marker) must move from its starting position toward a designated destination (the red star). Along the way, the robot must account for the presence of multiple pedestrians, represented as green circles with directional arrows indicating their walking trajectories.
Inspired by [1], we introduce a new representation of social distance based on observed human activities. In our environments, typical activities that are uniformly generated in the simulation are categorized as walking (default distance = 0.35 m), using a personal device (preferred distance = 0.85 m, requiring greater separation), and group interaction (default distance = 0.5 m), as shown in right figure.
[1] Thepsychophysical representation of proxemics by Hall (1963), which predicts the sensations an agent would likely experience in different physical proxemic configurations within the psychological distance zones.
NaviWM: Socially-aware Robot Navigation World Model Illustration
Other baseline without world model illustration
Comparison Simulation Experiments and Trajectory Illustrations
LLM Inference Procedure (NaviWM)
LLM Inference Procedure (Baselines)
LLM Inference Procedure (Ablation Models)
Experiment Results
References
[1]. Mead, Ross, and Maja J. Matarić. "Autonomous human–robot proxemics: socially aware navigation based on interaction potential." Autonomous Robots 41.5 (2017): 1189-1201.