Paper Details Please Check OpenReview
World Models as Execution Simulators for Automated Program Repair
Authors: Mysore supreeth, Atik Faysal, Manish Mehta, Sunil Kothari
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction
Authors: Qineng Wang, Wenlong Huang, Yu Zhou, Hang Yin, Tianwei Bao, Jianwen Lyu, Weiyu Liu, Ruohan Zhang, Jiajun Wu, Li Fei-Fei, Manling Li
stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation [Tiny Paper]
Authors: Lucas Maes, Quentin Le Lidec, Dan Haramati, Nassim Massaudi, Damien Scieur, Yann LeCun, Randall Balestriero
Computer-Using World Model
Authors: Yiming Guan, Rui Yu, John Zhang, Lu Wang, Chaoyun Zhang, Liqun Li, Bo Qiao, Si Qin, He Huang, Fangkai Yang, Pu Zhao, Lukas Wutschitz, Samuel Kessler, Huseyin A Inan, Robert Sim, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
Coherence-Validated Causal World Models for Multi-Scale Alzheimer’s Disease Progression and Pharmacologic Reversal
Authors: David Scott Lewis, Enrique Zueco
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Authors: Yanjiang Guo, Lucy Xiaoyang Shi, Jianyu Chen, Chelsea Finn
Tree of Options: Temporally Extended World Modeling, Planning, and Execution with Large Language Models
Authors: Xiaoling Zeng, Dingyang Chen, Qi Zhang
[Tiny Paper] GEST-Engine: Controllable Multi-Actor Video Synthesis with Perfect Spatiotemporal Annotations
Authors: Nicolae Cudlenco, Mihai Masala, Marius Leordeanu
Hierarchical World Models for Strategic AI Agents: Bridging Simulation and Reality through Multi-Fidelity Learning
Authors: Mysore supreeth, Atik Faysal, Manish Mehta, Sunil Kothari
Grounding Generated Videos in Feasible Plans via World Models
Authors: Christos Ziakas, Amir Bar, Alessandra Russo
VFMF: Dense Forecasting by Generating Foundation Model Features
Authors: Gabrijel Boduljak, Yushi Lan, Christian Rupprecht, Andrea Vedaldi
Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation
Authors: Zhichao Wu, Junyin Ye, Zhilong Zhang, Yihao Sun, Haoxin Lin, Haoxiang Ren, Jiaheng Luo, Lei Yuan, Yang Yu
Safe Context Switching for Agents in the Wild: Mitigating Subspace Interference via Orthogonal Adaptation
Authors: Akash Das, Ishan Roy
Mnemo: Policy Learning Accelerated by Experience
Authors: Xingrui Gu, Chuyi Jiang
World Action Verifier: Self-Improving World Models via Forward-Inverse Asymmetry
Authors: Yuejiang Liu, Fan Feng, Lingjing Kong, Weifeng Lu, Jinzhou Tang, XiangCheng Zhang, Kun Zhang, Kevin Murphy, Chelsea Finn, Yilun Du
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
Authors: Chenhao Li, Andreas Krause, Marco Hutter
Model Space Reasoning as Search in Feedback Space for Planning Domain Generation
Authors: James Oswald, Daniel Obolensky, Volodymyr Varha, Vasilije Dragovic, Kavitha Srinivas, Harsha Kokel, Michael Katz, Shirin Sohrabi
Parallel Stochastic Gradient-Based Planning for World Models
Authors: Michael Psenka, Michael Rabbat, Aditi S. Krishnapriyan, Yann LeCun, Amir Bar
[Tiny Paper] Probabilistic Dreaming for World Models
Authors: Gavin Y. Wong
FluIDWorld: Fluid-like Interactive Dynamics for 4D Worlds
Authors: Hyeongju Mun, In-Hwan Jin, Sohyeong Kim, Kyeongbo Kong
Beyond Patient Invariance: Learning Cardiac Dynamics via Action-Conditioned JEPAs
Authors: Jose Geraldo Fernandes, Luiz Facury de Souza, Pedro Robles Dutenhefner, Wagner Meira Jr.
Multi-scale Predictive Representations for Goal-conditioned Reinforcement Learning
Authors: Valliappan CA, David Meger, Sai Rajeswar, Pietro Mazzaglia
What Drives Compositional Generalization? The Importance of Continuous Training Objectives in Visual Generative Models
Authors: Karim Farid, Rajat Sahay, Yumna Alnaggar, Simon Schrodi, Volker Fischer, Cordelia Schmid, Thomas Brox
PhysLang: a Small Diagnostic Framework for Language-Grounded World Modeling
Authors: Noor Mairukh Khan Arnob, Azmine Toushik Wasi
Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models
Authors: Zhilong Zhang, Haoxiang Ren, Yihao Sun, Yifei Sheng, Haonan Wang, Zhichao Wu, Haoxin Lin, Pierre-Luc Bacon, Yang Yu
Learning Navigable World Models via Latent Energy Shaping
Authors: Luiz Facury de Souza, Jose Geraldo Fernandes, Pedro Robles Dutenhefner, Wagner Meira Jr.
Motion Attribution for Video Generation
Authors: Xindi Wu, Despoina Paschalidou, Jun Gao, Antonio Torralba, Laura Leal-Taixé, Olga Russakovsky, Sanja Fidler, Jonathan Lorraine
CausalPhysics: Unifying Semantic Reasoning, Physical Dynamics, and Counterfactual Simulation in World Models
Authors: Mysore supreeth, Manish Mehta
Planning with Unified Multimodal Models
Authors: Yihao Sun, Zhilong Zhang, Yang Yu, Pierre-Luc Bacon
Consistent Video World Model With Geometry-Aware Rotary Position Embedding
Authors: Chendong Xiang, Jiajun Liu, Jintao Zhang, Xiao Yang, Zhengwei Fang, Shizun Wang, Zijun Wang, Yingtian Zou, Hang Su, Jun Zhu
Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction [Tiny Paper]
Authors: Michael Hauri, Friedemann Zenke
World Action Models are Zero-shot Policies
Authors: Seonghyeon Ye, Yunhao Ge, Kaiyuan Zheng, Shenyuan Gao, Sihyun Yu, George Kurian, Suneel Indupuru, You Liang Tan, Chuning Zhu, Jiannan Xiang, Ayaan Naveed Malik, Kyungmin Lee, William Liang, Nadun Ranawaka Arachchige, Jiasheng Gu, Yinzhen Xu, Guanzhi Wang, Fengyuan Hu, Avnish Narayan, Johan Bjorck, Jing Wang, Gwanghyun Kim, Dantong Niu, Ruijie Zheng, Yuqi Xie, Jimmy Wu, Qi Wang, Danfei Xu, Yilun Du, Ryan Julian, Yevgen Chebotar, Scott Reed, Jan Kautz, Yuke Zhu, Linxi Fan, Joel Jang
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models
Authors: Jialong Wu, Xiaoying Zhang, Hongyi Yuan, XiangCheng Zhang, Tianhao Huang, Changjing He, Chaoyi Deng, Renrui Zhang, Youbin Wu, Mingsheng Long
Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game
Authors: Michael Katz, Harsha Kokel, Sarath Sreedharan
PREDICTING CAMERA POSE FROM PERSPECTIVE DESCRIPTIONS FOR SPATIAL REASONING
Authors: Xuejun Zhang, Aditi Tiwari, Zhenhailong Wang, Heng Ji
[Tiny Paper] Modular Training-Free Construction of Executable 3D Worlds from Narrative Text
Authors: Sanchit Singh
Reinforcement Learning with World Models for Optimizing Alzheimer’s Disease Treatment Timing and Dosing
Authors: David Scott Lewis, Enrique Zueco
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
Authors: Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu
A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures
Authors: Basile Terver, Randall Balestriero, Megi Dervishi, David Fan, Quentin Garrido, Tushar Nagarajan, Koustuv Sinha, Wancong Zhang, Michael Rabbat, Yann LeCun, Amir Bar
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
Authors: Jiahao Wang, Luoxin Ye, Taiming Lu, Junfei Xiao, Jiahan Zhang, Yuxiang Guo, Xijun Liu, Rama Chellappa, Cheng Peng, Alan Yuille, Jieneng Chen
DexSIM: Real-time Dexterous Simulation with Unified Causal Video Diffusion
Authors: Adam Lee
[Tiny Paper] Safe Streaming Flow Planning by Aligning Generation with Execution
Authors: Seunghwan Jang, Jeongyong Yang, Siddharth Ancha, SooJean Han
EGO-FLIGHT: Egocentric Grounding of Order for Frame-Level Inference in General Human Timelines
Authors: Jiahang He, Anya Singh, Jai Relan, Varun Nair
[TINY PAPER] Temporal Reversal Asymmetry: A Physics-Inspired Metric for Evaluating World Models
Authors: Kanpat Vesessook, Kevin Yang
What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators
Authors: Xinyu Zhang
WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotics
Authors: Yuchen Wang, Jiangtao Kong, Sizhe Wei, Xiaochang Li, Haohong Lin, Hongjue Zhao, Tianyi Zhou, Lu Gan, Huajie Shao
Toward World Models for Epidemiology
Authors: Zeeshan Memon, Yiqi Su, Christo Kurisummoottil Thomas, Walid Saad, Liang Zhao, Naren Ramakrishnan
Next Embedding Prediction Makes World Models Stronger
Authors: George Bredis, Nikita Balagansky, Daniil Gavrilov, Ruslan Rakhimov
Understanding Early Collapse in Predictive World-Model Pretraining
Authors: Sofiane ENNADIR, Levente Zólyomi, Oleg Smirnov
Latent Imagination Thinking: Beyond Recursive Models for Reasoning
Authors: Karim Farid, Jelena Bratulić, Sudhanshu Mittal, Cordelia Schmid, Thomas Brox
[Tiny Paper] Toward Pixel-Grounded World Models for Powered Descent: A Rocket Landing Benchmark and Expert Baseline
Authors: Charles Duong, Aviral Vaidya, Aditya Iyer, Lucas Maes, Aidan LaBella, Randall Balestriero
BlockMamba: Efficient Scalable Structured Sparsity for Mamba
Authors: Harshvardhan Mestha, Khaleelulla Khan Nazeer, David Kappel, Anand Subramoney
LaMo: A Latent Motion World Model for Long-Horizon Prediction
Authors: Azwar Abdulsalam, Christopher Hoang, Mengye Ren
Model-Based Meta-Learning for Algorithm Discovery
Authors: Theo Wolf, Alexander David Goldie, Jarek Luca Liesen, Uljad Berdica, Mattie Fellows, Jakob Nicolaus Foerster
H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model
Authors: Wenyuan Chen, Jinbang Huang, Oscar Pang, Zhiyuan Li, Xiao Hu, Lingfeng Zhang, Zhanguang Zhang, Mark Coates, Tongtong Cao, Xingyue Quan, Yingxue Zhang
SpaRRTa: A Synthetic Benchmark for Evaluating Spatial Intelligence in Visual Foundation Models
Authors: Turhan Can KARGIN, Wojciech Jasiński, Adam Pardyl, Bartosz Michał Zieliński, Marcin Przewięźlikowski
[Tiny Paper] Shortcut World Models: Learning to Leap, Not Step
Authors: Pranav Lakshmanan, Paras Chopra
Physical Informed Driving World Models
Authors: Zhuoran Yang, Yanyong Zhang
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
Authors: Runze Zhao, Yue Yu, Ruhan Wang, Chunfeng Huang, Dongruo Zhou
World-Gymnast: Training Robots with Reinforcement Learning in a World Model
Authors: Ansh Kumar Sharma, Yixiang Sun, Ninghao Lu, Yunzhe Zhang, Jiarao Liu, Sherry Yang
Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
Authors: Prakhar Gupta, Vaibhav Gupta
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
Authors: Xiangyu Meng, Zixian Zhang, Zhenghao Zhang, Junchao Liao, Long Qin, Weizhi Wang
Structure from Diffusion: Taming Video Diffusion Models for Camera Pose Estimation in Dynamic Videos
Authors: Sihan Liu, Zhuoyuan Wu, Heng Yu, Jun Gao, Jose M. Alvarez
Lifting Ego World Models for Planning and Control
Authors: Alex N Wang, Trevor Darrell, Pavel Izmailov, Yutong Bai, Amir Bar
Rethinking Video Generation Model for the Embodied World
Authors: Yufan Deng, Zilin Pan, Hongyu Zhang, Xiaojie Li, Huruoqing, Yufei Ding, Yiming Zou, Yan Zeng, Daquan Zhou
Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning
Authors: Rohan Deb, Stephen J. Wright, Arindam Banerjee
Reward-Forcing: Autoregressive Video Generation with Reward Feedback
Authors: Jingran Zhang, Ning Li, Yuanhao Ban, Andrew Bai, Justin Cui
CausalSliders: Graph-Guided LoRA Interventions for Causally Consistent Image Editing
Authors: Aditi Tiwari, Akshit Bhalla, Darshan Ganesh Prasad, Heng Ji
Cognitive Digital Twin Framework: Modeling and Real-Time Decision Making
Authors: Yangyang Zhang, Mengtong Li, Xinyu Wang, Zhihao Lin, Xiang Luo, Ernie Tian, Ning Lyu, Zhiguo Tao, Xiaotong Ding, Chuanzhen Wang
Hierarchical Latent Action Model
Authors: Hanjung Kim, Lerrel Pinto, Seon Joo Kim
A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents
Authors: Raghu Arghal, Phoebe Chen, Niall Dalton, Evgenii Kortukov, Calum McNamara, Angelos Nalmpantis, Moksh Nirvaan, Gabriele Sarti, Mario Giulianelli
RigidBench: Evaluating Rigid-Body Physics in Video Generation Models
Authors: Swarnim Jain, Shangzhe Wu
the Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation
Authors: Junichiro Niimi
Do LLMs Build Spatial World Models? Evidence from Grid-World Maze Tasks
Authors: Weijiang Li, Yilin Zhu, Rajarshi Das, Parijat Dube
Spiking Neural Networks for Continuous Control: Neuromorphic Reinforcement Learning in Conventional Computing
Authors: Jessica Hunter, Krishna Roy, Md Maruf Hossain Shuvo
Learning Evolving Latent Strategies for Multi-Agent Language Systems without Model Fine-Tuning
Authors: Wenlong Tang
MULTI-COMPONENT OUTCOME PREDICTION FOR ENTERPRISE ROUTING VIA HIERARCHICAL CREDIT ASSIGNMENT
Authors: Mysore supreeth, Atik Faysal, Manish Mehta, Sunil Kothari, Tao Liu
[Tiny Paper] Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces
Authors: Anthony Kobanda, Waris Radji
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
Authors: Zirui Wang, Junyi Zhang, Jiaxin Ge, Long Lian, Letian Fu, Lisa Dunlap, Ken Goldberg, XuDong Wang, Ion Stoica, David M. Chan, Sewon Min, Joseph E. Gonzalez
Action Shapley: A training data selection metric for Training World Models for Reinforcement Learning
Authors: Rajat Ghosh, Debojyoti Dutta
CausalSpatial: A Benchmark for Object-Centric Causal Spatial Reasoning
Authors: Wenxin Ma, Chenlong Wang, Ruisheng Yuan, Hao Chen, Nanru Dai, Yijun Yang, Chengxin Qian, Zhao-Yang Wang, Alan Yuille, Jieneng Chen
Evidential Latent World Models for Safe Model-based Reinforcement Learning
Authors: Alisson Henrique Kolling, Junior Costa De Jesus, Victor Augusto Kich, Ricardo Bedin Grando, Matheus Gonçalves Mateus, Rodrigo da Silva Guerra, Paulo L. J. Drews-Jr
Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation
Authors: Jacob Levy, Tyler Westenbroek, Kevin Huang, Fernando Palafox, Patrick Yin, Shayegan Omidshafiei, Dong-Ki Kim, Abhishek Gupta, David Fridovich-Keil
Robustness in the Face of Partial Identifiability in Reward Learning Problems
Authors: Filippo Lazzati, Alberto Maria Metelli
Active World-Model with 4D-informed Re- trieval for Exploration and Awareness
Authors: Elaheh Vaezpour, Amirhosein Javadi, Tara Javidi
Compositional Planning with Jumpy World Models
Authors: Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni, Marc G Bellemare, Alessandro Lazaric, Ahmed Touati
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Authors: Wenlong Huang, Yu-Wei Chao, Arsalan Mousavian, Ming-Yu Liu, Dieter Fox, Kaichun Mo, Li Fei-Fei
ProgressLM: Towards Progress Reasoning in Vision-Language Models
Authors: Jianshu Zhang, Chengxuan Qian, Haosen Sun, Haoran Lu, Dingcheng Wang, Letian Xue, Han Liu
LatentGS: Probabilistic Densification for Efficient, Compact, and Faster 3D Gaussian Splatting
Authors: Shuja Khalid, Mohamed Ibrahim, Yang Liu
[Tiny Paper] Integrating Simulation and Chain-of-thought Reasoning in Multimodal-Language Models For Physical Reasoning
Authors: YingQiao Wang, Eric Bigelow, Tomer Ullman, Yujin Tang, Sebastian Risi
Neural Computers
Authors: Mingchen Zhuge, Changsheng Zhao, Haozhe Liu, Zijian Zhou, Shuming Liu, Wenyi Wang, Ernie Chang, Gael Le Lan, Junjie Fei, Wenxuan Zhang, Yasheng SUN, Yunyang Xiong, Zechun Liu, Zhipeng Cai, Yining Yang, Yuandong Tian, Yangyang Shi, Vikas Chandra, Jürgen Schmidhuber
Environment Maps: Structured Environmental Representations for Long-Horizon Agents
Authors: Yenchia Feng, Chirag Sharma, Karime Maamari
Cross-View World Models
Authors: Rishabh Sharma, Gijs Hogervorst, Wayne Mackey, David Heeger, Stefano Martiniani
GridWM-Judge: Evaluating Vision-Language Model Judges in Grid Worlds via World Model Deficits
Authors: Qinan Zhang, Qihang Jin