FTP4RM

Fast Trajectory Planner with a Reinforcement Learning-Based Controller for Robotic Manipulators

Yongliang Wang, Hamidreza Kasaei

Department of Artificial Intelligence, Bernoulli Institute, University of Groningen

Engineering Applications of Artificial Intelligence (Volume 162, Part A, 15 December 2025)

Paper

Code

Presentation

Abstract

The ability to quickly generate obstacle avoidance trajectories in unstructured and obstructed environments remains a significant challenge for robotic manipulators. This paper highlights the strong potential of model-free reinforcement learning methods over model-based approaches for obstacle-free trajectory planning in joint space. We propose a fast trajectory planning system for manipulators that integrates vision-based path planning in task space with reinforcement learning-based obstacle avoidance in joint space. The paper introduces enhancements to the Proximal Policy Optimization (PPO) algorithm, including Action Ensembles (AE) and Policy Feedback (PF), which significantly improve precision and stability for goal-reaching and obstacle avoidance in joint space. These enhancements make PPO more adaptable to a variety of robotic tasks, thereby boosting performance. Additionally, we have integrated the Fast Segment Anything (FSA) with B-spline optimized kinodynamic path searching to develop a vision-based trajectory planner in task space. Experimental results demonstrated the effectiveness of PPO enhancements, Sim-to-Sim transfer for model robustness, and planner efficiency in complex scenarios. These enhancements allowed the robot to perform obstacle avoidance and real-time trajectory planning in obstructed environments.

Motivation

Method

We aim to enhance the capability of manipulators for safe and efficient motion planning in environments with both static and dynamic obstacles. The primary contribution is the development of an integrated vision-based trajectory planner coupled with an enhanced RL Joint Space Controller, enabling manipulators to achieve goal-oriented motion with obstacle avoidance.

We initially discussed vision-based trajectory planning. Subsequently, it outlines the construction of an improved PPO and its role in enhancing the performance of the reaching task with obstacle avoidance for manipulators. Finally, these elements are combined to propose a fast trajectory planner.

Vision-based Trajectory Planning in Task Space

RL-based Joint Space Controller for Obstacle Avoidance

Experiments

Strategy Evaluation in Reaching Tasks: Ablation Experiments

PPO with various AE methods

PPO with PF method

PPO performance with both

PPO_PF_AEL with different alpha

PPO_PF_AEP with different alpha

PPO_PF_AEB with different alpha

PPO_PF_AEE with different alpha

PPO_PF_AEW with different alpha

Strategy Evaluation in Reaching Tasks: Comparison Experiments

Comparison result of accumulated reward utilizing 5 random seeds for our method and other baselines.

Comparison of success rate on reaching task with obstacles

Comparison result of training time for different methods

Implementation Evaluation in Motion Planning Tasks

(*The videos show a single random trial, while the statistics are derived from averages across 100 trials.)

Task 1: Except head and body no other obstacles

Task 2: Except head and body with two obstacles

Task 3: An environment cluttered with obstacles

Task 4: A moving obstacle near the goal

Task 5: The goal is changed

Reaching Task with Obstacle Avoidance in Pybullet and Real World

Sim2Sim Validation and Motion Planning in Gazebo

Motion Planning in Real World

BibTex

@article{WANG2025112341,

title = {Fast trajectory planner with a reinforcement learning-based controller for robotic manipulators},

journal = {Engineering Applications of Artificial Intelligence},

volume = {162},

pages = {112341},

year = {2025},

issn = {0952-1976},

doi = {https://doi.org/10.1016/j.engappai.2025.112341},

url = {https://www.sciencedirect.com/science/article/pii/S0952197625023498},

author = {Yongliang Wang and Hamidreza Kasaei},

keywords = {Reinforcement learning, Artificial Intelligence Enabled Robotics, Motion planning, Artificial Intelligence Based Methods, Collision avoidance}

}

Page updated

Google Sites

Report abuse