PCAP

Learning to interact with deformable tree branches with minimal damage is challenging due to their intricate geometry and inscrutable dynamics. Furthermore, traditional vision-based modeling systems suffer from implicit occlusions in dense foliage, severely changing lighting conditions, and limited field of view, in addition to having a significant computation burden preventing real-time deployment.In this work, we simulate a procedural forest with realistic, self-similar branching structures derived from a parametric L-system model, actuated with crude spring abstractions, mirroring real-world variations with domain randomisation over the morphological and dynamic attributes. We then train a novel Proprioceptive Contact-Aware Policy (PCAP) for a reach task using reinforcement learning, aided by a whole-arm contact detection classifier and reward engineering, without external vision, tactile, or torque sensing. The agent deploys novel strategies to evade and mitigate contact impact, favouring a reactive exploration of the task space. Finally, we demonstrate that the learned behavioural patterns can be transferred zero-shot from simulation to real, allowing the arm to navigate around real branches with unseen topology and variable occlusions while minimising the contact forces and expected ruptures.

Simulation Experiments

sim_baseline_comparison.mov

PCAP vs PPO: Baseline Comparison

Comparison between the baseline policy (PPO with no rewards for avoiding contact) and our contact-aware policy (PCAP) in simulation.

sim_novel_strategies.mov

PCAP: Simulation Novel Strategies

The agent deploys novel and unexpected strategies (i.e, non-obvious side effects of our reward formulation) in both simulation and real to evade contacts as well as to reduce the contact impact forces.

Note: For all videos, use HD on the right bottom of the video and view full screen for clarity. Settings (Gear Icon) >> Quality >> 1080p HD

Real Experiments

real_baseline_comparison.mov

PCAP vs PPO: Baseline Comparison

Comparison between the baseline policy (PPO with no rewards for avoiding contact) and our contact-aware policy (PCAP) in real with actual branches.

real_emergent-strategies.mov

PCAP: Real Novel Strategies

Strategies exhibited of our Proprioceptive Contact-Aware policy in the real world operating on multiple tree branches of varying species.

real_baseline_comparison_ruler_1.mov

PCAP vs PPO: Basic Comparison

A simple comparison of the two policies, with nominal contact with the help of a deformable ruler in real.

Parametric L-System

Ternary L-System Formulation

We extend parametric L-system rules[1] from turtle graphics to simulation to generate realistic, self-similar, branching structure to model occlusion patterns found in real-world.

[1]: P. Prusinkiewicz and A. Lindenmayer. The algorithmic beauty of plants. Springer Science & Business Media, 2012.

Ternary L-System Simulation

While a variety of morphological models can be generated with the L-system formalism and our procedural forest generator, we experiment with 4 classes (a, b, c, d) of ternary branching structures. Above are the implementations in Isaac Gym Simulation

training_with_randomisation.mov

Forest Generator & Training

During the policy training phase, we randomise the L-system formal parameters to vary the branching structures, the dynamics parameters (stiffness/damping), the reach target, the part of the tree the robot has access to, and the measured contact impact forces.

BibTeX

Questions?

Contact [jjac4485@sydney.edu.au ]to get more information about the project

Page updated

Google Sites

Report abuse

Gentle Manipulation of Tree-Branches: A Contact-Aware Policy Learning Approach

Abstract

Simulation Experiments

PCAP vs PPO: Baseline Comparison

PCAP: Simulation Novel Strategies

Real Experiments

PCAP vs PPO: Baseline Comparison

PCAP: Real Novel Strategies

PCAP vs PPO: Basic Comparison

Parametric L-System

Ternary L-System Formulation

Ternary L-System Simulation

Forest Generator & Training

BibTeX

Questions?

Gentle Manipulation of Tree-Branches:
A Contact-Aware Policy Learning Approach