Home

Bioinspired morphology and task curricula for learning locomotion in bipedal muscle-actuated systems

Nadine Badie*, Firas Al-Hafez, Pierre Schumacher, Daniel F.B. Häufle, Jan Peters, Syn Schmitt*

Abstract

Humans master complex motor skills such as walking and running through a sophisticated blend of learning and adaptation. Replicating this level of skill acquisition with traditional Reinforcement Learning (RL) methods in musculoskeletal humanoid systems is challenging due to intricate control dynamics and over-actuation. Inspired by human developmental learning, here we address these challenges, with a double curriculum approach: a three-stage task curriculum (balance, walk, run) and an up to three-stage morphology curriculum (4-year-old, 12-year-old, adult), mimicking physical growth. This combined approach enables the agent to efficiently learn robust gaits that are adaptable to varying velocities and perturbations. Extensive analysis and ablation studies demonstrate that our method outperforms state-of-the-art exploration techniques for musculoskeletal systems. Our approach is agnostic to the underlying RL algorithm and does not require reward tuning, demonstrations, or specific muscular architecture information, marking a notable advancement in the field.

Locomotion tasks

A single policy is capable of performing balance, walking, and running. It also generalizes to perturbations, obstacles, and varying target velocities within an episode, even though these conditions were not encountered during training.

balance_0007_3704891_init5deg_wide.mp4

Balance

walk_00002_3639_501_2-4-10_onto_seed9_good_wide.mp4

Walk

00002_1202545_run_31ms_wide.mp4

Run

00005_4423_incVel_12s_wide.mp4

Increasing velocity

Inc_Dec_Vel.mp4

Increasing and decreasing velocity

perturation_x_12ms_85N_3663_withoutGRF_zoom.mp4

Walk with perturbations along x-axis

perturbation_x_24ms_00004_15s_3446_withoutGRF.mp4

Run with perturbations along x-axis

perturbation_z_12ms_00006_seed6_70N_4791_withoutGRF.mp4

Walk with perturbations along z-axis

pertubation_z_24ms_00006_seed9_50N_4386_withouGRF_endCrop.mp4

Run with perturbations along z-axis

stairs_thick_00001_3150_wide.mp4

Stairs

gaps_00004_3486_wide.mp4

Gaps

Page updated

Google Sites

Report abuse