EMS®: A Massive Computational Experiment Management System towards Data-driven Robotics

Abstract

We propose EMS®, a cloud-enabled massive computational experiment management system supporting high-throughput computational robotics research. Compared to existing systems, EMS® features a sky-based pipeline orchestrator which allows us to exploit heterogeneous computing environments painlessly (e.g., on-premise clusters, public clouds, edge devices) to optimally deploy large-scale computational jobs (e.g., with more than millions of computational hours) in an integrated fashion. Cornerstoned on this sky-based pipeline orchestrator, this paper introduces three abstraction layers of the EMS® software architecture: (i) Configuration management layer focusing on automatically enumerating experimental configurations; (ii) Dependency management layer focusing on managing the complex task dependencies within each experimental configuration; (iii) Computation management layer focusing on optimally executing the computational tasks using the given computing resource. Such an architectural design greatly increases the scalability and reproducibility of data-driven robotics research leading to much-improved productivity. To demonstrate this point, we compare EMS® with more traditional approaches on an offline reinforcement learning problem for training mobile robots. Our results show that EMS®outperforms more traditional approaches in two magnitudes of orders (in terms of experimental high throughput and cost) with only several lines of code change. We also exploit EMS® to develop mobile robot, robot arm, and bipedal applications, demonstrating its applicability to numerous robot applications.

Notes: EMSR installation tutorial, usage tutorial(02:56), mnist tutorial(05:06) and real-world experiments(07:36).

EMS® documentation: link

Robot Arm Experiment in EMS®

robot arm 1

robot arm 2

Bipedal Walking Experiment in EMS®

bipedal walking 1

bipedal walking 2

bipedal walking 3

bipedal walking 4

Turtlebot Exeperiment in EMS®

deployment1

deployment2

deployment3

Deployment on real robot

We exploited EMS® to develop three data-driven robotics applications. Developing collision avoidance policy of turtlebot robot. This pipeline consists of training collision avoidance policy in Stage simulation with PPO algorithm, testing policy in stage, and deployment on a real robot.

Toy Examples in EMS®

fourrooms

corridor

doorkey

Training Curves in EMS®

Turtlebot Training curves of 64 pipelines

Bipedal Training curves of 64 pipelines

Page updated

Google Sites

Report abuse