Remotely Detectable Robot Policy Watermarking

Overview

The success of machine learning for real-world robotic systems has created a new form of intellectual property: the trained policy. This raises a critical need for novel methods that verify ownership and detect unauthorized, possibly unsafe misuse. While watermarking is established in other domains, physical policies present a unique challenge: remote detection. Existing methods assume access to the robot's internal state, but auditors are often limited to external observations (e.g., video footage). This Physical Observation Gap means the watermark must be detected from signals that are noisy, asynchronous, and filtered by unknown system dynamics.

Glimpses

The policy auditor, who aims to identify the policy used on the robot, can only access glimpses of the policy behavior through remote sensing, such as a camera feed; these glimpses are passed through the detection function to identify the policy. In our experiments, we consider the following glimpses modalities.

Results - RoboMaster platform

Results of our CoNoCo method watermarking a policy trained for navigation task on the real-world RoboMaster platform, using only Remote Motion Capture Glimpses for detection.

Non-Watermarked Policy

Watermarked Policy

Detection

Anonymity

Reward Preservation

Results - Simulated Mujoco HalfCheetah

Results of our CoNoCo method watermarking a policy trained for Mujoco HalfCheetah control, using Ground-Truth Actions, Noisy Onboard Sensors, and Remote Camera Feed Glimpses modalities for detection.