Remotely Detectable Robot Policy Watermarking

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 7.0 Download Report PDF

watermarkingroboticsstochastic policies

The success of machine learning for real-world robotic systems has created a new form of intellectual property: the trained policy. This raises a critical need for novel methods that verify ownership and detect unauthorized, possibly unsafe misuse. While watermarking is established in other domains, physical policies present a unique challenge: remote detection. Existing methods assume access to the robot’s internal state, but auditors are often limited to external observations (e.g., video footage). This “Physical Observation Gap” means the watermark must be detected from signals that are noisy, asynchronous, and filtered by unknown system dynamics. We formalize this challenge using the concept of a glimpse sequence, and introduce Colored Noise Coherency (CoNoCo), the first watermarking strategy designed for remote detection. CoNoCo embeds a spectral signal into the robot’s motions by leveraging the policy’s inherent stochasticity. To show it does not degrade performance, we prove CoNoCo preserves the marginal action distribution. Our experiments demonstrate strong, robust detection across various remote modalities—including motion capture and side-way/top-down video footage—in both simulated and real-world robot experiments. This work provides a necessary step toward protecting intellectual property in robotics, offering the first method for validating the provenance of physical policies non invasively, using purely remote observations.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces a watermarking framework for robot policies that enables remote detection through external observations, addressing what the authors term the 'Physical Observation Gap.' Within the taxonomy, this work occupies the 'Colored Noise Coherency Watermarking' leaf under 'Frequency-Based Watermarking for Robot Policies.' Notably, this leaf contains only the original paper itself, with no sibling papers identified. The taxonomy as a whole comprises just two papers across two leaves, suggesting this is an emerging and sparsely populated research direction rather than a crowded subfield.

The taxonomy structure reveals that the broader category of 'Frequency-Based Watermarking for Robot Policies' contains one neighboring leaf focused on 'Frequency-Based Replay Attack Detection,' which addresses security concerns in robotic arms using frequency analysis. This neighboring work targets a different problem (replay attacks) and application context (robotic arms with multiple degrees of freedom), while the original paper focuses on ownership verification and misuse detection across general robotic systems. The taxonomy's scope notes clarify that non-frequency watermarking approaches and non-robotic watermarking fall outside this branch, positioning the work within a specific intersection of signal processing and robot policy protection.

Among the three contributions analyzed, the literature search examined only one candidate paper total, finding no clear refutations for any contribution. Specifically, the theoretical guarantee of marginal action distribution preservation was examined against one candidate, which was classified as non-refutable or unclear. The formalization of glimpse sequences and the CoNoCo strategy itself were examined against zero candidates. Given this extremely limited search scope—one candidate paper across all contributions—the analysis provides minimal evidence about prior work overlap. The absence of refutable candidates may reflect either genuine novelty or insufficient literature coverage.

Based on the single-paper taxonomy and minimal literature search (one candidate examined), the work appears to occupy a nascent research area with limited documented prior art. However, the analysis explicitly acknowledges its scope limitations: the search was not exhaustive, relying on top-K semantic matching. The sparse taxonomy and zero-sibling-paper finding suggest either that this specific formulation is genuinely novel or that related work exists under different terminology or in adjacent communities not captured by the search methodology.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: remotely detectable robot policy watermarking using frequency domain analysis. This emerging area addresses the challenge of verifying ownership or authenticity of deployed robot policies by embedding detectable signatures in their behavior. The taxonomy centers on frequency-based watermarking techniques, which exploit the spectral properties of control signals or trajectories to insert imperceptible markers. Within this single top-level branch, the field explores methods that modulate robot actions in the frequency domain—such as injecting colored noise patterns—so that an observer can later extract and verify the watermark without direct access to the policy's internal parameters. Representative works like Robot Policy Watermarking[0] demonstrate how coherency analysis of frequency components can reveal embedded signatures while maintaining task performance. A key theme across this line of work is the trade-off between watermark robustness and the subtlety of behavioral perturbations: stronger frequency signatures improve detectability but risk degrading control quality or becoming noticeable to adversaries. Robot Policy Watermarking[0] sits within the colored noise coherency watermarking cluster, emphasizing spectral coherence measures to achieve remote detection without requiring model access. This approach contrasts with potential alternatives that might embed watermarks in time-domain statistics or rely on cryptographic hashes of policy weights. The focus on frequency domain analysis reflects a broader interest in leveraging signal processing tools to balance imperceptibility, verifiability, and resilience against policy modifications or adversarial removal attempts.

Claimed Contributions

Formalization of remotely detectable policy watermarking using glimpse sequences

0 retrieved papers

The authors introduce a formal framework for robot policy watermarking that must be detected from remote observations only. They define glimpse sequences to model the Physical Observation Gap and identify three core challenges: synchronization uncertainty, system dynamics filtering, and interference plus noise.

0 retrieved papers

Colored Noise Coherency (CoNoCo) watermarking strategy

0 retrieved papers

The authors propose CoNoCo, a watermarking method that embeds spectral signatures by replacing white Gaussian noise with colored Gaussian noise in the policy's exploration, and detects these signatures using spectral coherency. This approach is designed specifically to enable remote detection despite unknown system dynamics and asynchronous sensing.

0 retrieved papers

Theoretical guarantee of marginal action distribution preservation

1 retrieved paper

The authors provide a theoretical proof (Theorem 4.1) demonstrating that their watermarking approach preserves the statistical distribution of actions at any single time step, ensuring the watermarked policy behaves identically to the original policy in terms of marginal action probabilities.

1 retrieved paper

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Within the taxonomy built over the current TopK core-task papers, the original paper is assigned to a leaf with no direct siblings and no cousin branches under the same grandparent topic. In this retrieved landscape, it appears structurally isolated, which is one partial signal of novelty, but still constrained by search coverage and taxonomy granularity.

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Formalization of remotely detectable policy watermarking using glimpse sequences

Contribution

Colored Noise Coherency (CoNoCo) watermarking strategy

Contribution

Theoretical guarantee of marginal action distribution preservation

[2] Watermarks for Deep Reinforcement Learning PDF

Cannot Refute

Remotely Detectable Robot Policy Watermarking

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

Contribution Analysis

Formalization of remotely detectable policy watermarking using glimpse sequences

Colored Noise Coherency (CoNoCo) watermarking strategy

Theoretical guarantee of marginal action distribution preservation

[2] Watermarks for Deep Reinforcement Learning PDF

Table of Contents