Watermarking AI Game-Playing Agents in Perfect-Information Games

publication · 2026-05-16

A new research paper on arXiv (2605.14283) introduces a method for watermarking game-playing agents in perfect-information extensive-form games, adapting the KGW watermarking technique originally developed for large language models (LLMs). The watermark embeds hidden information into the agent's strategy, enabling detection of unauthorized use, such as cheating in online chess. The authors demonstrate that the watermark can be detected via statistical testing and that the degradation in strategy quality, measured by expected utility, can be bounded. However, they note a tradeoff between detectability and quality. The study addresses model misuse in gaming platforms, similar to concerns with LLMs.

Key facts

arXiv paper 2605.14283 introduces watermarking for game-playing agents.
Adapts KGW watermark from LLMs to perfect-information extensive-form games.
Watermark enables detection of unauthorized AI use, e.g., cheating in online chess.
Detection uses a statistical test.
Quality degradation (expected utility) can be bounded.
Tradeoff exists between detectability and quality.
Addresses model misuse in gaming platforms.
Published on arXiv.

Watermarking AI Game-Playing Agents in Perfect-Information Games

Key facts

Entities

Institutions

Sources