Semantic Supply-Chain Attacks on AI Agent Skill Registries

ai-technology · 2026-05-13

A recent investigation published on arXiv (2605.11418) indicates that autonomous AI agents are susceptible to semantic supply-chain attacks via their SKILL.md files. These documents outline the conditions under which agents employ modular skills and can be exploited to affect skill discovery, selection, and governance. By utilizing actual ClawHub skills and realistic registry systems, the researchers demonstrated that brief textual prompts during the Discovery phase could manipulate embedding-based retrieval, achieving a pairwise win rate of 86% and an 80% Top-10 placement for adversarial skills. In the Selection phase, framing based solely on descriptions led agents to favor functionally similar adversarial variants in 77.6% of trials. The research also points out semantic evasion methods in Governance, where natural-language metadata can circumvent security measures, highlighting new risks in AI agent ecosystems.

Key facts

arXiv paper 2605.11418 studies semantic supply-chain attacks on AI agent skill registries.
Attacks target SKILL.md files used by autonomous AI agents.
Discovery stage: text triggers achieve 86% pairwise win rate and 80% Top-10 placement.
Selection stage: adversarial variants selected in 77.6% of paired trials.
Governance stage: semantic evasion bypasses security checks.
Study uses real ClawHub skills and realistic registry mechanisms.
Natural-language metadata can manipulate embedding-based retrieval.
Research highlights new risks in AI agent ecosystems.

Semantic Supply-Chain Attacks on AI Agent Skill Registries

Key facts

Entities

Institutions

Sources