ARTFEED — Contemporary Art Intelligence

New Benchmark Exposes Security Vulnerabilities in LLM-Based Autonomous Agents

ai-technology · 2026-05-23

Researchers have introduced A3S-Bench, a benchmark comprising 2,254 real-world test cases to evaluate security vulnerabilities in LLM-based autonomous agents like OpenClaw. The study identifies three novel evasion vectors: temporal evasion, which fragments malicious payloads across multiple interaction turns; spatial evasion, which hides payloads within complex external artifacts that bypass standard LLM parsing; and semantic evasion, which conceals malicious intents under benign contextual noise. Current vulnerability analyses focus on single-turn, stateless behaviors, overlooking risks from stateful, multi-turn interactions and dynamic tool invocations. The framework aims to systematically quantify these threats as autonomous agents gain deep system-level privileges.

Key facts

  • A3S-Bench includes 2,254 real-world test cases
  • Three evasion vectors: temporal, spatial, semantic
  • OpenClaw is an example of an autonomous agent
  • Current analyses focus on single-turn, stateless behaviors
  • Agents operate with deep system-level privileges
  • Temporal evasion fragments payloads across turns
  • Spatial evasion uses complex external artifacts
  • Semantic evasion uses benign contextual noise

Entities

Institutions

  • arXiv

Sources