ACE: Self-Evolving LLM Code Generation via Adversarial Testing

ai-technology · 2026-05-20

Researchers have introduced ACE, a framework for self-evolving code generation tailored for large language models (LLMs), which employs a solver-adversary structure. In this innovative setup, a single LLM alternates between crafting candidate programs and generating adversarial unit test inputs aimed at triggering execution failures, such as runtime errors or non-termination. This method overcomes the shortcomings of conventional solver-verifier systems, where tests produced by verifiers become less effective as solvers advance. ACE operates exclusively on execution-derived supervision, removing the necessity for extensive annotated solutions. The framework emphasizes active failure discovery to facilitate ongoing self-improvement in code generation. This research is documented in arXiv preprint 2605.16299.

Key facts

ACE is a self-evolving code generation framework for LLMs.
It uses a solver-adversary architecture.
A single LLM generates both candidate programs and adversarial test inputs.
Adversarial tests aim to cause execution failures (runtime errors, exceptions, non-termination).
Supervision is derived solely from execution outcomes.
It addresses degradation of solver-verifier frameworks as solvers improve.
The framework does not require large-scale annotated solutions.
The paper is available on arXiv (2605.16299).

ACE: Self-Evolving LLM Code Generation via Adversarial Testing

Key facts

Entities

Institutions

Sources