ACE: Self-Evolving LLM Code Generation via Adversarial Testing
Researchers have introduced ACE, a framework for self-evolving code generation tailored for large language models (LLMs), which employs a solver-adversary structure. In this innovative setup, a single LLM alternates between crafting candidate programs and generating adversarial unit test inputs aimed at triggering execution failures, such as runtime errors or non-termination. This method overcomes the shortcomings of conventional solver-verifier systems, where tests produced by verifiers become less effective as solvers advance. ACE operates exclusively on execution-derived supervision, removing the necessity for extensive annotated solutions. The framework emphasizes active failure discovery to facilitate ongoing self-improvement in code generation. This research is documented in arXiv preprint 2605.16299.
Key facts
- ACE is a self-evolving code generation framework for LLMs.
- It uses a solver-adversary architecture.
- A single LLM generates both candidate programs and adversarial test inputs.
- Adversarial tests aim to cause execution failures (runtime errors, exceptions, non-termination).
- Supervision is derived solely from execution outcomes.
- It addresses degradation of solver-verifier frameworks as solvers improve.
- The framework does not require large-scale annotated solutions.
- The paper is available on arXiv (2605.16299).
Entities
Institutions
- arXiv