ARTFEED — Contemporary Art Intelligence

AutoReproduce: AI Framework Automates Research Paper Reproduction

ai-technology · 2026-04-27

Researchers have introduced AutoReproduce, a multi-agent framework that autonomously reproduces experimental code from research papers. The system uses a novel 'paper lineage' algorithm to mine implicit knowledge from cited literature, enabling end-to-end code reproduction. A sampling-based unit testing strategy ensures executability. The team also developed a benchmark called AutoBench with verified implementations and metrics for evaluating reproduction and execution fidelity. Evaluations on PaperBench and AutoBench show AutoReproduce consistently outperforms existing methods. The work aims to address the labor-intensive nature of reproducing increasingly complex research methods.

Key facts

  • AutoReproduce is a multi-agent framework for autonomous reproduction of experimental code.
  • The paper lineage algorithm mines implicit knowledge from cited literature.
  • A sampling-based unit testing strategy ensures code executability.
  • AutoBench is a benchmark with verified implementations and comprehensive metrics.
  • Evaluations were conducted on PaperBench and AutoBench.
  • AutoReproduce consistently surpasses existing methods.
  • The work aims to accelerate scientific progress by reducing reproduction effort.
  • The paper is available on arXiv with ID 2505.20662.

Entities

Institutions

  • arXiv

Sources