Ratchet: Self-Evolving Skill Library for LLM Agents

ai-technology · 2026-05-23

A new paper introduces Ratchet, a single-agent loop that enables frozen LLMs to autonomously manage their own skill libraries through writing, retrieval, curation, and retirement of natural-language skills. The system addresses the bottleneck in self-evolving skill libraries, where LLM-authored skills previously showed zero improvement over no-skill baselines while human-curated skills delivered +16.2pp. Ratchet integrates four hygiene mechanisms: outcome-driven retirement, bounded active-cap, meta-skill authoring guidance, and pattern canonicalisation. On the MBPP+ hard-100 benchmark using Claude Opus 4.7, Ratchet improved held-out pass@1 from a baseline of 0.258 to a peak of 0.658, a rolling-mean gain of +0.328, compared to a no-skill control drift of +0.002. The paper is available on arXiv under ID 2605.22148.

Key facts

Ratchet is a single-agent loop for self-evolving LLM skill libraries.
Frozen LLMs write, retrieve, curate, and retire their own skills.
Four hygiene mechanisms: outcome-driven retirement, bounded active-cap, meta-skill authoring guidance, pattern canonicalisation.
On MBPP+ hard-100 with Claude Opus 4.7, pass@1 rose from 0.258 to peak 0.658.
Rolling-mean gain of +0.328 vs no-skill control drift of +0.002.
Paper ID: arXiv:2605.22148.
Self-evolving skill libraries were pioneered by Voyager.
LLM-authored skills previously showed +0.0pp improvement over no-skill baselines.

Ratchet: Self-Evolving Skill Library for LLM Agents

Key facts

Entities

Institutions

Sources