STARS Framework Stabilizes Recurrent Dynamics in Looped Language Models

other · 2026-05-27

A new training framework called STARS (STAbility-driven Recurrent Scaling) addresses the instability of test-time scaling in Looped Language Models (LoopLMs). LoopLMs use depth recurrence for latent reasoning but often see performance peak then collapse with further iteration. STARS constrains latent states to approach asymptotically stable fixed points via Jacobian Spectral Radius Regularization with random loop sampling. Experiments on arithmetic tasks show STARS achieves stable and effective scaling, enabling longer reasoning chains without degradation. The work conceptualizes reasoning as uncertainty reduction and targets convergence to stable fixed points.

Key facts

LoopLMs enable efficient latent reasoning through depth recurrence.
Performance in LoopLMs often peaks at a certain iteration depth then collapses.
STARS stands for STAbility-driven Recurrent Scaling.
STARS constrains latent states to approach asymptotically stable fixed points.
STARS uses Jacobian Spectral Radius Regularization with random loop sampling.
The framework maximizes effectiveness while ensuring rigorous stability.
Experiments were conducted on arithmetic tasks.
The work proposes reasoning as uncertainty reduction.

Entities

—

Sources

arXiv cs.AI — 2026-05-27